WO2004002162A1 - Method for generating hashes from a compressed multimedia content - Google Patents
Method for generating hashes from a compressed multimedia content Download PDFInfo
- Publication number
- WO2004002162A1 WO2004002162A1 PCT/IB2003/002625 IB0302625W WO2004002162A1 WO 2004002162 A1 WO2004002162 A1 WO 2004002162A1 IB 0302625 W IB0302625 W IB 0302625W WO 2004002162 A1 WO2004002162 A1 WO 2004002162A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- bit
- stream
- hash
- predetermined parameters
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 230000005236 sound signal Effects 0.000 claims description 19
- 230000003595 spectral effect Effects 0.000 claims description 18
- 230000001052 transient effect Effects 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 25
- 238000004364 calculation method Methods 0.000 description 5
- 238000009795 derivation Methods 0.000 description 5
- 230000000007 visual effect Effects 0.000 description 5
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000009432 framing Methods 0.000 description 2
- 238000005284 basis set Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N1/32101—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2347—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving video stream encryption
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/467—Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/63—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/835—Generation of protective data, e.g. certificates
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3225—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
- H04N2201/3233—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of authentication information, e.g. digital signature, watermark
- H04N2201/3236—Details of authentication information generation
Definitions
- the invention relates to a method and apparatus suitable for the generation of a hash signal representative of a multimedia signal.
- Hash functions are commonly used in the world of cryptography where they are commonly used to summarise and verify large amounts of data.
- the MD5 algorithm developed by Professor R L Rivest of MIT (Massachusetts Institute of Technology) has as an input a message of arbitrary length and produces as an output a 128- bit "finger print", "signature” or “hash” of the input. It has been conjectured that it is statistically very unlikely that two different messages have the same hash. Consequently, such cryptographic hash algorithms are a useful way to verify data integrity.
- multimedia signals can frequently be transmitted in a variety of file formats.
- file formats For instance, several different file formats exist for audio files, like WAV, MP3 and Windows Media, as well as a variety of compression or quality levels.
- Cryptographic hashes such as MD5 are based on the binary data format, and so will provide different hash values for different file formats of the same multimedia content. This makes cryptographic hashes unsuitable for summarising multimedia data, for which it is required that different quality versions of the same content yield the same hash, or at least similar hashes.
- Hashes of multimedia content that are relatively invariant to data processing are referred to as robust summaries, robust signatures, robust fingerprints, perceptual hashes or robust hashes.
- Robust hashes capture the perceptually essential parts of audio-visual content, as perceived by the Human Auditory System (HAS) and/or the Human Visual System (HVS).
- a robust hash is a function that associates with every basic time-unit of multimedia content a semi-unique bit-sequence that is continuous with respect to content similarity as perceived by the HAS/HVS.
- the HAS/HVS identifies two pieces of audio, video or image as being very similar, the associated hashes should also be very similar.
- the hashes of original content and compressed content should be similar.
- the robust hash should be able to distinguish the two signals (semi-unique). Consequently, robust hashing enables content identification, which is the basis for many applications.
- the proposed technique computes a robust hash value for basic windowed time intervals of the audio signal.
- the audio signal is thus divided into frames, and subsequently the spectral representation of each time frame computed by a Fourier transform.
- the technique aims to provide a robust hash function that mimics the behaviour of the HAS i.e. it provides a hash value mimicking the content of the audio signal as would be perceived by a listener.
- bit-stream including the encoded audio signal is received by a bit-stream decoder 110.
- the bit-stream decoder fully decodes the bit-stream, so as to produce an audio signal.
- This audio signal is then passed to the framing unit 120.
- the framing unit divides the audio signal into a series of basic windowed time intervals. Preferably, the time intervals overlap, such that the resulting hash values from subsequent frames are largely similar.
- Each of the windowed time intervals signals are then passed to a Fourier transform unit 130, which calculates a Fourier transform for each time window.
- An absolute value calculating unit 140 is then used to calculate the absolute value of the Fourier transform. This calculation is carried out as the Human Auditory System (HAS) is relatively insensitive to phase, and only the absolute value of the spectrum is retained as this corresponds to the tone that would be heard by the human ear.
- HAS Human Auditory System
- selectors, 151, 152,..., 158, 159 are used to select the Fourier coefficients corresponding to the desired bands.
- the Fourier coefficients for each band are then passed to respective energy computing stages 161, 162, ..., 168, 169.
- Each energy computing stage then calculates the energy of each of the frequency bands, and then passes the computed energy onto a bit derivation circuit 170 which computes and sends to the output 180 a hash bit (H(n,x), where x corresponds to the respective frequency band and n corresponds to the relevant time frame interval).
- the bits can be a sign indicating whether the energy is greater than a predetermined threshold.
- the perceptual features relate to those that would be viewed by the HVS i.e. it aims to produce the same (or a similar) hash signal for content that is considered the same by the HVS.
- the proposed algorithm looks to consider features extracted from either the luminance component, or alternatively the chrominance components, computed over blocks of pixels.
- the respective information (audio or visual) signal is decoded from the bit-stream, divided into frames, then the perceptual features extracted from the frames and utilised to calculate a hash signal.
- the present invention provides a method of generating a hash signal representative of a multimedia signal, the method comprising the steps of: receiving a bit-stream comprising a compressed multimedia signal; selectively reading from the bit-stream predetermined parameters; and deriving a hash function from said parameters.
- the present invention provides a hash signal representative of a multimedia signal, the hash signal having been generated by selectively reading predetermined parameters relating to perceptual properties of the multimedia signal from a bit-stream comprising a compressed version of the multimedia signal.
- the present invention provides an apparatus arranged to generate a hash signal representative of a multimedia signal, the apparatus comprising: a receiver arranged to receive a bit-stream comprising a compressed multimedia signal; a decoder arranged to selectively read from the bit-stream predetermined parameters; a processing unit arranged to derive a hash function from said parameters.
- Figure 1 is a schematic diagram of a known arrangement for extracting a hash signal from an audio signal encoded within a bit-stream
- Figure 2 is a schematic diagram of an arrangement for extracting a hash signal from an encoded multimedia signal in accordance with an embodiment of the present invention.
- Prior art robust hashing schemes require that the respective information signal is decoded from the encoded signal (i.e. the bit-stream), with the decoded information signal being sampled so as to extract the relevant perceptual information. This perceptual information is subsequently utilised to determine the hash function.
- the present inventors have realised that the complete decoding of the transmission signal is not necessary.
- the hash function can instead in many instances be directly determined from the bit-stream representation.
- Multimedia signals are typically encoded using source coding so as to form efficient descriptions of information sources. Source coded data can then be efficiently transmitted in a bit-stream.
- the encoded signal In order for the multimedia signal to be recognisable when decoded, the encoded signal must contain information relating to the perceptual features of the multimedia signal. For instance, transform, subband and parametric encoded audio signals all contain spectral representations of the audio signal.
- the chosen property can be any predetermined function of the perceptual coefficients.
- the sign of energy differences is a property that is very robust to many kinds of processing.
- the robust properties are subsequently converted into bits, each bit being indicative of the energy change within a frequency band of the respective frame, with all of the bits of a frame representing the hash for that frame.
- FIG. 2 illustrates an apparatus suitable for calculating a hash function directly from a bit-stream incorporating an encoded multimedia signal. The operation of the apparatus will now be described in conjunction with a transform encoded audio signal.
- Transform coders are typically called spectral encoders because the signal is described in terms of a spectral decomposition (in a selected basis set).
- the spectral terms are computed for overlapping (typically having a 50% overlap) successive blocks of input data.
- the output of a transform coder can be viewed as a set of time series, one series for each spectral term.
- the input audio signal when undergoing transform coding, the input audio signal will be filtered resulting in a large number of spectral coefficients.
- these coefficients are grouped in frequency bands, denoted as scale-factor bands, that resemble a non-uniform frequency division such as an ERB-grid (Equivalent Rectangular Bandwidth grid).
- ERB-grid Equivalent Rectangular Bandwidth grid
- the resulting spectral coefficients are quantized according to a perceptual model, and subsequently encoded into a bit-stream representation.
- Figure 2 shows a schematic diagram of an apparatus 200 arranged to receive such a bit-stream.
- the bit-stream is received at the input of the selective bit-stream decoder 210.
- the decoder 210 is arranged to selectively extract bits from the bit-stream relating to predetermined parameters of the multimedia signal. These predetermined parameters are then utilised to determine the hash function.
- the scale- factors (and optionally the spectral values) per scale factor band are extracted from the bit-stream.
- These scale-factors and spectral values are subsequently processed in order to obtain energies.
- the scale-factors alone give an estimate of the energies. The estimates are made more precise if the spectral values are also taken into account. In the simplest case, these values are then utilised to calculate the hash function.
- each calculation unit corresponds to a separate ERB frequency band, and is used to derive an estimate of the energies per ERB frequency band from the decoded scale-factors (and optionally from the spectral values) per scale factor band.
- the ERB bands have a logarithmic spacing, with the first band starting at 300Hz, and every successive band having a bandwidth of one musical tone up to the maximum frequency of 3000Hz (the most relevant frequency range to the HAS).
- the energies are subsequently converted into bits.
- the bits can be assigned by calculating an arbitrary function of the energies of possibly different frames, and then comparing it to a threshold value.
- the threshold itself might also be the result of another function of the energy values.
- the bit derivation circuit 270 converts the energy levels of the bands into a binary hash word.
- bits of the hash string can be formally defined as:
- the bit derivation circuit 270 comprises, for each band, a first subfractor 271, a frame delay 272, a second subfractor 273, and a comparator 274.
- a 32-bit hash word i.e. H(n,m).
- a separate hash word is calculated for each time frame in the audio signal, with a concatenation of the hash words forming the overall hash function.
- Such computed hash words of successive frames can be stored in buffers, or other memory stores, and utilised by computers to match the multimedia signal encoded in the bit-stream by comparing it with a database of hash values that have been calculated in a similar manner.
- the above embodiment has been described with reference to a particular type of coding scheme, it will be appreciated that it can be applied to any coding scheme that stores perceptual information.
- syntax description contains the structure of the bit-stream, and how to write or extract (read) encoded parameters to and from the bit-stream.
- decoder description describes how to decode these extracted parameters and subsequently generate the multimedia output.
- the encoding process is similar to that utilised in transform coders.
- the audio input signal is filtered resulting in a limited number of sub- signals.
- Each sub-signal represents signal values in a frequency band of fixed size.
- the thus obtained sub-signals are then quantized according to a perceptual model, and subsequently encoded into a bit-stream representation.
- the signal values also scale- factors, that scale the signal values, are encoded in the bit-stream.
- the scale-factors per subband are extracted from the bit-sfream.
- the signal values i.e. the actual (scaled) spectral values are extracted from the bit-stream, if a more precise estimate of the energies is required.
- the extracted parameters are subsequently converted into energies.
- the energies within subbands that correspond to a "critical" band are then grouped.
- Critical bands are those predetermined frequency bands that have been determined to contain the desired perceptual information required to form robust hashes. In the case that a critical band does not exactly match a subband border, an estimation of the energy within the critical band can be made e.g.
- this data can then be passed to a bit derivation circuit in order for the hash function to be calculated. Similar to transform coding, these scale factors could also be used to further reduce complexity.
- sinusoidal components are estimated. These sinusoidal components, at predetermined time intervals, represent the frequencies that are present in the audio signal. In the preferred scheme, the sinusoidal parameters are updated about every eight milliseconds. For coding efficiency, the sinusoidal frequencies are quantized on an ERB-grid, which resembles a logarithmic grid. The representation levels, which are obtained after quantization, are subsequently differentially encoding, both in the frequency direction as well as in the time direction, and encoded into a bit-stream representation.
- the frequencies that are contained in the parametric bit-stream are extracted, and grouped within the frequency regions used for the hash operation. For each time frame and frequency within a group (i.e. frequency band), the amplitude (and optionally the phase information) is retrieved in order to calculate the energy of all components within a frequency group. This data can then be used to calculate the hash function.
- phase information is optionally used as, for low frequencies, the phase information has an influence on the actual power contained in the sinusoid. Depending on the starting phase of the sinusoid, the power can fluctuate. For that reason it can be appropriate to include phase information, particularly if the multi-media signal includes many low frequency components.
- Each transient object is only present within a single time frame.
- the frequencies that are contained within the transient object are grouped within frequency bands, with the corresponding amplitude and phase information contributing to the total energy within a frequency band.
- this envelope function also needs to be considered when determining the energy per component. inclusion of the energies contained in the noise signal components is less straight forward, and would significantly increase the computational complexity.
- a sufficiently reliable feature signal may be obtained, thus allowing the construction of a hashing word from these sinusoidal components.
- encoding schemes will divide multimedia signals simultaneously into predetermined time frames, and blocks of perceptual features for each time frame. For instance, a video signal may, for each image, be divided into square blocks of pixels.
- an audio signal may be divided into predetermined frequency bands.
- it is desirable to calculate a hash function from time frames and/or blocks of perceptual features that do not match those used in the encoding scheme it will be appreciated that further processing may be carried out on the components relating to the perceptual features extracted from the bit stream, so as to estimate the properties of the multimedia signal falling within the desired time frames and/or perceptual blocks based upon the time frames or perceptual blocks used in the encoding scheme.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computer Security & Cryptography (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/518,264 US20050259819A1 (en) | 2002-06-24 | 2003-04-12 | Method for generating hashes from a compressed multimedia content |
JP2004515156A JP2005531024A (en) | 2002-06-24 | 2003-06-12 | How to generate a hash from compressed multimedia content |
EP03732921A EP1518414A1 (en) | 2002-06-24 | 2003-06-12 | Method for generating hashes from a compressed multimedia content |
AU2003239732A AU2003239732A1 (en) | 2002-06-24 | 2003-06-12 | Method for generating hashes from a compressed multimedia content |
KR10-2004-7021157A KR20050013630A (en) | 2002-06-24 | 2003-06-12 | Method for generating hashes from a compressed multimedia content |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02077499 | 2002-06-24 | ||
EP02077499.8 | 2002-06-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004002162A1 true WO2004002162A1 (en) | 2003-12-31 |
Family
ID=29797222
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2003/002625 WO2004002162A1 (en) | 2002-06-24 | 2003-06-12 | Method for generating hashes from a compressed multimedia content |
Country Status (7)
Country | Link |
---|---|
US (1) | US20050259819A1 (en) |
EP (1) | EP1518414A1 (en) |
JP (1) | JP2005531024A (en) |
KR (1) | KR20050013630A (en) |
CN (1) | CN100380975C (en) |
AU (1) | AU2003239732A1 (en) |
WO (1) | WO2004002162A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008520134A (en) * | 2004-11-11 | 2008-06-12 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | Apparatus and method for detecting tampering of information signal |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7824029B2 (en) | 2002-05-10 | 2010-11-02 | L-1 Secure Credentialing, Inc. | Identification card printer-assembler for over the counter card issuing |
US7606790B2 (en) | 2003-03-03 | 2009-10-20 | Digimarc Corporation | Integrating and enhancing searching of media content and biometric databases |
KR100636232B1 (en) * | 2005-04-29 | 2006-10-18 | 삼성전자주식회사 | Method and apparatus for checking proximity between devices using hash chain |
US20070162761A1 (en) | 2005-12-23 | 2007-07-12 | Davis Bruce L | Methods and Systems to Help Detect Identity Fraud |
US8224018B2 (en) | 2006-01-23 | 2012-07-17 | Digimarc Corporation | Sensing data from physical objects |
US8077905B2 (en) * | 2006-01-23 | 2011-12-13 | Digimarc Corporation | Capturing physical feature data |
US20080086311A1 (en) * | 2006-04-11 | 2008-04-10 | Conwell William Y | Speech Recognition, and Related Systems |
US8738749B2 (en) | 2006-08-29 | 2014-05-27 | Digimarc Corporation | Content monitoring and host compliance evaluation |
US8707459B2 (en) | 2007-01-19 | 2014-04-22 | Digimarc Corporation | Determination of originality of content |
US8010511B2 (en) | 2006-08-29 | 2011-08-30 | Attributor Corporation | Content monitoring and compliance enforcement |
US9179200B2 (en) | 2007-03-14 | 2015-11-03 | Digimarc Corporation | Method and system for determining content treatment |
US10242415B2 (en) | 2006-12-20 | 2019-03-26 | Digimarc Corporation | Method and system for determining content treatment |
US7984158B2 (en) * | 2007-03-20 | 2011-07-19 | Microsoft Corporation | Web service for coordinating actions of clients |
WO2009004727A1 (en) * | 2007-07-04 | 2009-01-08 | Fujitsu Limited | Encoding apparatus, encoding method and encoding program |
US8141152B1 (en) * | 2007-12-18 | 2012-03-20 | Avaya Inc. | Method to detect spam over internet telephony (SPIT) |
JP5262171B2 (en) | 2008-02-19 | 2013-08-14 | 富士通株式会社 | Encoding apparatus, encoding method, and encoding program |
US10631068B2 (en) | 2008-11-26 | 2020-04-21 | Free Stream Media Corp. | Content exposure attribution based on renderings of related content across multiple devices |
US9986279B2 (en) | 2008-11-26 | 2018-05-29 | Free Stream Media Corp. | Discovery, access control, and communication with networked services |
US8180891B1 (en) | 2008-11-26 | 2012-05-15 | Free Stream Media Corp. | Discovery, access control, and communication with networked services from within a security sandbox |
US10880340B2 (en) | 2008-11-26 | 2020-12-29 | Free Stream Media Corp. | Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device |
US10567823B2 (en) | 2008-11-26 | 2020-02-18 | Free Stream Media Corp. | Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device |
US10419541B2 (en) | 2008-11-26 | 2019-09-17 | Free Stream Media Corp. | Remotely control devices over a network without authentication or registration |
US9154942B2 (en) | 2008-11-26 | 2015-10-06 | Free Stream Media Corp. | Zero configuration communication between a browser and a networked media device |
US10334324B2 (en) | 2008-11-26 | 2019-06-25 | Free Stream Media Corp. | Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device |
US10977693B2 (en) | 2008-11-26 | 2021-04-13 | Free Stream Media Corp. | Association of content identifier of audio-visual data with additional data through capture infrastructure |
US9519772B2 (en) | 2008-11-26 | 2016-12-13 | Free Stream Media Corp. | Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device |
US9961388B2 (en) | 2008-11-26 | 2018-05-01 | David Harrison | Exposure of public internet protocol addresses in an advertising exchange server to improve relevancy of advertisements |
US8886531B2 (en) * | 2010-01-13 | 2014-11-11 | Rovi Technologies Corporation | Apparatus and method for generating an audio fingerprint and using a two-stage query |
US20140064107A1 (en) * | 2012-08-28 | 2014-03-06 | Palo Alto Research Center Incorporated | Method and system for feature-based addressing |
EP2709017B1 (en) * | 2012-09-14 | 2015-05-27 | Barcelona Supercomputing Center-Centro Nacional de Supercomputación | Device for controlling the access to a cache structure |
US10701305B2 (en) * | 2013-01-30 | 2020-06-30 | Kebron G. Dejene | Video signature system and method |
US10567489B2 (en) * | 2013-03-15 | 2020-02-18 | Time Warner Cable Enterprises Llc | System and method for seamless switching between data streams |
CN104602015A (en) * | 2014-12-31 | 2015-05-06 | 西安蒜泥电子科技有限责任公司 | Real-time video monitoring encryption and authentication method |
US10594689B1 (en) | 2015-12-04 | 2020-03-17 | Digimarc Corporation | Robust encoding of machine readable information in host objects and biometrics, and associated decoding and authentication |
CN108701158B (en) * | 2016-12-30 | 2023-03-10 | 谷歌有限责任公司 | Hash-based dynamic restriction of content on information resources |
KR20200142787A (en) * | 2019-06-13 | 2020-12-23 | 네이버 주식회사 | Electronic apparatus for recognition multimedia signal and operating method of the same |
US11922532B2 (en) | 2020-01-15 | 2024-03-05 | Digimarc Corporation | System for mitigating the problem of deepfake media content using watermarking |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5852664A (en) | 1995-07-10 | 1998-12-22 | Intel Corporation | Decode access control for encoded multimedia signals |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2675032B2 (en) * | 1987-12-21 | 1997-11-12 | 株式会社日立製作所 | How to create compressed slips |
JP2997483B2 (en) * | 1989-11-08 | 2000-01-11 | 株式会社日立製作所 | Verification data generator |
US5568403A (en) * | 1994-08-19 | 1996-10-22 | Thomson Consumer Electronics, Inc. | Audio/video/data component system bus |
JPH06178274A (en) * | 1992-11-30 | 1994-06-24 | Sony Corp | Motion picture decoding device |
US5987181A (en) * | 1995-10-12 | 1999-11-16 | Sharp Kabushiki Kaisha | Coding and decoding apparatus which transmits and receives tool information for constructing decoding scheme |
US6205249B1 (en) * | 1998-04-02 | 2001-03-20 | Scott A. Moskowitz | Multiple transform utilization and applications for secure digital watermarking |
US8107015B1 (en) * | 1996-06-07 | 2012-01-31 | Virage, Incorporated | Key frame selection |
US6002443A (en) * | 1996-11-01 | 1999-12-14 | Iggulden; Jerry | Method and apparatus for automatically identifying and selectively altering segments of a television broadcast signal in real-time |
US5907619A (en) * | 1996-12-20 | 1999-05-25 | Intel Corporation | Secure compressed imaging |
JPH11164130A (en) * | 1997-12-01 | 1999-06-18 | Sumikin Seigyo Engineering Kk | Method for preventing tampering of image |
US6266644B1 (en) * | 1998-09-26 | 2001-07-24 | Liquid Audio, Inc. | Audio encoding apparatus and methods |
JP3768705B2 (en) * | 1998-11-27 | 2006-04-19 | キヤノン株式会社 | Digital watermark embedding device, output control device, and computer-readable storage medium |
JP2000286836A (en) * | 1999-03-30 | 2000-10-13 | Fujitsu Ltd | Certification device and recording medium |
GB9922904D0 (en) * | 1999-09-28 | 1999-12-01 | Signum Technologies Limited | Method of authenticating digital data works |
US20010032189A1 (en) * | 1999-12-27 | 2001-10-18 | Powell Michael D. | Method and apparatus for a cryptographically assisted commercial network system designed to facilitate idea submission, purchase and licensing and innovation transfer |
JP3683766B2 (en) * | 2000-01-21 | 2005-08-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Image processing apparatus and method |
US6675174B1 (en) * | 2000-02-02 | 2004-01-06 | International Business Machines Corp. | System and method for measuring similarity between a set of known temporal media segments and a one or more temporal media streams |
US6236341B1 (en) * | 2000-03-16 | 2001-05-22 | Lucent Technologies Inc. | Method and apparatus for data compression of network packets employing per-packet hash tables |
KR100893671B1 (en) * | 2001-02-12 | 2009-04-20 | 그레이스노트, 인크. | Generating and matching hashes of multimedia content |
EP1244221A1 (en) * | 2001-03-23 | 2002-09-25 | Sun Microsystems, Inc. | Method and system for eliminating data redundancies |
US20060047967A1 (en) * | 2004-08-31 | 2006-03-02 | Akhan Mehmet B | Method and system for data authentication for use with computer systems |
WO2007061178A1 (en) * | 2005-09-15 | 2007-05-31 | Samsung Electronics Co., Ltd. | Method and system for protecting broadcast frame |
US8788830B2 (en) * | 2008-10-02 | 2014-07-22 | Ricoh Co., Ltd. | Method and apparatus for logging based identification |
-
2003
- 2003-04-12 US US10/518,264 patent/US20050259819A1/en not_active Abandoned
- 2003-06-12 CN CNB03814669XA patent/CN100380975C/en not_active Expired - Fee Related
- 2003-06-12 JP JP2004515156A patent/JP2005531024A/en active Pending
- 2003-06-12 WO PCT/IB2003/002625 patent/WO2004002162A1/en active Application Filing
- 2003-06-12 AU AU2003239732A patent/AU2003239732A1/en not_active Abandoned
- 2003-06-12 KR KR10-2004-7021157A patent/KR20050013630A/en not_active Application Discontinuation
- 2003-06-12 EP EP03732921A patent/EP1518414A1/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5852664A (en) | 1995-07-10 | 1998-12-22 | Intel Corporation | Decode access control for encoded multimedia signals |
Non-Patent Citations (11)
Title |
---|
CHENG YANG: "MACS: music audio characteristic sequence indexing for similarity retrieval", IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, XX, XX, 21 October 2001 (2001-10-21), pages 123 - 126, XP010566890 * |
FAN J ET AL: "MULTIVIEW: MULTILEVEL VIDEO CONTENT REPRESENTATION AND RETRIEVAL", JOURNAL OF ELECTRONIC IMAGING, SPIE + IS&T, US, vol. 10, no. 4, October 2001 (2001-10-01), pages 895 - 908, XP001104669, ISSN: 1017-9909 * |
GIONIS A ET AL: "Similarity search in high dimensions via hashing", VERY LARGE DATA BASES. PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, PROCEEDINGS OF 25TH INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, EDINBURGH, UK, 7-10 SEPT. 1999, 1999, Orlando, FL, USA, Morgan Kaufmann Publishers, USA, pages 518 - 529, XP002256564 * |
HAITSMA ET AL: "Robust Audio Hashing for Content Identification", PROCEEDINGS INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, XX, XX, 19 September 2001 (2001-09-19), pages 1 - 8, XP002198245 * |
J.C. OOSTVEEN; A.A.C. KALKER; J.A. HAITSMA: "Visual Hashing of Digital Video: Applications and Techniques", SPIE, APPLICATIONS OF DIGITAL IMAGE PROCESSING, vol. XXIV, 31 July 2001 (2001-07-31) |
MANDAL M K ET AL: "A CRITICAL EVALUATION OF IMAGE AND VIDEO INDEXING TECHNIQUES IN THECOMPRESSED DOMAIN", IMAGE AND VISION COMPUTING, GUILDFORD, GB, vol. 17, 1999, pages 513 - 529, XP002936007, ISSN: 0262-8856 * |
MANDAL M K ET AL: "INTEGRATED COMPRESSION AND INDEXING OF VIDEO IN THE WAVELET DOMAIN", JOURNAL OF ELECTRONIC IMAGING, SPIE + IS&T, US, vol. 9, no. 2, April 2000 (2000-04-01), pages 85 - 100, XP000959229, ISSN: 1017-9909 * |
QIU G: "Constructing colour image databases using BTC for efficient storage and effective retrieval", PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2 April 2001 (2001-04-02), Hong Kong, pages 364 - 367, XP010544738 * |
SCHNEIDER M ET AL: "A ROBUST CONTENT BASED DIGITAL SIGNATURE FOR IMAGE AUTHENTICATION", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) LAUSANNE, SEPT. 16 - 19, 1996, NEW YORK, IEEE, US, vol. 3, September 1996 (1996-09-01), pages 1 - 2, XP002952953, ISBN: 0-7803-3259-8 * |
SHIH-FU CHANG: "Compressed-domain techniques for image/video indexing and manipulation", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING. (ICIP). WASHINGTON, OCT. 23 - 26, 1995, LOS ALAMITOS, IEEE COMP. SOC. PRESS, US, vol. 3, 23 October 1995 (1995-10-23), pages 314 - 317, XP010196854, ISBN: 0-7803-3122-2 * |
SWANSON M D ET AL: "CODING FOR CONTENT-BASED RETRIEVAL", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP). ATLANTA, MAY 7 - 10, 1996, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP), NEW YORK, IEEE, US, vol. 4 CONF. 21, 7 May 1996 (1996-05-07), pages 1958 - 1961, XP000681634, ISBN: 0-7803-3193-1 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008520134A (en) * | 2004-11-11 | 2008-06-12 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | Apparatus and method for detecting tampering of information signal |
US7917763B2 (en) | 2004-11-11 | 2011-03-29 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Device and method for detecting a manipulation of an information signal |
JP4689679B2 (en) * | 2004-11-11 | 2011-05-25 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | Apparatus and method for detecting tampering of information signal |
US8135955B2 (en) | 2004-11-11 | 2012-03-13 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Device and method for detecting a manipulation of an information signal |
Also Published As
Publication number | Publication date |
---|---|
US20050259819A1 (en) | 2005-11-24 |
KR20050013630A (en) | 2005-02-04 |
CN100380975C (en) | 2008-04-09 |
JP2005531024A (en) | 2005-10-13 |
AU2003239732A1 (en) | 2004-01-06 |
EP1518414A1 (en) | 2005-03-30 |
CN1663281A (en) | 2005-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20050259819A1 (en) | Method for generating hashes from a compressed multimedia content | |
JP6728456B2 (en) | Adaptive processing by multiple media processing nodes | |
US20060013451A1 (en) | Audio data fingerprint searching | |
CN1511311A (en) | Method for comparing audio signal by characterisation based on auditory events | |
US20060173692A1 (en) | Audio compression using repetitive structures | |
EP1568010B1 (en) | Coding an audio signal | |
US20080288263A1 (en) | Method and Apparatus for Encoding/Decoding | |
JP2003316394A (en) | System, method, and program for decoding sound | |
AU2020200861B2 (en) | Adaptive Processing with Multiple Media Processing Nodes | |
KR20080112000A (en) | Encoding and decoding using the resemblance of a tonality | |
CN115312069A (en) | Audio encoding and decoding method and device, computer readable medium and electronic equipment | |
Kurth | Perceptually transparent attachment of content-based data to audio-visual documents |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2003732921 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004515156 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10518264 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2003814669X Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020047021157 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 1020047021157 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 2003732921 Country of ref document: EP |