WO2004102528A1 - オーディオ電子透かし装置 - Google Patents
オーディオ電子透かし装置 Download PDFInfo
- Publication number
- WO2004102528A1 WO2004102528A1 PCT/JP2003/006114 JP0306114W WO2004102528A1 WO 2004102528 A1 WO2004102528 A1 WO 2004102528A1 JP 0306114 W JP0306114 W JP 0306114W WO 2004102528 A1 WO2004102528 A1 WO 2004102528A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- watermark
- audio
- generation
- audio data
- Prior art date
Links
- 238000000034 method Methods 0.000 claims abstract description 57
- 210000005069 ears Anatomy 0.000 claims description 6
- 230000006835 compression Effects 0.000 abstract description 9
- 238000007906 compression Methods 0.000 abstract description 7
- 230000002441 reversible effect Effects 0.000 abstract description 3
- 230000002427 irreversible effect Effects 0.000 abstract description 2
- 230000001747 exhibiting effect Effects 0.000 abstract 1
- 230000006870 function Effects 0.000 description 64
- 238000010586 diagram Methods 0.000 description 45
- 238000010276 construction Methods 0.000 description 21
- 238000001514 detection method Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 238000005070 sampling Methods 0.000 description 9
- 238000005352 clarification Methods 0.000 description 4
- 230000002542 deteriorative effect Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 239000000470 constituent Substances 0.000 description 2
- 230000006866 deterioration Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
Definitions
- the present invention relates to a device for embedding copyright identification information or the like in audio data as watermark data, and a device for detecting and decoding watermark data from audio data in which the watermark data is embedded.
- the watermark data can be embedded without deteriorating the sound quality of the original audio data, and the watermark data remains even after being deformed by irreversible compression or the like, and the watermark data can be easily converted.
- the ideal digital watermarking technology that can be detected has not yet been established.
- a first invention is an audio digital watermarking device for recording watermark data on an audio recording medium, wherein the audio data acquiring unit acquires audio data, the watermark data acquiring unit acquires watermark data, By superimposing the audio data acquired by the audio data acquiring unit and superimposing the audio data, the result of the predetermined sum of the superimposed audio data at predetermined intervals represents the watermark data acquired by the watermark data acquiring unit.
- a watermark generation data generation unit that generates watermark generation data; audio data acquired by the audio data acquisition unit; and watermark generation data generated by the watermark generation data generation unit. And a superimposed audio data generating unit for generating the digital watermark.
- the second invention relates to the audio digital watermarking device according to claim 1, wherein the watermark generation data generation unit generates low frequency watermark generation data that cannot be heard by human ears.
- the watermark generation data generation section generates the watermark generation data in which the value and the slope at the boundary where the amplitude of the function represented by the watermark generation data generated thereby changes are always zero.
- the watermark generation data generation unit is configured to represent the watermark generation data such that a result of the predetermined sum for each of the predetermined periods represents the watermark data acquired by the watermark data acquisition unit. 4.
- An audio watermarking apparatus according to claim 1, wherein the amplitude of the function is adaptively changed every half cycle.
- the result of the predetermined sum for each predetermined cycle is a sign of the sum of the superimposed audio data for each half cycle of the watermarking generation data. It relates to the audio watermarking device described.
- the result of the predetermined sum for each predetermined cycle is a sign of a difference between the sum of the superimposed audio data corresponding to the first half cycle and the second half cycle of the watermarking generation data.
- the present invention relates to the audio digital watermarking device according to any one of 1 to 4.
- a seventh invention is a method for decoding watermark data recorded on an audio recording medium.
- An audio digital watermark decoding device for obtaining a superimposed audio data, and a sum calculating a result of a predetermined sum of the superimposed audio data acquired by the superimposed audio data acquisition unit for each of the predetermined periods.
- An audio digital watermark decoding device comprising: a calculating unit; and a watermark data decoding unit that decodes the watermark data based on a result of the predetermined sum calculated by the sum calculating unit.
- the present invention relates to a digital watermark decoding device.
- the total sum calculation unit includes a total sum of the superimposed audio data acquired by the superimposed audio data acquisition unit over a time period of a first half of one cycle of the watermark generation data and a time of a second half cycle.
- the audio digital watermark decoding device according to claim 7, wherein a sign of a difference of the total sum over is calculated.
- FIG. 1 is a functional block diagram of the first embodiment.
- FIG. 2 is a diagram showing a waveform pattern of audio data acquired in the first embodiment.
- FIG. 3 is a diagram showing a waveform pattern of a primitive function of the watermark generation data generated in the first embodiment.
- FIG. 4 is a diagram showing a sampling pattern of a primitive function of the watermark generation data generated in the first embodiment.
- FIG. 5 is a diagram showing a waveform pattern of the watermark generation data of the character “C” generated in the first embodiment. 3 006114
- FIG. 6 is a diagram showing a digital watermark embedding process according to the first embodiment.
- FIG. 7 is a diagram showing a flow of processing in the first embodiment.
- FIG. 8 is a functional block diagram of the second embodiment.
- FIG. 9 is a diagram showing a flow of processing of the second embodiment.
- FIG. 10 is a functional block diagram of the third embodiment.
- FIG. 11 is a diagram showing a waveform pattern of a primitive function of watermark generation data generated in the third embodiment.
- FIG. 12 is a diagram showing a processing flow of the third embodiment.
- FIG. 13 is a functional block diagram of the fourth embodiment.
- FIG. 14 is a diagram showing a waveform pattern of a fundamental function of watermark generation data generated in the fourth embodiment.
- FIG. 15 is a diagram showing a sampling pattern of a base function of watermark generation data generated in the fourth embodiment.
- FIG. 16 is a diagram illustrating a waveform pattern of watermark generation data of the character “C” generated in the fourth embodiment.
- FIG. 17 is a diagram showing the flow of the process of the fourth embodiment.
- FIG. 18 is a functional block diagram of the fifth embodiment.
- FIG. 19 is a diagram showing a digital watermark embedding process according to the fifth embodiment.
- FIG. 20 is a diagram showing the flow of the process of the fifth embodiment.
- FIG. 21 is a functional block diagram of the sixth embodiment.
- FIG. 22 is a diagram showing a processing flow of the sixth embodiment.
- FIG. 23 is a functional block diagram of the seventh embodiment.
- FIG. 24 is a diagram showing a digital watermark detection process according to the seventh embodiment.
- FIG. 25 is a diagram showing a processing flow in the seventh embodiment.
- FIG. 26 is a functional block diagram of the eighth embodiment.
- FIG. 27 is a diagram showing a digital watermark detection process according to the eighth embodiment.
- FIG. 28 is a diagram showing a processing flow of the eighth embodiment.
- FIG. 29 is a functional block diagram of the ninth embodiment.
- FIG. 30 is a diagram showing a digital watermark detection process according to the ninth embodiment.
- FIG. 31 is a diagram showing a processing flow of the ninth embodiment.
- the relationship between the embodiment and the claims is roughly as follows.
- the first embodiment mainly describes claims 1 and 10.
- claims 2 and 11 are mainly described.
- the third embodiment mainly describes claims 3 and 12.
- claims 4 and 13 are mainly described.
- claims 5 and 14 are mainly described.
- the sixth embodiment mainly describes claims 6 and 15.
- claims 7 and 16 are mainly described.
- claims 8 and 17 are mainly described.
- the ninth embodiment mainly describes claims 9 and 18.
- the invention described in the first embodiment is obtained by acquiring copyright identification information or the like as watermark data, superimposing it on audio data to produce superimposed audio data, and using a result of a predetermined summation for each predetermined period.
- the present invention relates to an audio digital watermarking device for embedding watermark data such as chopstick crop identification information.
- the audio digital watermarking apparatus of the first embodiment Is composed of an audio data acquisition unit 0101, a watermark data acquisition unit 0102, a watermark generation data generation unit 0103, and a superimposed audio data generation unit 0104.
- the audio data acquisition unit acquires audio data.
- the watermark data acquisition unit acquires the watermark data.
- watermark data corresponds to digital data such as codes and texts for copyright identification information and IDs used for content distribution and the like.
- the watermark generation data generating unit superimposes the audio data obtained by the audio data obtaining unit on the audio data obtaining unit, and superimposes the audio data on the audio data obtaining unit. Generate watermark generation data.
- the result of the predetermined sum in each predetermined cycle refers to the result of the predetermined sum of the watermark generation data of the superimposed audio data in each of the predetermined cycles.
- the “predetermined cycle” includes half cycle, 1 cycle, 1.5 cycle, 2 cycle, 2.5 cycle, 3 cycle, and so on.
- the “predetermined sum” corresponds to a sum for a half cycle, a sum for one cycle, and the like.
- the “result of the predetermined sum” includes a half period, a sum over one period, a sign of the sum, and a sign of the difference of the sum.
- the superimposed audio data generation unit generates superimposed audio data by superimposing the audio data acquired by the audio data acquisition unit and the watermark generation data generated by the watermark generation data generation unit. Description based on specific examples>
- Embodiment 1 of the present invention will be described in detail using a specific example.
- the audio data of the waveform pattern shown in Fig. 2 is acquired.
- a digitized data such as a pulse code modulation (PCM) may be obtained, or an analog waveform may be obtained and sampled and quantized. May be converted to digital data.
- PCM pulse code modulation
- the compressed audio data may be decrypted and extracted as PCM data.
- the watermark data can be anything as long as it is digital information.
- a character string consisting of alphabets without compression convert the value to ASCII code and display the value in binary.
- Watermark generation data is represented by a function obtained by multiplying the underlying function (hereinafter referred to as the base function) by the amplitude a.
- the base function the underlying function
- FIG. 1 shows an example of the fundamental function u (t), in which a sine wave with a period R / f is moved upward and adjusted so that the maximum and minimum values are 1 and 0, respectively. .
- this function u (t) Since the value of this function u (t) is frequently used when embedding watermark data, one cycle is calculated in advance and the list of the function values is stored in memory.
- u (0), u (1),..., U (R / f-1) are stored in the memory.
- the value at the sampling point t in the i-th cycle can be obtained from the value for one cycle using the following relational expression.
- u (t) u (t-(i-l)-R / f)
- Superimposed audio data is generated by adding the watermark generation data and the original audio data. Assuming that the sample value of the original audio data is V (t), the superimposed audio data is as follows.
- ⁇ w (t) ⁇ v (t) + a; ⁇ ⁇ u (t).
- ⁇ represents the sum of one cycle of the i-th cycle of the watermark generation data.
- V i ⁇ V (t)
- U ⁇ u (t) far.
- V i generally changes every period i, but U is a constant. This constant U is also stored in the memory in advance.
- the value of a is determined so that the absolute value of the sum of the superimposed audio data is constant and the sign represents the bit value of the watermark data. Assuming that the bit value is b (0 or 1) and the absolute value of the sum of the superimposed audio data is S,
- the function representing the watermark generation data is obtained by multiplying the base function of the watermark generation data by this amplitude a;
- FIG. 5 is a schematic diagram of the watermark generation data waveform pattern corresponding to the watermark data character "C".
- the sign of amplitude a; and the sign of (1-1) b match, but depending on the size of It could be.
- the sign of the bit value is not the sign of the amplitude of the watermark generation data, but the sign of the sum of the superimposed audio data.
- w (t) V (t) + ⁇ (-1) b ⁇ S-V i ⁇ ⁇ u (t) / U
- audio data for one cycle of watermark generation data enters A 01, where the sum of the above equations is calculated and output to A 03.
- the watermark data enters A 02, where the code of (11) b in the above equation is output to A 03 every cycle of the watermark generation data according to the bit value. That is, when the bit value is 0, “positive” is output, and when the bit value is 1, “negative” is output.
- AO 3 uses these two values to generate watermark generation data according to the above equation, and outputs the data of the watermark generation data to AO 4.
- the AO 4 generates and outputs superimposed audio data containing watermark data by superimposing the data of the watermark generation data and the original audio data.
- the process of embedding the watermark data is reversible, and if there is time series data of the amplitude of the watermark generation data used at the time of embedding the watermark data, it can be completely restored. Furthermore, even if one person embeds the watermark data and another person embeds another watermark data, if there is time-series data of the amplitude of the watermark generation data used in each process, Each watermark data can be extracted and restored to the original audio data. In this way, it is possible to embed any number of layers.For example, after the copyright holder embeds information about the copyright, while the copyright is protected, the content distribution company can make unauthorized secondary distribution. For example, it is possible to embed a unique ID for the purpose of preventing the problem.
- the person who claims the copyright of the audio data takes his original, unwatermarked original audio data (which he claims) to where he controls the copyright, Using the time series data of the amplitude of the watermark generation data, it is combined with the original audio data brought in. If this matches the superimposed audio data containing the watermark data that is actually distributed, it can be determined that the data was created by that person.
- the detection of the start point of the watermark data, the error processing, and the like have been omitted, but these can be easily implemented by conventional well-known techniques.
- a specific bit pattern may be inserted before the watermark data in advance, and decoding may be started immediately after the pattern is found. Specifically, it skips the point where the amplitude is zero unconditionally, finds the point where it is synchronized with the start code while shifting the start position little by little, and then finds the watermark.
- For error processing there is a method of embedding a check sum or the like as watermark data and checking it at the time of decoding.
- FIG. 7 shows a processing flow of the first embodiment.
- the audio data acquisition section acquires audio data (step S0701).
- the watermark data acquisition unit acquires the watermark data (Step S 0 7 0 2) 0
- the watermark generation data generation unit superimposes the audio data obtained in step S7071 on the basis of the watermark data obtained in step SO702 to form superimposed audio data.
- Watermark generation data is generated in which the result of the predetermined summation for each predetermined period represents the watermark data acquired in step S7072 (step S7073).
- the superimposed audio data generation unit generates superimposed audio data by superimposing the audio data obtained in step SO701 and the watermark generation data generated in step S7073 ( Step S0704).
- copyright identification information and the like are obtained as watermark data, superimposed on audio data to be superimposed audio data, and the watermark information is expressed by a result of a predetermined summation in each predetermined cycle.
- the watermark generation data since one bit is encoded in one wavelength of the watermark generation data, even though the watermark generation data has a long wavelength, many bits are efficiently transmitted.
- the data can be embedded.
- the invention according to the second embodiment relates to the audio digital watermarking device according to the first embodiment, which generates low-frequency watermark generation data that cannot be heard by human ears.
- the audio watermarking device 0800 of the second embodiment includes an audio data acquisition unit 08001, a watermark data acquisition unit 0802, and a watermark generation data generation unit 0800. 8 0 3 and a superimposed audio data generation unit 0 8 0 4.
- the watermark generation data generation unit generates low-frequency watermark generation data that cannot be heard by the human ear.
- low frequency that cannot be heard by the human ear means a low frequency of about 20 Hz or less.
- the configuration is the same as that of the first embodiment (configuration requirement: superimposed audio data generation unit), and a description thereof will be omitted.
- Embodiment 2 of the present invention will be described in detail using a specific example.
- the watermark generation data uses a low frequency that cannot be heard by human ears.
- the sampling rate of audio data is 44.1 kHz
- a function with a frequency of 10 Hz is used as watermark generation data.
- the primitive function u (t) is a function of the period 4 4. I k Z l O.
- FIG. 9 shows a processing flow of the second embodiment.
- the audio data acquisition section acquires audio data (step S0901).
- the watermark data acquisition unit acquires the watermark data (step S0902).
- the watermark generation data generation unit superimposes the audio data obtained in step S9091 on the basis of the watermark data obtained in step S9092 to generate superimposed audio data.
- step S0903 low-frequency watermark generation data that cannot be heard by the human ear is generated, and the result of the predetermined summation for each predetermined period represents the watermark data acquired in step S0902.
- the superimposed audio data generation unit obtains the data in step SO901. W
- the superimposed audio data is superimposed on the watermark generation data generated in step S 0903 to generate superimposed audio data (step S 0904).
- the invention according to the third embodiment relates to the audio digital watermarking apparatus according to claim 1 or 2, wherein a function representing the data for watermark generation has a value and a slope at a boundary where the amplitude changes are always zero. .
- an audio watermarking apparatus 1003 includes an audio data acquisition unit 1001, a watermark data acquisition unit 1002, and data generation for watermark generation. And a superimposed audio data generation unit 1004.
- the watermark generation data generation unit generates the watermark generation data in which the value and the slope at the boundary where the amplitude of the function represented by the watermark generation data generated thereby changes are always zero.
- the configuration is the same as that of the first or second embodiment (configuration requirement: watermark generation data generation unit), and thus the description is omitted.
- Embodiment 3 of the present invention will be described in detail using a specific example.
- a function in which the value and the slope at the boundary where the amplitude changes always become zero is used.
- xn is the nth boundary point where the amplitude changes.
- watermark generation data base function generation
- Embodiment 3 except that the base function u (X) satisfies the condition of (watermark generation data: generation of base function), the same as (watermark generation data: summation calculation) of Embodiment 1 or 2. Therefore, the description is omitted.
- Embodiment 3 except that the primitive function u (X) satisfies the condition of (watermark generation data: generation of primitive function), the description is the same as (superimposed audio data generation) of Embodiment 1 or 2. Omitted.
- FIG. 12 shows the processing flow of the third embodiment.
- the audio data acquisition section acquires audio data (step S1221).
- the watermark data obtaining unit obtains the watermark data (step S122).
- the watermark generation data generation section superimposes the audio data acquired in step S1221, based on the watermark data acquired in step S1222, as superimposed audio data.
- the result of the predetermined summation for each predetermined period represents the watermark data obtained in step S122, and the watermark and the value at the boundary where the amplitude of the function represented by the watermark generation data changes are always zero.
- the generation data is generated (step S1203).
- the superimposed audio data generation unit superimposes the audio data acquired in step S1201 and the watermark generation data generated in step S1203 to superimpose the superimposed audio data. Generate (Step S1204).
- the invention according to the fourth embodiment is characterized in that the amplitude of the watermark generation data is adaptively changed every half cycle so that the result of the predetermined summation of the superimposed audio data for each predetermined cycle represents the watermark data.
- the present invention relates to the audio digital watermarking device according to any one of 1 to 3.
- the audio digital watermarking device 1300 of the fourth embodiment includes an audio data acquisition unit 1301, a watermark data acquisition unit 1302, and a watermark generation data generation unit 1. 303, and a superimposed audio data generation unit 1304.
- the watermark generation data generator adaptively adjusts the amplitude of the watermark generation data every half cycle so that the result of the predetermined sum at each predetermined cycle represents the watermark data acquired by the watermark data acquisition unit. To change.
- Embodiment 4 of the present invention will be described in detail using a specific example. .
- Watermark generation data is represented by a function obtained by multiplying the underlying function (hereinafter referred to as the base function) by the amplitude a.
- the base function the underlying function
- a the amplitude
- u (0), u (1),..., u (R / f / 2-1) are stored in memory.
- the values u (R / f / 2), u (R / f / 2 + 1),..., u (R / f-1) in the second half of the second half are inverted in the sign of the value in the first half of the period Things.
- the value at the sampling point t in the i-th cycle can be obtained from the above list of function values using the following relational expression.
- u (t) u (t-(i-l)-R / f)
- sample value at the sample point t of the superimposed audio data containing the watermark generation data is w (t)
- it can be expressed by the following equation. w (t) V + a (t) u (t) w (t) over half the i-th period of the watermark generation data Add up.
- a (t) is set to a constant value in this half cycle. If this constant value is a;
- ⁇ w (t) ⁇ v (t) + ⁇ u (where ⁇ represents the sum of half the i-th period of the watermark generation data.
- ⁇ represents the sum of half the i-th period of the watermark generation data.
- the i-th period The first half cycle of:
- V H ⁇ V (t)
- the values of a Li and a 2i are determined so that the absolute value of the sum of the audio data superimposed every half cycle of the watermark generation data is constant and the code represents the bit value of the watermark data.
- the sum of the audio data superimposed in the second half cycle of the watermark generation data is opposite in sign to the sum of the first half cycle.
- a 2i ⁇ -(— 1) b ⁇ S — V 2i ⁇ / U.
- one cycle represents one bit, so b generally differs for each i.
- the function that represents the watermarking generation data is obtained by multiplying the base function u (t) of the watermark generation data by the amplitudes a Li and a 2i , that is, the first half of the i-th cycle: ⁇ (-1) s -V u ⁇ / UJ u (t
- FIG. 16 is a schematic diagram of a waveform pattern of the generation data corresponding to the character “C” of the watermark data.
- the sign of the watermark generation data corresponds to the bit value. However, in general, the sign may be inverted depending on the magnitudes of VH and V2i .
- the sign of the bit value is not the sign of the watermark generation data, but the sign of the sum of the superimposed audio data. It is.
- the audio data w (t) synthesized with the watermark generation data is as follows.
- FIG. 17 shows the processing flow of the fourth embodiment.
- the audio data obtaining unit obtains audio data (step S1701).
- the watermark data obtaining unit obtains the watermark data (step S).
- the watermark generation data generation unit sets the amplitude of the watermark generation data so that the result of the predetermined summation of the superimposed audio data for each predetermined period represents the watermark data acquired in step S1702. It is changed adaptively every half cycle (step S1703).
- the superimposed audio data generation unit generates superimposed audio data by superimposing the audio data obtained in step S1701 and the watermark generation data generated in step S1703.
- the invention according to the fifth embodiment is characterized in that the result of the predetermined summation of the superimposed audio data for each predetermined cycle is a sign of the sum of the superimposed audio data for each half cycle of the watermark generation data.
- the present invention relates to an audio digital watermarking device described in item 1.
- the audio watermarking device 1800 of the fifth embodiment includes an audio data acquisition unit 1801, a watermark data acquisition unit 1802, and a watermark generation data generation unit. It consists of 1803 and a superimposed audio data generation unit 1804.
- the watermark generation data generation unit generates the watermark generation data so that the result of the predetermined summation of the superimposed audio data for each predetermined period represents the watermark data acquired by the watermark data acquisition unit.
- the “result of the predetermined sum in each predetermined cycle” refers to the sign of the sum of the superimposed audio data in each half cycle of the watermark generation data.
- the configuration is the same as that of the fourth embodiment (configuration requirement: superimposed audio data generation unit), and a description thereof is omitted.
- Embodiment 5 of the present invention will be described in detail using a specific example.
- audio data for a half cycle of the watermark generation data enters A 01, where the sum of the above equations is calculated and output to A 03.
- the watermark data enters A 0 2, where the sign of (1 1) b or 1 (1 1) b in the above equation is output to A 0 3 every half cycle of the watermark generation data according to the bit value. Is done. That is, in the first half period, when the bit value is 0, “positive” is output, and when it is 1, “negative” is output. In the second half period, when the bit value is 0, “negative” is output. When it is 1, “positive” is output.
- AO 3 uses these two values to generate watermark generation data according to the above equation, and outputs the watermark generation data to AO 4.
- the AO 4 generates and outputs superimposed audio data containing the watermark data by superimposing the data of the watermark generation data and the original audio data.
- FIG. 20 shows a processing flow of the fifth embodiment.
- the audio data acquisition unit acquires audio data (step S201).
- the watermark data obtaining unit obtains the watermark data (step S2002).
- the watermark generation data generation unit generates the watermark so that the sign of the sum of the superimposed audio data for each half cycle of the watermark generation data represents the watermark data acquired in step S2002. Data is generated (step S2003).
- the superimposed audio data generation unit superimposes the audio data obtained in step S2001 and the watermark generation data generated in step S2003 to convert the superimposed audio data. It is generated (step S2004).
- the sign of the sum of the superimposed audio data over half the period of the watermark generation data represents 0Z1 of each bit of the watermark data
- the watermark data can be easily detected and decoded.
- the absolute value of the sum always has a constant value different from zero, a digital watermark that is durable against deformation of audio data can be realized.
- the result of the predetermined sum for each predetermined cycle is a sign of a difference between the sum of the superimposed audio data of the first half cycle and the second half cycle of the watermarking generation data.
- the present invention relates to the audio digital watermark device according to one of the above aspects.
- the audio digital watermarking device 2100 of the sixth embodiment includes an audio data acquisition unit 2101, a watermark data acquisition unit 2102, and a watermark generation data generation unit 2. 103 and a superimposed audio data generation unit 210.
- the configuration is the same as that of the fourth embodiment (component requirement: watermark data acquisition unit), and the description is omitted.
- the watermark generation data generator adaptively adjusts the amplitude of the watermark generation data every half cycle so that the result of the predetermined summation of the superimposed audio data at each predetermined cycle represents the watermark data acquired by the watermark data acquisition unit. Change to.
- the “result of the predetermined sum in each predetermined cycle” refers to the sign of the difference between the sum of the superimposed audio data in the first half cycle and the second half cycle of the watermark generation data.
- the configuration is the same as that of the fourth embodiment (component requirement: watermark generation data generation unit), and a description thereof is omitted.
- the configuration is the same as that of the fourth embodiment (configuration requirement: superimposed audio data generation unit), and a description thereof is omitted.
- FIG. 22 shows the processing flow of the sixth embodiment.
- the audio data acquisition section acquires audio data (step S2201).
- the watermark data obtaining unit obtains the watermark data (step S2202).
- the watermark generation data generation unit calculates the difference sign of the sum of the first half period and the second half period of the watermarking generation data of the superimposed audio data by using the watermark data acquired in step S222. As shown, data for watermark generation is generated (step S2203).
- the superimposed audio data generation unit superimposes the audio data obtained in step S221 and the watermark generation data generated in step S223 to superimpose the superimposed audio data. Generate (Step S2204).
- the 0--1 of each bit of the watermark data is expressed by the sign of the difference between the sum of the predetermined first half period and the second half period of the watermarked superimposed audio data.
- Embodiment 7 is an audio digital watermark decoding that calculates a result of a predetermined sum of watermarking generation data of the acquired superimposed audio data for each predetermined period, and decodes watermark data based on the result.
- the audio digital watermark decoding apparatus 230 of Embodiment 7 includes a superimposed audio data acquiring section 2301, a sum calculating section 2302, and a watermark data decoding section 23. 0 3 and
- the superimposed audio data acquisition unit acquires superimposed audio data.
- the total sum calculation unit calculates a result of a predetermined sum of the superimposed audio data acquired by the superimposed audio data acquisition unit at predetermined intervals.
- the result of the predetermined sum in each predetermined cycle refers to the result of the predetermined sum of the superimposed audio data in each of the predetermined cycles of the watermark generation data.
- the “predetermined cycle” includes half cycle, 1 cycle, 1.5 cycle, 2 cycle, 2.5 cycle, 3 cycle, and so on.
- Predetermined sum is the sum of half cycle, 1 cycle And so on.
- the “result of the predetermined sum” includes a half cycle, a sum over one cycle, a sign of the sum, and a sign of a difference of the sum.
- the watermark data decoding unit decodes the watermark data based on the result of the predetermined sum calculated by the sum calculation unit.
- Embodiment 7 of the present invention will be described in detail using a specific example.
- the acquisition of the superimposed audio data of the seventh embodiment is assumed to acquire the superimposed audio data w (t) generated in the first embodiment.
- the difference between each bit value of the watermark data is Since the difference appears in the sign of the sum of the data, it is possible to obtain the sum of one cycle, determine the 0Z1 of each bit of the watermark data using the code, and decode the data.
- FIG. 24 A block diagram of this watermark data detection process is shown in Fig. 24.
- each sample value of the superimposed audio data containing the watermark data is represented by B01 , The sum of one cycle is calculated, and the sign is output to B02.
- B02 In B02, 0 or 1 is selected according to the code, and is output as watermark data.
- FIG. 25 shows the processing flow of the seventh embodiment.
- the superimposed audio data acquisition unit acquires superimposed audio data of the watermark generation data and the audio data (step S2501).
- the sum calculation unit calculates a result of a predetermined sum of the superimposed audio data acquired in step S2501 for each predetermined cycle of the watermark generation data (step S2501) ).
- the watermark data decoding unit determines and decodes the bit value of the watermark data based on the result of the predetermined sum calculated in step S2502 (step S2503).
- the invention described in the seventh embodiment it is possible to determine 0-1 of each bit of the watermark data based on the result of the predetermined sum of the overlapped audio data for the predetermined period of the watermark generation data, so that the watermark data can be easily detected and decoded. It is. In addition, the result of the above-mentioned predetermined summation is not affected by the deformation of the audio data. Therefore, it is possible to realize a digital watermark that is simple to use.
- the audio watermark decoding apparatus wherein the watermark data is decoded based on a sign of a value obtained by summing the superimposed audio data over a half period of the watermark generation data.
- the audio digital watermark decoding device 260 of Embodiment 8 includes a superimposed audio data acquisition unit 2601, a sum calculation unit 2602, and a watermark data decoding unit 26. 0 3 and power.
- the superimposed audio data acquisition unit acquires superimposed audio data.
- the sum calculation unit calculates the sign of the sum of the values over a half period of the watermark generation data.
- the watermark data decoding unit decodes the watermark data based on the sign of the sum calculated by the sum calculation unit.
- Embodiment 8 of the present invention will be described in detail using a specific example.
- the acquisition of the superimposed audio data according to the eighth embodiment is described as an example in the fifth embodiment. Assuming that the superimposed audio data w (t) generated by is obtained, the first half of the i-th cycle:
- the sum of the superimposed audio data in the first half and the second half of the i-th cycle is equal to the first half of the i-th cycle, £ w (t) ⁇ ,, + u
- the difference in each bit value of the watermark data appears as a difference in the sign of the sum of the superimposed audio data.
- the code can determine 0/1 for each bit of the watermark data and decode it.
- Fig. 27 shows a block diagram of this watermark data detection process.
- each sample value of the superimposed audio data containing the watermark data is shown for each half cycle of the watermark generation data (the first half). It enters B 0 1 (per cycle or every second half cycle), where the sum of the half cycle is calculated and the sign is output to B 0 2.
- B02 0 or 1 is selected according to the sign, and output as watermark data.
- FIG. 28 shows a processing flow of the eighth embodiment.
- the superimposed audio data acquisition unit acquires superimposed audio data of the watermark generation data and the audio data (step S2801).
- the sum calculation unit calculates the sign of the value obtained by summing the superimposed audio data obtained in step S2801 over the half cycle of the watermark generation data (for each first half cycle or for each second half cycle). Is calculated (step S2802).
- the watermark data decoding unit determines a bit value from the sign of the sum calculated in step S2802 and decodes the bit value (step S2803).
- Embodiment 9 ⁇ Concept of Embodiment 9>
- the invention according to the ninth embodiment wherein the watermark data is decoded based on the sign of the difference between the sum of the superimposed audio data over the first half period of the watermark generation data and the sum over the second half period of the watermark generation data.
- the present invention relates to a digital watermark decoding device.
- the audio watermark decoding apparatus 2900 of the ninth embodiment includes a superimposed audio data acquisition section 2901, a sum calculation section 2902, and a watermark data decoding section 2900. 90 3 and power.
- the superimposed audio data acquisition unit acquires superimposed audio data.
- the sum calculation unit calculates the sign of the difference between the sum of the superimposed audio data acquired by the superimposed audio data acquisition unit over the first half period of the watermark generation data and the second half of the watermark generation data.
- the watermark data decoding unit decodes the watermark data based on the sign of the sum difference calculated by the sum calculation unit.
- Embodiment 9 of the present invention will be described in detail using a specific example.
- the difference in each bit value of the watermark data appears as a difference in the sign of the above equation.
- Fig. 30 shows a block diagram of this watermark data detection process.
- the superimposed audio data containing the watermark data enters B 0 1 for each half cycle of the watermark generation data.
- the difference between the sum of the periods and the sum of the second half is calculated, and the sign is output to B02.
- B02 0 or 1 is selected according to the code, and output as watermark data.
- FIG. 31 shows a processing flow of the ninth embodiment.
- the superimposed audio data acquisition unit acquires superimposed audio data of the watermark generation data and the audio data (step S3101).
- the sum calculation unit calculates the sign of the difference between the sum of the superimposed audio data acquired in step S3101 over the time of the first half cycle of the watermark generation data and the time of the second half cycle (step S3102).
- the watermark data decoding unit decodes the watermark data such that the sign of the value calculated in step S3102 represents the bit value of the watermark data (step S3103).
- 0/1 of each bit of the watermark data is determined by the sign of the difference between the sum of the superimposed audio data over the first half cycle of the watermark generation data and the sum of the second half cycle.
- DC offset cancels and is more resistant to distortion Digital watermarking can be realized.
- the amplitude of the watermark generation data can be arbitrarily set without deteriorating the sound quality of the original audio data.
- the size can be increased, and the watermark strength can be increased.
- the watermark data can be detected / decoded.
- the process of embedding the watermark data is reversible, and if there is time series data of the amplitude of the watermark generation data used at the time of embedding the watermark data, it can be completely restored. Furthermore, even if one person embeds the watermark data and another person embeds another watermark data, if there is time series data of the amplitude of the watermark generation data used in each process. Data can be extracted and restored to the original audio data. In this way, it is possible to embed any number of layers.For example, after a copyright owner embeds copyright information, a content distributor prevents unauthorized secondary distribution while protecting the copyright. It is also possible to embed a unique ID for the purpose.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004571864A JPWO2004102528A1 (ja) | 2003-05-16 | 2003-05-16 | オーディオ電子透かし装置 |
US10/542,277 US20060052887A1 (en) | 2003-05-16 | 2003-05-16 | Audio electronic watermarking device |
PCT/JP2003/006114 WO2004102528A1 (ja) | 2003-05-16 | 2003-05-16 | オーディオ電子透かし装置 |
AU2003231518A AU2003231518A1 (en) | 2003-05-16 | 2003-05-16 | Audio electronic watermarking device |
FI20050021A FI20050021A (fi) | 2003-05-16 | 2005-01-10 | Digitaaliseen äänivesileimaan liittyvä laite |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2003/006114 WO2004102528A1 (ja) | 2003-05-16 | 2003-05-16 | オーディオ電子透かし装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004102528A1 true WO2004102528A1 (ja) | 2004-11-25 |
Family
ID=33446551
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2003/006114 WO2004102528A1 (ja) | 2003-05-16 | 2003-05-16 | オーディオ電子透かし装置 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20060052887A1 (ja) |
JP (1) | JPWO2004102528A1 (ja) |
AU (1) | AU2003231518A1 (ja) |
FI (1) | FI20050021A (ja) |
WO (1) | WO2004102528A1 (ja) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2409956A (en) * | 2004-09-01 | 2005-07-13 | Ace Records Ltd | Watermarking audio tracks by inverting a portion of the audio signal |
JP2006227330A (ja) * | 2005-02-18 | 2006-08-31 | Dainippon Printing Co Ltd | 音響信号に対する情報の埋め込み装置・方法、音響信号からの情報の抽出装置・方法 |
JP2006235359A (ja) * | 2005-02-25 | 2006-09-07 | Dainippon Printing Co Ltd | 音響信号からの情報の抽出装置 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4197307B2 (ja) * | 2004-03-30 | 2008-12-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 電子透かし検出装置、その検出方法及びプログラム |
CN103137134B (zh) * | 2011-11-28 | 2015-03-11 | 鸿富锦精密工业(深圳)有限公司 | 音频设备及音频信号的水印信息加载方法 |
US11244692B2 (en) * | 2018-10-04 | 2022-02-08 | Digital Voice Systems, Inc. | Audio watermarking via correlation modification using an amplitude and a magnitude modification based on watermark data and to reduce distortion |
CN112153482B (zh) * | 2020-09-16 | 2022-02-22 | 山东科技大学 | 一种音视频匹配零水印生成方法及音视频防篡改检测方法 |
DE102024001192A1 (de) | 2024-04-13 | 2024-06-06 | Mercedes-Benz Group AG | Computerimplementiertes Verfahren zum Markieren von Daten mit einem digitalen Wasserzeichen, Computerprogrammprodukt und Recheneinheit |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001188549A (ja) * | 1999-12-29 | 2001-07-10 | Sony Corp | 情報処理装置及びその方法並びにプログラム格納媒体 |
JP2003162288A (ja) * | 2001-11-28 | 2003-06-06 | M Ken Co Ltd | 音声情報に透かし情報を埋め込む方法及び透かし情報を埋め込んだ音声情報から透かし情報を検出する方法 |
-
2003
- 2003-05-16 AU AU2003231518A patent/AU2003231518A1/en not_active Abandoned
- 2003-05-16 WO PCT/JP2003/006114 patent/WO2004102528A1/ja active Application Filing
- 2003-05-16 JP JP2004571864A patent/JPWO2004102528A1/ja active Pending
- 2003-05-16 US US10/542,277 patent/US20060052887A1/en not_active Abandoned
-
2005
- 2005-01-10 FI FI20050021A patent/FI20050021A/fi not_active IP Right Cessation
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001188549A (ja) * | 1999-12-29 | 2001-07-10 | Sony Corp | 情報処理装置及びその方法並びにプログラム格納媒体 |
JP2003162288A (ja) * | 2001-11-28 | 2003-06-06 | M Ken Co Ltd | 音声情報に透かし情報を埋め込む方法及び透かし情報を埋め込んだ音声情報から透かし情報を検出する方法 |
Non-Patent Citations (13)
Title |
---|
BASSIA P. ET AL.: "Robust audio watermarking in the time domain", IEEE TRANSACTIONS ON MULTIMEDIA, vol. 3, no. 2, June 2001 (2001-06-01), pages 232 - 241, XP002982757 * |
HUANG J. ET AL.: "A blind audio watermarking algorithm with self-synchronization", PROC. OF 2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, vol. 3, May 2002 (2002-05-01), pages 627 - 630, XP002982759 * |
IKEDA M. ET AL.: "Taiiki seigen sareta ransu keiretsu o mochiita gaukon eno joho umekomi", DENSI JOHO TSUSHIN GAKKAI KENKYU HOKOKU UONSEI], vol. 98, no. 264, 11 September 1998 (1998-09-11), pages 63 - 68, XP002982761 * |
INOUE A.: "Denshi sukashi multi media jidai no ango system", KABUSHIKI KAISHA KOBOSHA, 20 November 1997 (1997-11-20), pages 23 - 53, XP002982768 * |
ISHII M. ET AL.: "Iso joho o riro shita onkyo joho ni taisuru denshi sukashi", 2001 NEN ANGO TO JOHO SECURITY SYMPOSIUM YOKOSHU, vol. 2, 23 January 2001 (2001-01-23), pages 909 - 913, XP002982765 * |
IWAKARI M., MATSUI K.: "Taiiki bunkatsu to spectrum kakusan o mochiita digital ongaku eno denshi sukashi", 2001 NEN ANGO TO JOHO SECURITY SYMPOSIUM YOKOSHU, vol. 2, 23 January 2001 (2001-01-23), pages 921 - 926, XP002982767 * |
IWAKIRI M. ET AL.: "Gaukon fugo eno denshi sukashi ni kansuru ichiteian", 2001 NEN ANGO TO JOHO SECURITY SYMPOSIUM YOKOSHU, vol. 2, 23 January 2001 (2001-01-23), pages 915 - 920, XP002982766 * |
IWAKIRI M., MATSUI K.: "Digital ongaku eno denshi sukashi ni kansuru ichiteian", COMPUTER SECURITY SYMPOSIUM 2000, 26 October 2000 (2000-10-26), pages 91 - 96, XP002982763 * |
IWAKIRI M., MATSUI K.: "Kacho taiiki o koryo shita ongaku soft eno kahengata denshi sukashi", TRANSACTIONS OF INFORMATION PROCESSING SOCIETY OF JAPAN, vol. 42, no. 5, 15 May 2001 (2001-05-15), pages 1254 - 1262, XP002982758 * |
KOIZUMI S., NISHI T.: "Onkyo shingo eno data umekomi.kenshutsu shuho no ichikento", DENSHI JOHO TSUSHIN GAKKAI KENKYU HOKOKU UOYO ONKYO], vol. 98, no. 679, 19 March 1999 (1999-03-19), pages 21 - 26, XP002982762 * |
MATSUI K.: "Denshi sukashi no kiso", MORIKITA SHUPPAN CO. LTD., 21 August 1998 (1998-08-21), pages 146 - 194, XP002982769 * |
NAKAYAMA A. ET AL.: "Jikan.shuhasu masking in motozuku audio shingo eno denshi sukashi", DENSHI JOHO TSUSHIN GAKKAI KENKYU HOKOKUUONSEI], vol. 98, no. 264, 11 September 1998 (1998-09-11), pages 57 - 62, XP002982760 * |
NAKAYAMA A. ET AL.: "Shinri onkyo model ni motozuita audio shingo no denshi sukashi", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS D-II, vol. J83-D-II, no. 11, 25 November 2000 (2000-11-25), pages 2255 - 2263, XP002982764 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2409956A (en) * | 2004-09-01 | 2005-07-13 | Ace Records Ltd | Watermarking audio tracks by inverting a portion of the audio signal |
GB2409956B (en) * | 2004-09-01 | 2005-12-07 | Ace Records Ltd | Audio watermarking |
JP2006227330A (ja) * | 2005-02-18 | 2006-08-31 | Dainippon Printing Co Ltd | 音響信号に対する情報の埋め込み装置・方法、音響信号からの情報の抽出装置・方法 |
JP2006235359A (ja) * | 2005-02-25 | 2006-09-07 | Dainippon Printing Co Ltd | 音響信号からの情報の抽出装置 |
JP4713180B2 (ja) * | 2005-02-25 | 2011-06-29 | 大日本印刷株式会社 | 音響信号からの情報の抽出装置 |
Also Published As
Publication number | Publication date |
---|---|
FI20050021A (fi) | 2005-01-10 |
US20060052887A1 (en) | 2006-03-09 |
AU2003231518A1 (en) | 2004-12-03 |
JPWO2004102528A1 (ja) | 2006-07-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1684265B1 (en) | Method of embedding a digital watermark in a useful signal | |
EP1256086B1 (en) | Methods and apparatus for multi-layer data hiding | |
Wu et al. | Robust and efficient digital audio watermarking using audio content analysis | |
Yeo et al. | Modified patchwork algorithm: A novel audio watermarking scheme | |
US7664958B2 (en) | Optimization methods for the insertion, protection and detection of digital watermarks in digital data | |
US5889868A (en) | Optimization methods for the insertion, protection, and detection of digital watermarks in digitized data | |
US20040059581A1 (en) | Audio watermarking with dual watermarks | |
JP2008529046A5 (ja) | ||
WO2001005075A1 (en) | Improved audio watermarking with covert channel and permutations | |
Dhar et al. | A new DCT-based watermarking method for copyright protection of digital audio | |
Dhar et al. | Digital watermarking scheme based on fast Fourier transformation for audio copyright protection | |
KR20020022131A (ko) | 디지털 권리들 관리를 위한 신호 프로세싱 방법들,디바이스들 및 응용들 | |
WO2004102528A1 (ja) | オーディオ電子透かし装置 | |
US7114071B1 (en) | Method and apparatus for embedding digital watermarking into compressed multimedia signals | |
Singh et al. | Enhancement of LSB based steganography for hiding image in audio | |
JP2003044067A (ja) | 位相の周期偏移によるディジタルデータの埋めこみ・検出装置 | |
Loytynoja et al. | Audio protection with removable watermarking | |
Kaabneh et al. | Muteness-based audio watermarking technique | |
Jenkins et al. | Steganography in audio | |
Hatada et al. | Digital watermarking based on process of speech production | |
Aboelezz | Watermarking audio files with copyrights | |
JP5486839B2 (ja) | 小型検出窓を利用した電子透かし埋め込み検出方法 | |
Steinebach et al. | Evaluation of robustness and transparency of multiple audio watermark embedding | |
WO2001043422A1 (fr) | Procede de traitement d'information et support enregistre | |
De Li et al. | Research on Copyright Protection Technology Based on MIDI Music Structural Features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20050021 Country of ref document: FI |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2004571864 Country of ref document: JP |
|
ENP | Entry into the national phase |
Ref document number: 2006052887 Country of ref document: US Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10542277 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 10542277 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |