WO2001031629A1 - Dispositif de traitement de signaux et procede associe et support de stockage de programme - Google Patents
Dispositif de traitement de signaux et procede associe et support de stockage de programme Download PDFInfo
- Publication number
- WO2001031629A1 WO2001031629A1 PCT/JP2000/007593 JP0007593W WO0131629A1 WO 2001031629 A1 WO2001031629 A1 WO 2001031629A1 JP 0007593 W JP0007593 W JP 0007593W WO 0131629 A1 WO0131629 A1 WO 0131629A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- input signal
- watermark
- digital watermark
- signal
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract description 25
- 238000007906 compression Methods 0.000 claims abstract description 72
- 230000006835 compression Effects 0.000 claims abstract description 66
- 238000004458 analytical method Methods 0.000 claims abstract description 24
- 230000005236 sound signal Effects 0.000 claims description 58
- 230000000873 masking effect Effects 0.000 claims description 17
- 238000003672 processing method Methods 0.000 claims 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 46
- 238000001228 spectrum Methods 0.000 description 22
- 238000010586 diagram Methods 0.000 description 10
- 230000003287 optical effect Effects 0.000 description 10
- 238000001514 detection method Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 238000013139 quantization Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 6
- 239000008186 active pharmaceutical agent Substances 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 244000078127 Eleusine coracana Species 0.000 description 2
- 235000013499 Eleusine coracana subsp coracana Nutrition 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000006866 deterioration Effects 0.000 description 2
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 235000002079 ragi Nutrition 0.000 description 2
- JOCBASBOOFNAJA-UHFFFAOYSA-N N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid Chemical compound OCC(CO)(CO)NCCS(O)(=O)=O JOCBASBOOFNAJA-UHFFFAOYSA-N 0.000 description 1
- 102100031798 Protein eva-1 homolog A Human genes 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/00086—Circuits for prevention of unauthorised reproduction or copying, e.g. piracy
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/00086—Circuits for prevention of unauthorised reproduction or copying, e.g. piracy
- G11B20/00884—Circuits for prevention of unauthorised reproduction or copying, e.g. piracy involving a watermark, i.e. a barely perceptible transformation of the original data which can nevertheless be recognised by an algorithm
- G11B20/00891—Circuits for prevention of unauthorised reproduction or copying, e.g. piracy involving a watermark, i.e. a barely perceptible transformation of the original data which can nevertheless be recognised by an algorithm embedded in audio data
Definitions
- the present invention relates to a signal processing device and method, and a program storage medium, and is suitably applied to, for example, a case where copyright of music data or the like is protected.
- the computer user can create a music software in which favorite music is recorded with high sound quality at will. it can.
- the present invention has been made in consideration of the above points, and for example, a signal processing apparatus and a method thereof capable of effectively suppressing copyright infringement of contents distributed to an unspecified number of terminal devices via a network It also proposes a program storage medium.
- the input signal when the input signal is compressed Psychoacoustic analysis of the remaining state of the force signal corresponding to the compression process, outputs the analysis result as psychoacoustic coded information, and superimposes electronic watermarking information on the input signal based on the psychoacoustic coded information
- digital watermark information with high compression resistance and digital watermark information with low compression resistance can be easily generated.
- FIG. 1 is a block diagram showing the overall configuration of a copy control system using a watermark according to the present invention.
- FIG. 2 is a schematic diagram illustrating an application example of copy control in a copy control system.
- FIG. 3 is a block diagram illustrating a configuration of a watermark encoder according to an embodiment of the present invention.
- FIG. 4 is a schematic diagram for explaining the watermark superimposition principle.
- FIG. 5 is a schematic diagram used to explain the processing of the psychological hearing analysis unit.
- FIG. 6 is a schematic diagram used to explain the processing of the psychological hearing analysis unit.
- FIG. 7 is a schematic diagram used for describing psychoacoustic coding.
- c 9 is a schematic diagram illustrating a Enko one de example of ⁇ O one Tamaku according to the invention is a Furochiya one preparative illustrating a processing procedure of the watermark encoder
- FIG. 10 is a block diagram showing the configuration of the digital mark decoder.
- FIG. 11 is a flowchart showing a processing procedure of the watermark decoder.
- FIG. 12 is a block diagram showing an internal configuration of the computer. BEST MODE FOR CARRYING OUT THE INVENTION
- reference numeral 10 denotes a content copy control system as a whole.
- a digital audio signal DA which is an audio source
- a watermark (electronic watermark information) encoder 1 uses a Robust signal.
- (Strong) Water mark DC 1 and Fragi 1 e (Weak level) Water mark DC 2 is added to digital audio signal DA and recorded on optical disc 2.
- R obust water mark DC 1 is composed of information such as the copyright holder of the audio source and whether or not copying is permitted.
- Fragile water mark DC 2 is composed of information such as a trigger for detecting the compression history. Be composed.
- R obust watermark DC 1 does not disappear even if it passes through a compression system such as MP3. It is an watermark, and fragile watermark DC 2 is quantized when it passes through a compression system such as MP 3. It is a water mark that disappears due to errors (remains low).
- the optical discs 2 on which the digital audio signal DA including the Robust watermark DC 1 and the Fragi 1 e-marker DC 2 is recorded for example, MP 3 (MPEG Audio Layer 3)
- MP 3 MPEG Audio Layer 3
- the personal computer 5 When the optical disk 2 on which the digital audio signal DA including the water mark DC 2 is recorded and the optical disk 2 circulating without any compression processing is loaded into the personal computer 5, first, the personal computer 5 The input signal is an uncompressed digital audio signal DA or a compressed stream based on an input signal input to a codec (Code and Code) constituting a modem (modulator / demodulator). It determines whether the signal is a signal DS or a compressed stream signal DS encoded according to which compression standard, and selectively switches the switch SW according to the result of the determination.
- a codec Code and Code
- the embedded mark information is detected from the input signal of the personal computer 5 by the water mark decoder 6, and the input signal (digital audio signal) to a PD (Potable Device) 8 such as a memory card is detected.
- Signal DA, compressed stream signal DS, etc. That is, the data-mark decoder 6 detects the robust water mark DC 1 and the Fraggi 1 e data mark DC 2.
- the L CM (L icensed SDM I (Secure Digital Music Initiative) Compliant Module) 7 is composed of two types of data marks (Ro bust) detected by the watermark decoder 6.
- the watermark DC 1 and the Fragi 1 e-watermark DC 2) control whether music content is copied to the PD 8.
- FIG. 2 shows an example in which LCM7 controls whether or not music content is copied to PD 8 using two types of detected watermarks.
- LCM7 is a Robust data mark DC1. Is detected and the Fragile Watermark DC2 is not detected, it is determined that the music content has a compression history, and copying of the music content to the pD8 is prohibited.
- the LCM 7 determines that the music content does not have a compression history, and proceeds to the PD 8 of the music content. Allow copying of.
- LCM7 does not have information about the copyright holder of the music content and whether or not to permit copying. Copying of the music content to PD 8 is permitted. Further, in the LCM 7, the case where the Robust water mark DC1 is not detected and the Fragile water mark DC2 is not detected is an exception because it is technically impossible.
- FIG. 3 shows the configuration of the watermark encoder 1.
- the digital audio signal DA1 which is sequentially input, is analyzed using a psychological auditory model, and a watermark having a R obust watermark superimposition section and a Fragi 1 e watermark superimposition section is performed. Controls the superposition unit 16.
- the digital audio signal DA 1 is converted to the MDCT (Modified Discrete Cosine Transform:
- the processing unit 14 generates MDCT coefficients D 14, and outputs this to the watermark superimposition processing unit 16.
- the water mark superimposition processing section 16 converts the R obust water mark DC 1 and the Fragi 1 e watermark DC 2 into a frequency spectrum obtained by performing MDCT conversion on a time series sample block of the digital audio signal DA 1. Of these, the spectrum is superimposed on the spectrum of the frequency band determined based on the analysis result of the psychological auditory analysis unit 11 described later.
- the watermark superimposition processing unit 16 is configured to perform a RCT water mark one DC 1 and a Fraggi e water among frequency spectra obtained by performing MDCT conversion on a time-series sample block of the input audio signal DA 1.
- the spectrum f1 in the frequency band where the single mark DC2 is superimposed is extracted (Fig. 4 (A)).
- the watermark superimposition processing section 16 performs scaling on the frequency spectrum f1 at a fixed ratio to generate an attenuated frequency spectrum f2 (FIG. 4 (B)).
- the polarity of the data to be embedded is "1”
- the data is used as it is
- the polarity of the data to be embedded is "0”
- the frequency spectrum f2 obtained by inverting the polarity is used.
- the watermark superimposition processing unit 16 shifts the frequency spectrum f2 in the direction in which the frequency increases or decreases, for example, so that four spectrums are separated (FIG. 4 (C)).
- FIG. 4C four lines are shifted in the direction of increasing frequency to generate a frequency spectrum # 3.
- the water mark superimposition processing section 16 performs the frequency spectrum shifted in this manner.
- the frequency spectrum f 3 (Fig. 4 (O))
- the original frequency spectrum f 1 (Fig. 4 (A)
- the obtained frequency spectrum f 4 (Fig. 4 (D) ) Is superimposed with Ro bust watermark DC 1 and Fragi 1 e-marker DC 2 (Fig. 4 (D)).
- processing is performed so that the polarity correlation between a certain frequency spectrum and a frequency spectrum four distances apart is biased positively or negatively. That is, processing is performed only when the sign of the frequency spectrum f4 is inverted by adding the frequency spectrum f3 to the frequency spectrum f1 and when no other sign inversion is involved. Does not perform any processing.
- the white frame is the signal component removed from the frequency spectrum f 1 (FIG. 4 (A)), and the hatching frame is the signal component obtained as a result of the addition.
- the level of the first frequency component f 1 (FIG. 4 (A)), which is determined to embed the watermark, is attenuated at a fixed ratio, and the level obtained by multiplying the polarity of the watermark to be embedded is Is added to the level of the first frequency component f 2 (FIG. 4 (B)) and the second frequency component f 3 (FIG. 4 (C)) separated by, for example, four frequency components.
- the first frequency component fl (Fig. 4 (A)) and the second frequency component f3 (Fig. 4 (C)) are correlated, the polarity is the same after embedding the watermark. (Positive or negative) increases, and the watermark mark decoder described later uses this to detect watermarks.
- the watermark superimposition processing unit 16 generates the R obust watermark DC 1 and the Fraggi e watermark based on the analysis result D 11 analyzed in the psychological auditory analysis unit 11.
- DC 2 is assigned to each The embedding is made so that the frequency band and sound pressure level take into account the minimum audible limit and masking / threshold effect described later in the number of frequency bands.
- the psychological hearing analysis unit 11 analyzes the input digital audio signal DA1, and calculates a masking threshold, which is a limit that can be perceived while actually hearing the original sound, from the curve of the original sound and the minimum audible limit.
- a masking threshold which is a limit that can be perceived while actually hearing the original sound, from the curve of the original sound and the minimum audible limit.
- the masking ability for a given digital audio signal DA1 depends on its frequency band and sound pressure level
- an encoder that performs a compression process determines the best method for expressing an input audio signal with a limited bit resolution based on information on a frequency band and a sound pressure level.
- the psychological auditory model used for compression embedding the watermark with the position and energy of the frequency that is easily cut by the compression processing, it is possible to realize the Fragi 1 e watermark DC 2 that is weak in compression. it can.
- Figure 5 shows the minimum audibility limit.
- the minimum audibility limit in silence is the minimum level of sound that can be detected by hearing, and is related to the limit of noise that can be heard when hearing is quiet. As shown in Fig. 5, sound with a sound pressure level higher than the minimum audible limit during silence, such as tone A, can be heard, but sound pressure below the minimum audible limit during confusion, such as tone B. Level sounds cannot be heard. Also, as shown in Fig. 5, the minimum audible limit in silence depends on the frequency, and even if the tones A and B have the same sound pressure level, they can be heard at the frequency of the sound. There are times when it can be done and times when it cannot.
- Figure 6 shows the masking effect.
- the masking effect is mainly related to the detection limit of quantization distortion and background noise, and the detection limit of a specific sound is heard simultaneously. It changes greatly with other sounds. As shown in Fig. 6, other sounds are difficult to hear within a certain frequency range with respect to tone C. For example, tone D is difficult to hear even a pure tone with a relatively high sound pressure level. On the other hand, tone E is audible.
- the masking effect works stronger as the frequencies of the masking sound (masker) and the masked sound that cannot be heard (masky) are closer.
- the digital mark encoder 1 utilizes these principles to take advantage of these principles to consider the compression characteristics of the compression processing of signals such as MP3 (R obust ust water mark DC 1 and Fragi 1 e DC By the way, the embedding process of 2) is performed.
- Fig. 7 shows the encoding (psychological auditory encoding considering psychological auditory sense) in the MP3 encoder 3 (Fig. 1).
- the MP3 encoder 3 converts the input audio signal DA into 3 After dividing into two frequency bands (subbands), psychoacoustic coding is performed so that the quantization noise falls below the minimum audible limit.
- the solid line in Fig. 7 represents the frequency distribution of the audio signal DA, and the thick line represents the minimum audible limit during silence.
- the masking threshold (dashed line in Fig. 7), which is the limit that can be perceived when the original sound is actually heard, is calculated from the curve of the original sound and the minimum audible limit.
- a quantization step for each subband is assigned so that the quantization noise is at a level lower than the masking 'threshold.
- the range to be sampled by the compression processing is indicated by a rectangular area, and the bottom of the rectangle is the quantization noise level.
- the quantization step is reduced. Reduce the level of quantization noise.
- the psychological auditory analysis section 11 of the watermark encoder 1 converts the input digital audio signal DA 1 into a digital audio signal DA 1 based on the characteristics of the psychoacoustic encoding of the MP3 encoder 3. Analyze 1 and Then, based on the analysis result, the frequency band in which the R obust watermark DC 1 and the Fragi 1 e watermark DC 2 are to be embedded and its sound pressure level are calculated, and the calculation result (psychological auditory coding information Dl 1)
- the watermark overlap section 16 is controlled by the control section.
- the watermark functions as a strong R obust water mark DC1 and is compressed.
- the water mark functions as a Fragi 1 e water mark DC 2 that is weak in compression.
- the water mark superimposing unit 16 adds the R obust watermark DC 1 and the Fragi 1 e to the digital audio signal DA 1. Embed water mark DC 2.
- the area in which the watermark is embedded is not limited to the frequency domain, but may be the time domain.
- the MDCT coefficient D 16 output from the water mark superimposing unit 16 is subjected to inverse orthogonal transformation in the I MDCT processing unit 15, so that it has the same format as the original digital audio signal DA 1. It is output as a digital audio signal DA 2 in which the watermark watermark DC 1 and the watermark 1 DC 2 are embedded.
- a digital audio signal DA2 is encoded by a predetermined encoder (not shown) to expose a master disk, and the optical disk 2 is mass-produced from the master disk.
- Fig. 9 shows the processing procedure of the watermark encoder 1 for processing the digital audio signal DA1, and the watermark encoder 1 moves from step SP11 to step SP12 and sequentially samples the digital audio signal DA1. Read for each lock. Subsequently, the watermark encoder 1 proceeds to step SP13, analyzes the digital audio signal DA1, and extracts psychoacoustic encoded information D11 from the digital audio signal DA1.
- the watermark 1 embeds the R obust watermark DC 1 in the digital signal DA 1 based on the psychoacoustic coded information D 11, and furthermore, in step SP 15 the psychological hearing The Fraggi 1 e-watermark DC 2 is embedded in the digital audio signal DA 1 based on the encoding information D 11.
- the watermark encoder 1 has the same format as the input digital audio signal DA 1, and has the R obust watermark DC 1 and the Fragi 1 e watermark DC 2 embedded therein.
- the digital audio signal DA 2 is obtained.
- step SP16 determines whether or not the processing of the digital audio signal DA1 has been completed. If a negative result is obtained here, the process returns to step SP12. As a result, the watermark encoder 1 sequentially processes the digital audio signal DA 1 by repeating this processing procedure for each sample block, and when a positive result is obtained in step SP 16, the steps SP 16 to SP 16 are performed. It moves to 17 and ends the processing procedure.
- FIG. 10 shows the configuration of the water mark decoder 6 of the personal computer 5 described above with reference to FIG. 1, and the water mark decoder 6 converts the digital audio signals DA 2 to R obtained from the reproduction of the optical disc 2 (FIG. 1).
- Obust ⁇ O - in Tamaku DC 1 and F Ragi 1 c ie water one mark decoder 6 detects the e watermark DC 2, R obust ⁇ O one Tama Kudekodo portion 6 1, the digital audio signal DA 2 sequentially input
- the MDCT processing section 60 performs MDCT processing to obtain an MDCT coefficient D60, which is output to the water mark detection section 61.
- the watermark detection unit 61 determines, for the input MDCT coefficient D 60, a plurality of each of which is separately allocated in advance as an embedding frequency band of the radio watermark DC 1 and the radio watermark DC 2.
- the frequency component is shifted by, for example, four frequency components in the direction in which the frequency increases or decreases, and a new MDCT coefficient obtained by the process and an MDCT coefficient D 60 obtained from the MDCT processing unit 60 are performed.
- the polarities of the frequency components are compared based on the above, and based on the deviation of the polarities, the Rustust DC1 and the Fragment ewatermark DC2 are detected.
- the area for detecting the watermark is not limited to the frequency domain, but may be a time domain.
- Fig. 11 shows the processing procedure of the watermark decoder 6 for processing the digital audio signal DA2.
- the watermark 6 enters the processing procedure from step SP21, the watermark 6 moves to step SP22, and the digital audio signal DA2 is processed.
- Read DA 2 sequentially for each sample block.
- the watermark decoder 6 proceeds to step SP23 to detect the watermark water mark DC1 from the digital audio signal DA2, and then detects the Fragi1 ewatermark DC2 in step SP24. .
- step SP 25 determines whether or not the processing of the digital audio signal DA 2 has been completed, and obtains a negative result. And return to step SP22. Thereby, the watermark decoder 6 processes the digital audio signal DA 2 by repeating this processing procedure for each sample block in sequence, and if a positive result is obtained in step SP 25, the process proceeds to step SP 26. Move to end the processing procedure.
- the psychological auditory analysis unit 11 of the watermark encoder 1 determines a compression characteristic (a psychological auditory model such as a minimum audible limit and a masking effect adopted in various compression standards) in the MP3 encoder 3.
- R obust data — Mark DC 1 and Fragi 1 e Water mark DC 2 Decide the embedding position (for example, frequency band) and sound pressure level, and determine the compression that matches the compression characteristics of MP3 encoder 3.
- Fragi 1 e Watermark DC 2 can be embedded in the digital audio signal DA 1.
- the compression rate of the Fragi 1 e-watermark DC 2 which is vulnerable to compression, decreases sufficiently when compressed through the MP3 encoder 3 and is not detected by the watermark decoder 6. Will be.
- the digital watermark signal obtained by reproducing the optical disk 2 is used to obtain the watermark watermark DC 1 and the! 7 Ragi 1 e Watermark DC 2 has been described, but the present invention is not limited to this.
- the present invention is also applicable to the case where a watermark is detected from a stream signal obtained by compressing a digital audio signal. can do. This eliminates the need to expand the compressed stream signal again and convert the compressed stream signal into the same format as the original digital audio signal, thereby increasing the processing speed.
- the Fragi 1 e-watermark DC 2 is not erased by the predetermined compression processing. It may be embedded.
- the Fragi 1 e-watermark DC 2 is embedded in a form that is not deleted by the compression processing at the time of valid distribution, and the digital audio signal with the watermark is subjected to other compression processing such as MP3.
- the watermark DC 2 may be embedded in such a manner that the watermark DC 2 is erased when the operation is performed. Specifically, by comparing the compression characteristics of the compression process used at the time of valid distribution with the compression characteristics of other compression processes, they are not deleted by the compression process used at the time of distribution. The minimum audible limit and masking used. ⁇
- a Fragi 1 e watermark DC 2 is embedded in a signal deleted by a quantization process performed on a digital audio signal based on a threshold.
- both the Rust water mark DC 1 and the Fragi 1 e water mark DC 2 are embedded in the digital audio signal.
- the present invention is not limited to this, but the Rustust mark DC1 is embedded in the digital audio signal, the compression processing is performed, and the compression-processed stream signal is applied to the Fragi1e.
- the water mark DC 2 may be embedded.
- the present invention is not limited to this, and it is not necessary to completely delete the Fragi 1 e watermark DC 2 even after compression processing such as MP3.
- the detection side determines whether or not the detection level of the Fragi 1 e water mark DC 2 is equal to or lower than a predetermined threshold. If the detection level is lower than the threshold, the digital audio signal to be detected is illegally copied. In the case where the digital audio signal to be detected is equal to or greater than the threshold, the digital audio signal to be detected is determined to be a valid copy. Just do it.
- the detection After performing compression processing such as MP3 on the digital audio signal embedded with the R obust watermark DC 1 and the Fraggi 1 e data mark DC 2, the detection If it is possible to detect the fact that the compression processing has been performed based on the degree of decrease in the remaining ratio of the water mark DC 2, the Fragile watermark DC 2 is embedded in the digital audio signal in any configuration. You may do it.
- the digital audio signal is added to the R obust water mark DC 1 and the psychological auditory model, which is the compression characteristic of MP3. It is described for embedding a finely F Ragi 1 e water one mark DC 2, the present invention is not limited to this, for example, MP EG- AAC, ATRAC, AT R AC 2 x A TRA C 3, D olby AC 3, MS (Microsoft) It can be widely applied when using a psychoacoustic model based on various compression standards such as Au dio, Tw in VQ, or a combination of these. In this way, a general-purpose fragile watermark that supports more compression standards can be realized.
- the present invention is not limited to this.
- a connection configuration can also be applied.
- the configuration has been described in which the watermark decoder section and the watermark 1e watermark section are integrated in the watermark decoder 6.
- the R obust watermark decoding unit and the Fragi 1 e watermark decoding unit are connected in series, or the R obust watermark decoding unit and the Fragi 1 e watermark decoding unit are connected in parallel It is also possible to apply a configuration that implements this.
- a CPU Central Processing Unit
- ROM Read Only Memory
- a RAM Random Access Memory
- a hard disk drive 74 storing various data
- the CPU 71 via a network. It has a communication port 75 which is an interface for communicating with the outside, and these are connected to each other via a bus 76.
- a program for realizing each of the above-described functions is stored in the hard disk device 74 or the ROM 72, and at the time of execution, the CPU 71 responds based on the work memory stored in the RAM 73. What is necessary is just to realize each functional part. Further, in the computer 70, programs for realizing the above-described functions may be installed via the communication port 75.
- the present invention relates to a signal processing device and method, and a program storage medium, and can be applied to, for example, a case where copyright of music data or the like is protected.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Acoustics & Sound (AREA)
- Computer Security & Cryptography (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Reverberation, Karaoke And Other Acoustics (AREA)
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001534134A JP4582384B2 (ja) | 1999-10-29 | 2000-10-27 | 信号処理装置及びその方法並びにプログラム格納媒体 |
EP00970176A EP1202250A4 (en) | 1999-10-29 | 2000-10-27 | METHOD, DEVICE AND DATA CARRIER COMPUTER PROGRAM FOR SIGNAL PROCESSING |
US09/869,170 US7272718B1 (en) | 1999-10-29 | 2000-10-27 | Device, method and storage medium for superimposing first and second watermarking information on an audio signal based on psychological auditory sense analysis |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP11/310324 | 1999-10-29 | ||
JP31032499 | 1999-10-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2001031629A1 true WO2001031629A1 (fr) | 2001-05-03 |
Family
ID=18003874
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2000/007593 WO2001031629A1 (fr) | 1999-10-29 | 2000-10-27 | Dispositif de traitement de signaux et procede associe et support de stockage de programme |
Country Status (4)
Country | Link |
---|---|
US (1) | US7272718B1 (ja) |
EP (1) | EP1202250A4 (ja) |
JP (1) | JP4582384B2 (ja) |
WO (1) | WO2001031629A1 (ja) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008114432A1 (ja) * | 2007-03-20 | 2008-09-25 | Fujitsu Limited | データ埋め込み装置、データ抽出装置、及び音声通信システム |
US9002487B2 (en) | 2008-08-14 | 2015-04-07 | Sk Telecom Co., Ltd. | System and method for data reception and transmission in audible frequency band |
CN110110574A (zh) * | 2018-01-30 | 2019-08-09 | 普天信息技术有限公司 | 心理压力参数的获取方法和标注方法 |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8144368B2 (en) | 1998-01-20 | 2012-03-27 | Digimarc Coporation | Automated methods for distinguishing copies from original printed objects |
US6744906B2 (en) | 1995-05-08 | 2004-06-01 | Digimarc Corporation | Methods and systems using multiple watermarks |
US7006661B2 (en) | 1995-07-27 | 2006-02-28 | Digimarc Corp | Digital watermarking systems and methods |
US6850626B2 (en) | 1998-01-20 | 2005-02-01 | Digimarc Corporation | Methods employing multiple watermarks |
US6633654B2 (en) | 2000-06-19 | 2003-10-14 | Digimarc Corporation | Perceptual modeling of media signals based on local contrast and directional edges |
US8094869B2 (en) | 2001-07-02 | 2012-01-10 | Digimarc Corporation | Fragile and emerging digital watermarks |
US7728048B2 (en) | 2002-12-20 | 2010-06-01 | L-1 Secure Credentialing, Inc. | Increasing thermal conductivity of host polymer used with laser engraving methods and compositions |
US20060075424A1 (en) * | 2003-02-10 | 2006-04-06 | Koninklijke Philips Electronics N.V. | Import control of content |
US7225991B2 (en) | 2003-04-16 | 2007-06-05 | Digimarc Corporation | Three dimensional data storage |
CN101136202B (zh) * | 2006-08-29 | 2011-05-11 | 华为技术有限公司 | 音频信号处理系统、方法以及音频信号收发装置 |
US9160988B2 (en) * | 2009-03-09 | 2015-10-13 | The Nielsen Company (Us), Llc | System and method for payload encoding and decoding |
US8522032B2 (en) * | 2010-03-30 | 2013-08-27 | Disney Enterprises, Inc. | System and method to prevent audio watermark detection |
JP6007481B2 (ja) * | 2010-11-25 | 2016-10-12 | ヤマハ株式会社 | マスカ音生成装置、マスカ音信号を記憶した記憶媒体、マスカ音再生装置、およびプログラム |
US10573329B2 (en) * | 2017-05-31 | 2020-02-25 | Dell Products L.P. | High frequency injection for improved false acceptance reduction |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11232779A (ja) * | 1997-11-20 | 1999-08-27 | Toshiba Corp | コピー防止装置 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6760463B2 (en) * | 1995-05-08 | 2004-07-06 | Digimarc Corporation | Watermarking methods and media |
US6061793A (en) * | 1996-08-30 | 2000-05-09 | Regents Of The University Of Minnesota | Method and apparatus for embedding data, including watermarks, in human perceptible sounds |
US6272634B1 (en) * | 1996-08-30 | 2001-08-07 | Regents Of The University Of Minnesota | Digital watermarking to resolve multiple claims of ownership |
US6031914A (en) * | 1996-08-30 | 2000-02-29 | Regents Of The University Of Minnesota | Method and apparatus for embedding data, including watermarks, in human perceptible images |
US6282299B1 (en) * | 1996-08-30 | 2001-08-28 | Regents Of The University Of Minnesota | Method and apparatus for video watermarking using perceptual masks |
US6330672B1 (en) * | 1997-12-03 | 2001-12-11 | At&T Corp. | Method and apparatus for watermarking digital bitstreams |
US6332194B1 (en) * | 1998-06-05 | 2001-12-18 | Signafy, Inc. | Method for data preparation and watermark insertion |
US6154571A (en) * | 1998-06-24 | 2000-11-28 | Nec Research Institute, Inc. | Robust digital watermarking |
JP2000058928A (ja) * | 1998-08-12 | 2000-02-25 | Shin Aizawa | ブラウン官の三原色発光ダイオードの改良 |
US6591365B1 (en) * | 1999-01-21 | 2003-07-08 | Time Warner Entertainment Co., Lp | Copy protection control system |
US6952774B1 (en) | 1999-05-22 | 2005-10-04 | Microsoft Corporation | Audio watermarking with dual watermarks |
FR2816153B1 (fr) * | 2000-10-27 | 2002-12-20 | Canon Kk | Procede de controle prealable de la detectabilite d'un signal de marquage |
WO2002049363A1 (en) * | 2000-12-15 | 2002-06-20 | Agency For Science, Technology And Research | Method and system of digital watermarking for compressed audio |
US7043019B2 (en) * | 2001-02-28 | 2006-05-09 | Eastman Kodak Company | Copy protection for digital motion picture image data |
-
2000
- 2000-10-27 JP JP2001534134A patent/JP4582384B2/ja not_active Expired - Fee Related
- 2000-10-27 WO PCT/JP2000/007593 patent/WO2001031629A1/ja active Application Filing
- 2000-10-27 US US09/869,170 patent/US7272718B1/en not_active Expired - Fee Related
- 2000-10-27 EP EP00970176A patent/EP1202250A4/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11232779A (ja) * | 1997-11-20 | 1999-08-27 | Toshiba Corp | コピー防止装置 |
Non-Patent Citations (2)
Title |
---|
MITCHELL D. SWANSON ET AL.: "Robust audio watermarking using perceptual masking", SIGNAL PROCESSING, vol. 66, 1998, pages 337 - 355, XP002935138 * |
See also references of EP1202250A4 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008114432A1 (ja) * | 2007-03-20 | 2008-09-25 | Fujitsu Limited | データ埋め込み装置、データ抽出装置、及び音声通信システム |
US9002487B2 (en) | 2008-08-14 | 2015-04-07 | Sk Telecom Co., Ltd. | System and method for data reception and transmission in audible frequency band |
CN110110574A (zh) * | 2018-01-30 | 2019-08-09 | 普天信息技术有限公司 | 心理压力参数的获取方法和标注方法 |
Also Published As
Publication number | Publication date |
---|---|
EP1202250A1 (en) | 2002-05-02 |
EP1202250A4 (en) | 2006-12-06 |
US7272718B1 (en) | 2007-09-18 |
JP4582384B2 (ja) | 2010-11-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2001031629A1 (fr) | Dispositif de traitement de signaux et procede associe et support de stockage de programme | |
AU2005241609B2 (en) | Watermark incorporation | |
US20040059918A1 (en) | Method and system of digital watermarking for compressed audio | |
CA2527011C (en) | Audio encoding/decoding apparatus having watermark insertion/abstraction function and method using the same | |
Qiao et al. | Noninvertible watermarking methods for mpeg-encoded audio | |
Nematollahi et al. | Blind digital speech watermarking based on Eigen-value quantization in DWT | |
JP2001188549A (ja) | 情報処理装置及びその方法並びにプログラム格納媒体 | |
Foo et al. | An adaptive audio watermarking system | |
Yan et al. | Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization | |
JP2005530206A (ja) | 合成されたスペクトル成分に適合するようにデコードされた信号の特性を使用するオーディオコーディングシステム | |
US20070052560A1 (en) | Bit-stream watermarking | |
EP1634276B1 (en) | Apparatus and method for embedding a watermark using sub-band filtering | |
Attari et al. | Robust and transparent audio watermarking based on spread spectrum in wavelet domain | |
EP1104969A1 (en) | Method and apparatus for encoding/decoding and watermarking a data stream | |
Xu et al. | Digital audio watermarking and its application in multimedia database | |
Xu et al. | Content-based digital watermarking for compressed audio | |
Wei et al. | Audio watermarking using time-frequency compression expansion | |
Kirbiz et al. | Decode-time forensic watermarking of AAC bitstreams | |
Kalantari et al. | A robust audio watermarking scheme using mean quantization in the wavelet transform domain | |
Gopalan | Robust watermarking of music signals by cepstrum modification | |
Trivedi et al. | An algorithmic digital audio watermarking in perceptual domain using direct sequence spread spectrum | |
Cvejic et al. | Audio watermarking: Requirements, algorithms, and benchmarking | |
Quan et al. | Data hiding in MPEG compressed audio using wet paper codes | |
KR20180058611A (ko) | 오디오 워터마크 삽입 방법 및 장치, 그리고 오디오 워터마크 검출 방법 및 장치 | |
Kui-long et al. | A mp3 watermarking scheme based on the energy relation of two adjacent channels |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): JP US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): DE FR GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2000970176 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref country code: JP Ref document number: 2001 534134 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09869170 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 2000970176 Country of ref document: EP |