CN105976823B - Adaptive audio water mark method and system based on phase code - Google Patents

Adaptive audio water mark method and system based on phase code Download PDF

Info

Publication number
CN105976823B
CN105976823B CN201610458411.4A CN201610458411A CN105976823B CN 105976823 B CN105976823 B CN 105976823B CN 201610458411 A CN201610458411 A CN 201610458411A CN 105976823 B CN105976823 B CN 105976823B
Authority
CN
China
Prior art keywords
frequency
watermark
signal
audio
phase
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201610458411.4A
Other languages
Chinese (zh)
Other versions
CN105976823A (en
Inventor
陈怡�
高戈
张康
刘影
吕冰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong Normal University
Original Assignee
Huazhong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong Normal University filed Critical Huazhong Normal University
Priority to CN201610458411.4A priority Critical patent/CN105976823B/en
Publication of CN105976823A publication Critical patent/CN105976823A/en
Application granted granted Critical
Publication of CN105976823B publication Critical patent/CN105976823B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Editing Of Facsimile Originals (AREA)

Abstract

The present invention provides adaptive audio water mark method and system based on phase code, telescopiny includes doing FFT time-frequency conversion, calculate the range that each frame frequency-region signal can be embedded in watermark, generate the pseudorandom frequency expansion sequence of binary system, the judgement for tonal content is carried out, the phase mask threshold value of phase spectrum is obtained using global masking threshold;Phase spectrum after obtaining insertion watermark, generates the audio file for having watermark;Detection process includes doing the time-frequency conversion of FFT, calculates the range that each frame frequency-region signal can be embedded in watermark, generates the pseudorandom frequency expansion sequence of binary system, carries out ASSOCIATE STATISTICS and examines to obtain watermark bit.The present invention relaxes the judgement that frequency spectrum has tone region, so that the global masking threshold calculated is more accurate, according to the embedment strength of the adaptive adjustment watermark of the masking threshold of phase angle on phase spectrum, ensure audio frequency watermark in non situation, makes the embedment strength maximum of watermark to ensure the robustness of audio frequency watermark.

Description

Adaptive audio water mark method and system based on phase code
Technical field
The present invention relates to digital audio frequency watermark fields, more particularly to the adaptive audio water mark method based on phase code and System.
Background technique
Digital audio frequency watermark is that certain digital informations are added into audio signal to reach the identification of the file true and false, copyright guarantor The signal processing operations of the purpose of shield, Information hiding.Adaptive audio watermaking system based on phase code refers to according to psychology Acoustic model dynamically adjusts the embedment strength of watermark on phase spectrum, it is ensured that audio frequency watermark is in the condition for meeting not sentience Lower robustness is maximum.Traditional audio frequency watermark embedded mobile GIS based on phase code, directly adds fixing intensity on phase spectrum Watermark.If the intensity for being embedded in watermark is excessive, it is easy to generate noise, influence sound quality;If the intensity for being embedded in watermark is too small, Detection is to be not easy to check, and influences robustness.In addition audio signal is dynamic change, even if in the strong of some regions insertion Degree is suitable for, but possible embedment strength is excessive or too small for other regions.Such watermark embedded mode makes audio frequency watermark Not sentience and robustness cannot be met simultaneously.
Summary of the invention
The object of the present invention is to provide the adaptive audio digital watermark schemes based on phase code, make the water in phase spectrum Embedment strength is printed according to the adjustment of audio signal self adaption, to reach the compromise of audio frequency watermark not sentience and robustness.
Technical solution of the present invention provides a kind of adaptive audio water mark method based on phase code, including telescopiny and Detection process,
The telescopiny includes the following steps,
Step A1 reads audio file, the audio signal x and sample rate f s1 of time domain is obtained, to the audio signal x of time domain First framing, frame length indicate with N, xnIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the amplitude of the audio signal of frequency domain Compose XnAnd phase spectrum
Step A2, according to sample rate f s1, frame length N and according to the preset insertion of frequency-portions of auditory perceptual sensitivity Start frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain this range Maximum value freqmax1 and minimum value freqmin1, chooses the frequency-domain audio signals within the scope of this;
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A3, using key key as random number seed, generate that length is freqmax1-freqmin1+1 two into Make pseudorandom frequency expansion sequence u;
Step A4, progress is as follows for the judgement of tonal content,
Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, Pn[k]dBTable Show the signal power at local maximum point k of n-th frame signal, Pn[k-j]dBIndicate the signal function at maximum of points j Rate value;
According to judging result, using global masking threshold Thn, obtain the phase mask threshold θ of phase spectrumn
Step A5, according to pseudo-random sequence u, phase mask threshold θnWith watermark bit b, using following formula in audio Phase spectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,
Wherein, α is constant, the intensity of control watermark insertion;
Utilize the amplitude spectrum X of frequency-region signalnWith the phase spectrum after insertion watermarkThen it is embedded in by Euler's formula Frequency-region signal after watermark is as follows,
Wherein, YnFor the frequency-region signal after insertion watermark, e is natural Exponents;
Step A6, by the frequency-region signal Y after insertion watermarknTransform to time-domain signal yn, generate the audio text with watermark Part;The detection process includes the following steps,
Step B1 reads the time-domain audio file for having watermark, obtains the amplitude of the audio signal with watermark of time domain Data z and sample rate f s2, to the framing of time-domain signal elder generation, frame length N, znFor the n-th frame of signal to be detected;Time-frequency conversion is done again, Obtain the amplitude spectrum Z of the audio signal of frequency domainnWith phase spectrum ξn
Step B2, according to sample rate f s2, frame length N and the preset insertion of frequency-portions according to auditory perceptual sensitivity Start frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain this range Maximum value freqmax2 and minimum value freqmin2, chooses the amplitude spectrum of the audio within the scope of this;
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Wherein, floor is downward bracket function;
Step B3, using key key as random number seed, generate that length is freqmax2-freqmin2+1 two into Make pseudorandom frequency expansion sequence u;
Step B4 examines formula according to following ASSOCIATE STATISTICS, to the phase of pseudo-random sequence u and signal n-th frame to be detected Compose ξn, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtainedn
Wherein,<>indicates that the inner product of signal calculates;
If detecting sufficient statistic rn>=0, then the watermark bit b=1 detected;It otherwise is b=0.
Moreover, obtaining the phase mask threshold θ of phase spectrum using triangle relation in step A4n,
The present invention accordingly provides a kind of adaptive audio watermaking system based on phase code, including audio frequency watermark insertion System and adaptive audio watermark detection subsystem,
The audio frequency watermark insertion subsystem comprises the following modules,
First time-frequency convert module obtains the audio signal x and sample rate f s1 of time domain, clock synchronization for reading audio file The audio signal x elder generation framing in domain, frame length indicate with N, xnIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the sound of frequency domain The amplitude spectrum X of frequency signalnAnd phase spectrum
First insertion range selection module, for according to sample rate f s1, frame length N and according to auditory perceptual sensitivity The start frequency FWMIN of the preset insertion of frequency-portions, terminate frequency FWMAX, watermark can be embedded in by calculating each frame frequency-region signal Range, obtain the maximum value freqmax1 and minimum value freqmin1 of this range, choose the frequency-domain audio signals within the scope of this;
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax1-freqmin1+1;
Improved psycho-acoustic module, it is as follows for the judgement of tonal content for carrying out,
Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, Pn[k]dBTable Show the signal power at local maximum point k of n-th frame signal, Pn[k-j]dBIndicate the signal function at maximum of points j Rate value;
According to judging result, using global masking threshold Thn, obtain the phase mask threshold θ of phase spectrumn
Additive insertion module, for according to pseudo-random sequence u, phase mask threshold θnWith watermark bit b, following public affairs are utilized Phase spectrum of the formula in audioThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,
Wherein, α is constant, the intensity of control watermark insertion;
Utilize the amplitude spectrum X of frequency-region signalnWith the phase spectrum after insertion watermarkThen it is embedded in by Euler's formula Frequency-region signal after watermark is as follows,
Wherein, YnFor the frequency-region signal after insertion watermark, e is natural Exponents;
Time-frequency inverse transform module, for the frequency-region signal Y after watermark will to be embedded innTransform to time-domain signal yn, generate and have water The audio file of print;
The adaptive audio watermark detection subsystem comprises the following modules,
Second time-frequency convert module, for reading the time-domain audio file for having watermark, obtain time domain with watermark The amplitude data z and sample rate f s2 of audio signal, to the framing of time-domain signal elder generation, frame length N, znIt is the n-th of signal to be detected Frame;Time-frequency conversion is done again, obtains the amplitude spectrum Z of the audio signal of frequency domainnWith phase spectrum ξn
Second insertion range selection module, for according to sample rate f s2, frame length N and according to auditory perceptual sensitivity The start frequency FWMIN of the preset insertion of frequency-portions, terminate frequency FWMAX, watermark can be embedded in by calculating each frame frequency-region signal Range, obtain the maximum value freqmax2 and minimum value freqmin2 of this range, choose the amplitude spectrum of the audio within the scope of this;
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Wherein, floor is downward bracket function;
Second frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax2-freqmin2+1;
Associated extraction module, for examining formula according to following ASSOCIATE STATISTICS, to pseudo-random sequence u and signal to be detected the The phase spectrum ξ of n framen, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtainedn
Wherein,<>indicates that the inner product of signal calculates;
If detecting sufficient statistic rn>=0, then the watermark bit b=1 detected;It otherwise is b=0.
Moreover, obtaining the phase mask threshold θ of phase spectrum using triangle relation in improved psycho-acoustic modulen,
Present invention selection is embedded in watermark dependent on human ear to the insensitive of phase modification on the phase spectrum of audio signal.It is logical The judgement relaxed and have tone region to frequency spectrum in psychoacoustic model one is crossed, more to be there is the ingredient of tone, makes to succeed in one's scheme The global masking threshold of calculation is more accurate, using revisable amplitude and can modify the triangle relation between phase angle and obtains phase The masking threshold of angle, so as to the insertion of the adjustment watermark on phase spectrum according to the masking threshold of phase angle adaptively Intensity, it is ensured that audio frequency watermark makes the embedment strength maximum of watermark in non situation to ensure the robust of audio frequency watermark Property.Technical solution of the present invention has important market value.
Detailed description of the invention
Fig. 1 is the insertion subsystem structure block diagram of the embodiment of the present invention.
Fig. 2 is the detection subsystem structure block diagram of the embodiment of the present invention.
Fig. 3 is the telescopiny flow chart of the embodiment of the present invention
Fig. 4 is the detection process flow chart of the embodiment of the present invention.
Specific embodiment
Technical solution of the present invention is described further with specific embodiment combination attached drawing below.
A kind of adaptive audio watermaking system based on phase code provided in an embodiment of the present invention, including audio frequency watermark are embedding Enter subsystem and adaptive audio watermark detection subsystem.
Referring to Fig. 1, the adaptive audio watermark provided in an embodiment of the present invention based on phase code is embedded in subsystem, including First time-frequency convert module 1, first is embedded in range selection module 2, the first frequency expansion sequence generation module 3, improved psychologic acoustics Module 4, additive insertion module 5 and time-frequency inverse transform module 6 can realize each mould using software firming bechnology when specific implementation Block.
The first time-frequency convert module 1, for the time-domain audio signal read to be converted to frequency-region signal, and by when The relevant information and frequency-region signal of domain audio signal are exported to the first insertion range selection module 2;
The first insertion range selection module 2, according to the information (sample rate) of the time-domain audio signal read and frequency The frequency range of domain signal and human ear more sensitivity calculates the range that this frequency-region signal can be embedded in watermark, by the insertion range Maximum value and minimum value export to the first frequency expansion sequence generation module 3;
The first frequency expansion sequence generation module 3, for what is inputted according to random number seed and insertion range selection module 2 It is 1 or -1 equally distributed random sequence that the maximum value and minimum value for being embedded in range, which generate and be embedded in amplitude of the range with length, And this random sequence is exported to additive insertion module 5;
The improved psycho-acoustic module 4 has the judgment condition in tone region by relaxing in psychoacoustic model one, More there is tone area, to provide better amplitude masking threshold, then according to threshold value and original amplitude can be changed Triangle relation obtain adjustable phase angle threshold value, and phase angle threshold value is exported to additive insertion module 5;
The additive insertion module 5, for being exported according to the audio signal with watermark information for generating frequency domain to time-frequency Inverse transform module 6;
The time-frequency inverse transform module 6, the audio with watermark information of the frequency domain for inputting additive insertion module 5 Signal is converted to the audio signal with watermark information of time domain, and generates audio file, the obtained sound with watermark information Frequency file.
Referring to fig. 2, the adaptive audio watermark detection subsystem provided in an embodiment of the present invention based on phase code, including Second time-frequency convert module 7, second is embedded in range selection module 8, the second frequency expansion sequence generation module 9, associated extraction module 10, Each module can be realized using software firming bechnology when specific implementation.
The second time-frequency convert module 7 is essentially identical with the function of module 1, and the result of generation is exported and gives insertion range Selecting module 8;
The second insertion range selection module 8 and the function of module 2 are essentially identical, will be embedded in the maximum value and most of range Small value output will be embedded in the signal in range and export related detection module 10 to frequency expansion sequence generation module 9;
The second frequency expansion sequence generation module 9 is essentially identical with the function of module 3, and the result of generation is exported to correlation Detection module 10;
The coherent detection module 10, signal and frequency expansion sequence for being inputted according to insertion range selection module 8 generate The frequency expansion sequence that module 9 inputs calculates correlation according to the symbol of correlation and judges watermark.
Each module specific implementation is referring to method corresponding steps, and it will not go into details by the present invention.One kind provided in an embodiment of the present invention Adaptive audio water mark method based on phase code, including telescopiny and detection process.
Referring to Fig. 3, the adaptive audio watermark telescopiny provided in an embodiment of the present invention based on phase code can be adopted Process is carried out automatically with computer software technology means, specifically includes the following steps:
Step A1 reads audio file, the audio signal x and sample rate f s1 of time domain is obtained, to the framing of time-domain signal elder generation (frame length indicates with N, xnIndicate n-th frame time-domain signal) time-frequency conversion (such as FFT Fast Fourier Transform (FFT)) is done again, frequency is taken respectively Domain audio signal amplitude composes XnAnd phase spectrum
Step A2, according to sample rate f s1, frequency range (those skilled in the art of frame length N and human ear more sensitivity Can be according to auditory perceptual characteristic sets itself, such as 1000-7000Hz) range that frequency-region signal frame can be embedded in watermark is calculated, The maximum value for obtaining this range is freqmax1, and minimum value freqmin1 chooses the frequency-domain audio signals within the scope of this;
Freqmin1=floor ((FWMIN × 2.0/fs1) × N) (1)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N) (2)
FWMIN, FWMAX respectively indicate the more sensitive low-limit frequency and highest frequency of human ear, i.e., quick according to auditory perceptual The start frequency of the preset insertion of the frequency-portions of sense terminates frequency;Floor is downward bracket function.
Step A3, using key key as random number seed, generate that length is freqmax1-freqmin1+1 two into Make pseudorandom frequency expansion sequence u;
Detailed process is as follows for embodiment in MATLAB:
Firstly, calling RandStream function (random seed function) to rand function, (random number is raw using key key At function) initialized, then call rand function generate random number, due to rand function generate random number be 0~1 it Between number, also need these numbers round up become 0 and 1 binary pseudo-random sequence, then by this unipolar puppet Random sequence switchs to the pseudo-random sequence u that bipolarity contains only+1 He -1.
Step A4 modifies judgement of the ISO-MPEG psychoacoustic model one for tonal content, by obtaining more sounds It is tuned into point to obtain the masking threshold of more accurate range signal, the minimum in subband is not used to cover for last masking threshold Threshold value is covered, but directlys adopt global masking threshold Thn, then the phase mask threshold θ of phase spectrum is obtained using triangle relationn
Detailed process is as follows for embodiment:
By the sound tune region decision condition of frequency spectrum in ISO-MPEG psychoacoustic model one, in power spectrum PnLocal maxima Value point k has to be larger than all Frequency point 7dB nearby, is revised as being greater than neighbouring all sample frequency 1dB, and exist and be greater than 7dB The case where.
Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, Pn[k]dBTable Show the signal power at local maximum point k of n-th frame signal, Pn[k-j]dBIndicate the signal function at maximum of points j Rate value.
Based on Rule of judgment after the above modification, after obtaining the judging result for tonal content, global masking is calculated Threshold value Thn.Global masking threshold is signal amplitude revisable maximum value in undistorted situation.It is formed in real axis and the imaginary axis Two-dimensional surface in, for frequency domain point, the circle constituted using masking threshold as radius is the region that the frequency domain point can be modified, when repairing When the line and tangent circle of frequency domain point and origin after changing, the phase value of variation is maximum, as the variable maximum value of phase angle, As phase mask threshold value, the available phase mask threshold θ of triangle relation is utilizedn
Step A5, according to pseudo-random sequence u, phase mask threshold θnWith watermark bit b, using following formula in audio Phase spectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermark
Wherein, α is constant, and the intensity of control watermark insertion, those skilled in the art can preset value when specific implementation.
Utilize the amplitude spectrum X of frequency-region signalnWith the phase spectrum after insertion watermarkThen it is embedded in by Euler's formula Frequency-region signal after watermark
Wherein, YnFor the frequency-region signal after insertion watermark, e is natural Exponents.
Step A6, by the frequency-region signal Y after insertion watermarknTransform to time-domain signal yn, ultimately produce audio file to get To the audio file for having watermark.
Each module specific implementation is referring to method corresponding steps, and it will not go into details by the present invention.It is provided in an embodiment of the present invention to be based on The adaptive audio method of detecting watermarks of phase code, including telescopiny and detection process.
Referring to fig. 4, the adaptive audio watermark detection mode provided in an embodiment of the present invention based on phase code, can adopt Process is carried out automatically with computer software technology means, specifically includes the following steps:
Step B1 reads the time-domain audio file for having watermark, obtains the amplitude of the audio signal with watermark of time domain Data z and sample rate f s2, to the framing of time-domain signal elder generation, (frame length is similarly N, znFor the n-th frame of signal to be detected) time-frequency is done again It converts (such as FFT Fast Fourier Transform (FFT)), obtains the amplitude spectrum Z of the audio signal of frequency domainnWith phase spectrum ξn
Step B2, according to sample rate f s2, the frequency range of frame length N and human ear more sensitivity calculates this frequency-region signal It can be embedded in the range of watermark, the maximum value for obtaining this range is freqmax2, and minimum value freqmin2 chooses within the scope of this Audio amplitude spectrum;
Freqmin2=floor ((FWMIN × 2.0/fs2) × N) (8)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N) (9)
FWMIN, FWMAX respectively indicate the more sensitive low-limit frequency and highest frequency of human ear, i.e., quick according to auditory perceptual The start frequency of the preset insertion of the frequency-portions of sense terminates frequency;Floor is the downward bracket function inside MATLAB.
Step B3 takes the mode as when watermark insertion to generate bipolarity only has+1 and -1 two using key key It is worth pseudo-random sequence u.I.e. using key key as random number seed, generate that length is freqmax2-freqmin2+1 two into Make pseudorandom frequency expansion sequence u.
Step B4 examines formula (10) according to ASSOCIATE STATISTICS, to the phase of pseudo-random sequence u and signal n-th frame to be detected Compose ξn, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtainedn
<>indicates that the inner product of signal calculates in formula.
If detecting sufficient statistic rn>=0, then the watermark bit b=1 detected;It otherwise is b=0.
It is described in the present invention that specific embodiments are merely illustrative of the spirit of the present invention.Technology belonging to the present invention The technical staff in field can make various modifications or additions to the described embodiments or by a similar method Substitution, however, it does not deviate from the spirit of the invention or beyond the scope of the appended claims.

Claims (4)

1. a kind of adaptive audio water mark method based on phase code, it is characterised in that: including telescopiny and detection process,
The telescopiny includes the following steps,
Step A1 reads audio file, obtains the audio signal x and sample rate f s1 of time domain, first divide the audio signal x of time domain Frame, frame length indicate with N, xnIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the amplitude spectrum X of the audio signal of frequency domainnWith And phase spectrum
Step A2, the beginning of the preset insertion of frequency-portions according to sample rate f s1, frame length N and according to auditory perceptual sensitivity Frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain the maximum value of this range Freqmax1 and minimum value freqmin1, chooses the frequency-domain audio signals within the scope of this;
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
It is pseudo- to generate the binary system that length is freqmax1-freqmin1+1 using key key as random number seed by step A3 Random frequency expansion sequence u;
Step A4, progress is as follows for the judgement of tonal content,
Pn[k]dB-Pn[k-j]dB≥1
Pn[k]dB-Pn[k-j]dB≥7
Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, Pn[k]dBIndicate n-th The signal power at local maximum point k of frame signal, Pn[k-j]dBIndicate the signal power value at maximum of points j;
According to judging result, using global masking threshold Thn, obtain the phase mask threshold θ of phase spectrumn
Step A5, according to pseudo-random sequence u, phase mask threshold θnWith watermark bit b, using following formula audio phase SpectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,
Wherein, α is constant, the intensity of control watermark insertion;
Utilize the amplitude spectrum X of frequency-region signalnWith the phase spectrum after insertion watermarkThen insertion watermark is obtained by Euler's formula Frequency-region signal afterwards is as follows,
Wherein, YnFor the frequency-region signal after insertion watermark, e is natural Exponents;
Step A6, by the frequency-region signal Y after insertion watermarknTransform to time-domain signal yn, generate the audio file for having watermark;Institute Detection process is stated to include the following steps,
Step B1 reads the time-domain audio file for having watermark, obtains the amplitude data z of the audio signal with watermark of time domain With sample rate f s2, to the framing of time-domain signal elder generation, frame length N, znFor the n-th frame of signal to be detected;Time-frequency conversion is done again, is obtained The amplitude spectrum Z of the audio signal of frequency domainnWith phase spectrum ξn
Step B2, according to sample rate f s2, the beginning of frame length N and the preset insertion of frequency-portions according to auditory perceptual sensitivity Frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain the maximum value of this range Freqmax2 and minimum value freqmin2, chooses the amplitude spectrum of the audio within the scope of this;
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Wherein, floor is downward bracket function;
It is pseudo- to generate the binary system that length is freqmax2-freqmin2+1 using key key as random number seed by step B3 Random frequency expansion sequence u;
Step B4 examines formula according to following ASSOCIATE STATISTICS, to the phase spectrum ξ of pseudo-random sequence u and signal n-th frame to be detectedn, Relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtainedn
Wherein,<>indicates that the inner product of signal calculates;
If detecting sufficient statistic rn>=0, then the watermark bit b=1 detected;It otherwise is b=0.
2. the adaptive audio water mark method based on phase code according to claim 1, it is characterised in that: in step A4, The phase mask threshold θ of phase spectrum is obtained using triangle relationn,
3. a kind of adaptive audio watermaking system based on phase code, it is characterised in that: be embedded in subsystem including audio frequency watermark With adaptive audio watermark detection subsystem,
The audio frequency watermark insertion subsystem comprises the following modules,
First time-frequency convert module obtains the audio signal x and sample rate f s1 of time domain, to time domain for reading audio file The framing of audio signal x elder generation, frame length indicate with N, xnIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the audio letter of frequency domain Number amplitude spectrum XnAnd phase spectrum
First insertion range selection module, for according to sample rate f s1, frame length N and according to the frequency portion of auditory perceptual sensitivity Divide the start frequency FWMIN of preset insertion, terminate frequency FWMAX, calculates the range that each frame frequency-region signal can be embedded in watermark, The maximum value freqmax1 and minimum value freqmin1 of this range are obtained, the frequency-domain audio signals within the scope of this are chosen;
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax1- The pseudorandom frequency expansion sequence u of the binary system of freqmin1+1;
Improved psycho-acoustic module, it is as follows for the judgement of tonal content for carrying out,
Pn[k]dB-Pn[k-j]dB≥1
Pn[k]dB-Pn[k-j]dB≥7
Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, Pn[k]dBIndicate n-th The signal power at local maximum point k of frame signal, Pn[k-j]dBIndicate the signal power value at maximum of points j;
According to judging result, using global masking threshold Thn, obtain the phase mask threshold θ of phase spectrumn
Additive insertion module, for according to pseudo-random sequence u, phase mask threshold θnWith watermark bit b, existed using following formula The phase spectrum of audioThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,
Wherein, α is constant, the intensity of control watermark insertion;
Utilize the amplitude spectrum X of frequency-region signalnWith the phase spectrum after insertion watermarkThen insertion watermark is obtained by Euler's formula Frequency-region signal afterwards is as follows,
Wherein, YnFor the frequency-region signal after insertion watermark, e is natural Exponents;
Time-frequency inverse transform module, for the frequency-region signal Y after watermark will to be embedded innTransform to time-domain signal yn, generate with watermark Audio file;
The adaptive audio watermark detection subsystem comprises the following modules,
Second time-frequency convert module obtains the audio with watermark of time domain for reading the time-domain audio file for having watermark The amplitude data z and sample rate f s2 of signal, to the framing of time-domain signal elder generation, frame length N, znFor the n-th frame of signal to be detected;Again Time-frequency conversion is done, the amplitude spectrum Z of the audio signal of frequency domain is obtainednWith phase spectrum ξn
Second insertion range selection module, for according to sample rate f s2, frame length N and according to the frequency portion of auditory perceptual sensitivity Divide the start frequency FWMIN of preset insertion, terminate frequency FWMAX, calculates the range that each frame frequency-region signal can be embedded in watermark, The maximum value freqmax2 and minimum value freqmin2 of this range are obtained, the amplitude spectrum of the audio within the scope of this is chosen;
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Wherein, floor is downward bracket function;
Second frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax2- The pseudorandom frequency expansion sequence u of the binary system of freqmin2+1;
Associated extraction module, for examining formula according to following ASSOCIATE STATISTICS, to pseudo-random sequence u and signal n-th frame to be detected Phase spectrum ξn, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtainedn
Wherein,<>indicates that the inner product of signal calculates;
If detecting sufficient statistic rn>=0, then the watermark bit b=1 detected;It otherwise is b=0.
4. the adaptive audio watermaking system based on phase code according to claim 3, it is characterised in that: improved psychology In acoustic module, the phase mask threshold θ of phase spectrum is obtained using triangle relationn,
CN201610458411.4A 2016-06-22 2016-06-22 Adaptive audio water mark method and system based on phase code Expired - Fee Related CN105976823B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610458411.4A CN105976823B (en) 2016-06-22 2016-06-22 Adaptive audio water mark method and system based on phase code

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610458411.4A CN105976823B (en) 2016-06-22 2016-06-22 Adaptive audio water mark method and system based on phase code

Publications (2)

Publication Number Publication Date
CN105976823A CN105976823A (en) 2016-09-28
CN105976823B true CN105976823B (en) 2019-06-25

Family

ID=57021552

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610458411.4A Expired - Fee Related CN105976823B (en) 2016-06-22 2016-06-22 Adaptive audio water mark method and system based on phase code

Country Status (1)

Country Link
CN (1) CN105976823B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220020383A1 (en) * 2020-02-04 2022-01-20 Beijing Dajia Internet Information Technology Co., Ltd. Method for adding watermark information, method for extracting watermark information, and electronic device

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2881941A1 (en) * 2013-12-09 2015-06-10 Thomson Licensing Method and apparatus for watermarking an audio signal
CN109102814B (en) * 2018-09-13 2020-12-01 河海大学 Audio watermarking method for down-phase of DCT (discrete cosine transform)
CN109714284B (en) * 2018-11-27 2020-06-30 华中科技大学 Radio frequency watermark detection method based on K-S detection
US11914090B2 (en) * 2019-08-28 2024-02-27 Pgs Geophysical As Mitigating residual noise in a marine survey with orthogonal coded pseudo-random sweeps
CN113362835B (en) * 2020-03-05 2024-06-07 杭州网易云音乐科技有限公司 Audio watermarking method, device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6539095B1 (en) * 1993-11-18 2003-03-25 Geoffrey B. Rhoads Audio watermarking to convey auxiliary control information, and media embodying same
CN101101754A (en) * 2007-06-25 2008-01-09 中山大学 Steady audio-frequency water mark method based on Fourier discrete logarithmic coordinate transformation
CN101740033A (en) * 2008-11-24 2010-06-16 华为技术有限公司 Audio coding method and audio coder
CN102074240A (en) * 2010-12-24 2011-05-25 中国科学院声学研究所 Digital audio watermarking algorithm for copyright management
CN102142258A (en) * 2011-03-31 2011-08-03 上海第二工业大学 Wavelet transform and Arnold based adaptive gray-scale watermark embedded method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9401153B2 (en) * 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6539095B1 (en) * 1993-11-18 2003-03-25 Geoffrey B. Rhoads Audio watermarking to convey auxiliary control information, and media embodying same
CN101101754A (en) * 2007-06-25 2008-01-09 中山大学 Steady audio-frequency water mark method based on Fourier discrete logarithmic coordinate transformation
CN101740033A (en) * 2008-11-24 2010-06-16 华为技术有限公司 Audio coding method and audio coder
CN102074240A (en) * 2010-12-24 2011-05-25 中国科学院声学研究所 Digital audio watermarking algorithm for copyright management
CN102142258A (en) * 2011-03-31 2011-08-03 上海第二工业大学 Wavelet transform and Arnold based adaptive gray-scale watermark embedded method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
一种基于相位的音频数字水印算法;曾满红等;《微处理机》;20060430(第2期);第48-49,52页

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220020383A1 (en) * 2020-02-04 2022-01-20 Beijing Dajia Internet Information Technology Co., Ltd. Method for adding watermark information, method for extracting watermark information, and electronic device

Also Published As

Publication number Publication date
CN105976823A (en) 2016-09-28

Similar Documents

Publication Publication Date Title
CN105976823B (en) Adaptive audio water mark method and system based on phase code
Lei et al. Robust SVD-based audio watermarking scheme with differential evolution optimization
Djebbar et al. A view on latest audio steganography techniques
CN101271690B (en) Audio spread-spectrum watermark processing method for protecting audio data
US9812147B2 (en) System and method for generating an audio signal representing the speech of a user
CN101101754B (en) Steady audio-frequency water mark method based on Fourier discrete logarithmic coordinate transformation
CN104658542B (en) Based on orthogonal additivity spread spectrum audio frequency watermark embedding grammar, detection method and system
US20140119548A1 (en) Device comprising a plurality of audio sensors and a method of operating the same
JP4896455B2 (en) Data embedding device, data embedding method, data extracting device, and data extracting method
CN102074238A (en) Linear interference cancellation-based speech secrete communication method
CN110163787A (en) Digital audio Robust Blind Watermarking Scheme embedding grammar based on dual-tree complex wavelet transform
Baras et al. Controlling the inaudibility and maximizing the robustness in an audio annotation watermarking system
Dutta et al. Audio watermarking using pseudorandom sequences based on biometric templates.
CN1647186A (en) Time domain watermarking of multimedia signals
Datta et al. Robust multi layer audio steganography
KR100814792B1 (en) Digital audio watermarking method using hybrid transform
Singh A survey on audio steganography approaches
KR20020031654A (en) Method and apparatus for embedding watermarks using fast fourier transformed data
Dieu et al. An improved technique for hiding data in audio
Nita et al. Tic-tac, forgery time has run-up! live acoustic watermarking for integrity check in forensic applications
Dutta et al. Blind watermarking in audio signals using biometric features in wavelet domain
Wu et al. Adaptive audio watermarking based on SNR in localized regions
Nishimura Data hiding for audio signals that are robust with respect to air transmission and a speech codec
Hernaez et al. Speech watermarking based on coding of the harmonic phase
US20160379653A1 (en) Method and apparatus for increasing the strength of phase-based watermarking of an audio signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190625