CN105976823B

CN105976823B - Adaptive audio water mark method and system based on phase code

Info

Publication number: CN105976823B
Application number: CN201610458411.4A
Authority: CN
Inventors: 陈怡�; 高戈; 张康; 刘影; 吕冰
Original assignee: Huazhong Normal University
Current assignee: Huazhong Normal University
Priority date: 2016-06-22
Filing date: 2016-06-22
Publication date: 2019-06-25
Anticipated expiration: 2036-06-22
Also published as: CN105976823A

Abstract

The present invention provides adaptive audio water mark method and system based on phase code, telescopiny includes doing FFT time-frequency conversion, calculate the range that each frame frequency-region signal can be embedded in watermark, generate the pseudorandom frequency expansion sequence of binary system, the judgement for tonal content is carried out, the phase mask threshold value of phase spectrum is obtained using global masking threshold；Phase spectrum after obtaining insertion watermark, generates the audio file for having watermark；Detection process includes doing the time-frequency conversion of FFT, calculates the range that each frame frequency-region signal can be embedded in watermark, generates the pseudorandom frequency expansion sequence of binary system, carries out ASSOCIATE STATISTICS and examines to obtain watermark bit.The present invention relaxes the judgement that frequency spectrum has tone region, so that the global masking threshold calculated is more accurate, according to the embedment strength of the adaptive adjustment watermark of the masking threshold of phase angle on phase spectrum, ensure audio frequency watermark in non situation, makes the embedment strength maximum of watermark to ensure the robustness of audio frequency watermark.

Description

Adaptive audio water mark method and system based on phase code

Technical field

The present invention relates to digital audio frequency watermark fields, more particularly to the adaptive audio water mark method based on phase code and System.

Background technique

Digital audio frequency watermark is that certain digital informations are added into audio signal to reach the identification of the file true and false, copyright guarantor The signal processing operations of the purpose of shield, Information hiding.Adaptive audio watermaking system based on phase code refers to according to psychology Acoustic model dynamically adjusts the embedment strength of watermark on phase spectrum, it is ensured that audio frequency watermark is in the condition for meeting not sentience Lower robustness is maximum.Traditional audio frequency watermark embedded mobile GIS based on phase code, directly adds fixing intensity on phase spectrum Watermark.If the intensity for being embedded in watermark is excessive, it is easy to generate noise, influence sound quality；If the intensity for being embedded in watermark is too small, Detection is to be not easy to check, and influences robustness.In addition audio signal is dynamic change, even if in the strong of some regions insertion Degree is suitable for, but possible embedment strength is excessive or too small for other regions.Such watermark embedded mode makes audio frequency watermark Not sentience and robustness cannot be met simultaneously.

Summary of the invention

The object of the present invention is to provide the adaptive audio digital watermark schemes based on phase code, make the water in phase spectrum Embedment strength is printed according to the adjustment of audio signal self adaption, to reach the compromise of audio frequency watermark not sentience and robustness.

Technical solution of the present invention provides a kind of adaptive audio water mark method based on phase code, including telescopiny and Detection process,

The telescopiny includes the following steps,

Step A1 reads audio file, the audio signal x and sample rate f s1 of time domain is obtained, to the audio signal x of time domain First framing, frame length indicate with N, x_nIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the amplitude of the audio signal of frequency domain Compose X_nAnd phase spectrum

Step A2, according to sample rate f s1, frame length N and according to the preset insertion of frequency-portions of auditory perceptual sensitivity Start frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain this range Maximum value freqmax1 and minimum value freqmin1, chooses the frequency-domain audio signals within the scope of this；

Freqmin1=floor ((FWMIN × 2.0/fs1) × N)

Freqmax1=floor ((FWMAX × 2.0/fs1) × N)

Wherein, floor is downward bracket function；

Step A3, using key key as random number seed, generate that length is freqmax1-freqmin1+1 two into Make pseudorandom frequency expansion sequence u；

Step A4, progress is as follows for the judgement of tonal content,

Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, P_n[k]_dBTable Show the signal power at local maximum point k of n-th frame signal, P_n[k-j]_dBIndicate the signal function at maximum of points j Rate value；

According to judging result, using global masking threshold Th_n, obtain the phase mask threshold θ of phase spectrum_n；

Step A5, according to pseudo-random sequence u, phase mask threshold θ_nWith watermark bit b, using following formula in audio Phase spectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,

Wherein, α is constant, the intensity of control watermark insertion；

Utilize the amplitude spectrum X of frequency-region signal_nWith the phase spectrum after insertion watermarkThen it is embedded in by Euler's formula Frequency-region signal after watermark is as follows,

Wherein, Y_nFor the frequency-region signal after insertion watermark, e is natural Exponents；

Step A6, by the frequency-region signal Y after insertion watermark_nTransform to time-domain signal y_n, generate the audio text with watermark Part；The detection process includes the following steps,

Step B1 reads the time-domain audio file for having watermark, obtains the amplitude of the audio signal with watermark of time domain Data z and sample rate f s2, to the framing of time-domain signal elder generation, frame length N, z_nFor the n-th frame of signal to be detected；Time-frequency conversion is done again, Obtain the amplitude spectrum Z of the audio signal of frequency domain_nWith phase spectrum ξ_n；

Step B2, according to sample rate f s2, frame length N and the preset insertion of frequency-portions according to auditory perceptual sensitivity Start frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain this range Maximum value freqmax2 and minimum value freqmin2, chooses the amplitude spectrum of the audio within the scope of this；

Freqmin2=floor ((FWMIN × 2.0/fs2) × N)

Freqmax2=floor ((FWMAX × 2.0/fs2) × N)

Wherein, floor is downward bracket function；

Step B3, using key key as random number seed, generate that length is freqmax2-freqmin2+1 two into Make pseudorandom frequency expansion sequence u；

Step B4 examines formula according to following ASSOCIATE STATISTICS, to the phase of pseudo-random sequence u and signal n-th frame to be detected Compose ξ_n, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtained_n；

Wherein,<>indicates that the inner product of signal calculates；

If detecting sufficient statistic r_n>=0, then the watermark bit b=1 detected；It otherwise is b=0.

Moreover, obtaining the phase mask threshold θ of phase spectrum using triangle relation in step A4_n,

The present invention accordingly provides a kind of adaptive audio watermaking system based on phase code, including audio frequency watermark insertion System and adaptive audio watermark detection subsystem,

The audio frequency watermark insertion subsystem comprises the following modules,

First time-frequency convert module obtains the audio signal x and sample rate f s1 of time domain, clock synchronization for reading audio file The audio signal x elder generation framing in domain, frame length indicate with N, x_nIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the sound of frequency domain The amplitude spectrum X of frequency signal_nAnd phase spectrum

First insertion range selection module, for according to sample rate f s1, frame length N and according to auditory perceptual sensitivity The start frequency FWMIN of the preset insertion of frequency-portions, terminate frequency FWMAX, watermark can be embedded in by calculating each frame frequency-region signal Range, obtain the maximum value freqmax1 and minimum value freqmin1 of this range, choose the frequency-domain audio signals within the scope of this；

Freqmin1=floor ((FWMIN × 2.0/fs1) × N)

Freqmax1=floor ((FWMAX × 2.0/fs1) × N)

Wherein, floor is downward bracket function；

First frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax1-freqmin1+1；

Improved psycho-acoustic module, it is as follows for the judgement of tonal content for carrying out,

Additive insertion module, for according to pseudo-random sequence u, phase mask threshold θ_nWith watermark bit b, following public affairs are utilized Phase spectrum of the formula in audioThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,

Wherein, α is constant, the intensity of control watermark insertion；

Time-frequency inverse transform module, for the frequency-region signal Y after watermark will to be embedded in_nTransform to time-domain signal y_n, generate and have water The audio file of print；

The adaptive audio watermark detection subsystem comprises the following modules,

Second time-frequency convert module, for reading the time-domain audio file for having watermark, obtain time domain with watermark The amplitude data z and sample rate f s2 of audio signal, to the framing of time-domain signal elder generation, frame length N, z_nIt is the n-th of signal to be detected Frame；Time-frequency conversion is done again, obtains the amplitude spectrum Z of the audio signal of frequency domain_nWith phase spectrum ξ_n；

Second insertion range selection module, for according to sample rate f s2, frame length N and according to auditory perceptual sensitivity The start frequency FWMIN of the preset insertion of frequency-portions, terminate frequency FWMAX, watermark can be embedded in by calculating each frame frequency-region signal Range, obtain the maximum value freqmax2 and minimum value freqmin2 of this range, choose the amplitude spectrum of the audio within the scope of this；

Freqmin2=floor ((FWMIN × 2.0/fs2) × N)

Freqmax2=floor ((FWMAX × 2.0/fs2) × N)

Wherein, floor is downward bracket function；

Second frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax2-freqmin2+1；

Associated extraction module, for examining formula according to following ASSOCIATE STATISTICS, to pseudo-random sequence u and signal to be detected the The phase spectrum ξ of n frame_n, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtained_n；

Wherein,<>indicates that the inner product of signal calculates；

Moreover, obtaining the phase mask threshold θ of phase spectrum using triangle relation in improved psycho-acoustic module_n,

Present invention selection is embedded in watermark dependent on human ear to the insensitive of phase modification on the phase spectrum of audio signal.It is logical The judgement relaxed and have tone region to frequency spectrum in psychoacoustic model one is crossed, more to be there is the ingredient of tone, makes to succeed in one's scheme The global masking threshold of calculation is more accurate, using revisable amplitude and can modify the triangle relation between phase angle and obtains phase The masking threshold of angle, so as to the insertion of the adjustment watermark on phase spectrum according to the masking threshold of phase angle adaptively Intensity, it is ensured that audio frequency watermark makes the embedment strength maximum of watermark in non situation to ensure the robust of audio frequency watermark Property.Technical solution of the present invention has important market value.

Detailed description of the invention

Fig. 1 is the insertion subsystem structure block diagram of the embodiment of the present invention.

Fig. 2 is the detection subsystem structure block diagram of the embodiment of the present invention.

Fig. 3 is the telescopiny flow chart of the embodiment of the present invention

Fig. 4 is the detection process flow chart of the embodiment of the present invention.

Specific embodiment

Technical solution of the present invention is described further with specific embodiment combination attached drawing below.

A kind of adaptive audio watermaking system based on phase code provided in an embodiment of the present invention, including audio frequency watermark are embedding Enter subsystem and adaptive audio watermark detection subsystem.

Referring to Fig. 1, the adaptive audio watermark provided in an embodiment of the present invention based on phase code is embedded in subsystem, including First time-frequency convert module 1, first is embedded in range selection module 2, the first frequency expansion sequence generation module 3, improved psychologic acoustics Module 4, additive insertion module 5 and time-frequency inverse transform module 6 can realize each mould using software firming bechnology when specific implementation Block.

The first time-frequency convert module 1, for the time-domain audio signal read to be converted to frequency-region signal, and by when The relevant information and frequency-region signal of domain audio signal are exported to the first insertion range selection module 2；

The first insertion range selection module 2, according to the information (sample rate) of the time-domain audio signal read and frequency The frequency range of domain signal and human ear more sensitivity calculates the range that this frequency-region signal can be embedded in watermark, by the insertion range Maximum value and minimum value export to the first frequency expansion sequence generation module 3；

The first frequency expansion sequence generation module 3, for what is inputted according to random number seed and insertion range selection module 2 It is 1 or -1 equally distributed random sequence that the maximum value and minimum value for being embedded in range, which generate and be embedded in amplitude of the range with length, And this random sequence is exported to additive insertion module 5；

The improved psycho-acoustic module 4 has the judgment condition in tone region by relaxing in psychoacoustic model one, More there is tone area, to provide better amplitude masking threshold, then according to threshold value and original amplitude can be changed Triangle relation obtain adjustable phase angle threshold value, and phase angle threshold value is exported to additive insertion module 5；

The additive insertion module 5, for being exported according to the audio signal with watermark information for generating frequency domain to time-frequency Inverse transform module 6；

The time-frequency inverse transform module 6, the audio with watermark information of the frequency domain for inputting additive insertion module 5 Signal is converted to the audio signal with watermark information of time domain, and generates audio file, the obtained sound with watermark information Frequency file.

Referring to fig. 2, the adaptive audio watermark detection subsystem provided in an embodiment of the present invention based on phase code, including Second time-frequency convert module 7, second is embedded in range selection module 8, the second frequency expansion sequence generation module 9, associated extraction module 10, Each module can be realized using software firming bechnology when specific implementation.

The second time-frequency convert module 7 is essentially identical with the function of module 1, and the result of generation is exported and gives insertion range Selecting module 8；

The second insertion range selection module 8 and the function of module 2 are essentially identical, will be embedded in the maximum value and most of range Small value output will be embedded in the signal in range and export related detection module 10 to frequency expansion sequence generation module 9；

The second frequency expansion sequence generation module 9 is essentially identical with the function of module 3, and the result of generation is exported to correlation Detection module 10；

The coherent detection module 10, signal and frequency expansion sequence for being inputted according to insertion range selection module 8 generate The frequency expansion sequence that module 9 inputs calculates correlation according to the symbol of correlation and judges watermark.

Each module specific implementation is referring to method corresponding steps, and it will not go into details by the present invention.One kind provided in an embodiment of the present invention Adaptive audio water mark method based on phase code, including telescopiny and detection process.

Referring to Fig. 3, the adaptive audio watermark telescopiny provided in an embodiment of the present invention based on phase code can be adopted Process is carried out automatically with computer software technology means, specifically includes the following steps:

Step A1 reads audio file, the audio signal x and sample rate f s1 of time domain is obtained, to the framing of time-domain signal elder generation (frame length indicates with N, x_nIndicate n-th frame time-domain signal) time-frequency conversion (such as FFT Fast Fourier Transform (FFT)) is done again, frequency is taken respectively Domain audio signal amplitude composes X_nAnd phase spectrum

Step A2, according to sample rate f s1, frequency range (those skilled in the art of frame length N and human ear more sensitivity Can be according to auditory perceptual characteristic sets itself, such as 1000-7000Hz) range that frequency-region signal frame can be embedded in watermark is calculated, The maximum value for obtaining this range is freqmax1, and minimum value freqmin1 chooses the frequency-domain audio signals within the scope of this；

Freqmin1=floor ((FWMIN × 2.0/fs1) × N) (1)

Freqmax1=floor ((FWMAX × 2.0/fs1) × N) (2)

FWMIN, FWMAX respectively indicate the more sensitive low-limit frequency and highest frequency of human ear, i.e., quick according to auditory perceptual The start frequency of the preset insertion of the frequency-portions of sense terminates frequency；Floor is downward bracket function.

Detailed process is as follows for embodiment in MATLAB:

Firstly, calling RandStream function (random seed function) to rand function, (random number is raw using key key At function) initialized, then call rand function generate random number, due to rand function generate random number be 0~1 it Between number, also need these numbers round up become 0 and 1 binary pseudo-random sequence, then by this unipolar puppet Random sequence switchs to the pseudo-random sequence u that bipolarity contains only+1 He -1.

Step A4 modifies judgement of the ISO-MPEG psychoacoustic model one for tonal content, by obtaining more sounds It is tuned into point to obtain the masking threshold of more accurate range signal, the minimum in subband is not used to cover for last masking threshold Threshold value is covered, but directlys adopt global masking threshold Th_n, then the phase mask threshold θ of phase spectrum is obtained using triangle relation_n。

Detailed process is as follows for embodiment:

By the sound tune region decision condition of frequency spectrum in ISO-MPEG psychoacoustic model one, in power spectrum P_nLocal maxima Value point k has to be larger than all Frequency point 7dB nearby, is revised as being greater than neighbouring all sample frequency 1dB, and exist and be greater than 7dB The case where.

Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, P_n[k]_dBTable Show the signal power at local maximum point k of n-th frame signal, P_n[k-j]_dBIndicate the signal function at maximum of points j Rate value.

Based on Rule of judgment after the above modification, after obtaining the judging result for tonal content, global masking is calculated Threshold value Th_n.Global masking threshold is signal amplitude revisable maximum value in undistorted situation.It is formed in real axis and the imaginary axis Two-dimensional surface in, for frequency domain point, the circle constituted using masking threshold as radius is the region that the frequency domain point can be modified, when repairing When the line and tangent circle of frequency domain point and origin after changing, the phase value of variation is maximum, as the variable maximum value of phase angle, As phase mask threshold value, the available phase mask threshold θ of triangle relation is utilized_n

Step A5, according to pseudo-random sequence u, phase mask threshold θ_nWith watermark bit b, using following formula in audio Phase spectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermark

Wherein, α is constant, and the intensity of control watermark insertion, those skilled in the art can preset value when specific implementation.

Utilize the amplitude spectrum X of frequency-region signal_nWith the phase spectrum after insertion watermarkThen it is embedded in by Euler's formula Frequency-region signal after watermark

Wherein, Y_nFor the frequency-region signal after insertion watermark, e is natural Exponents.

Step A6, by the frequency-region signal Y after insertion watermark_nTransform to time-domain signal y_n, ultimately produce audio file to get To the audio file for having watermark.

Each module specific implementation is referring to method corresponding steps, and it will not go into details by the present invention.It is provided in an embodiment of the present invention to be based on The adaptive audio method of detecting watermarks of phase code, including telescopiny and detection process.

Referring to fig. 4, the adaptive audio watermark detection mode provided in an embodiment of the present invention based on phase code, can adopt Process is carried out automatically with computer software technology means, specifically includes the following steps:

Step B1 reads the time-domain audio file for having watermark, obtains the amplitude of the audio signal with watermark of time domain Data z and sample rate f s2, to the framing of time-domain signal elder generation, (frame length is similarly N, z_nFor the n-th frame of signal to be detected) time-frequency is done again It converts (such as FFT Fast Fourier Transform (FFT)), obtains the amplitude spectrum Z of the audio signal of frequency domain_nWith phase spectrum ξ_n。

Step B2, according to sample rate f s2, the frequency range of frame length N and human ear more sensitivity calculates this frequency-region signal It can be embedded in the range of watermark, the maximum value for obtaining this range is freqmax2, and minimum value freqmin2 chooses within the scope of this Audio amplitude spectrum；

Freqmin2=floor ((FWMIN × 2.0/fs2) × N) (8)

Freqmax2=floor ((FWMAX × 2.0/fs2) × N) (9)

FWMIN, FWMAX respectively indicate the more sensitive low-limit frequency and highest frequency of human ear, i.e., quick according to auditory perceptual The start frequency of the preset insertion of the frequency-portions of sense terminates frequency；Floor is the downward bracket function inside MATLAB.

Step B3 takes the mode as when watermark insertion to generate bipolarity only has+1 and -1 two using key key It is worth pseudo-random sequence u.I.e. using key key as random number seed, generate that length is freqmax2-freqmin2+1 two into Make pseudorandom frequency expansion sequence u.

Step B4 examines formula (10) according to ASSOCIATE STATISTICS, to the phase of pseudo-random sequence u and signal n-th frame to be detected Compose ξ_n, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtained_n。

<>indicates that the inner product of signal calculates in formula.

It is described in the present invention that specific embodiments are merely illustrative of the spirit of the present invention.Technology belonging to the present invention The technical staff in field can make various modifications or additions to the described embodiments or by a similar method Substitution, however, it does not deviate from the spirit of the invention or beyond the scope of the appended claims.

Claims

1. a kind of adaptive audio water mark method based on phase code, it is characterised in that: including telescopiny and detection process,

The telescopiny includes the following steps,

Step A1 reads audio file, obtains the audio signal x and sample rate f s1 of time domain, first divide the audio signal x of time domain Frame, frame length indicate with N, x_nIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the amplitude spectrum X of the audio signal of frequency domain_nWith And phase spectrum

Step A2, the beginning of the preset insertion of frequency-portions according to sample rate f s1, frame length N and according to auditory perceptual sensitivity Frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain the maximum value of this range Freqmax1 and minimum value freqmin1, chooses the frequency-domain audio signals within the scope of this；

Freqmin1=floor ((FWMIN × 2.0/fs1) × N)

Freqmax1=floor ((FWMAX × 2.0/fs1) × N)

Wherein, floor is downward bracket function；

It is pseudo- to generate the binary system that length is freqmax1-freqmin1+1 using key key as random number seed by step A3 Random frequency expansion sequence u；

Step A4, progress is as follows for the judgement of tonal content,

P_n[k]_dB-P_n[k-j]_dB≥1

P_n[k]_dB-P_n[k-j]_dB≥7

Wherein, k indicates the frequency local maximum point at place, and j is indicated with a distance from local maximum point k, P_n[k]_dBIndicate n-th The signal power at local maximum point k of frame signal, P_n[k-j]_dBIndicate the signal power value at maximum of points j；

Step A5, according to pseudo-random sequence u, phase mask threshold θ_nWith watermark bit b, using following formula audio phase SpectrumThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,

Wherein, α is constant, the intensity of control watermark insertion；

Utilize the amplitude spectrum X of frequency-region signal_nWith the phase spectrum after insertion watermarkThen insertion watermark is obtained by Euler's formula Frequency-region signal afterwards is as follows,

Step A6, by the frequency-region signal Y after insertion watermark_nTransform to time-domain signal y_n, generate the audio file for having watermark；Institute Detection process is stated to include the following steps,

Step B1 reads the time-domain audio file for having watermark, obtains the amplitude data z of the audio signal with watermark of time domain With sample rate f s2, to the framing of time-domain signal elder generation, frame length N, z_nFor the n-th frame of signal to be detected；Time-frequency conversion is done again, is obtained The amplitude spectrum Z of the audio signal of frequency domain_nWith phase spectrum ξ_n；

Step B2, according to sample rate f s2, the beginning of frame length N and the preset insertion of frequency-portions according to auditory perceptual sensitivity Frequency FWMIN, terminate frequency FWMAX, calculate the range that each frame frequency-region signal can be embedded in watermark, obtain the maximum value of this range Freqmax2 and minimum value freqmin2, chooses the amplitude spectrum of the audio within the scope of this；

Freqmin2=floor ((FWMIN × 2.0/fs2) × N)

Freqmax2=floor ((FWMAX × 2.0/fs2) × N)

Wherein, floor is downward bracket function；

It is pseudo- to generate the binary system that length is freqmax2-freqmin2+1 using key key as random number seed by step B3 Random frequency expansion sequence u；

Step B4 examines formula according to following ASSOCIATE STATISTICS, to the phase spectrum ξ of pseudo-random sequence u and signal n-th frame to be detected_n, Relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtained_n；

Wherein,<>indicates that the inner product of signal calculates；

2. the adaptive audio water mark method based on phase code according to claim 1, it is characterised in that: in step A4, The phase mask threshold θ of phase spectrum is obtained using triangle relation_n,

3. a kind of adaptive audio watermaking system based on phase code, it is characterised in that: be embedded in subsystem including audio frequency watermark With adaptive audio watermark detection subsystem,

First time-frequency convert module obtains the audio signal x and sample rate f s1 of time domain, to time domain for reading audio file The framing of audio signal x elder generation, frame length indicate with N, x_nIt indicates n-th frame time-domain signal, then does time-frequency conversion, obtain the audio letter of frequency domain Number amplitude spectrum X_nAnd phase spectrum

First insertion range selection module, for according to sample rate f s1, frame length N and according to the frequency portion of auditory perceptual sensitivity Divide the start frequency FWMIN of preset insertion, terminate frequency FWMAX, calculates the range that each frame frequency-region signal can be embedded in watermark, The maximum value freqmax1 and minimum value freqmin1 of this range are obtained, the frequency-domain audio signals within the scope of this are chosen；

Freqmin1=floor ((FWMIN × 2.0/fs1) × N)

Freqmax1=floor ((FWMAX × 2.0/fs1) × N)

Wherein, floor is downward bracket function；

First frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax1- The pseudorandom frequency expansion sequence u of the binary system of freqmin1+1；

P_n[k]_dB-P_n[k-j]_dB≥1

P_n[k]_dB-P_n[k-j]_dB≥7

Additive insertion module, for according to pseudo-random sequence u, phase mask threshold θ_nWith watermark bit b, existed using following formula The phase spectrum of audioThe upper insertion for carrying out watermark, the phase spectrum after obtaining insertion watermarkIt is as follows,

Wherein, α is constant, the intensity of control watermark insertion；

Time-frequency inverse transform module, for the frequency-region signal Y after watermark will to be embedded in_nTransform to time-domain signal y_n, generate with watermark Audio file；

Second time-frequency convert module obtains the audio with watermark of time domain for reading the time-domain audio file for having watermark The amplitude data z and sample rate f s2 of signal, to the framing of time-domain signal elder generation, frame length N, z_nFor the n-th frame of signal to be detected；Again Time-frequency conversion is done, the amplitude spectrum Z of the audio signal of frequency domain is obtained_nWith phase spectrum ξ_n；

Second insertion range selection module, for according to sample rate f s2, frame length N and according to the frequency portion of auditory perceptual sensitivity Divide the start frequency FWMIN of preset insertion, terminate frequency FWMAX, calculates the range that each frame frequency-region signal can be embedded in watermark, The maximum value freqmax2 and minimum value freqmin2 of this range are obtained, the amplitude spectrum of the audio within the scope of this is chosen；

Freqmin2=floor ((FWMIN × 2.0/fs2) × N)

Freqmax2=floor ((FWMAX × 2.0/fs2) × N)

Wherein, floor is downward bracket function；

Second frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax2- The pseudorandom frequency expansion sequence u of the binary system of freqmin2+1；

Associated extraction module, for examining formula according to following ASSOCIATE STATISTICS, to pseudo-random sequence u and signal n-th frame to be detected Phase spectrum ξ_n, relevant calculation is done, the detection sufficient statistic r of suspect signal n-th frame signal is obtained_n；

Wherein,<>indicates that the inner product of signal calculates；

4. the adaptive audio watermaking system based on phase code according to claim 3, it is characterised in that: improved psychology In acoustic module, the phase mask threshold θ of phase spectrum is obtained using triangle relation_n,