CN106409302B - Audio-frequency water mark method and system based on insertion regional choice - Google Patents

Audio-frequency water mark method and system based on insertion regional choice Download PDF

Info

Publication number
CN106409302B
CN106409302B CN201610458412.9A CN201610458412A CN106409302B CN 106409302 B CN106409302 B CN 106409302B CN 201610458412 A CN201610458412 A CN 201610458412A CN 106409302 B CN106409302 B CN 106409302B
Authority
CN
China
Prior art keywords
frequency
signal
watermark
frame
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201610458412.9A
Other languages
Chinese (zh)
Other versions
CN106409302A (en
Inventor
陈怡�
高戈
张康
吕冰
刘影
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong Normal University
Original Assignee
Huazhong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong Normal University filed Critical Huazhong Normal University
Priority to CN201610458412.9A priority Critical patent/CN106409302B/en
Publication of CN106409302A publication Critical patent/CN106409302A/en
Application granted granted Critical
Publication of CN106409302B publication Critical patent/CN106409302B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present invention provides a kind of audio-frequency water mark method and system based on insertion regional choice, and telescopiny includes reading audio file, is first made whether as the judgement in insertion region, can then carry out the selection of the insertion frequency band of audio frequency watermark to every frame signal;Discrete Fourier Transform is carried out, the pseudorandom frequency expansion sequence of binary system is generated, carries out the insertion of watermark, transform to time domain;Detection process includes reading audio file to be detected, being made whether to every frame signal can be as the judgement in insertion region, calculate the starting point and frequency domain end point of detection range, it carries out Discrete Fourier Transform and generates the pseudorandom frequency expansion sequence of binary system, the sufficient statistic for calculating detection obtains the watermark bit detected.The invention proposes, than filtering out the accuracy rate that transient signal promotes watermark detection, promote the robustness of watermark by embedding a watermark in the significant frequency range of auditory perceptual by ceiling capacity in frame and least energy.

Description

Audio-frequency water mark method and system based on insertion regional choice
Technical field
The present invention relates to Digital Audio Watermarking Techniques fields, more particularly to the audio-frequency water mark method based on insertion regional choice And system.
Background technique
Digital audio frequency watermark is that certain digital informations are added into audio signal to reach the identification of the file true and false, copyright guarantor The signal processing operations of the purpose of shield, Information hiding.The selection technique in audio frequency watermark insertion region, which refers to, is embedded into sound in watermark Before frequency signal, appropriate audio region is selected to be embedded in watermark.Conventional audio digital watermark does not account for the spy of audio signal Sign, the insertion of watermark is all carried out to entire audio file, after will lead to the low region insertion watermark of 1) audio frequency signal amplitude in this way, Amplitude has exceeded masking threshold and generates noise, destroys the perception transparency;2) for occurring changing violent wink in audio signal The variance of state signal, the audio signal in the region is very big, causes watermark bit error rate when detecting watermark very high after being embedded in watermark;3) It is embedded in watermark in frequency domain, if the inapparent region of selection auditory perceptual is embedded in watermark, is damaged by signal processing or audio After compression, watermark will be lost a part, cause the watermark detection bit error rate high.
Summary of the invention
The object of the present invention is to provide the audio watermarking techniques of selection region insertion, and watermark is enable to be embedded into suitable sound In frequency domain, avoids the occurrence of unnecessary noise and reduce the generation of error code.
In order to achieve the above objectives, technical solution provided by the invention provides a kind of audio frequency watermark based on insertion regional choice Method, including telescopiny and detection process,
The telescopiny includes the following steps,
Step A1 reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length is N,
First to every frame signal xnBe made whether can as insertion region judgement,
Then being directed to can be as each frame signal x in insertion regionn, the selection of the insertion frequency band of audio frequency watermark is carried out, If the start frequency according to the preset insertion of frequency-portions of auditory perceptual sensitivity is FWMIN, end frequency is FWMAX, a frame Start be embedded in point freqmin1 and insertion end point freqmax1 seek it is as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can be embedded in watermarkn, carry out Discrete Fourier Transform and obtain frequency domain signal Xn
Step A3, using key key as random number seed, generate that length is freqmax1-freqmin1+1 two into Make pseudorandom frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, the insertion of watermark is carried out, obtains insertion water Frequency-region signal after print, calculating is as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| the frequency domain width before respectively indicating insertion watermark Frequency domain amplitude after value and insertion watermark, then obtains the frequency-region signal after insertion watermark by Euler's formula
Wherein, ∠ XnIndicate the phase of frequency-region signal, X 'nFrequency-region signal after indicating insertion watermark, e are that mathematics refers to naturally Number;
Step A5, by be embedded in watermark after frequency domain signal X 'nTime domain is transformed to, the audio file of insertion watermark is generated;
The detection process includes the following steps,
Step B1 reads audio file to be detected, the n-th frame signal z after obtained time domain framingnWith sample rate f s2,
First to every frame signal xnBeing made whether can be as the judgement in insertion region;
For each frame signal x that can be used as insertion regionn, as signal to be detected, calculate the starting of detection range Point freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2 carries out Discrete Fourier Transform and obtains the frequency-region signal Z of signal to be detectedn, corresponding frequency domain range value note For | Zn|;
Step B3, using key key as random number seed, generate that length is freqmax2-freqmin2+1 two into Make pseudorandom frequency expansion sequence u;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the abundant statistics of detection Measure rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detected is b=1;Otherwise, the watermark bit detected For b=0.
Moreover, in step A1 and step B1, to every frame signal xnIt is made whether to realize as the judgement in insertion region Mode is as follows,
1) signal xnAverage energySize exceed preset respective threshold τ1, it is not allow embedding for mute area then Enter watermark;
If 2) signal xnInterior includes transient signal, then does not allow to be embedded in watermark.
Moreover, signal xnWhether include inside transient signal, be judged by the following manner,
If a frame signal is decomposed into S block, the energy of S block is calculated separately out, compares the block and minimum of ceiling capacity The energy ratio rate of energy block and preset respective threshold τ2If rate is greater than τ2Then think that the frame signal includes transient signal.
The present invention correspondingly provides a kind of audio frequency watermark system based on insertion regional choice, including audio frequency watermark insertion System and watermark detection subsystem,
The audio frequency watermark insertion subsystem comprises the following modules,
Selection appropriate area insertion module obtains n-th frame time domain after sample rate f s1 and framing for reading audio file The signal x of audion, frame length N,
First to every frame signal xnBe made whether can as insertion region judgement,
Then being directed to can be as each frame signal x in insertion regionn, the selection of the insertion frequency band of audio frequency watermark is carried out, The selection of the insertion frequency band of audio frequency watermark is carried out, if according to the beginning of the preset insertion of frequency-portions of auditory perceptual sensitivity frequency It is FWMAX that rate, which is FWMIN, terminates frequency, a frame start to be embedded in point freqmin1 and insertion end point freqmax1 seek as Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can be embedded in watermarkn, carry out Discrete Fourier Transform and obtain To frequency domain signal Xn
First frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax1-freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, the insertion of watermark is carried out, Frequency-region signal after obtaining insertion watermark, calculating is as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| the frequency domain width before respectively indicating insertion watermark Frequency domain amplitude after value and insertion watermark, then obtains the frequency-region signal after insertion watermark by Euler's formula
Wherein, ∠ XnIndicate the phase of frequency-region signal, X 'nFrequency-region signal after indicating insertion watermark, e are that mathematics refers to naturally Number;
Time-frequency inverse transform module, for will be embedded in the frequency domain signal X after watermark 'nTime domain is transformed to, insertion watermark is generated Audio file;
The watermark detection subsystem comprises the following modules,
Select appropriate area detection module, for reading audio file to be detected, the n-th frame after obtained time domain framing Signal znWith sample rate f s2,
First to every frame signal xnBeing made whether can be as the judgement in insertion region;
For each frame signal x that can be used as insertion regionn, as signal to be detected, calculate the starting of detection range Point freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module obtains the frequency-region signal Z of signal to be detected for carrying out Discrete Fourier Transformn, phase Frequency domain range value is answered to be denoted as | Zn|;
Second frequency expansion sequence generation module, for using key key as random number seed, generating length to be The pseudorandom frequency expansion sequence u of the binary system of freqmax2-freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate detection Sufficient statistic rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detected is b=1;Otherwise, the watermark bit detected For b=0.
Moreover, in selection appropriate area insertion module and selection appropriate area detection module, to every frame signal xnIt is It is no can as insertion region judgement, implementation is as follows,
1) signal xnAverage energySize exceed preset respective threshold τ1, it is not allow embedding for mute area then Enter watermark;
If 2) signal xnInterior includes transient signal, then does not allow to be embedded in watermark.
Moreover, signal xnWhether include inside transient signal, be judged by the following manner,
If a frame signal is decomposed into S block, the energy of S block is calculated separately out, compares the block and minimum of ceiling capacity The energy ratio rate of energy block and preset respective threshold τ2If rate is greater than τ2Then think that the frame signal includes transient signal.
The invention proposes by ceiling capacity in frame and least energy than promoting watermark detection to filter out transient signal Accuracy rate promotes the robustness of watermark by embedding a watermark in the significant frequency range of auditory perceptual, further, proposes to utilize Average energy promotes the perception transparency to filter out No Tooting Area.Technical solution of the present invention has important market value.
Detailed description of the invention
Fig. 1 is the insertion subsystem structure block diagram of the embodiment of the present invention.
Fig. 2 is the detection subsystem structure block diagram of the embodiment of the present invention.
Fig. 3 is the telescopiny flow chart of the embodiment of the present invention
Fig. 4 is the detection process flow chart of the embodiment of the present invention.
Specific embodiment
Technical solution of the present invention is described further with specific embodiment combination attached drawing below.
The embodiment of the present invention provides a kind of audio frequency watermark system based on insertion regional choice, including audio frequency watermark insertion System and watermark detection subsystem.
Referring to Fig. 1, the audio watermarking technique of insertion regional choice provided in an embodiment of the present invention is embedded in subsystem, including choosing Select appropriate area insertion module 1, the first time-frequency convert module 2, the first frequency expansion sequence generation module 3, watermark embedding module 4 and when Frequency inverse transform module 5 can realize each module using software firming bechnology when specific implementation.
The selection appropriate area is embedded in module 1, judges the time-domain audio signal frame of reading, and when specific implementation can To judge whether to can satisfy the condition of insertion watermark frame by frame: being unsatisfactory for just skipping this frame, continue the judgement of next frame;If full Foot just exports signal to the first time-frequency conversion module 2, more according to the sample rate of the time-domain audio signal read and human ear Sensitive frequency range calculates the range of this frequency-region signal insertion watermark, and the frequency-region signal in embeddable range is exported water supply Print insertion module 4, the maximum value of the insertion range and minimum value are exported to the first frequency expansion sequence generation module 3;
The first time-frequency convert module 2, for the time-domain audio signal read to be converted to frequency-region signal, export to Watermark embedding module 4;
The first frequency expansion sequence generation module 3, for defeated according to random number seed and selection appropriate area insertion module 1 It is 1 or -1 equally distributed stochastic ordering that the maximum value and minimum value of the insertion range entered, which generate and be embedded in amplitude of the range with length, Column, and this random sequence is exported to watermark embedding module 4;
The watermark embedding module 4 generates the audio with watermark information of frequency domain for the amplitude spectrum in frequency-region signal Signal is exported to time-frequency inverse transform module 5;
The time-frequency inverse transform module 5, the audio with watermark information of the frequency domain for embedding a watermark into the input of module 4 Signal is converted to the audio signal with watermark information of time domain, and the audio signal with watermark information of this time domain is generated Audio file just obtains the audio file with watermark information.
Referring to fig. 2, watermark detection subsystem provided in an embodiment of the present invention, including selection appropriate area detection module 6, the Two time-frequency convert modules 7, the second frequency expansion sequence generation module 8, coherent detection module 9, can be solid using software when specific implementation Change technology realizes each module.
The selection appropriate area detection module 6 and the function of selecting appropriate area to be embedded in module 1 are essentially identical, are unsatisfactory for Watermark is embedded in the region of condition, does not also contain watermark generally, can not have to when detection consider: can sentence frame by frame when specific implementation It is disconnected, for being unsatisfactory for the frame of testing conditions, skips and do not detect, continue the judgement of next frame;Meet the audio signal of testing conditions It exports to the second time-frequency conversion module 7, equally exports the maxima and minima in frequency detecting region to the second time-frequency convert Module 7 and the second frequency expansion sequence generation module 8;
The second time-frequency convert module 7, for the time-domain audio signal read to be converted to frequency-region signal, export to Coherent detection module 9;
The function of the second frequency expansion sequence generation module 8 and the first frequency expansion sequence generation module 3 is essentially identical, will generate Result export to coherent detection module 9;
The coherent detection module 9, for the frequency domain amplitude signal and spread spectrum to be detected according to detection range to input The frequency expansion sequence that sequence generating module 9 inputs calculates correlation according to the symbol of correlation and judges watermark.
Each module specific implementation is referring to method corresponding steps, and it will not go into details by the present invention.It is provided in an embodiment of the present invention to be based on It is embedded in the audio-frequency water mark method of regional choice, including telescopiny and detection process.
Referring to Fig. 3, the audio frequency watermark telescopiny provided in an embodiment of the present invention based on selection region can be using calculating Machine software technology means carry out process automatically, specifically includes the following steps:
Step A1, reads audio file, the audio signal x elder generation framing to time domain, and the after obtaining sample rate f s1 and framing N frame time-domain audio signal xn(frame length N), to every frame signal xnBeing made whether can be as the judgement in insertion region, and judgement include Both sides judgement:
1) judge xnThe size of average energy whether exceed the threshold value of setting, to judge present frame xnIt whether is mute area, Insertion watermark is not allowed for if it is mute area, is not otherwise just mute area beyond threshold value, insertion can be can be carried out.By following The average energy of formula calculating n-th frame
Wherein, N is frame length, i.e. sample points in a frame;I is the sample point index number in a frame, and value arrives N- 0 Between 1;xn 2(i) n-th frame time-domain signal x is indicatednI-th point of the energy in frame;τ1It is specific real for the decision threshold of average energy Those skilled in the art can voluntarily preset value when applying, such as be empirically derived;If exceeding threshold value, meet condition 1), Carry out following condition 2) judgement.
2) for there is the case where transient signal in a frame, due to its frequency acute variation, the biggish variance that will cause, The error probability of watermark detection caused by signal variance is bigger when detecting is higher, and such case should not also be embedded in watermark.It is logical It crosses and a frame is decomposed into S block, calculate separately out the energy of S block, pass through the block of ceiling capacity and the energy of least energy block Than rate and threshold tau2Comparison, rate be greater than τ2Then it is considered that the frame signal includes that transient signal is not embedded in watermark, otherwise may be used To be embedded in watermark.When it is implemented, those skilled in the art can voluntarily preset the value of S.
Specific implementation is as follows:
First by a frame signal xnIt is divided into S block, then the sample points M in each sub-block is
M=N/S (2)
Each piece of ENERGY EiIt calculates as follows
Wherein, i indicates the index number of intra block, and j indicates the index number of sample point in frame, xn 2(j) when indicating n-th frame Domain signal xnThe energy of jth point in frame.
Find out the ceiling capacity E in block energyMaxWith least energy EMin
EMax=MAX { Ei, EMin=MIN { Ei, i ∈ [0, S-1] (4)
Wherein, MAX, MIN respectively indicate maximizing function and minimum value function.
The ratio rate of ceiling capacity and least energy calculates as follows:
If rate > τ2, it is considered as signal frame xnInside there is transient signal, which is not embedded in watermark;Otherwise, it can be embedded in Watermark.Wherein τ2For threshold value, those skilled in the art can voluntarily preset value, such as τ when specific implementation2For the inspection of transient signal Threshold value is surveyed, is empirically derived.
Then being directed to can be as each frame signal x in insertion regionn, for audio frequency watermark insertion frequency band selection, Should be the more significant region of auditory perceptual, those skilled in the art can voluntarily preset according to auditory perceptual characteristic, such as 1000-7000Hz.Because the signal in these regions will not be removed after the attack such as filtering, audio compression.So by water Print is embedded into the apparent region of perception, will not be erased after being subjected to some signal attacks, be able to detect out.If setting according to people The start frequency of the sensitive preset insertion of frequency-portions of ear perception is FWMIN, end frequency is FWMAX, and a corresponding frame is opened Begin insertion point freqmin1 and be embedded in end point freqmax1 seek it is as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N) (6)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N) (7)
Wherein, floor is downward bracket function.
According to insertion point freqmin1 and insertion end point freqmax1 is started, the frequency domain audio letter within the scope of this is chosen Number.
It can judge frame by frame when specific implementation, be unsatisfactory for skipping for condition, carry out the judgement of next frame.
Step A2, to the signal frame x that can be embedded in watermarkn, carrying out FFT transform (fast discrete Fourier transformation) is frequency domain Signal Xn
Step A3, using key key as random number seed, generate that length is freqmax1-freqmin1+1 two into Pseudorandom spreading sequence u processed.
Detailed process is as follows for embodiment in MATLAB:
Firstly, calling RandStream function (random seed function) to rand function, (random number is raw using key key At function) initialized, then call rand function generate random number, due to rand function generate random number be 0~1 it Between number, also need these numbers round up become 0 and 1 binary pseudo-random sequence, then by this unipolar puppet Random sequence switchs to the pseudo-random sequence u that bipolarity contains only+1 He -1.
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, watermark is carried out using following formula (8) Insertion, obtain insertion watermark after frequency-region signal, calculate realize it is as follows
|X′n|=| Xn|+bαu (8)
Wherein, α is constant, controls the embedment strength of watermark, and those skilled in the art can preset value when specific implementation;| Xn| and | X 'n| then the frequency domain amplitude after frequency domain amplitude and insertion watermark before respectively indicating insertion watermark passes through Euler's formula Frequency-region signal after obtaining insertion watermark.
Wherein, ∠ XnIndicate the phase of frequency-region signal, X 'nFrequency-region signal after indicating insertion watermark, e are that mathematics refers to naturally Number.
Step A5, by be embedded in watermark after frequency domain signal X 'nTime domain is transformed to, it is embedding to get arriving to ultimately produce audio file Enter the audio file of watermark.
Referring to fig. 4, the audio frequency watermark detection process provided in an embodiment of the present invention based on selection region insertion, can use Computer software technology means carry out process automatically, specifically includes the following steps:
Step B1 reads audio file to be detected, the n-th frame signal z after obtained time domain framingnWith sample rate f s2, To each time-domain signal znTake steps the same decision method in A1,
Consider two following conditions,
1) signal xnAverage energySize exceed preset respective threshold τ1, it is not allow embedding for mute area then Enter watermark;
If 2) signal xnInterior includes transient signal, then does not allow to be embedded in watermark.
Then it can not be embedded in watermark for mute area and not comprising the frame signal of transient signal and have to be detected.
It can judge frame by frame when specific implementation, be unsatisfactory for skipping for condition, carry out the judgement of next frame.
For each frame signal x that can be used as insertion regionn, as signal to be detected, calculate the frequency domain of detection range Starting point freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N) (10)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N) (11)
Step B2, for meeting the signal z of testing conditionsn, carry out Discrete Fourier Transform and obtain the frequency of signal to be detected Domain signal Zn, corresponding frequency domain range value is denoted as | Zn|。
Step B3 generates binary system frequency expansion sequence u (the u mode phase obtained with embedding grammar above using key key Together), i.e., it using key key as random number seed, generates the binary system pseudorandom that length is freqmax2-freqmin2+1 and expands Frequency sequence u.
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, by calculate frequency expansion sequence u and The frequency domain range value of signal to be detected | Zn| correlation, calculate the sufficient statistic r of detectionn
Wherein,<>indicates that the inner product of signal calculates.
If sufficient statistic rn>=0, then the watermark bit detected is b=1;Otherwise, the watermark bit detected For b=0.
It is described in the present invention that specific embodiments are merely illustrative of the spirit of the present invention.Technology belonging to the present invention The technical staff in field can make various modifications or additions to the described embodiments or by a similar method Substitution, however, it does not deviate from the spirit of the invention or beyond the scope of the appended claims.

Claims (6)

1. a kind of audio-frequency water mark method based on insertion regional choice, it is characterised in that: including telescopiny and detection process, institute Telescopiny is stated to include the following steps,
Step A1 reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length N, first To every frame signal xnBe made whether can as insertion region judgement,
Then being directed to can be as each frame signal x in insertion regionn, the selection of the insertion frequency band of audio frequency watermark is carried out, if according to The start frequency of the preset insertion of frequency-portions of auditory perceptual sensitivity is FWMIN, end frequency is FWMAX, the beginning of a frame Insertion point freqmin1 and be embedded in end point freqmax1 seek it is as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can be embedded in watermarkn, carry out Discrete Fourier Transform and obtain frequency domain signal Xn
It is pseudo- to generate the binary system that length is freqmax1-freqmin1+1 using key key as random number seed by step A3 Random frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, the insertion of watermark is carried out, after obtaining insertion watermark Frequency-region signal, calculating is as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| respectively indicate insertion watermark before frequency domain amplitude and Then frequency domain amplitude after being embedded in watermark obtains the frequency-region signal after insertion watermark by Euler's formula
Wherein, ∠ XnIndicate the phase of frequency-region signal, X 'nFrequency-region signal after indicating insertion watermark, e are mathematics natural Exponents, on Mark j is imaginary unit;
Step A5, by be embedded in watermark after frequency domain signal X 'nTime domain is transformed to, the audio file of insertion watermark is generated;
The detection process includes the following steps,
Step B1 reads audio file to be detected, the n-th frame signal z after obtained time domain framingnIt is first right with sample rate f s2 Every frame signal xnBeing made whether can be as the judgement in insertion region;
For each frame signal x that can be used as insertion regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2 carries out Discrete Fourier Transform and obtains the frequency-region signal Z of signal to be detectedn, corresponding frequency domain range value is denoted as | Zn |;
It is pseudo- to generate the binary system that length is freqmax2-freqmin2+1 using key key as random number seed by step B3 Random frequency expansion sequence u;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the sufficient statistic r of detectionn It is as follows,
If sufficient statistic rn>=0, then the watermark bit detected is b=1;Otherwise, the watermark bit detected is b= 0。
2. the audio-frequency water mark method according to claim 1 based on insertion regional choice, it is characterised in that: step A1 and step In B1, to every frame signal xnBeing made whether can be as the judgement in insertion region, and implementation is as follows,
1) signal xnAverage energySize exceed preset respective threshold τ1, it is then not allow to be embedded in water for mute area Print;
The average energy of n-th frame is calculated by following formula
Wherein, N is frame length, and i is the sample point index number in a frame, xn 2(i) n-th frame time-domain signal x is indicatednI-th in frame The energy of point;τ1For the decision threshold of average energy;
If 2) signal xnInterior includes transient signal, then does not allow to be embedded in watermark.
3. the audio-frequency water mark method according to claim 2 based on insertion regional choice, it is characterised in that: signal xnInside whether Comprising transient signal, it is judged by the following manner,
If a frame signal is decomposed into S block, the energy of S block is calculated separately out, compares the block and least energy of ceiling capacity The energy ratio rate of block and preset respective threshold τ2If rate is greater than τ2Then think that the frame signal includes transient signal.
4. it is a kind of based on insertion regional choice audio frequency watermark system, it is characterised in that: including audio frequency watermark insertion subsystem and Watermark detection subsystem,
The audio frequency watermark insertion subsystem comprises the following modules,
Selection appropriate area insertion module obtains n-th frame time-domain audio after sample rate f s1 and framing for reading audio file Signal xn, frame length N,
First to every frame signal xnBe made whether can as insertion region judgement,
Then being directed to can be as each frame signal x in insertion regionn, the selection of the insertion frequency band of audio frequency watermark is carried out, if according to The start frequency of the preset insertion of frequency-portions of auditory perceptual sensitivity is FWMIN, end frequency is FWMAX, the beginning of a frame Insertion point freqmin1 and be embedded in end point freqmax1 seek it is as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can be embedded in watermarkn, carry out Discrete Fourier Transform and obtain frequency domain Signal Xn
First frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax1- The pseudorandom frequency expansion sequence u of the binary system of freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, the insertion of watermark is carried out, is obtained embedding Frequency-region signal after entering watermark, calculating is as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| respectively indicate insertion watermark before frequency domain amplitude and Then frequency domain amplitude after being embedded in watermark obtains the frequency-region signal after insertion watermark by Euler's formula
Wherein, ∠ XnIndicate the phase of frequency-region signal, X 'nFrequency-region signal after indicating insertion watermark, e are mathematics natural Exponents, on Mark j is imaginary unit;
Time-frequency inverse transform module, for will be embedded in the frequency domain signal X after watermark 'nTime domain is transformed to, the audio of insertion watermark is generated File;
The watermark detection subsystem comprises the following modules,
Select appropriate area detection module, for reading audio file to be detected, the n-th frame signal after obtained time domain framing znWith sample rate f s2,
First to every frame signal xnBeing made whether can be as the judgement in insertion region;
For each frame signal x that can be used as insertion regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module obtains the frequency-region signal Z of signal to be detected for carrying out Discrete Fourier Transformn, corresponding frequency domain Range value is denoted as | Zn|;
Second frequency expansion sequence generation module, for using key key as random number seed, generation length to be freqmax2- The pseudorandom frequency expansion sequence u of the binary system of freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate filling for detection Divide statistic rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detected is b=1;Otherwise, the watermark bit detected is b= 0。
5. the audio frequency watermark system according to claim 4 based on insertion regional choice, it is characterised in that: selection appropriate area It is embedded in module and selection appropriate area detection module, to every frame signal xnBe made whether can as insertion region judgement, Implementation is as follows,
1) signal xnAverage energySize exceed preset respective threshold τ1, it is then not allow to be embedded in water for mute area Print;
The average energy of n-th frame is calculated by following formula
Wherein, N is frame length, and i is the sample point index number in a frame, xn 2(i) n-th frame time-domain signal x is indicatednI-th in frame The energy of point;τ1For the decision threshold of average energy;
If 2) signal xnInterior includes transient signal, then does not allow to be embedded in watermark.
6. the audio frequency watermark system according to claim 5 based on insertion regional choice, it is characterised in that: signal xnInside whether Comprising transient signal, it is judged by the following manner,
If a frame signal is decomposed into S block, the energy of S block is calculated separately out, compares the block and least energy of ceiling capacity The energy ratio rate of block and preset respective threshold τ2If rate is greater than τ2Then think that the frame signal includes transient signal.
CN201610458412.9A 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice Expired - Fee Related CN106409302B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610458412.9A CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610458412.9A CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Publications (2)

Publication Number Publication Date
CN106409302A CN106409302A (en) 2017-02-15
CN106409302B true CN106409302B (en) 2019-07-09

Family

ID=58005751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610458412.9A Expired - Fee Related CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Country Status (1)

Country Link
CN (1) CN106409302B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109714284B (en) * 2018-11-27 2020-06-30 华中科技大学 Radio frequency watermark detection method based on K-S detection
CN111292756B (en) * 2020-01-19 2023-05-26 成都潜在人工智能科技有限公司 Compression-resistant audio silent watermark embedding and extracting method and system
CN113362835B (en) * 2020-03-05 2024-06-07 杭州网易云音乐科技有限公司 Audio watermarking method, device, electronic equipment and storage medium
CN111883108A (en) * 2020-07-06 2020-11-03 珠海格力电器股份有限公司 Password embedding method and device, password matching method and device and control system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101185121A (en) * 2005-06-02 2008-05-21 汤姆森许可贸易公司 Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum
CN102142255A (en) * 2010-07-08 2011-08-03 北京三信时代信息公司 Method for embedding and extracting digital watermark in audio signal
CN102664013A (en) * 2012-04-18 2012-09-12 南京邮电大学 Audio digital watermark method of discrete cosine transform domain based on energy selection
CN104658542A (en) * 2015-03-16 2015-05-27 武汉大学 Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality
CN104700841A (en) * 2015-02-10 2015-06-10 浙江省广电科技股份有限公司 Watermark embedding and detecting method based on audio content classification
CN105374360A (en) * 2015-11-25 2016-03-02 武汉大学 Interleaved additive spread spectrum audio watermark embedding method and detection method and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100611094B1 (en) * 2000-06-15 2006-08-09 주식회사 케이티 Apparatus and method for inserting/detecting watermark based stochastic model

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101185121A (en) * 2005-06-02 2008-05-21 汤姆森许可贸易公司 Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum
CN102142255A (en) * 2010-07-08 2011-08-03 北京三信时代信息公司 Method for embedding and extracting digital watermark in audio signal
CN102664013A (en) * 2012-04-18 2012-09-12 南京邮电大学 Audio digital watermark method of discrete cosine transform domain based on energy selection
CN104700841A (en) * 2015-02-10 2015-06-10 浙江省广电科技股份有限公司 Watermark embedding and detecting method based on audio content classification
CN104658542A (en) * 2015-03-16 2015-05-27 武汉大学 Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality
CN105374360A (en) * 2015-11-25 2016-03-02 武汉大学 Interleaved additive spread spectrum audio watermark embedding method and detection method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A Spread Spectrum watermarking Algorithm based Local Instruction Statistic;Tianwen Feng et.al;《9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing》;20131016;全文
Spread-Spectrum Robust Image Watermarking for Ownership Protection;Ansari,A ET AL;《22nd Iranian Conference on Electrical Engineering》;20140522;全文

Also Published As

Publication number Publication date
CN106409302A (en) 2017-02-15

Similar Documents

Publication Publication Date Title
CN106409302B (en) Audio-frequency water mark method and system based on insertion regional choice
KR102386155B1 (en) How to protect your voice assistant from being controlled by machine learning-based silent commands
Lei et al. Robust SVD-based audio watermarking scheme with differential evolution optimization
CN105976823B (en) Adaptive audio water mark method and system based on phase code
Avcibas et al. Steganalysis of watermarking techniques using image quality metrics
WO2016077760A1 (en) Determining media device activation based on frequency response analysis
CN104658542B (en) Based on orthogonal additivity spread spectrum audio frequency watermark embedding grammar, detection method and system
CN101494051A (en) Detection method for time-domain audio LSB hidden write
CN113782041B (en) Method for embedding and positioning watermark based on audio variable frequency domain
CN108682425B (en) Robust digital audio watermark embedding system based on constant watermark
CN110163787A (en) Digital audio Robust Blind Watermarking Scheme embedding grammar based on dual-tree complex wavelet transform
CN102074238A (en) Linear interference cancellation-based speech secrete communication method
Sundaram et al. Audio scene segmentation using multiple features, models and time scales
CN111613243A (en) Voice detection method and device
CN105374360B (en) Intersect additivity spread spectrum audio frequency watermark embedding grammar, detection method and system
CN101350198B (en) Method for compressing watermark using voice based on bone conduction
Zeng et al. An algorithm of echo steganalysis based on Bayes classifier
KR20070061285A (en) Digital audio watermarking method using hybrid transform
Khademi et al. Audio watermarking based on quantization index modulation in the frequency domain
Wu et al. Adaptive audio watermarking based on SNR in localized regions
Panda et al. Application of energy efficient watermark on audio signal for authentication
Youssef HFSA-AW: a hybrid fuzzy self-adaptive audio watermarking
Chen et al. Multipurpose audio watermarking algorithm
CN108877819A (en) A kind of voice content evidence collecting method based on coefficient correlation
Wang et al. An audio watermarking scheme with neural network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190709

CF01 Termination of patent right due to non-payment of annual fee