CN106409302A - Audio frequency watermark method and system based on embedding area selection - Google Patents
Audio frequency watermark method and system based on embedding area selection Download PDFInfo
- Publication number
- CN106409302A CN106409302A CN201610458412.9A CN201610458412A CN106409302A CN 106409302 A CN106409302 A CN 106409302A CN 201610458412 A CN201610458412 A CN 201610458412A CN 106409302 A CN106409302 A CN 106409302A
- Authority
- CN
- China
- Prior art keywords
- frequency
- signal
- watermark
- embedded
- frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000001514 detection method Methods 0.000 claims abstract description 43
- 230000008569 process Effects 0.000 claims abstract description 12
- 230000001052 transient effect Effects 0.000 claims description 20
- 229910002056 binary alloy Inorganic materials 0.000 claims description 11
- 238000009432 framing Methods 0.000 claims description 11
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 11
- 230000001427 coherent effect Effects 0.000 claims description 5
- 238000007689 inspection Methods 0.000 claims description 2
- 238000001228 spectrum Methods 0.000 abstract description 4
- 230000005236 sound signal Effects 0.000 description 17
- 230000006870 function Effects 0.000 description 13
- 230000008447 perception Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000032696 parturition Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Editing Of Facsimile Originals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The present invention provides an audio frequency watermark method and system based on embedding area selection. The embedding process comprises: reading an audio frequency file, determining whether each frame signal can be taken as an embedding area or not, and then performing selection of the embedding frequency bands of the audio frequency watermark; and performing the discrete Fourier transform, generating a binary pseudo-random spread spectrum sequence, performing watermark embedding, and converting the binary pseudo-random spread spectrum sequence to a time domain. The detection process includes: reading an audio file to be detected, determining whether each frame signal can be taken as an embedding area or not, calculating the starting point and the frequency domain ending point of a detection range, performing the discrete Fourier transform to generate a binary pseudo-random spread spectrum sequence, calculating the detected sufficient statistics, and obtaining the detected watermark bits.
Description
Technical field
The present invention relates to Digital Audio Watermarking Techniques field, more particularly, to it is based on and embeds regioselective audio-frequency water mark method and system.
Background technology
Digital audio frequency watermark is to add some digital informations in audio signal to reach file real and fake discrimination, copyright protection, information
The signal processing operations of the purpose such as hiding.The selection technique that audio frequency watermark embeds region refers to before watermark is embedded into audio signal,
Appropriate audio region is selected to embed watermark.Conventional audio digital watermark, does not account for the feature of audio signal, to whole audio frequency
File all carries out the embedded of watermark, so can lead to 1) after the low region of audio frequency signal amplitude embeds watermark, amplitude is beyond covering
Cover threshold value and produce noise, destroy the perception transparency;2) for the transient signal that appearance change in audio signal is violent, this region
Audio signal variance very big, lead to after embedded watermark detect watermark when the watermark bit error rate very high;3) embed watermark in frequency domain,
If selecting the inapparent region of auditory perceptual to embed watermark, after signal processing or audio frequency lossy compression method, watermark will be lost
Lose a part, lead to the watermark detection bit error rate high.
Content of the invention
It is an object of the invention to provide the audio watermarking technique that selection region embeds, watermark is enable to be embedded into suitable audio region
In, it is to avoid the generation unnecessary noise occurring and reducing error code.
For reaching above-mentioned purpose, the technical scheme that the present invention provides provides a kind of being based on to embed regioselective audio-frequency water mark method,
Including telescopiny and detection process,
Described telescopiny comprises the following steps,
Step A1, reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound
The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be
FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as
Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain signal Xn;
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo-
Random frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain embedded watermark
Frequency-region signal afterwards, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and
Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio file of embedded watermark;
Described detection process comprises the following steps,
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range
Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2, carries out the frequency-region signal Z that discrete Fourier transform (DFT) obtains signal to be detectedn, corresponding frequency domain range value is designated as
|Zn|;
Step B3, by the use of key key as random number seed, generate the binary system that length is freqmax2-freqmin2+1 pseudo- with
The frequency expansion sequence u of machine;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the sufficient statistic r of detectionnAs
Under,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
And, in step A1 and step B1, to every frame signal xnBeing made whether can be used as the judgement in embedded region, realization side
Formula is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
And, signal xnInside whether comprise transient signal, judge in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity
Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
The present invention correspondingly provide a kind of based on embedding regioselective audio frequency watermark system, include audio frequency watermark embed subsystem with
Watermark detection subsystem,
Described audio frequency watermark embeds subsystem and includes with lower module,
Select appropriate area to embed module, for reading audio file, obtain n-th frame time-domain audio after sample rate f s1 and framing
Signal xn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound
The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be
FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as
Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain
Signal Xn;
First frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be
The binary system pseudorandom frequency expansion sequence u of freqmax1-freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain
Frequency-region signal to after embedded watermark, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and
Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Time-frequency inverse transform module, for by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio frequency literary composition of embedded watermark
Part;
Described watermark detection subsystem includes with lower module,
Select appropriate area detection module, the n-th frame signal z for reading audio file to be detected, after the time domain framing obtainingn
With sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range
Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module, obtains the frequency-region signal Z of signal to be detected for carrying out discrete Fourier transform (DFT)n, respective tones
Domain range value is designated as | Zn|;
Second frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be
The binary system pseudorandom frequency expansion sequence u of freqmax2-freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate filling of detection
Divide statistic rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
And, select appropriate area to embed module and select in appropriate area detection module, to every frame signal xnBeing made whether can
As the judgement in embedded region, implementation is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
And, signal xnInside whether comprise transient signal, judge in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity
Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
The present invention proposes the accuracy rate lifting watermark detection by frame in ceiling capacity and least energy than to filter transient signal,
Lift the robustness of watermark by embedding a watermark in the significant frequency range of auditory perceptual, further, propose to utilize average energy
To filter the No Tooting Area lifting perception transparency.Technical solution of the present invention has important market value.
Brief description
Fig. 1 is the embedded subsystem structure block diagram of the embodiment of the present invention.
Fig. 2 is the detection subsystem structure block diagram of the embodiment of the present invention.
Fig. 3 is the telescopiny flow chart of the embodiment of the present invention
Fig. 4 is the detection process flow chart of the embodiment of the present invention.
Specific embodiment
Combine accompanying drawing with specific embodiment below technical scheme is described further.
The embodiment of the present invention provide a kind of based on embedding regioselective audio frequency watermark system, include audio frequency watermark embed subsystem with
Watermark detection subsystem.
Referring to Fig. 1, embedded regioselective audio watermarking technique provided in an embodiment of the present invention embeds subsystem, closes including selection
Suitable region embeds module 1, the first time-frequency convert module 2, the first frequency expansion sequence generation module 3, watermark embedding module 4 and time-frequency
Inverse transform module 5, can realize each module using software firming bechnology when being embodied as.
Described selection appropriate area embeds module 1, and the time-domain audio signal frame reading is judged, can be by when being embodied as
Frame judges whether to disclosure satisfy that the condition of embedded watermark:It is unsatisfactory for just skipping this frame, continue the judgement of next frame;If meeting
Signal output is given the first time-frequency conversion module 2, the sample rate according to the time-domain audio signal reading and human ear more sensitivity
Frequency range calculates the scope that this frequency-region signal embeds watermark, and the frequency-region signal that can embed in scope output feedwater print is embedded mould
Block 4, the maximum of this embedded scope and minima are exported to the first frequency expansion sequence generation module 3;
Described first time-frequency convert module 2, for being converted to frequency-region signal, output feedwater print by the time-domain audio signal reading
Embedded module 4;
Described first frequency expansion sequence generation module 3, for embedding module 1 input according to random number seed and selection appropriate area
The maximum of embedded scope and minima generate and embed the amplitude with length for the scope is 1 or -1 equally distributed random sequences, and
This random sequence is exported to watermark embedding module 4;
Described watermark embedding module 4, for the amplitude spectrum in frequency-region signal, generates the audio signal with watermark information of frequency domain
Export to time-frequency inverse transform module 5;
Described time-frequency inverse transform module 5, for embedding a watermark into the audio signal with watermark information of the frequency domain of module 4 input
Be converted to the audio signal with watermark information of time domain, and the audio signal with watermark information for this time domain is generated audio frequency literary composition
Part, just obtains the audio file with watermark information.
Referring to Fig. 2, the adaptive audio watermark detection subsystem based on phase code provided in an embodiment of the present invention, including selection
Appropriate area detection module 6, the second time-frequency convert module 7, the second frequency expansion sequence generation module 8, coherent detection module 9, tool
Body can realize each module using software firming bechnology when implementing.
Described selection appropriate area detection module 6 is essentially identical with the function of selecting appropriate area to embed module 1, is unsatisfactory for watermark
The region of embedded condition, does not typically contain watermark yet, can be without consideration during detection:Can judge frame by frame when being embodied as, right
In the frame being unsatisfactory for testing conditions, skip and do not detect, continue the judgement of next frame;Meet testing conditions audio signal export to
Second time-frequency conversion module 7, equally exports the maxima and minima in frequency detecting region to the second time-frequency convert module 7 He
Second frequency expansion sequence generation module 8;
Described second time-frequency convert module 7, for the time-domain audio signal reading is converted to frequency-region signal, exports to correlation
Detection module 9;
Described second frequency expansion sequence generation module 8 is essentially identical with the function of the first frequency expansion sequence generation module 3, the knot that will produce
Fruit exports to coherent detection module 9;
Described coherent detection module 9, for giving birth to the frequency domain amplitude signal to be detected of input and frequency expansion sequence according to detection range
Become the frequency expansion sequence of module 9 input, calculate correlation, according to the symbol of correlation, judge watermark.
Each module implements referring to method corresponding steps, and it will not go into details for the present invention.Provided in an embodiment of the present invention based on embedded area
The audio-frequency water mark method that domain selects, including telescopiny and detection process.
Referring to Fig. 3, the audio frequency watermark telescopiny based on selection region provided in an embodiment of the present invention can adopt computer software
Technological means carry out flow process automatically, specifically include following steps:
Step A1, reads audio file, the audio signal x elder generation framing to time domain, obtains n-th after sample rate f s1 and framing
Frame time-domain audio signal xn(frame length is N), to every frame signal xnIt is made whether to judge bag as the judgement in embedded region
Judge containing both sides:
1) judge xnAverage energy size whether beyond the threshold value setting, to judge present frame xnWhether it is quiet area, such as
Fruit is that quiet area does not allow for embedded watermark, is not otherwise just quiet area beyond threshold value, may be embedded.By following public affairs
Formula calculates the average energy of n-th frame
Wherein, N is frame length, i.e. the sample points of a frame in;I is the sample point index number of a frame in, and value arrives N-1 0
Between;xn 2I () represents n-th frame time-domain signal xnIn i-th point of energy of frame in;τ1For the decision threshold of average energy, specifically reality
When applying, those skilled in the art voluntarily can preset value, for example, be empirically derived;If exceeding threshold value, meet condition 1),
Carry out following condition 2) judgement.
When 2) transient signal for a frame in, due to its frequency acute variation, the larger variance that can cause, in inspection
During survey, the error probability of the watermark detection that signal variance causes more greatly is higher, and this situation should not embed watermark yet.By by one
Frame is decomposed into S block, calculates the energy of S block respectively, by energy ratio rate of the block of ceiling capacity and least energy block and
Threshold tau2Comparison, rate be more than τ2Then it is considered that this frame signal comprises the not embedded watermark of transient signal, otherwise can embed water
Print.When being embodied as, those skilled in the art can voluntarily preset the value of S.
Specific implementation is as follows:
First by frame signal xnIt is divided into S block, then sample points M in each sub-block are
M=N/S (2)
The ENERGY E of each blockiIt is calculated as follows
Wherein, i represents the index number of intra block, and j represents the index number of frame in sample point, xn 2J () represents n-th frame time domain
Signal xnEnergy in frame in jth point.
Find out the ceiling capacity E in block energyMaxWith least energy EMin
EMax=MAX { Ei, EMin=MIN { Ei, i ∈ [0, S-1] (4)
Wherein, MAX, MIN represent maximizing function and minimum value function respectively.
The ratio rate of ceiling capacity and least energy is calculated as follows:
If rate is > τ2, it is considered as signal frame xnInside there is transient signal, this frame does not embed watermark;Otherwise, water can be embedded
Print.Wherein τ2For threshold value, when being embodied as, those skilled in the art can voluntarily preset value, such as τ2Detection for transient signal
Threshold value, is empirically derived.
Then being directed to can be used as each frame signal x in embedded regionn, for the selection of the embedded frequency band of audio frequency watermark, Ying Weiren
The more significant region of ear perception, those skilled in the art voluntarily can preset according to auditory perceptual characteristic, for example
1000-7000Hz.Because the signal in these regions is after filtering, after audio compression etc. attacks, will not be removed.So by water
Print is embedded into the obvious region of perception, is standing will not to be erased after some signals are attacked, is being capable of detecting when.If setting according to people
The sensitive default embedded starts frequency of frequency-portions of ear perception is FWMIN, end frequency is FWMAX, a corresponding frame
Start embedded point freqmin1 and embedded end point freqmax1 ask for as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N) (6)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N) (7)
Wherein, floor is downward bracket function.
According to starting embedded point freqmin1 and embedded end point freqmax1, choose the frequency-domain audio signals in the range of this.
Can judge frame by frame when being embodied as, be unsatisfactory for skipping of condition, carry out the judgement of next frame.
Step A2, to the signal frame x that can embed watermarkn, carrying out FFT (fast discrete Fourier conversion) is frequency domain
Signal Xn.
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo-
Random frequency expansion sequence u.
Embodiment detailed process in MATLAB is as follows:
First, using key key, call RandStream function (random seed function) to rand function (generating random number
Function) initialized, then call rand function to generate random number, because the random number that rand function generates is between 0~1
Number, also need to carry out, to these numbers, the binary pseudo-random sequence becoming 0 and 1 that rounds up, then by this unipolar pseudo- with
Machine sequence, switchs to pseudo-random sequence u that bipolarity comprises only+1 and -1.
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out watermark using equation below (8)
Embedded, obtain the frequency-region signal after embedded watermark, calculate realize as follows
|X′n|=| Xn|+bαu (8)
Wherein, α is constant, controls the embedment strength of watermark, those skilled in the art's predeterminable value when being embodied as;|Xn| and
|X′n| represent the frequency domain amplitude before embedded watermark and the frequency domain amplitude after embedded watermark respectively, then embedded by Euler's formula
Frequency-region signal after watermark.
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents.
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, ultimately produce audio file, that is, obtain embedded water
The audio file of print.
Referring to Fig. 4, the audio frequency watermark detection process embedding based on selection region provided in an embodiment of the present invention, computer can be adopted
Software engineering means carry out flow process automatically, specifically include following steps:
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2, right
Each time-domain signal znTake steps the same decision method in A1,
Consider following two conditions,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
It is not then quiet area and the frame signal not comprising transient signal, watermark can be embedded and have to be detected.
Can judge frame by frame when being embodied as, be unsatisfactory for skipping of condition, carry out the judgement of next frame.
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the frequency domain starting point of detection range
Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N) (10)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N) (11)
Step B2, for the signal z meeting testing conditionsn, carry out the frequency domain letter that discrete Fourier transform (DFT) obtains signal to be detected
Number Zn, corresponding frequency domain range value is designated as | Zn|.
Step B3, using key key, generates binary system frequency expansion sequence u (identical with the u mode that embedding grammar above obtains),
I.e. by the use of key key as random number seed, generate the binary system pseudorandom spreading sequence that length is freqmax2-freqmin2+1
u.
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, by calculating frequency expansion sequence u and to be checked
Survey the frequency domain range value of signal | Zn| correlation, calculate the sufficient statistic r of detectionn
Wherein,<·>Represent that the inner product of signal calculates.
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
Specific embodiment described in the present invention is only explanation for example to present invention spirit.The technical field of the invention
Technical staff can be made various modifications or supplement or substituted using similar mode to described specific embodiment, but simultaneously
Do not deviate by the spirit of the present invention or surmount scope defined in appended claims.
Claims (6)
1. a kind of based on embedding regioselective audio-frequency water mark method it is characterised in that:Including telescopiny and detection process,
Described telescopiny comprises the following steps,
Step A1, reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound
The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be
FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as
Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain signal Xn;
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo-
Random frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain embedded watermark
Frequency-region signal afterwards, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and
Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio file of embedded watermark;
Described detection process comprises the following steps,
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range
Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2, carries out the frequency-region signal Z that discrete Fourier transform (DFT) obtains signal to be detectedn, corresponding frequency domain range value is designated as
|Zn|;
Step B3, by the use of key key as random number seed, generate the binary system that length is freqmax2-freqmin2+1 pseudo- with
The frequency expansion sequence u of machine;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the sufficient statistic r of detectionnAs
Under,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
2. according to claim 1 be based on embed regioselective audio-frequency water mark method it is characterised in that:Step A1 and step B1
In, to every frame signal xnBeing made whether can be as follows as the judgement in embedded region, implementation,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
3. according to claim 2 be based on embed regioselective audio-frequency water mark method it is characterised in that:Signal xnInside whether comprise
Transient signal, judges in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity
Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
4. a kind of based on embedding regioselective audio frequency watermark system it is characterised in that:Embed subsystem and watermark inspection including audio frequency watermark
Survey subsystem,
Described audio frequency watermark embeds subsystem and includes with lower module,
Select appropriate area to embed module, for reading audio file, obtain n-th frame time-domain audio after sample rate f s1 and framing
Signal xn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound
The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be
FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as
Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain
Signal Xn;
First frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be
The binary system pseudorandom frequency expansion sequence u of freqmax1-freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain
Frequency-region signal to after embedded watermark, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and
Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Time-frequency inverse transform module, for by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio frequency literary composition of embedded watermark
Part;
Described watermark detection subsystem includes with lower module,
Select appropriate area detection module, the n-th frame signal z for reading audio file to be detected, after the time domain framing obtainingn
With sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range
Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module, obtains the frequency-region signal Z of signal to be detected for carrying out discrete Fourier transform (DFT)n, respective tones
Domain range value is designated as | Zn|;
Second frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be
The binary system pseudorandom frequency expansion sequence u of freqmax2-freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate filling of detection
Divide statistic rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
5. according to claim 4 be based on embed regioselective audio frequency watermark system it is characterised in that:Appropriate area is selected to embed
In module and selection appropriate area detection module, to every frame signal xnBeing made whether can be used as the judgement in embedded region, realization side
Formula is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
6. according to claim 5 be based on embed regioselective audio frequency watermark system it is characterised in that:Signal xnInside whether comprise
Transient signal, judges in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity
Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610458412.9A CN106409302B (en) | 2016-06-22 | 2016-06-22 | Audio-frequency water mark method and system based on insertion regional choice |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610458412.9A CN106409302B (en) | 2016-06-22 | 2016-06-22 | Audio-frequency water mark method and system based on insertion regional choice |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106409302A true CN106409302A (en) | 2017-02-15 |
CN106409302B CN106409302B (en) | 2019-07-09 |
Family
ID=58005751
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610458412.9A Expired - Fee Related CN106409302B (en) | 2016-06-22 | 2016-06-22 | Audio-frequency water mark method and system based on insertion regional choice |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106409302B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109714284A (en) * | 2018-11-27 | 2019-05-03 | 华中科技大学 | A kind of radio frequency method of detecting watermarks based on K-S detection |
CN111292756A (en) * | 2020-01-19 | 2020-06-16 | 成都嗨翻屋科技有限公司 | Compression-resistant audio silent watermark embedding and extracting method and system |
CN111883108A (en) * | 2020-07-06 | 2020-11-03 | 珠海格力电器股份有限公司 | Password embedding method and device, password matching method and device and control system |
CN113362835A (en) * | 2020-03-05 | 2021-09-07 | 杭州网易云音乐科技有限公司 | Audio watermark processing method and device, electronic equipment and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020138730A1 (en) * | 2000-06-15 | 2002-09-26 | Hongseok Kim | Apparatus and method for inserting and detecting watermark based on stochastic model |
CN101185121A (en) * | 2005-06-02 | 2008-05-21 | 汤姆森许可贸易公司 | Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum |
CN102142255A (en) * | 2010-07-08 | 2011-08-03 | 北京三信时代信息公司 | Method for embedding and extracting digital watermark in audio signal |
CN102664013A (en) * | 2012-04-18 | 2012-09-12 | 南京邮电大学 | Audio digital watermark method of discrete cosine transform domain based on energy selection |
CN104658542A (en) * | 2015-03-16 | 2015-05-27 | 武汉大学 | Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality |
CN104700841A (en) * | 2015-02-10 | 2015-06-10 | 浙江省广电科技股份有限公司 | Watermark embedding and detecting method based on audio content classification |
CN105374360A (en) * | 2015-11-25 | 2016-03-02 | 武汉大学 | Interleaved additive spread spectrum audio watermark embedding method and detection method and system |
-
2016
- 2016-06-22 CN CN201610458412.9A patent/CN106409302B/en not_active Expired - Fee Related
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020138730A1 (en) * | 2000-06-15 | 2002-09-26 | Hongseok Kim | Apparatus and method for inserting and detecting watermark based on stochastic model |
CN101185121A (en) * | 2005-06-02 | 2008-05-21 | 汤姆森许可贸易公司 | Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum |
CN102142255A (en) * | 2010-07-08 | 2011-08-03 | 北京三信时代信息公司 | Method for embedding and extracting digital watermark in audio signal |
CN102664013A (en) * | 2012-04-18 | 2012-09-12 | 南京邮电大学 | Audio digital watermark method of discrete cosine transform domain based on energy selection |
CN104700841A (en) * | 2015-02-10 | 2015-06-10 | 浙江省广电科技股份有限公司 | Watermark embedding and detecting method based on audio content classification |
CN104658542A (en) * | 2015-03-16 | 2015-05-27 | 武汉大学 | Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality |
CN105374360A (en) * | 2015-11-25 | 2016-03-02 | 武汉大学 | Interleaved additive spread spectrum audio watermark embedding method and detection method and system |
Non-Patent Citations (2)
Title |
---|
ANSARI,A ET AL: "Spread-Spectrum Robust Image Watermarking for Ownership Protection", 《22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING》 * |
TIANWEN FENG ET.AL: "A Spread Spectrum watermarking Algorithm based Local Instruction Statistic", 《9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING》 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109714284A (en) * | 2018-11-27 | 2019-05-03 | 华中科技大学 | A kind of radio frequency method of detecting watermarks based on K-S detection |
CN109714284B (en) * | 2018-11-27 | 2020-06-30 | 华中科技大学 | Radio frequency watermark detection method based on K-S detection |
CN111292756A (en) * | 2020-01-19 | 2020-06-16 | 成都嗨翻屋科技有限公司 | Compression-resistant audio silent watermark embedding and extracting method and system |
CN111292756B (en) * | 2020-01-19 | 2023-05-26 | 成都潜在人工智能科技有限公司 | Compression-resistant audio silent watermark embedding and extracting method and system |
CN113362835A (en) * | 2020-03-05 | 2021-09-07 | 杭州网易云音乐科技有限公司 | Audio watermark processing method and device, electronic equipment and storage medium |
CN113362835B (en) * | 2020-03-05 | 2024-06-07 | 杭州网易云音乐科技有限公司 | Audio watermarking method, device, electronic equipment and storage medium |
CN111883108A (en) * | 2020-07-06 | 2020-11-03 | 珠海格力电器股份有限公司 | Password embedding method and device, password matching method and device and control system |
Also Published As
Publication number | Publication date |
---|---|
CN106409302B (en) | 2019-07-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106409302A (en) | Audio frequency watermark method and system based on embedding area selection | |
Cvejic et al. | Spread spectrum audio watermarking using frequency hopping and attack characterization | |
CN101271690B (en) | Audio spread-spectrum watermark processing method for protecting audio data | |
CN104658542B (en) | Based on orthogonal additivity spread spectrum audio frequency watermark embedding grammar, detection method and system | |
CN105976823A (en) | Adaptive audio watermarking method based on phase coding and system | |
KR20010111057A (en) | Watermark embedding and extracting method for protecting digital audio contents copyright and preventing duplication and apparatus using thereof | |
US20050240768A1 (en) | Re-embedding of watermarks in multimedia signals | |
CN102142255B (en) | Method for embedding and extracting digital watermark in audio signal | |
CN110163787A (en) | Digital audio Robust Blind Watermarking Scheme embedding grammar based on dual-tree complex wavelet transform | |
CN102074238A (en) | Linear interference cancellation-based speech secrete communication method | |
Sheikhan et al. | Improvement of embedding capacity and quality of DWT-based audio steganography systems | |
US20070036357A1 (en) | Watermarking of multimedia signals | |
CN105374360B (en) | Intersect additivity spread spectrum audio frequency watermark embedding grammar, detection method and system | |
JP6316288B2 (en) | Digital watermark embedding device, digital watermark detection device, digital watermark embedding method, digital watermark detection method, digital watermark embedding program, and digital watermark detection program | |
KR100814792B1 (en) | Digital audio watermarking method using hybrid transform | |
US20050147248A1 (en) | Window shaping functions for watermarking of multimedia signals | |
KR20020031654A (en) | Method and apparatus for embedding watermarks using fast fourier transformed data | |
Wu et al. | Adaptive audio watermarking based on SNR in localized regions | |
Wang et al. | A blind audio watermarking algorithm robust against synchronization attack | |
Erçelebi et al. | Robust multi bit and high quality audio watermarking using pseudo-random sequences | |
Esmaili et al. | A novel spread spectrum audio watermarking scheme based on time-frequency characteristics | |
Li et al. | Spread-spectrum audio watermark robust against pitch-scale modification | |
Panda et al. | Application of energy efficient watermark on audio signal for authentication | |
Wang et al. | An audio watermarking scheme with neural network | |
Lihua et al. | A new algorithm for digital audio watermarking based on DWT |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20190709 |