CN106409302A - Audio frequency watermark method and system based on embedding area selection - Google Patents

Audio frequency watermark method and system based on embedding area selection Download PDF

Info

Publication number
CN106409302A
CN106409302A CN201610458412.9A CN201610458412A CN106409302A CN 106409302 A CN106409302 A CN 106409302A CN 201610458412 A CN201610458412 A CN 201610458412A CN 106409302 A CN106409302 A CN 106409302A
Authority
CN
China
Prior art keywords
frequency
signal
watermark
embedded
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610458412.9A
Other languages
Chinese (zh)
Other versions
CN106409302B (en
Inventor
陈怡�
高戈
张康
吕冰
刘影
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong Normal University
Original Assignee
Huazhong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong Normal University filed Critical Huazhong Normal University
Priority to CN201610458412.9A priority Critical patent/CN106409302B/en
Publication of CN106409302A publication Critical patent/CN106409302A/en
Application granted granted Critical
Publication of CN106409302B publication Critical patent/CN106409302B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present invention provides an audio frequency watermark method and system based on embedding area selection. The embedding process comprises: reading an audio frequency file, determining whether each frame signal can be taken as an embedding area or not, and then performing selection of the embedding frequency bands of the audio frequency watermark; and performing the discrete Fourier transform, generating a binary pseudo-random spread spectrum sequence, performing watermark embedding, and converting the binary pseudo-random spread spectrum sequence to a time domain. The detection process includes: reading an audio file to be detected, determining whether each frame signal can be taken as an embedding area or not, calculating the starting point and the frequency domain ending point of a detection range, performing the discrete Fourier transform to generate a binary pseudo-random spread spectrum sequence, calculating the detected sufficient statistics, and obtaining the detected watermark bits.

Description

Based on embedded regioselective audio-frequency water mark method and system
Technical field
The present invention relates to Digital Audio Watermarking Techniques field, more particularly, to it is based on and embeds regioselective audio-frequency water mark method and system.
Background technology
Digital audio frequency watermark is to add some digital informations in audio signal to reach file real and fake discrimination, copyright protection, information The signal processing operations of the purpose such as hiding.The selection technique that audio frequency watermark embeds region refers to before watermark is embedded into audio signal, Appropriate audio region is selected to embed watermark.Conventional audio digital watermark, does not account for the feature of audio signal, to whole audio frequency File all carries out the embedded of watermark, so can lead to 1) after the low region of audio frequency signal amplitude embeds watermark, amplitude is beyond covering Cover threshold value and produce noise, destroy the perception transparency;2) for the transient signal that appearance change in audio signal is violent, this region Audio signal variance very big, lead to after embedded watermark detect watermark when the watermark bit error rate very high;3) embed watermark in frequency domain, If selecting the inapparent region of auditory perceptual to embed watermark, after signal processing or audio frequency lossy compression method, watermark will be lost Lose a part, lead to the watermark detection bit error rate high.
Content of the invention
It is an object of the invention to provide the audio watermarking technique that selection region embeds, watermark is enable to be embedded into suitable audio region In, it is to avoid the generation unnecessary noise occurring and reducing error code.
For reaching above-mentioned purpose, the technical scheme that the present invention provides provides a kind of being based on to embed regioselective audio-frequency water mark method, Including telescopiny and detection process,
Described telescopiny comprises the following steps,
Step A1, reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain signal Xn
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo- Random frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain embedded watermark Frequency-region signal afterwards, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio file of embedded watermark;
Described detection process comprises the following steps,
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2, carries out the frequency-region signal Z that discrete Fourier transform (DFT) obtains signal to be detectedn, corresponding frequency domain range value is designated as |Zn|;
Step B3, by the use of key key as random number seed, generate the binary system that length is freqmax2-freqmin2+1 pseudo- with The frequency expansion sequence u of machine;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the sufficient statistic r of detectionnAs Under,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
And, in step A1 and step B1, to every frame signal xnBeing made whether can be used as the judgement in embedded region, realization side Formula is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
And, signal xnInside whether comprise transient signal, judge in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
The present invention correspondingly provide a kind of based on embedding regioselective audio frequency watermark system, include audio frequency watermark embed subsystem with Watermark detection subsystem,
Described audio frequency watermark embeds subsystem and includes with lower module,
Select appropriate area to embed module, for reading audio file, obtain n-th frame time-domain audio after sample rate f s1 and framing Signal xn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain Signal Xn
First frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be The binary system pseudorandom frequency expansion sequence u of freqmax1-freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain Frequency-region signal to after embedded watermark, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Time-frequency inverse transform module, for by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio frequency literary composition of embedded watermark Part;
Described watermark detection subsystem includes with lower module,
Select appropriate area detection module, the n-th frame signal z for reading audio file to be detected, after the time domain framing obtainingn With sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module, obtains the frequency-region signal Z of signal to be detected for carrying out discrete Fourier transform (DFT)n, respective tones Domain range value is designated as | Zn|;
Second frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be The binary system pseudorandom frequency expansion sequence u of freqmax2-freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate filling of detection Divide statistic rnIt is as follows,
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
And, select appropriate area to embed module and select in appropriate area detection module, to every frame signal xnBeing made whether can As the judgement in embedded region, implementation is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
And, signal xnInside whether comprise transient signal, judge in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
The present invention proposes the accuracy rate lifting watermark detection by frame in ceiling capacity and least energy than to filter transient signal, Lift the robustness of watermark by embedding a watermark in the significant frequency range of auditory perceptual, further, propose to utilize average energy To filter the No Tooting Area lifting perception transparency.Technical solution of the present invention has important market value.
Brief description
Fig. 1 is the embedded subsystem structure block diagram of the embodiment of the present invention.
Fig. 2 is the detection subsystem structure block diagram of the embodiment of the present invention.
Fig. 3 is the telescopiny flow chart of the embodiment of the present invention
Fig. 4 is the detection process flow chart of the embodiment of the present invention.
Specific embodiment
Combine accompanying drawing with specific embodiment below technical scheme is described further.
The embodiment of the present invention provide a kind of based on embedding regioselective audio frequency watermark system, include audio frequency watermark embed subsystem with Watermark detection subsystem.
Referring to Fig. 1, embedded regioselective audio watermarking technique provided in an embodiment of the present invention embeds subsystem, closes including selection Suitable region embeds module 1, the first time-frequency convert module 2, the first frequency expansion sequence generation module 3, watermark embedding module 4 and time-frequency Inverse transform module 5, can realize each module using software firming bechnology when being embodied as.
Described selection appropriate area embeds module 1, and the time-domain audio signal frame reading is judged, can be by when being embodied as Frame judges whether to disclosure satisfy that the condition of embedded watermark:It is unsatisfactory for just skipping this frame, continue the judgement of next frame;If meeting Signal output is given the first time-frequency conversion module 2, the sample rate according to the time-domain audio signal reading and human ear more sensitivity Frequency range calculates the scope that this frequency-region signal embeds watermark, and the frequency-region signal that can embed in scope output feedwater print is embedded mould Block 4, the maximum of this embedded scope and minima are exported to the first frequency expansion sequence generation module 3;
Described first time-frequency convert module 2, for being converted to frequency-region signal, output feedwater print by the time-domain audio signal reading Embedded module 4;
Described first frequency expansion sequence generation module 3, for embedding module 1 input according to random number seed and selection appropriate area The maximum of embedded scope and minima generate and embed the amplitude with length for the scope is 1 or -1 equally distributed random sequences, and This random sequence is exported to watermark embedding module 4;
Described watermark embedding module 4, for the amplitude spectrum in frequency-region signal, generates the audio signal with watermark information of frequency domain Export to time-frequency inverse transform module 5;
Described time-frequency inverse transform module 5, for embedding a watermark into the audio signal with watermark information of the frequency domain of module 4 input Be converted to the audio signal with watermark information of time domain, and the audio signal with watermark information for this time domain is generated audio frequency literary composition Part, just obtains the audio file with watermark information.
Referring to Fig. 2, the adaptive audio watermark detection subsystem based on phase code provided in an embodiment of the present invention, including selection Appropriate area detection module 6, the second time-frequency convert module 7, the second frequency expansion sequence generation module 8, coherent detection module 9, tool Body can realize each module using software firming bechnology when implementing.
Described selection appropriate area detection module 6 is essentially identical with the function of selecting appropriate area to embed module 1, is unsatisfactory for watermark The region of embedded condition, does not typically contain watermark yet, can be without consideration during detection:Can judge frame by frame when being embodied as, right In the frame being unsatisfactory for testing conditions, skip and do not detect, continue the judgement of next frame;Meet testing conditions audio signal export to Second time-frequency conversion module 7, equally exports the maxima and minima in frequency detecting region to the second time-frequency convert module 7 He Second frequency expansion sequence generation module 8;
Described second time-frequency convert module 7, for the time-domain audio signal reading is converted to frequency-region signal, exports to correlation Detection module 9;
Described second frequency expansion sequence generation module 8 is essentially identical with the function of the first frequency expansion sequence generation module 3, the knot that will produce Fruit exports to coherent detection module 9;
Described coherent detection module 9, for giving birth to the frequency domain amplitude signal to be detected of input and frequency expansion sequence according to detection range Become the frequency expansion sequence of module 9 input, calculate correlation, according to the symbol of correlation, judge watermark.
Each module implements referring to method corresponding steps, and it will not go into details for the present invention.Provided in an embodiment of the present invention based on embedded area The audio-frequency water mark method that domain selects, including telescopiny and detection process.
Referring to Fig. 3, the audio frequency watermark telescopiny based on selection region provided in an embodiment of the present invention can adopt computer software Technological means carry out flow process automatically, specifically include following steps:
Step A1, reads audio file, the audio signal x elder generation framing to time domain, obtains n-th after sample rate f s1 and framing Frame time-domain audio signal xn(frame length is N), to every frame signal xnIt is made whether to judge bag as the judgement in embedded region Judge containing both sides:
1) judge xnAverage energy size whether beyond the threshold value setting, to judge present frame xnWhether it is quiet area, such as Fruit is that quiet area does not allow for embedded watermark, is not otherwise just quiet area beyond threshold value, may be embedded.By following public affairs Formula calculates the average energy of n-th frame
Wherein, N is frame length, i.e. the sample points of a frame in;I is the sample point index number of a frame in, and value arrives N-1 0 Between;xn 2I () represents n-th frame time-domain signal xnIn i-th point of energy of frame in;τ1For the decision threshold of average energy, specifically reality When applying, those skilled in the art voluntarily can preset value, for example, be empirically derived;If exceeding threshold value, meet condition 1), Carry out following condition 2) judgement.
When 2) transient signal for a frame in, due to its frequency acute variation, the larger variance that can cause, in inspection During survey, the error probability of the watermark detection that signal variance causes more greatly is higher, and this situation should not embed watermark yet.By by one Frame is decomposed into S block, calculates the energy of S block respectively, by energy ratio rate of the block of ceiling capacity and least energy block and Threshold tau2Comparison, rate be more than τ2Then it is considered that this frame signal comprises the not embedded watermark of transient signal, otherwise can embed water Print.When being embodied as, those skilled in the art can voluntarily preset the value of S.
Specific implementation is as follows:
First by frame signal xnIt is divided into S block, then sample points M in each sub-block are
M=N/S (2)
The ENERGY E of each blockiIt is calculated as follows
Wherein, i represents the index number of intra block, and j represents the index number of frame in sample point, xn 2J () represents n-th frame time domain Signal xnEnergy in frame in jth point.
Find out the ceiling capacity E in block energyMaxWith least energy EMin
EMax=MAX { Ei, EMin=MIN { Ei, i ∈ [0, S-1] (4)
Wherein, MAX, MIN represent maximizing function and minimum value function respectively.
The ratio rate of ceiling capacity and least energy is calculated as follows:
If rate is > τ2, it is considered as signal frame xnInside there is transient signal, this frame does not embed watermark;Otherwise, water can be embedded Print.Wherein τ2For threshold value, when being embodied as, those skilled in the art can voluntarily preset value, such as τ2Detection for transient signal Threshold value, is empirically derived.
Then being directed to can be used as each frame signal x in embedded regionn, for the selection of the embedded frequency band of audio frequency watermark, Ying Weiren The more significant region of ear perception, those skilled in the art voluntarily can preset according to auditory perceptual characteristic, for example 1000-7000Hz.Because the signal in these regions is after filtering, after audio compression etc. attacks, will not be removed.So by water Print is embedded into the obvious region of perception, is standing will not to be erased after some signals are attacked, is being capable of detecting when.If setting according to people The sensitive default embedded starts frequency of frequency-portions of ear perception is FWMIN, end frequency is FWMAX, a corresponding frame Start embedded point freqmin1 and embedded end point freqmax1 ask for as follows,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N) (6)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N) (7)
Wherein, floor is downward bracket function.
According to starting embedded point freqmin1 and embedded end point freqmax1, choose the frequency-domain audio signals in the range of this.
Can judge frame by frame when being embodied as, be unsatisfactory for skipping of condition, carry out the judgement of next frame.
Step A2, to the signal frame x that can embed watermarkn, carrying out FFT (fast discrete Fourier conversion) is frequency domain Signal Xn.
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo- Random frequency expansion sequence u.
Embodiment detailed process in MATLAB is as follows:
First, using key key, call RandStream function (random seed function) to rand function (generating random number Function) initialized, then call rand function to generate random number, because the random number that rand function generates is between 0~1 Number, also need to carry out, to these numbers, the binary pseudo-random sequence becoming 0 and 1 that rounds up, then by this unipolar pseudo- with Machine sequence, switchs to pseudo-random sequence u that bipolarity comprises only+1 and -1.
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out watermark using equation below (8) Embedded, obtain the frequency-region signal after embedded watermark, calculate realize as follows
|X′n|=| Xn|+bαu (8)
Wherein, α is constant, controls the embedment strength of watermark, those skilled in the art's predeterminable value when being embodied as;|Xn| and |X′n| represent the frequency domain amplitude before embedded watermark and the frequency domain amplitude after embedded watermark respectively, then embedded by Euler's formula Frequency-region signal after watermark.
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents.
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, ultimately produce audio file, that is, obtain embedded water The audio file of print.
Referring to Fig. 4, the audio frequency watermark detection process embedding based on selection region provided in an embodiment of the present invention, computer can be adopted Software engineering means carry out flow process automatically, specifically include following steps:
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2, right Each time-domain signal znTake steps the same decision method in A1,
Consider following two conditions,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
It is not then quiet area and the frame signal not comprising transient signal, watermark can be embedded and have to be detected.
Can judge frame by frame when being embodied as, be unsatisfactory for skipping of condition, carry out the judgement of next frame.
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the frequency domain starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N) (10)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N) (11)
Step B2, for the signal z meeting testing conditionsn, carry out the frequency domain letter that discrete Fourier transform (DFT) obtains signal to be detected Number Zn, corresponding frequency domain range value is designated as | Zn|.
Step B3, using key key, generates binary system frequency expansion sequence u (identical with the u mode that embedding grammar above obtains), I.e. by the use of key key as random number seed, generate the binary system pseudorandom spreading sequence that length is freqmax2-freqmin2+1 u.
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, by calculating frequency expansion sequence u and to be checked Survey the frequency domain range value of signal | Zn| correlation, calculate the sufficient statistic r of detectionn
Wherein,<·>Represent that the inner product of signal calculates.
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
Specific embodiment described in the present invention is only explanation for example to present invention spirit.The technical field of the invention Technical staff can be made various modifications or supplement or substituted using similar mode to described specific embodiment, but simultaneously Do not deviate by the spirit of the present invention or surmount scope defined in appended claims.

Claims (6)

1. a kind of based on embedding regioselective audio-frequency water mark method it is characterised in that:Including telescopiny and detection process, Described telescopiny comprises the following steps,
Step A1, reads audio file, obtains the signal x of n-th frame time-domain audio after sample rate f s1 and framingn, frame length is N, First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
Step A2, to each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain signal Xn
Step A3, by the use of key key as random number seed, generates the binary system that length is freqmax1-freqmin1+1 pseudo- Random frequency expansion sequence u;
Step A4, according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain embedded watermark Frequency-region signal afterwards, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
X n &prime; = | X n &prime; | e j &angle; X n
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Step A5, by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio file of embedded watermark;
Described detection process comprises the following steps,
Step B1, reads audio file to be detected, the n-th frame signal z after the time domain framing obtainingnWith sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Step B2, carries out the frequency-region signal Z that discrete Fourier transform (DFT) obtains signal to be detectedn, corresponding frequency domain range value is designated as |Zn|;
Step B3, by the use of key key as random number seed, generate the binary system that length is freqmax2-freqmin2+1 pseudo- with The frequency expansion sequence u of machine;
Step B4, according to the frequency domain range value of frequency expansion sequence u and signal to be detected | Zn|, calculate the sufficient statistic r of detectionnAs Under,
r n = < u , | Z n | > < u , u >
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
2. according to claim 1 be based on embed regioselective audio-frequency water mark method it is characterised in that:Step A1 and step B1 In, to every frame signal xnBeing made whether can be as follows as the judgement in embedded region, implementation,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
3. according to claim 2 be based on embed regioselective audio-frequency water mark method it is characterised in that:Signal xnInside whether comprise Transient signal, judges in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
4. a kind of based on embedding regioselective audio frequency watermark system it is characterised in that:Embed subsystem and watermark inspection including audio frequency watermark Survey subsystem,
Described audio frequency watermark embeds subsystem and includes with lower module,
Select appropriate area to embed module, for reading audio file, obtain n-th frame time-domain audio after sample rate f s1 and framing Signal xn, frame length is N,
First to every frame signal xnBe made whether can as the judgement in embedded region,
Then being directed to can be used as each frame signal x in embedded regionn, carry out the selection of the embedded frequency band of audio frequency watermark, carry out sound The selection of the embedded frequency band of frequency watermark, if according to the sensitive default embedded starts frequency of frequency-portions of auditory perceptual be FWMIN, end frequency are FWMAX, a frame start embedded point freqmin1 and embedded end point freqmax1 ask for as Under,
Freqmin1=floor ((FWMIN × 2.0/fs1) × N)
Freqmax1=floor ((FWMAX × 2.0/fs1) × N)
Wherein, floor is downward bracket function;
First time-frequency convert module, for each frame signal x that can embed watermarkn, carry out discrete Fourier transform (DFT) and obtain frequency domain Signal Xn
First frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be The binary system pseudorandom frequency expansion sequence u of freqmax1-freqmin1+1;
Watermark embedding module, for according to frequency expansion sequence u, frequency domain signal XnWith watermark bit b, carry out the embedded of watermark, obtain Frequency-region signal to after embedded watermark, is calculated as follows,
|X′n|=| Xn|+bαu
Wherein, α is constant, controls the embedment strength of watermark, | Xn| and | X 'n| represent respectively the frequency domain amplitude before embedded watermark and Frequency domain amplitude after embedded watermark, the then frequency-region signal after Euler's formula obtains embedded watermark
X n &prime; = | X n &prime; | e j &angle; X n
Wherein, ∠ XnRepresent the phase place of frequency-region signal, X 'nRepresent the frequency-region signal after embedded watermark, e is mathematics natural Exponents;
Time-frequency inverse transform module, for by the frequency domain signal X ' after embedded watermarknTransform to time domain, generate the audio frequency literary composition of embedded watermark Part;
Described watermark detection subsystem includes with lower module,
Select appropriate area detection module, the n-th frame signal z for reading audio file to be detected, after the time domain framing obtainingn With sample rate f s2,
First to every frame signal xnBeing made whether can be used as the judgement in embedded region;
For can be used as each frame signal x in embedded regionn, as signal to be detected, calculate the starting point of detection range Freqmin2 and frequency domain end point freqmax2
Freqmin2=floor ((FWMIN × 2.0/fs2) × N)
Freqmax2=floor ((FWMAX × 2.0/fs2) × N)
Second time-frequency convert module, obtains the frequency-region signal Z of signal to be detected for carrying out discrete Fourier transform (DFT)n, respective tones Domain range value is designated as | Zn|;
Second frequency expansion sequence generation module, for by the use of key key as random number seed, generating length be The binary system pseudorandom frequency expansion sequence u of freqmax2-freqmin2+1;
Coherent detection module, for the frequency domain range value according to frequency expansion sequence u and signal to be detected | Zn|, calculate filling of detection Divide statistic rnIt is as follows,
r n = < u , | Z n | > < u , u >
If sufficient statistic rn>=0, then the watermark bit detecting is b=1;Otherwise, the watermark bit detecting is b=0.
5. according to claim 4 be based on embed regioselective audio frequency watermark system it is characterised in that:Appropriate area is selected to embed In module and selection appropriate area detection module, to every frame signal xnBeing made whether can be used as the judgement in embedded region, realization side Formula is as follows,
1) signal xnAverage energySize exceed default respective threshold τ1, it is to be then quiet area, do not allow embedded watermark;
2) if signal xnInside comprise transient signal, then do not allow embedded watermark.
6. according to claim 5 be based on embed regioselective audio frequency watermark system it is characterised in that:Signal xnInside whether comprise Transient signal, judges in the following manner,
If a frame signal is decomposed into S block, calculate the energy of S block respectively, compare block and the least energy of ceiling capacity Energy ratio rate of block and default respective threshold τ2If rate is more than τ2Then think that this frame signal comprises transient signal.
CN201610458412.9A 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice Active CN106409302B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610458412.9A CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610458412.9A CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Publications (2)

Publication Number Publication Date
CN106409302A true CN106409302A (en) 2017-02-15
CN106409302B CN106409302B (en) 2019-07-09

Family

ID=58005751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610458412.9A Active CN106409302B (en) 2016-06-22 2016-06-22 Audio-frequency water mark method and system based on insertion regional choice

Country Status (1)

Country Link
CN (1) CN106409302B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109714284A (en) * 2018-11-27 2019-05-03 华中科技大学 A kind of radio frequency method of detecting watermarks based on K-S detection
CN111292756A (en) * 2020-01-19 2020-06-16 成都嗨翻屋科技有限公司 Compression-resistant audio silent watermark embedding and extracting method and system
CN111883108A (en) * 2020-07-06 2020-11-03 珠海格力电器股份有限公司 Password embedding method and device, password matching method and device and control system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020138730A1 (en) * 2000-06-15 2002-09-26 Hongseok Kim Apparatus and method for inserting and detecting watermark based on stochastic model
CN101185121A (en) * 2005-06-02 2008-05-21 汤姆森许可贸易公司 Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum
CN102142255A (en) * 2010-07-08 2011-08-03 北京三信时代信息公司 Method for embedding and extracting digital watermark in audio signal
CN102664013A (en) * 2012-04-18 2012-09-12 南京邮电大学 Audio digital watermark method of discrete cosine transform domain based on energy selection
CN104658542A (en) * 2015-03-16 2015-05-27 武汉大学 Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality
CN104700841A (en) * 2015-02-10 2015-06-10 浙江省广电科技股份有限公司 Watermark embedding and detecting method based on audio content classification
CN105374360A (en) * 2015-11-25 2016-03-02 武汉大学 Interleaved additive spread spectrum audio watermark embedding method and detection method and system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020138730A1 (en) * 2000-06-15 2002-09-26 Hongseok Kim Apparatus and method for inserting and detecting watermark based on stochastic model
CN101185121A (en) * 2005-06-02 2008-05-21 汤姆森许可贸易公司 Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum
CN102142255A (en) * 2010-07-08 2011-08-03 北京三信时代信息公司 Method for embedding and extracting digital watermark in audio signal
CN102664013A (en) * 2012-04-18 2012-09-12 南京邮电大学 Audio digital watermark method of discrete cosine transform domain based on energy selection
CN104700841A (en) * 2015-02-10 2015-06-10 浙江省广电科技股份有限公司 Watermark embedding and detecting method based on audio content classification
CN104658542A (en) * 2015-03-16 2015-05-27 武汉大学 Additive spread spectrum audio watermarking embedding method, additive spread spectrum audio watermarking detection method and additive spread spectrum audio watermarking embedding system based on orthogonality
CN105374360A (en) * 2015-11-25 2016-03-02 武汉大学 Interleaved additive spread spectrum audio watermark embedding method and detection method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ANSARI,A ET AL: "Spread-Spectrum Robust Image Watermarking for Ownership Protection", 《22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING》 *
TIANWEN FENG ET.AL: "A Spread Spectrum watermarking Algorithm based Local Instruction Statistic", 《9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109714284A (en) * 2018-11-27 2019-05-03 华中科技大学 A kind of radio frequency method of detecting watermarks based on K-S detection
CN109714284B (en) * 2018-11-27 2020-06-30 华中科技大学 Radio frequency watermark detection method based on K-S detection
CN111292756A (en) * 2020-01-19 2020-06-16 成都嗨翻屋科技有限公司 Compression-resistant audio silent watermark embedding and extracting method and system
CN111292756B (en) * 2020-01-19 2023-05-26 成都潜在人工智能科技有限公司 Compression-resistant audio silent watermark embedding and extracting method and system
CN111883108A (en) * 2020-07-06 2020-11-03 珠海格力电器股份有限公司 Password embedding method and device, password matching method and device and control system

Also Published As

Publication number Publication date
CN106409302B (en) 2019-07-09

Similar Documents

Publication Publication Date Title
CN101271690B (en) Audio spread-spectrum watermark processing method for protecting audio data
CN105976823B (en) Adaptive audio water mark method and system based on phase code
CN104658542B (en) Based on orthogonal additivity spread spectrum audio frequency watermark embedding grammar, detection method and system
KR20010111057A (en) Watermark embedding and extracting method for protecting digital audio contents copyright and preventing duplication and apparatus using thereof
CA2485800A1 (en) Method and apparatus for multi-sensory speech enhancement
CN106409302A (en) Audio frequency watermark method and system based on embedding area selection
US20050240768A1 (en) Re-embedding of watermarks in multimedia signals
CN102142255B (en) Method for embedding and extracting digital watermark in audio signal
CN108682425B (en) Robust digital audio watermark embedding system based on constant watermark
CN110163787A (en) Digital audio Robust Blind Watermarking Scheme embedding grammar based on dual-tree complex wavelet transform
CN103050120B (en) high-capacity digital audio reversible watermark processing method
Sheikhan et al. Improvement of embedding capacity and quality of DWT-based audio steganography systems
CN113782041A (en) Method for embedding and positioning watermark based on audio frequency-to-frequency domain
CN105374360B (en) Intersect additivity spread spectrum audio frequency watermark embedding grammar, detection method and system
US20070036357A1 (en) Watermarking of multimedia signals
US20050147248A1 (en) Window shaping functions for watermarking of multimedia signals
KR20070061285A (en) Digital audio watermarking method using hybrid transform
CN106205627A (en) Based on side information prediction and the DAB reversible water mark algorithm of rectangular histogram translation
CN115760535A (en) Self-adaptive audio blind watermark embedding and extracting method based on local mean decomposition
Wu et al. Adaptive audio watermarking based on SNR in localized regions
WO2014120685A1 (en) Systems and methods for detecting a synchronization code word
Wu et al. BCH Code-Based Robust Audio Watermarking Algorithm in the DWT Domain
Dieu et al. An improved technique for hiding data in audio
Wang et al. A blind audio watermarking algorithm robust against synchronization attack
Erçelebi et al. Robust multi bit and high quality audio watermarking using pseudo-random sequences

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant