CN104021791B - Detecting method based on digital audio waveform sudden changes - Google Patents
Detecting method based on digital audio waveform sudden changes Download PDFInfo
- Publication number
- CN104021791B CN104021791B CN201410285152.0A CN201410285152A CN104021791B CN 104021791 B CN104021791 B CN 104021791B CN 201410285152 A CN201410285152 A CN 201410285152A CN 104021791 B CN104021791 B CN 104021791B
- Authority
- CN
- China
- Prior art keywords
- audio
- audio frequency
- logarithm
- spectrum
- frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Signal Processing For Digital Recording And Reproducing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The invention discloses a detecting method based on digital audio waveform sudden changes, which is a statistical judging method disclosed according to the characteristic of a ridge of a speech spectrum caused by audio waveform sudden changes, and belongs to the multimedia information safety field. For waveform sudden change detection due to copying-pasting operation of a digital audio, ridge span changes before and after audio joint points in a speech spectrum log domain are analyzed, and a ridge factor is constructed to describe ridge bandwidth of a one-frame audio log speech spectrum to represent short-time energy change characteristics, and a difference operator is utilized for distinguishing sudden noises of audio, and ridge factor identification is caused by audio waveform sudden changes. The method comprises the following steps of carrying out short-time Fourier transformation and logarithmic transformation on an audio signal to obtain an audio spectrum of the log domain; calculating the ridge factor of each frame in the speech spectrum; and detecting and judging the difference transformation result for the ridge factor. The detecting method disclosed by the invention can identify audio waveform sudden changes effectively, and provides an effective method for detecting a digital audio editing operation boundary.
Description
Technical field
The present invention relates to field of multi-media information safety, especially a kind of detection method of DAB waveform mutation.
Background technology
The progress of the digitized of multimedia technology and transmission technology is so that the application of DAB increases rapidly.Numeral
Audio frequency is easy to be transmitted propagating by physics or electronics system, but simultaneously exactly these advantages also bring many new
Problem:DAB record and copy procedure in may suffer from distorting consciously or unconsciously.Either premeditate artificial destruction
Integrity, verity distort it is also possible to transmission communication process in occur mistake, all can be original to the information data of itself
Property damage, the information especially acquiring a special sense at some, such as court's proof, department's classified papers, historical document are standby
The important contents such as part, once maliciously being distorted, can cause very serious consequence.
Digital speech replicates that to paste/delete and distort be that one section of sound bite in digital speech copies to another voice sheet
Duan Zhong, or delete the fragment in one section of voice, it is a kind of tampering methods simply effectively changing voice important information, such as says
Shown in bright book accompanying drawing 1,2.Due to there being consistent or similar noise in same section of voice, speaker's vocal print so that human ear very
Hardly possible distinguishes to the sound bite after distorting, and detects that the judgement distorted to voice true or false of this form has important reality
Meaning.
Content of the invention
The technical problem to be solved is the detection method providing a kind of mutation of DAB waveform, and it can be to giving
One section of fixed voice, judges whether it have passed through sound bite and replicate the audio volume control pasted or delete one section of voice and cause
It is mutated moreover it is possible to orient the time range that voice is distorted, thus confirming the true or false of voice, to overcome the deficiencies in the prior art.
The present invention is realized in:The detection method of DAB waveform mutation, comprises the following steps:1)Audio frequency is believed
Number conversion obtains the audio frequency language spectrum of log-domainY, the audio frequency language spectrum of acquisition is carried out logarithmic transformation, obtains logarithm language spectrumG;2)Enter
Row logarithm language spectrum G energy binaryzation calculates;3)Calculate every frame logarithm language spectrumGAudio frequency change coefficientσ t ;4)To audio frequency mutantion line
Numberσ t Judged, carried out audio volume control abrupt climatic change and Sudden change region positioning.
Step 1)The described audio frequency language spectrum that audio signal conversion is obtained log-domainY, the audio frequency language spectrum of acquisition is carried out
Logarithmic transformation, obtains logarithm language spectrumG,Specifically, the digital audio and video signals for h for lengthyCarry out framing, obtaining frame number isN l , frame length is 2*NMatrix;Add window function and carry out Short Time Fourier Transform, obtaining size isN * N 1 Audio frequency language
SpectrumY;Audio frequency language is composedYCarry out logarithmic transformation, obtain logarithm language spectrumG, its size isN * N 1 .
Step 2)Described enters logarithm language spectrumGEnergy binaryzation calculates, and specifically, is first calculated logarithm language spectrumGIn
MaximumG max And minimaG min If the energy value of every frame rate isG ti (1≤t≤N, 1≤i≤Nl), by such as
Lower formula(1)Calculate energy binary valueδ(t,i),
WhereinλFor threshold factor.
Step 3)Described calculating every frame logarithm language is composedG t Audio frequency change coefficient, specifically, by formula(2)Calculate sound
Frequency change coefficientσ t (1≤t≤N 1 );
Step 4)Described to audio frequency change coefficientσ t Judged, carry out audio volume control abrupt climatic change and Sudden change region is fixed
Position, specifically it is assumed that in step 3) in obtained logarithm language spectrum G i-th frameG i And three audio frequency change coefficients of consecutive frameσ i-1 、 σ i 、σ i+1 If meeting:
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frameG i It is the audio volume control Sudden change region detecting.
Compared with prior art, the present invention is using distorting in splicing voice process signal waveform mutation so that this time
The property that the frequency short-time energy of fragment is uprushed, distinguishes, using difference algorithm, audio volume control mutation and the strong signal that editor causes
Audio frequency is mutated, and after calculating audio frequency change coefficient, detects that this section audio has duplication to paste/delete and distorts vestige, and accurately
The positioning tampering time.The inventive method has preferable detection and positioning function, cannot observe in human ear None- identified, sound spectrograph
The waveform mutation of audio frequency can be identified in the case of coming well, be that tampering detection is pasted/deleted in a kind of effective audio dubbing
Technology.
Brief description
Fig. 1 is audio data insertion audio fragment schematic diagram;
Fig. 2 is that audio data deletes audio fragment schematic diagram;
Fig. 3 is that waveform mutation schematic diagram at splice point distorted in voice;
Fig. 4 is detection algorithm flow chart;
Fig. 5 is the oscillogram of detection voice;
Fig. 6 is the logarithm language spectrum of detection voice;
Fig. 7 is the audio frequency change coefficient of detection voice.
Specific embodiment
Embodiments of the invention 1:The detection method of DAB waveform mutation,
1)By audio signalyConversion obtains the audio frequency language spectrum of log-domainY:For the digital audio and video signals for h for the lengthy
, to audio signalyCarry out framing, every frame length is2 * N(Set N=128);Duplication isl(Setl= 0.5), then
Frame numberN 1For:
If framing signal isy i , i=1...Nl, using formula(1)Carry out Discrete Short Time Fourier Transform, obtainY ti, its
Middle w (N-m) is window function(It is set as Hamming window function).
Using formula(2)Calculate spectrogram, carry out logarithmic transformation and obtain logarithm language spectrumG i (i=1...Nl), that is,
All log-magnitude values are formed the logarithm language spectrum that matrix is voice signalG, its size isN * N 1 ;
2)Carry out logarithm language spectrumG i Energy binaryzation calculates:Calculate logarithm language spectrum firstG i In maximumG max And minimum
ValueG min , for the energy value of every frame rateG i (k),k = 1 ... N, by formula(3)Calculate the energy two-value of frequency
Change valueδ (t,i) :
WhereinλFor threshold factor(Setλ =0.65), willδThe matrix Δ of composition is defined as short-time energy two-value spectrum,δ (i,k) Value represents for 0iFramekFrequency content energy is low, and 1 is that energy is high;
3)Calculate every frame logarithm language spectrumG i Audio frequency change coefficientσ t :Because imagineering pastes sound bite, in editor
Place's audio signal occurs that waveform is mutated, and this operation introduces new radio-frequency component so that log-magnitude spectrum comprises audio frequency mutation
The all frequency energy of frame increase suddenly with respect to consecutive frame, therefore, the nonzero value number of this frame short-time energy two-value spectrum should substantially
More than consecutive frame;Statistic procedure 2)Middle short-time energy two-value composes the average of every frameσ i , according to formula(4)It is calculated audio frequency mutation
Coefficientσ i (1≤i≤N 1 );
4)To audio frequency change coefficientσ t Judged, carried out audio volume control abrupt climatic change and Sudden change region positioning:Due to language
Sound feature, strong voice signal can make the audio frequency change coefficient that detection obtains become big, but this signal has time duration, manually
Duplication is pasted the audio frequency change coefficient causing and only can be present in a frame, therefore, distinguishes, using difference algorithm, the sound that editor causes
The audio frequency mutation of the mutation of frequency waveform and strong signal, it comprises the concrete steps that, by formula(5)Calculate every frame logarithm language spectrumG i Audio frequency
Change coefficientσ t (1≤t≤N1);
Step 4)Described to audio frequency change coefficientσ t Judged, carry out audio volume control abrupt climatic change and Sudden change region is fixed
Position, specifically it is assumed that in step 3) in obtained logarithm language and composed the i-th frameG i And three audio frequency change coefficients of consecutive frameσ i-1 、 σ i 、σ i+1 If meeting:
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frameG i It is the audio volume control Sudden change region detecting.
As shown in figure 5, Fig. 5 is to distort speech waveform figure after deleting part of speech, the wherein dotted line mark moment is sound
Frequency clipped position, and this editing trace cannot be gone out by auditory discrimination..As shown in Figure 6 although in the logarithm language spectrum of this section audio
No significantly short-time energy mutation, as shown in fig. 7, detecting that this section audio has duplication viscous after calculating audio frequency change coefficient
Patch/delete and distort vestige, and it is accurately positioned the time of distorting.
From above-described embodiment as can be seen that the inventive method has preferable detection and positioning function.Human ear None- identified,
Sound spectrograph can identify the waveform mutation of audio frequency well in the case of cannot observing out, is that a kind of effective audio dubbing is glued
Patch/delete tampering detection technology.
Claims (1)
1. a kind of mutation of DAB waveform detection method it is characterised in that:Comprise the following steps:1) audio signal is converted
Obtain the audio frequency language spectrum Y of log-domain, the audio frequency language spectrum of acquisition is carried out logarithmic transformation, obtain logarithm language spectrum G;2) carry out logarithm language
Spectrum G energy binaryzation calculates;3) calculate every frame logarithm language and compose GtAudio frequency change coefficient σt;4) to audio frequency change coefficient σtCarry out
Judge, carry out audio volume control abrupt climatic change and Sudden change region positioning;
Step 1) described in the audio frequency language spectrum Y that audio signal conversion is obtained log-domain, the audio frequency language of acquisition spectrum is carried out logarithm
Conversion, obtains logarithm language spectrum G, specifically, carries out framing for the digital audio and video signals y for h for the length, and obtaining frame number is Nl, frame
The matrix of a length of 2*N;Add window function and carry out Short Time Fourier Transform, obtaining size is N*N1Audio frequency language spectrum Y;To audio frequency
Language spectrum Y carries out logarithmic transformation, obtains logarithm language spectrum G, and its size is N*N1;
Step 2) described in carry out logarithm language spectrum G energy binaryzation calculate, specifically, be first calculated logarithm language compose G in maximum
Value GmaxWith minima GminIf the energy value of every frame rate is Gti(1≤t≤N,1≤i≤Nl), energy is calculated by equation below (1)
Amount binary value δ (t, i),
Wherein λ is threshold factor;
Step 3) described in calculating every frame logarithm language spectrum GtAudio frequency change coefficient, specifically, by formula (2) calculate audio frequency dash forward
Variable coefficient σt(1≤t≤N1);
Step 4) described in audio frequency change coefficient σtJudged, carried out audio volume control abrupt climatic change and Sudden change region positioning, tool
Body is it is assumed that in step 3) in obtained logarithm language spectrum G the i-th frame GiAnd three audio frequency change coefficient σ of consecutive framei-1、σi、σi+1,
If meeting:
σi>0.85 and | σi-σi-1|*|σi-σi+1|>σi/16
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frame GiIt is the audio volume control Sudden change region detecting.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410285152.0A CN104021791B (en) | 2014-06-24 | 2014-06-24 | Detecting method based on digital audio waveform sudden changes |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410285152.0A CN104021791B (en) | 2014-06-24 | 2014-06-24 | Detecting method based on digital audio waveform sudden changes |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104021791A CN104021791A (en) | 2014-09-03 |
CN104021791B true CN104021791B (en) | 2017-02-22 |
Family
ID=51438515
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410285152.0A Active CN104021791B (en) | 2014-06-24 | 2014-06-24 | Detecting method based on digital audio waveform sudden changes |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104021791B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108831506A (en) * | 2018-06-25 | 2018-11-16 | 华中师范大学 | Digital audio based on GMM-BIC distorts point detecting method and system |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11217076B1 (en) * | 2018-01-30 | 2022-01-04 | Amazon Technologies, Inc. | Camera tampering detection based on audio and video |
CN111863023B (en) * | 2020-09-22 | 2021-01-08 | 深圳市声扬科技有限公司 | Voice detection method and device, computer equipment and storage medium |
CN113571091B (en) * | 2021-06-30 | 2024-04-19 | 青岛海尔科技有限公司 | Audio mutation detection method and device for monitoring and household appliance |
CN116543796B (en) * | 2023-07-06 | 2023-09-15 | 腾讯科技(深圳)有限公司 | Audio processing method and device, computer equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1585020A (en) * | 2004-05-28 | 2005-02-23 | 中山大学 | Digital audio-frequency anti-distorting method |
US7146309B1 (en) * | 2003-09-02 | 2006-12-05 | Mindspeed Technologies, Inc. | Deriving seed values to generate excitation values in a speech coder |
CN101383171A (en) * | 2008-10-16 | 2009-03-11 | 中山大学 | Blind detection method for MP3 audio distortion |
CN101562016A (en) * | 2009-05-26 | 2009-10-21 | 上海大学 | Totally-blind digital speech authentication method |
CN102592588A (en) * | 2012-01-10 | 2012-07-18 | 清华大学 | Digital audio record integrity detection method |
CN103345927A (en) * | 2013-07-11 | 2013-10-09 | 暨南大学 | Processing method for detecting and locating audio time domain tampering |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19736669C1 (en) * | 1997-08-22 | 1998-10-22 | Fraunhofer Ges Forschung | Beat detection method for time discrete audio signal |
-
2014
- 2014-06-24 CN CN201410285152.0A patent/CN104021791B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7146309B1 (en) * | 2003-09-02 | 2006-12-05 | Mindspeed Technologies, Inc. | Deriving seed values to generate excitation values in a speech coder |
CN1585020A (en) * | 2004-05-28 | 2005-02-23 | 中山大学 | Digital audio-frequency anti-distorting method |
CN101383171A (en) * | 2008-10-16 | 2009-03-11 | 中山大学 | Blind detection method for MP3 audio distortion |
CN101562016A (en) * | 2009-05-26 | 2009-10-21 | 上海大学 | Totally-blind digital speech authentication method |
CN102592588A (en) * | 2012-01-10 | 2012-07-18 | 清华大学 | Digital audio record integrity detection method |
CN103345927A (en) * | 2013-07-11 | 2013-10-09 | 暨南大学 | Processing method for detecting and locating audio time domain tampering |
Non-Patent Citations (5)
Title |
---|
《数字音频的真实性鉴定》;邵松年;《中国优秀硕士学位论文全文数据库 信息科技辑》;20101115(第11期);全文 * |
《数字音频真实性鉴定的研究》;柳永娟;《中国优秀硕士学位论文全文数据库 信息科技辑》;20110515(第05期);全文 * |
《数字音频篡改检测与隐写分析技术研究》;丁琦;《中国博士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 * |
《数字音频篡改检测技术的研究》;石倩;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 * |
《音频信息篡改检测算法的研究》;路允祥;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108831506A (en) * | 2018-06-25 | 2018-11-16 | 华中师范大学 | Digital audio based on GMM-BIC distorts point detecting method and system |
Also Published As
Publication number | Publication date |
---|---|
CN104021791A (en) | 2014-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104021791B (en) | Detecting method based on digital audio waveform sudden changes | |
Ratnam et al. | Blind estimation of reverberation time | |
KR101269296B1 (en) | Neural network classifier for separating audio sources from a monophonic audio signal | |
Perez-Gonzalez et al. | Automatic gain and fader control for live mixing | |
EP2808867A1 (en) | Transient speech signal encoding method and device, decoding method and device, processing system and computer-readable storage medium | |
US6718301B1 (en) | System for measuring speech content in sound | |
WO2015196760A1 (en) | Microphone array speech detection method and device | |
US20110246205A1 (en) | Method for detecting audio signal transient and time-scale modification based on same | |
CN103886865A (en) | Sound Processing Device, Sound Processing Method, And Program | |
CN113674763B (en) | Method, system, device and storage medium for identifying whistle by utilizing line spectrum characteristics | |
EP3229487B1 (en) | Approach for detecting alert signals in changing environments | |
Prego et al. | A blind algorithm for reverberation-time estimation using subband decomposition of speech signals | |
CN102610232B (en) | Method for adjusting self-adaptive audio sensing loudness | |
EP1492085A2 (en) | Method of reflecting time/language distortion in objective speech quality assessment | |
CN105100508A (en) | Network voice quality estimation method, device and system | |
US10522160B2 (en) | Methods and apparatus to identify a source of speech captured at a wearable electronic device | |
Narkhede et al. | Acoustic scene identification for audio authentication | |
CN110718229A (en) | Detection method for record playback attack and training method corresponding to detection model | |
US20230386492A1 (en) | System and method for suppressing noise from audio signal | |
CN112233693B (en) | Sound quality evaluation method, device and equipment | |
CN112581975B (en) | Ultrasonic voice instruction defense method based on signal aliasing and binaural correlation | |
CN112750458B (en) | Touch screen sound detection method and device | |
CN104240705A (en) | Intelligent voice-recognition locking system for safe box | |
CN107548007B (en) | Detection method and device of audio signal acquisition equipment | |
CN108877816B (en) | QMDCT coefficient-based AAC audio frequency recompression detection method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20201113 Address after: No.3, pingzhong Road, Matou Town, Pingguo City, Baise City, Guangxi Zhuang Autonomous Region Patentee after: Guangxi Pingguo Runmin Poverty Alleviation Development Co., Ltd Address before: 550025 science and Technology Department, north campus, Guizhou University, Huaxi, Guizhou, China Patentee before: Guizhou University |
|
TR01 | Transfer of patent right |