CN104021791B - Detecting method based on digital audio waveform sudden changes - Google Patents

Detecting method based on digital audio waveform sudden changes Download PDF

Info

Publication number
CN104021791B
CN104021791B CN201410285152.0A CN201410285152A CN104021791B CN 104021791 B CN104021791 B CN 104021791B CN 201410285152 A CN201410285152 A CN 201410285152A CN 104021791 B CN104021791 B CN 104021791B
Authority
CN
China
Prior art keywords
audio
audio frequency
logarithm
spectrum
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410285152.0A
Other languages
Chinese (zh)
Other versions
CN104021791A (en
Inventor
徐晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangxi Pingguo Runmin Poverty Alleviation Development Co., Ltd
Original Assignee
Guizhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guizhou University filed Critical Guizhou University
Priority to CN201410285152.0A priority Critical patent/CN104021791B/en
Publication of CN104021791A publication Critical patent/CN104021791A/en
Application granted granted Critical
Publication of CN104021791B publication Critical patent/CN104021791B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a detecting method based on digital audio waveform sudden changes, which is a statistical judging method disclosed according to the characteristic of a ridge of a speech spectrum caused by audio waveform sudden changes, and belongs to the multimedia information safety field. For waveform sudden change detection due to copying-pasting operation of a digital audio, ridge span changes before and after audio joint points in a speech spectrum log domain are analyzed, and a ridge factor is constructed to describe ridge bandwidth of a one-frame audio log speech spectrum to represent short-time energy change characteristics, and a difference operator is utilized for distinguishing sudden noises of audio, and ridge factor identification is caused by audio waveform sudden changes. The method comprises the following steps of carrying out short-time Fourier transformation and logarithmic transformation on an audio signal to obtain an audio spectrum of the log domain; calculating the ridge factor of each frame in the speech spectrum; and detecting and judging the difference transformation result for the ridge factor. The detecting method disclosed by the invention can identify audio waveform sudden changes effectively, and provides an effective method for detecting a digital audio editing operation boundary.

Description

The detection method of DAB waveform mutation
Technical field
The present invention relates to field of multi-media information safety, especially a kind of detection method of DAB waveform mutation.
Background technology
The progress of the digitized of multimedia technology and transmission technology is so that the application of DAB increases rapidly.Numeral Audio frequency is easy to be transmitted propagating by physics or electronics system, but simultaneously exactly these advantages also bring many new Problem:DAB record and copy procedure in may suffer from distorting consciously or unconsciously.Either premeditate artificial destruction Integrity, verity distort it is also possible to transmission communication process in occur mistake, all can be original to the information data of itself Property damage, the information especially acquiring a special sense at some, such as court's proof, department's classified papers, historical document are standby The important contents such as part, once maliciously being distorted, can cause very serious consequence.
Digital speech replicates that to paste/delete and distort be that one section of sound bite in digital speech copies to another voice sheet Duan Zhong, or delete the fragment in one section of voice, it is a kind of tampering methods simply effectively changing voice important information, such as says Shown in bright book accompanying drawing 1,2.Due to there being consistent or similar noise in same section of voice, speaker's vocal print so that human ear very Hardly possible distinguishes to the sound bite after distorting, and detects that the judgement distorted to voice true or false of this form has important reality Meaning.
Content of the invention
The technical problem to be solved is the detection method providing a kind of mutation of DAB waveform, and it can be to giving One section of fixed voice, judges whether it have passed through sound bite and replicate the audio volume control pasted or delete one section of voice and cause It is mutated moreover it is possible to orient the time range that voice is distorted, thus confirming the true or false of voice, to overcome the deficiencies in the prior art.
The present invention is realized in:The detection method of DAB waveform mutation, comprises the following steps:1)Audio frequency is believed Number conversion obtains the audio frequency language spectrum of log-domainY, the audio frequency language spectrum of acquisition is carried out logarithmic transformation, obtains logarithm language spectrumG;2)Enter Row logarithm language spectrum G energy binaryzation calculates;3)Calculate every frame logarithm language spectrumGAudio frequency change coefficientσ t ;4)To audio frequency mutantion line Numberσ t Judged, carried out audio volume control abrupt climatic change and Sudden change region positioning.
Step 1)The described audio frequency language spectrum that audio signal conversion is obtained log-domainY, the audio frequency language spectrum of acquisition is carried out Logarithmic transformation, obtains logarithm language spectrumG,Specifically, the digital audio and video signals for h for lengthyCarry out framing, obtaining frame number isN l , frame length is 2*NMatrix;Add window function and carry out Short Time Fourier Transform, obtaining size isN * N 1 Audio frequency language SpectrumY;Audio frequency language is composedYCarry out logarithmic transformation, obtain logarithm language spectrumG, its size isN * N 1 .
Step 2)Described enters logarithm language spectrumGEnergy binaryzation calculates, and specifically, is first calculated logarithm language spectrumGIn MaximumG max And minimaG min If the energy value of every frame rate isG ti (1tN, 1iNl), by such as Lower formula(1)Calculate energy binary valueδ(t,i),
WhereinλFor threshold factor.
Step 3)Described calculating every frame logarithm language is composedG t Audio frequency change coefficient, specifically, by formula(2)Calculate sound Frequency change coefficientσ t (1≤t≤N 1
Step 4)Described to audio frequency change coefficientσ t Judged, carry out audio volume control abrupt climatic change and Sudden change region is fixed Position, specifically it is assumed that in step 3) in obtained logarithm language spectrum G i-th frameG i And three audio frequency change coefficients of consecutive frameσ i-1 σ i 、σ i+1 If meeting:
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frameG i It is the audio volume control Sudden change region detecting.
Compared with prior art, the present invention is using distorting in splicing voice process signal waveform mutation so that this time The property that the frequency short-time energy of fragment is uprushed, distinguishes, using difference algorithm, audio volume control mutation and the strong signal that editor causes Audio frequency is mutated, and after calculating audio frequency change coefficient, detects that this section audio has duplication to paste/delete and distorts vestige, and accurately The positioning tampering time.The inventive method has preferable detection and positioning function, cannot observe in human ear None- identified, sound spectrograph The waveform mutation of audio frequency can be identified in the case of coming well, be that tampering detection is pasted/deleted in a kind of effective audio dubbing Technology.
Brief description
Fig. 1 is audio data insertion audio fragment schematic diagram;
Fig. 2 is that audio data deletes audio fragment schematic diagram;
Fig. 3 is that waveform mutation schematic diagram at splice point distorted in voice;
Fig. 4 is detection algorithm flow chart;
Fig. 5 is the oscillogram of detection voice;
Fig. 6 is the logarithm language spectrum of detection voice;
Fig. 7 is the audio frequency change coefficient of detection voice.
Specific embodiment
Embodiments of the invention 1:The detection method of DAB waveform mutation,
1)By audio signalyConversion obtains the audio frequency language spectrum of log-domainY:For the digital audio and video signals for h for the lengthy , to audio signalyCarry out framing, every frame length is2 * N(Set N=128);Duplication isl(Setl= 0.5), then Frame numberN 1For:
If framing signal isy i , i=1...Nl, using formula(1)Carry out Discrete Short Time Fourier Transform, obtainY ti, its Middle w (N-m) is window function(It is set as Hamming window function).
Using formula(2)Calculate spectrogram, carry out logarithmic transformation and obtain logarithm language spectrumG i (i=1...Nl), that is,
All log-magnitude values are formed the logarithm language spectrum that matrix is voice signalG, its size isN * N 1
2)Carry out logarithm language spectrumG i Energy binaryzation calculates:Calculate logarithm language spectrum firstG i In maximumG max And minimum ValueG min , for the energy value of every frame rateG i (k),k = 1 ... N, by formula(3)Calculate the energy two-value of frequency Change valueδ (t,i)
WhereinλFor threshold factor(Setλ =0.65), willδThe matrix Δ of composition is defined as short-time energy two-value spectrum,δ (i,k) Value represents for 0iFramekFrequency content energy is low, and 1 is that energy is high;
3)Calculate every frame logarithm language spectrumG i Audio frequency change coefficientσ t :Because imagineering pastes sound bite, in editor Place's audio signal occurs that waveform is mutated, and this operation introduces new radio-frequency component so that log-magnitude spectrum comprises audio frequency mutation The all frequency energy of frame increase suddenly with respect to consecutive frame, therefore, the nonzero value number of this frame short-time energy two-value spectrum should substantially More than consecutive frame;Statistic procedure 2)Middle short-time energy two-value composes the average of every frameσ i , according to formula(4)It is calculated audio frequency mutation Coefficientσ i (1≤iN 1 );
4)To audio frequency change coefficientσ t Judged, carried out audio volume control abrupt climatic change and Sudden change region positioning:Due to language Sound feature, strong voice signal can make the audio frequency change coefficient that detection obtains become big, but this signal has time duration, manually Duplication is pasted the audio frequency change coefficient causing and only can be present in a frame, therefore, distinguishes, using difference algorithm, the sound that editor causes The audio frequency mutation of the mutation of frequency waveform and strong signal, it comprises the concrete steps that, by formula(5)Calculate every frame logarithm language spectrumG i Audio frequency Change coefficientσ t (1≤t≤N1)
Step 4)Described to audio frequency change coefficientσ t Judged, carry out audio volume control abrupt climatic change and Sudden change region is fixed Position, specifically it is assumed that in step 3) in obtained logarithm language and composed the i-th frameG i And three audio frequency change coefficients of consecutive frameσ i-1 σ i 、σ i+1 If meeting:
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frameG i It is the audio volume control Sudden change region detecting.
As shown in figure 5, Fig. 5 is to distort speech waveform figure after deleting part of speech, the wherein dotted line mark moment is sound Frequency clipped position, and this editing trace cannot be gone out by auditory discrimination..As shown in Figure 6 although in the logarithm language spectrum of this section audio No significantly short-time energy mutation, as shown in fig. 7, detecting that this section audio has duplication viscous after calculating audio frequency change coefficient Patch/delete and distort vestige, and it is accurately positioned the time of distorting.
From above-described embodiment as can be seen that the inventive method has preferable detection and positioning function.Human ear None- identified, Sound spectrograph can identify the waveform mutation of audio frequency well in the case of cannot observing out, is that a kind of effective audio dubbing is glued Patch/delete tampering detection technology.

Claims (1)

1. a kind of mutation of DAB waveform detection method it is characterised in that:Comprise the following steps:1) audio signal is converted Obtain the audio frequency language spectrum Y of log-domain, the audio frequency language spectrum of acquisition is carried out logarithmic transformation, obtain logarithm language spectrum G;2) carry out logarithm language Spectrum G energy binaryzation calculates;3) calculate every frame logarithm language and compose GtAudio frequency change coefficient σt;4) to audio frequency change coefficient σtCarry out Judge, carry out audio volume control abrupt climatic change and Sudden change region positioning;
Step 1) described in the audio frequency language spectrum Y that audio signal conversion is obtained log-domain, the audio frequency language of acquisition spectrum is carried out logarithm Conversion, obtains logarithm language spectrum G, specifically, carries out framing for the digital audio and video signals y for h for the length, and obtaining frame number is Nl, frame The matrix of a length of 2*N;Add window function and carry out Short Time Fourier Transform, obtaining size is N*N1Audio frequency language spectrum Y;To audio frequency Language spectrum Y carries out logarithmic transformation, obtains logarithm language spectrum G, and its size is N*N1
Step 2) described in carry out logarithm language spectrum G energy binaryzation calculate, specifically, be first calculated logarithm language compose G in maximum Value GmaxWith minima GminIf the energy value of every frame rate is Gti(1≤t≤N,1≤i≤Nl), energy is calculated by equation below (1) Amount binary value δ (t, i),
Wherein λ is threshold factor;
Step 3) described in calculating every frame logarithm language spectrum GtAudio frequency change coefficient, specifically, by formula (2) calculate audio frequency dash forward Variable coefficient σt(1≤t≤N1);
σ t = Σ i = 1 n δ ( t , i ) n - - - ( 2 ) ;
Step 4) described in audio frequency change coefficient σtJudged, carried out audio volume control abrupt climatic change and Sudden change region positioning, tool Body is it is assumed that in step 3) in obtained logarithm language spectrum G the i-th frame GiAnd three audio frequency change coefficient σ of consecutive framei-1、σi、σi+1, If meeting:
σi>0.85 and | σii-1|*|σii+1|>σi/16
Then determine in audio frequency there is audio frequency mutation, the wherein i-th frame GiIt is the audio volume control Sudden change region detecting.
CN201410285152.0A 2014-06-24 2014-06-24 Detecting method based on digital audio waveform sudden changes Active CN104021791B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410285152.0A CN104021791B (en) 2014-06-24 2014-06-24 Detecting method based on digital audio waveform sudden changes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410285152.0A CN104021791B (en) 2014-06-24 2014-06-24 Detecting method based on digital audio waveform sudden changes

Publications (2)

Publication Number Publication Date
CN104021791A CN104021791A (en) 2014-09-03
CN104021791B true CN104021791B (en) 2017-02-22

Family

ID=51438515

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410285152.0A Active CN104021791B (en) 2014-06-24 2014-06-24 Detecting method based on digital audio waveform sudden changes

Country Status (1)

Country Link
CN (1) CN104021791B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108831506A (en) * 2018-06-25 2018-11-16 华中师范大学 Digital audio based on GMM-BIC distorts point detecting method and system

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11217076B1 (en) * 2018-01-30 2022-01-04 Amazon Technologies, Inc. Camera tampering detection based on audio and video
CN111863023B (en) * 2020-09-22 2021-01-08 深圳市声扬科技有限公司 Voice detection method and device, computer equipment and storage medium
CN113571091B (en) * 2021-06-30 2024-04-19 青岛海尔科技有限公司 Audio mutation detection method and device for monitoring and household appliance
CN116543796B (en) * 2023-07-06 2023-09-15 腾讯科技(深圳)有限公司 Audio processing method and device, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1585020A (en) * 2004-05-28 2005-02-23 中山大学 Digital audio-frequency anti-distorting method
US7146309B1 (en) * 2003-09-02 2006-12-05 Mindspeed Technologies, Inc. Deriving seed values to generate excitation values in a speech coder
CN101383171A (en) * 2008-10-16 2009-03-11 中山大学 Blind detection method for MP3 audio distortion
CN101562016A (en) * 2009-05-26 2009-10-21 上海大学 Totally-blind digital speech authentication method
CN102592588A (en) * 2012-01-10 2012-07-18 清华大学 Digital audio record integrity detection method
CN103345927A (en) * 2013-07-11 2013-10-09 暨南大学 Processing method for detecting and locating audio time domain tampering

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19736669C1 (en) * 1997-08-22 1998-10-22 Fraunhofer Ges Forschung Beat detection method for time discrete audio signal

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7146309B1 (en) * 2003-09-02 2006-12-05 Mindspeed Technologies, Inc. Deriving seed values to generate excitation values in a speech coder
CN1585020A (en) * 2004-05-28 2005-02-23 中山大学 Digital audio-frequency anti-distorting method
CN101383171A (en) * 2008-10-16 2009-03-11 中山大学 Blind detection method for MP3 audio distortion
CN101562016A (en) * 2009-05-26 2009-10-21 上海大学 Totally-blind digital speech authentication method
CN102592588A (en) * 2012-01-10 2012-07-18 清华大学 Digital audio record integrity detection method
CN103345927A (en) * 2013-07-11 2013-10-09 暨南大学 Processing method for detecting and locating audio time domain tampering

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
《数字音频的真实性鉴定》;邵松年;《中国优秀硕士学位论文全文数据库 信息科技辑》;20101115(第11期);全文 *
《数字音频真实性鉴定的研究》;柳永娟;《中国优秀硕士学位论文全文数据库 信息科技辑》;20110515(第05期);全文 *
《数字音频篡改检测与隐写分析技术研究》;丁琦;《中国博士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 *
《数字音频篡改检测技术的研究》;石倩;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 *
《音频信息篡改检测算法的研究》;路允祥;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120715(第07期);全文 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108831506A (en) * 2018-06-25 2018-11-16 华中师范大学 Digital audio based on GMM-BIC distorts point detecting method and system

Also Published As

Publication number Publication date
CN104021791A (en) 2014-09-03

Similar Documents

Publication Publication Date Title
CN104021791B (en) Detecting method based on digital audio waveform sudden changes
Ratnam et al. Blind estimation of reverberation time
KR101269296B1 (en) Neural network classifier for separating audio sources from a monophonic audio signal
Perez-Gonzalez et al. Automatic gain and fader control for live mixing
EP2808867A1 (en) Transient speech signal encoding method and device, decoding method and device, processing system and computer-readable storage medium
US6718301B1 (en) System for measuring speech content in sound
WO2015196760A1 (en) Microphone array speech detection method and device
US20110246205A1 (en) Method for detecting audio signal transient and time-scale modification based on same
CN103886865A (en) Sound Processing Device, Sound Processing Method, And Program
CN113674763B (en) Method, system, device and storage medium for identifying whistle by utilizing line spectrum characteristics
EP3229487B1 (en) Approach for detecting alert signals in changing environments
Prego et al. A blind algorithm for reverberation-time estimation using subband decomposition of speech signals
CN102610232B (en) Method for adjusting self-adaptive audio sensing loudness
EP1492085A2 (en) Method of reflecting time/language distortion in objective speech quality assessment
CN105100508A (en) Network voice quality estimation method, device and system
US10522160B2 (en) Methods and apparatus to identify a source of speech captured at a wearable electronic device
Narkhede et al. Acoustic scene identification for audio authentication
CN110718229A (en) Detection method for record playback attack and training method corresponding to detection model
US20230386492A1 (en) System and method for suppressing noise from audio signal
CN112233693B (en) Sound quality evaluation method, device and equipment
CN112581975B (en) Ultrasonic voice instruction defense method based on signal aliasing and binaural correlation
CN112750458B (en) Touch screen sound detection method and device
CN104240705A (en) Intelligent voice-recognition locking system for safe box
CN107548007B (en) Detection method and device of audio signal acquisition equipment
CN108877816B (en) QMDCT coefficient-based AAC audio frequency recompression detection method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20201113

Address after: No.3, pingzhong Road, Matou Town, Pingguo City, Baise City, Guangxi Zhuang Autonomous Region

Patentee after: Guangxi Pingguo Runmin Poverty Alleviation Development Co., Ltd

Address before: 550025 science and Technology Department, north campus, Guizhou University, Huaxi, Guizhou, China

Patentee before: Guizhou University

TR01 Transfer of patent right