CN102280103A - Audio signal transient-state segment detection method based on variance - Google Patents

Audio signal transient-state segment detection method based on variance Download PDF

Info

Publication number
CN102280103A
CN102280103A CN201110219767XA CN201110219767A CN102280103A CN 102280103 A CN102280103 A CN 102280103A CN 201110219767X A CN201110219767X A CN 201110219767XA CN 201110219767 A CN201110219767 A CN 201110219767A CN 102280103 A CN102280103 A CN 102280103A
Authority
CN
China
Prior art keywords
frame
state
transient state
transition segment
transient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201110219767XA
Other languages
Chinese (zh)
Inventor
张涛
周延献
张瑞生
邢东亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201110219767XA priority Critical patent/CN102280103A/en
Publication of CN102280103A publication Critical patent/CN102280103A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention relates to the audio technology, and provides an audio signal detection method capable of effectively reducing redundant detection of low-energy transient-state segment. The technical scheme is that: the audio signal transient-state segment detection method based on variance comprises the following steps: 1, inputting an audio signal, and framing the audio signal, wherein the frame length is N; calculating the average and standard deviation of each frame according to the sample average and standard deviation formula respectively; comparing the average and standard deviation with a threshold to judge whether the frame is a pre-transient state frame; dividing the pre-transient state frame into short signals with length of M; judging the transient state/stable state of the frame again; judging the transient state/stable state of the processed frame; and if the frame is a transient-state frame, marking the position at which the transient state occurs. The method provided by the invention is mainly used for testing an audio signal.

Description

Sound signal transition segment detection method based on variance
Technical field
The present invention relates to Audiotechnica, relate in particular to sound signal transition segment detection side to, specifically relate to sound signal transition segment detection method based on variance.
Background technology
In the low Bit Rate Audio Coding of present high-quality, encryption algorithm generally adopts the piece transform process method, with the windowing method continuous sound signal is divided into audio data block, and each audio data block is carried out time-frequency conversion, quantization encoding, and then store or transmit.Help removing the redundance of signal based on the audio coding algorithm of piece conversion, improve ratio of compression.Yet in the audio coding technology based on piecemeal, the Pre echoes distortion is the problem that is difficult to solution always.The basic reason that the Pre echoes distortion produces is that the temporal resolution deficiency causes the diffusion of quantizing noise in time domain.Especially transformed to frequency domain when carrying out quantization encoding by piecemeal when the transient state in sound signal part, quantizing noise is diffused on the whole transform block, Pre echoes will occur.When lower or ratio of compression is higher when target bit rate, Pre echoes will be more obvious.。
The method that solves the Pre echoes problem at present mainly contains: bit pond, time domain noise-shaped, the switching of length piece, hybrid filter-bank, gain control etc.These methods all are to be prerequisite with accurate detection sound signal transition segment.Sound signal transition segment detection algorithm can be divided three classes: time domain energy detection method, frequency domain energy measuring method and time and frequency zone complex energy detection method.Yet these methods at algorithm complex, accuracy (omission and flase drop) but, exist on temporal resolution and the practicality not enough.For example, " perceptual entropy " algorithm detected transient signal that adopts among the MPEG-2AAC, but its threshold value difference is big, is not easy to use.The time and frequency zone detection algorithm utilizes interblock capacity volume variance and unpredictable degree to carry out transition segment and detects, but has the higher-energy transition segment and can have influence on next transition segment and detect for previous, can cause flase drop like this.Smoothly estimate the deficiency that detection algorithm has remedied above-mentioned algorithm to a certain extent, have practical potentiality.Yet smoothly estimate detection method and can detect the lower transition segment of energy, cause some unnecessary detections.
Summary of the invention
For overcoming the deficiencies in the prior art, a kind of sound signal transition segment detection method based on variance is provided, this method can effectively reduce the redundancy detection of low-yield transition segment.For reaching above-mentioned purpose, the technical scheme that the present invention takes is that the sound signal transition segment detection method based on variance comprises the following steps:
Step 1: import a section audio signal, divide frame to handle sound signal, frame length is N, according to formula:
Sample mean:
X ‾ = 1 N Σ i = 1 N | X i | - - - ( 1 )
The standard deviation formula:
σ = Σ i = 1 N ( | X i | - X ‾ ) 2 N - 1 - - - ( 2 )
X wherein iI sample value in the expression current block, | X i| expression is to X iAsk absolute value, N is the sample value sum in the current block,
Figure BDA0000080621250000021
Represent the mean value of current block sample value, calculate the average of every frame respectively
Figure BDA0000080621250000022
And standard deviation sigma, compare with threshold value SD_TH then, as σ f SD_TH, judge that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame;
Step 2: pre-transient state frame is divided into the short signal section that length is M, each short signal section adopt smoothly estimate, a kind of detection method in the perceptual entropy, time domain energy, frequency domain energy detects the state of each short signal section of mark;
Step 3: according to the state of the short signal section that obtains in the step 2, secondary is adjudicated the transient state/lower state of this frame;
Step 4: according to the judged result of step 1 and step 3, judge the transient state/lower state of handled frame, if the transient state frame, and the position of mark transient state generation.
Described method is refined as: divide frame to handle input audio signal, and frame length N=1024, at first basis (1) formula is calculated the mean value of a frame signal sample
Figure BDA0000080621250000023
According to (2) formula basis of calculation difference σ, carry out then comparing with threshold value SD_TH, SD_TH=2.8 as σ f SD_TH, judges that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame;
When this frame is pre-transient state frame, carry out the frame data piecemeal, whether detect each piece with smooth detection technique is transition segment:
At first, these frame data are carried out piecemeal, block length M=128 presses each small-signal section respectively
Figure BDA0000080621250000024
Ask the smooth TFM of estimating of time domain and the smooth FFM of estimating of frequency domain;
Secondly, carry out that time domain is smooth estimates judgement, TFM f T_TH, T_TH=0.12 judge that this section is transition segment, and the mark transition segment, otherwise carry out that frequency domain is smooth estimates judgement, FFM p F_TH, F_TH=0.38, judge that this section is transition segment, and mark, otherwise carry out time domain-frequency domain synthesis judgement
Figure BDA0000080621250000025
TF_TH=1.4 judges this section for transition segment, and mark otherwise be the stable state section;
Conclude according to whether containing transition segment in this pre-transient state frame whether this frame is the transient state frame at last, and mark the position of each transition segment.
The present invention has following technique effect:
Transition segment based on variance proposed by the invention detects general framework and compares with the existing detection method of independent employing, produces unnecessary detection hardly on the lower transition segment of energy.And can with than the littler signal segment of frame as detected object, improve temporal resolution, the actual conditions that testing result more can approach signal for the follow-up work in the coding has reduced certain complexity, and have improved coding quality.In addition, this general framework has advantages such as accuracy in detection height, algorithm be simple.
Description of drawings
Fig. 1 is based on the transition segment testing process figure of this method;
The transition segment that Fig. 2 is based on this method detects test result figure, and (a) be time domain waveform among the figure, (b) estimates the fast transient state situation of detection signal for smooth, (c) is that standard deviation-smooth is estimated the fast transient state situation of comprehensive detection signal.
Embodiment
At this redundancy detection, the present invention proposes a kind ofly to detect sound signal transition segment general framework based on variance, when the lower but sample value fluctuation of whole frame signal energy is big, utilizes the variance of putting in order frame signal to reduce redundancy detection.
The most information that the transient phenomena of signal and irregular structure (as the saltus step of signal waveform, interruption, quick oscillation etc.) are usually carrying signal, the signal that comprises this transient phenomena and irregular structure, be commonly referred to as transient signal, because signal becomes duality relation at frequency domain with time domain, be that signal suddenlys change on time domain, it is smooth just to become on frequency spectrum.Otherwise signal is smooth on time domain, the sudden change of corresponding frequency spectrum.According to the scope of FM, FM the sudden change and smooth on good judgement can be arranged.Can judge promptly whether signal is transient signal, but general frequency domain transition segment detection algorithm can detect the lower transient signal of energy, cause unnecessary detection, main cause is can be detected as transient signal when the fluctuation of the lower but sample value of whole frame signal energy is big.But for standard deviation, the sample difference is just smaller comparatively speaking, can it be excluded by appropriate threshold value is set.Thereby whole frame is done variance just can suitably solve the problem that low-yield transition segment is detected.
Standard deviation is a kind of tolerance of one group of data mean value degree of scatter.A bigger standard deviation is represented between most of numerical value and its mean value to differ greatly; A less standard deviation represents these numerical value near mean value.
Sample mean:
X ‾ = 1 N Σ i = 1 N | X i | - - - ( 1 )
The standard deviation formula:
σ = Σ i = 1 N ( | X i | - X ‾ ) 2 N - 1 - - - ( 2 )
X wherein iI sample value in the expression current block, | X i| expression is to X iAsk absolute value, N is the sample value sum in the current block,
Figure BDA0000080621250000033
Represent the mean value of current block sample value.
The present invention can have a distinct increment in the little unnecessary context of detection of transition segment of elimination energy to transition segment detection algorithm commonly used.Algorithm is implemented according to Fig. 1 process flow diagram, and operation steps is as follows:
Step 1: import a section audio signal, divide frame to handle (frame length is N) sound signal, according to formula (1), (2) calculate the average of every frame respectively
Figure BDA0000080621250000034
And standard deviation sigma, compare with threshold value SD_TH then, as σ f SD_TH, judge that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame.
Step 2: pre-transient state frame is divided into the short signal section that length is M, and each short signal section can detect with detection method commonly used, such as: smoothly estimate perceptual entropy, time domain energy, frequency domain energy etc.The state of each short signal section of mark.
Step 3: according to the state of the short signal section that obtains in the step 2, secondary is adjudicated the transient state/lower state of this frame.
Step 4: according to the judged result of step 1 and step 3, judge the transient state/lower state of handled frame, if the transient state frame, and the position of mark transient state generation.
Further describe the present invention below in conjunction with drawings and Examples.
According to Fig. 1 transition segment testing process figure, divide frame to handle (frame length N=1024) input audio signal, at first calculate the mean value of a frame signal sample according to (1) formula
Figure BDA0000080621250000035
According to (2) formula basis of calculation difference σ, carry out then comparing with threshold value SD_TH (SD_TH=2.8).As σ f SD_TH, judge that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame.
When this frame is pre-transient state frame, carry out the frame data piecemeal, whether detect each piece with smooth detection technique is transition segment.At first, these frame data are carried out piecemeal, block length M=128 presses each small-signal section respectively
Figure BDA0000080621250000041
Ask the smooth TFM of estimating of time domain and the smooth FFM of estimating of frequency domain.
Secondly, carry out that time domain is smooth estimates judgement, TFM f T_TH (T_TH=0.12) judges that this section is transition segment, and the mark transition segment, otherwise carrying out that frequency domain is smooth estimates judgement, FFM p F_TH (F_TH=0.38) judges that this section is transition segment, and mark, otherwise carry out time domain-frequency domain synthesis judgement (TF_TH=1.4) judge that this section is transition segment, and mark.Otherwise be the stable state section.
Conclude according to whether containing transition segment in this pre-transient state frame whether this frame is the transient state frame at last, and mark the position of each transition segment.
Carry sound signal smlsplas.wav as test signal with MATLAB, the time domain of signal is shown in Fig. 2-a, earlier with signal with the smooth method detected transient section of estimating, the result is shown in Fig. 2-b, and then signal estimated comprehensive method detected transient section with variance-smooth, the result is shown in Fig. 2-c.By comparative analysis Fig. 2-b and Fig. 2-c, find that variance-smooth estimates detection algorithm under the smaller situation of transition segment energy, performance is more outstanding.The transient signal lower to those energy, the unnecessary transition segment context of detection effect that does not influence tonequality is obvious eliminating.In application, can reduce the complexity of follow-up work in the scrambler like this, improve code efficiency.
Employing experimentizes to 5 kinds of audio-frequency test signals respectively based on standard deviation-smooth sound signal transition segment detection algorithm of estimating, and the block of detection all is 100 (sample number is 100*128), and testing result is as shown in table 1.
The contrast of table 1 transition segment testing result
Figure BDA0000080621250000043

Claims (2)

1. the sound signal transition segment detection method based on variance is characterized in that, may further comprise the steps:
Step 1: import a section audio signal, divide frame to handle sound signal, frame length is N, according to formula: sample mean:
X ‾ = 1 N Σ i = 1 N | X i | - - - ( 1 )
The standard deviation formula:
σ = Σ i = 1 N ( | X i | - X ‾ ) 2 N - 1 - - - ( 2 )
X wherein iI sample value in the expression current block, | X i| expression is to X iAsk absolute value, N is the sample value sum in the current block,
Figure FDA0000080621240000013
Represent the mean value of current block sample value, calculate the average of every frame respectively
Figure FDA0000080621240000014
And standard deviation sigma, compare with threshold value SD_TH then, as σ f SD_TH, judge that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame;
Step 2: pre-transient state frame is divided into the short signal section that length is M, each short signal section adopt smoothly estimate, a kind of detection method in the perceptual entropy, time domain energy, frequency domain energy detects the state of each short signal section of mark;
Step 3: according to the state of the short signal section that obtains in the step 2, secondary is adjudicated the transient state/lower state of this frame;
Step 4: according to the judged result of step 1 and step 3, judge the transient state/lower state of handled frame, if the transient state frame, and the position of mark transient state generation.
2. the method for claim 1 is characterized in that, described method is refined as:
Divide frame to handle input audio signal, frame length N=1024, at first basis (1) formula is calculated the mean value of a frame signal sample
Figure FDA0000080621240000015
According to (2) formula basis of calculation difference σ, carry out then comparing with threshold value SD_TH, SD_TH=2.8 as σ f SD_TH, judges that this frame is pre-transient state frame; As σ p SD_TH, judge that this frame is the stable state frame;
When this frame is pre-transient state frame, carry out the frame data piecemeal, whether detect each piece with smooth detection technique is transition segment:
At first, these frame data are carried out piecemeal, block length M=128 presses each small-signal section respectively
Figure FDA0000080621240000016
Ask the smooth TFM of estimating of time domain and the smooth FFM of estimating of frequency domain;
Secondly, carry out that time domain is smooth estimates judgement, TFM f T_TH, T_TH=0.12 judge that this section is transition segment, and the mark transition segment, otherwise carry out that frequency domain is smooth estimates judgement, FFM p F_TH, F_TH=0.38, judge that this section is transition segment, and mark, otherwise carry out time domain-frequency domain synthesis judgement
Figure FDA0000080621240000017
TF_TH=1.4 judges this section for transition segment, and mark otherwise be the stable state section;
Conclude according to whether containing transition segment in this pre-transient state frame whether this frame is the transient state frame at last, and mark the position of each transition segment.
CN201110219767XA 2011-08-02 2011-08-02 Audio signal transient-state segment detection method based on variance Pending CN102280103A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110219767XA CN102280103A (en) 2011-08-02 2011-08-02 Audio signal transient-state segment detection method based on variance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110219767XA CN102280103A (en) 2011-08-02 2011-08-02 Audio signal transient-state segment detection method based on variance

Publications (1)

Publication Number Publication Date
CN102280103A true CN102280103A (en) 2011-12-14

Family

ID=45105518

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110219767XA Pending CN102280103A (en) 2011-08-02 2011-08-02 Audio signal transient-state segment detection method based on variance

Country Status (1)

Country Link
CN (1) CN102280103A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104143341A (en) * 2013-05-23 2014-11-12 腾讯科技(深圳)有限公司 Sonic boom detection method and device
CN107068155A (en) * 2017-01-23 2017-08-18 天津大学 A kind of temporary stable state decision method of multistage audio based on variance and time domain peak
CN110503973A (en) * 2019-08-28 2019-11-26 浙江大华技术股份有限公司 Audio signal Transient Noise suppressing method, system and storage medium
CN115185011A (en) * 2022-08-15 2022-10-14 河北雄安太芯电子科技有限公司 Target detection method applicable to terahertz detection

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080004873A1 (en) * 2006-06-28 2008-01-03 Chi-Min Liu Perceptual coding of audio signals by spectrum uncertainty
CN101521010A (en) * 2008-02-29 2009-09-02 华为技术有限公司 Coding and decoding method for voice frequency signals and coding and decoding device
CN101751928A (en) * 2008-12-08 2010-06-23 扬智科技股份有限公司 Method for simplifying acoustic model analysis through applying audio frame frequency spectrum flatness and device thereof
CN101847413A (en) * 2010-04-09 2010-09-29 北京航空航天大学 Method for realizing digital audio encoding by using new psychoacoustic model and quick bit allocation

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080004873A1 (en) * 2006-06-28 2008-01-03 Chi-Min Liu Perceptual coding of audio signals by spectrum uncertainty
CN101521010A (en) * 2008-02-29 2009-09-02 华为技术有限公司 Coding and decoding method for voice frequency signals and coding and decoding device
CN101751928A (en) * 2008-12-08 2010-06-23 扬智科技股份有限公司 Method for simplifying acoustic model analysis through applying audio frame frequency spectrum flatness and device thereof
CN101847413A (en) * 2010-04-09 2010-09-29 北京航空航天大学 Method for realizing digital audio encoding by using new psychoacoustic model and quick bit allocation

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104143341A (en) * 2013-05-23 2014-11-12 腾讯科技(深圳)有限公司 Sonic boom detection method and device
WO2014187095A1 (en) * 2013-05-23 2014-11-27 Tencent Technology (Shenzhen) Company Limited Method and device for detecting noise bursts in speech signals
CN104143341B (en) * 2013-05-23 2015-10-21 腾讯科技(深圳)有限公司 Sonic boom detection method and device
CN107068155A (en) * 2017-01-23 2017-08-18 天津大学 A kind of temporary stable state decision method of multistage audio based on variance and time domain peak
CN110503973A (en) * 2019-08-28 2019-11-26 浙江大华技术股份有限公司 Audio signal Transient Noise suppressing method, system and storage medium
CN110503973B (en) * 2019-08-28 2022-03-22 浙江大华技术股份有限公司 Audio signal transient noise suppression method, system and storage medium
CN115185011A (en) * 2022-08-15 2022-10-14 河北雄安太芯电子科技有限公司 Target detection method applicable to terahertz detection
CN115185011B (en) * 2022-08-15 2024-06-18 河北雄安太芯电子科技有限公司 Target detection method applicable to terahertz detection

Similar Documents

Publication Publication Date Title
Song et al. Study on the decoupling relationship between CO2 emissions and economic development based on two-dimensional decoupling theory: A case between China and the United States
CN101770779B (en) Noise spectrum tracking in noisy acoustical signals
CN102323577B (en) High-resolution radar dual-threshold detection method based on energy accumulation
EP3389048B1 (en) Method of and device for encoding a high frequency signal relating to bandwidth expansion in speech and audio coding
CN108847217A (en) A kind of phonetic segmentation method, apparatus, computer equipment and storage medium
RU2011101617A (en) DEVICE AND METHOD FOR CALCULATING THE NUMBER OF EXTENDING SPECTRA
EP2159788A1 (en) A voice activity detecting device and method
US20070027681A1 (en) Method and apparatus for extracting voiced/unvoiced classification information using harmonic component of voice signal
CN102280103A (en) Audio signal transient-state segment detection method based on variance
CN101689369A (en) Method and apparatus for coding and decoding amplitude of partial
CN101521010B (en) Coding and decoding method for voice frequency signals and coding and decoding device
CN101996640B (en) Frequency band expansion method and device
CN101727906A (en) Method and device for coding and decoding of high-frequency band signals
CN103632682A (en) Audio feature detection method
CN103262158A (en) Device and method for postprocessing decoded multi-hannel audio signal or decoded stereo signal
CN103578479A (en) Speech intelligibility measuring method based on auditory masking effect
CN104299614A (en) Decoding method and decoding device
CN101004916B (en) Anti channel error code method of line spectrum pair parameter of voice coder
CN101009098B (en) Sound coder gain parameter division-mode anti-channel error code method
CN101308651B (en) Detection method of audio transient signal
EP2793228A1 (en) Stereo encoding method, stereo encoding device, and encoder
CN102982807A (en) Method and system for multi-stage vector quantization of speech signal LPC coefficients
CN111861121A (en) Abnormal electricity utilization judgment method based on user electricity fluctuation and line loss fluctuation analysis
CN105070297B (en) A kind of MP3 audio compressions history detection method
CN101894557B (en) Method for discriminating window type of AAC codes

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20111214