CN109658955A - Sonic boom detection method and device - Google Patents

Sonic boom detection method and device Download PDF

Info

Publication number
CN109658955A
CN109658955A CN201910012237.4A CN201910012237A CN109658955A CN 109658955 A CN109658955 A CN 109658955A CN 201910012237 A CN201910012237 A CN 201910012237A CN 109658955 A CN109658955 A CN 109658955A
Authority
CN
China
Prior art keywords
audio frame
adjacent
sonic boom
white noise
class
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910012237.4A
Other languages
Chinese (zh)
Other versions
CN109658955B (en
Inventor
曾亮凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HUANHONG ELECTRONIC (KUNSHAN) Co Ltd
Universal Global Technology Kunshan Co Ltd
Original Assignee
HUANHONG ELECTRONIC (KUNSHAN) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HUANHONG ELECTRONIC (KUNSHAN) Co Ltd filed Critical HUANHONG ELECTRONIC (KUNSHAN) Co Ltd
Priority to CN201910012237.4A priority Critical patent/CN109658955B/en
Publication of CN109658955A publication Critical patent/CN109658955A/en
Application granted granted Critical
Publication of CN109658955B publication Critical patent/CN109658955B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)

Abstract

The invention proposes a kind of sonic boom detection methods, comprising: is directed to a sound source signal, acquirement is formed by multiple audio frames sequentially in time in a time domain;It is sampled for each audio frame;A sampling amplitude of each sampling result in each audio frame is compared with a default sampling amplitude respectively;According to the comparison result of each audio frame, to confirm that each audio frame belongs to white noise position or class sonic boom position;And whether judge in those audio frames according to the adjacent white noise position of time sequencing appearance, adjacent class sonic boom position and adjacent white noise position, for confirming whether the sound source signal sonic boom occurs, wherein when an audio frame at least one audio frame adjacent thereto is white noise position, then its corresponding audio frame belongs to adjacent white noise position, when an audio frame at least one audio frame adjacent thereto is class sonic boom position, then its corresponding audio frame belongs to adjacent class sonic boom position.

Description

Sonic boom detection method and device
Technical field
This application involves multi-media processing technical field more particularly to a kind of sonic boom detection method and device.
Background technique
Due to audio generation, processing, transmission, storage and etc. complexity, some music occur in audio " to be lost ?.Sonic boom is a kind of relatively common distortion phenomenon, and so-called sonic boom refers to In musical waveform there is of short duration pulse, sound like be " clatter " a sound, this sound can bring poor to hearer Experience.
Thus in order to smoothly play audio file on electronic product, it is necessary to detect in audio file that there are sonic booms Part, and then eliminated.In general, detection sonic boom is dependent on human ear mostly to judge, it is therefore possible to many problems, example occurs Such as: hearer can not do the test of long-time wide scope, and hearing sensitivity is bad, or the subjective ingredient too inconveniences such as again.
Summary of the invention
In view of this, the invention proposes a kind of sonic boom detection devices, it is characterised in that the device includes: a control Module, a sampling module and comparison module, equal electrical communication between the control module and the sampling module and the comparison module, In the sampling module be used for be directed to a sound source signal, acquirement be formed by sequentially in time in a time domain multiple audio frames with And it is sampled for each audio frame;The comparison module is used for respectively by each sampling result in each audio frame One sampling amplitude is compared with a default sampling amplitude, and is confirmed often according to the comparison result of each audio frame One audio frame belongs to white noise position or class sonic boom position;And the control module for judge in those audio frames whether according to There is adjacent white noise position, adjacent class sonic boom position and adjacent white noise position in time sequencing, for confirming the sound source signal Whether sonic boom is occurred, wherein then it is corresponding when an audio frame at least one audio frame adjacent thereto is white noise position Audio frame belongs to adjacent white noise position, when an audio frame at least one audio frame adjacent thereto is class sonic boom position, then Its corresponding audio frame belongs to adjacent class sonic boom position.
The application also proposed a kind of sonic boom detection method, it is characterised in that the described method includes: it is directed to a sound source signal, Acquirement is formed by multiple audio frames sequentially in time in a time domain;It is sampled for each audio frame;Respectively will One sampling amplitude of each sampling result in each audio frame is compared with a default sampling amplitude;According to each The comparison result of audio frame, to confirm that each audio frame belongs to white noise position or class sonic boom position;And judge those sounds Whether according to time sequencing there is adjacent white noise position, adjacent class sonic boom position and adjacent white noise position in frequency frame, for Confirm whether the sound source signal sonic boom occurs, wherein when an audio frame at least one audio frame adjacent thereto is white noise position When, then its corresponding audio frame belongs to adjacent white noise position, when an audio frame at least one audio frame adjacent thereto is class When sonic boom position, then its corresponding audio frame belongs to adjacent class sonic boom position.
Therefore, the sonic boom detection method and device proposed through the invention, it is expected to free hearer and be difficult to long-time, wide model It encloses, be absorbed in and listen to the predicament whether sonic boom generates.
In order to be further understood that technology, method and effect of the invention taken to reach set purpose, please refer to Below in connection with detailed description of the invention and attached drawing, it is believed that the purpose of the present invention, feature and feature, when can thus be able to deeply and It is specific to understand, however attached drawing only provides reference and description and is used, and is not intended to limit the present invention.
Detailed description of the invention
For the clearer technical solution illustrated in the embodiment of the present invention, will make below to required in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, under the premise of not paying excessive make great efforts, it can also be obtained according to these attached drawings others Attached drawing.Wherein,
The waveform of an audio file in the embodiment of the present invention is presented in Fig. 1, wherein there is the part of sonic boom;
Fig. 2 is presented according to the sound exports coding in Fig. 1 into audio file;
The schematic diagram of a typical sonic boom waveform is presented in Fig. 3;
The flow chart of sonic boom detection method depicted in one embodiment of the invention is presented in Fig. 4;
The block diagram of sonic boom detection device depicted in one embodiment of the invention is presented in Fig. 5.
Specific embodiment
Various exemplary embodiments will be more fully described referring to attached drawing below, show some illustrative realities in the accompanying drawings Apply example.However, concept of the present invention may embody in many different forms, and it should not be construed as limited by example set forth herein The property shown embodiment.Specifically, these exemplary embodiments are provided in order that making the present invention will be to be detailed and complete, and it will be to Those skilled in the art sufficiently convey the scope of concept of the present invention.In the various figures, similar number indicates similar assembly always.
Will be appreciated that " although first ", " second ", the terms such as " third " can be used to describe in this article various assemblies or Component, but these components or component should not be limited by these terms.These terms only to distinguish a component or component with Another component or component.Therefore, the first assembly or component of following the discussion are referred to alternatively as in the case where not departing from teachings of the present invention Second component or the second component.
Electronic product may generate sonic boom during certain operations, these modes of operation for example including booting, shutdown, System enters standby, system from standby middle wake-up and power supply hit etc..
Referring to Figure 1, the waveform that an audio file in the embodiment of the present invention is presented, wherein 0.035 second in the time domain (sec) occurs sonic boom at.Fig. 1 longitudinal axis is to represent amplitude, is indicated with dB (i.e. multiple), conversion formula YdB=20log (X), it is analyzed with 16-bit pulse code modulation data pcm data, amplitude maximum value 0x7fff is 0dB, when definition is pre- If when amplitude 0x40, about 512 times that 0x7fff is 0x40 can be calculated.When knowing 0x40 is 1/512 times of 0x7fff, It is represented by -54dB.
The file original of this sound source signal is recorded with the file format of wav, in the middle section of this figure, should be originally Sonic boom has occurred in the sound output of white noise (white noise), and spike shown in the middle section Fig. 1 is presented in audio file.It can be with Observe, at 0.045 second (sec) at 0.035 second (sec), occur continuing 10 milliseconds (ms) of sonic boom (pop noise)。
Fig. 2 is referred to, is presented according to the audio coding in Fig. 1 into wav file;Then, this wav file is compiled with 16 After volume device ultraedit is opened, it can be observed wherein to have one piece of data this sound source signal to be presented with 44.1kHz sampling frequency, Pulse code modulation data (pcm data) after left and right sound channels sample, the numerical value of intensity of sound is with 16 2 complement code tables Show, wherein 0x7fff represents the boundary value of positive value, and 0x8000 represents the boundary value of negative value.Later, to pulse code modulation data It is analyzed, the sound of obvious nonwhite noise occurs from the 1st data of the 1st data to the line number 1f80h of line number 18a0h Intensity.There is this segment data length L/R (0x1f7-0x18a+1) * 8+2=882 to sample number.This just with sonic boom shown in Fig. 1 Consistent (44.1k*0.01*2) is presented for 10 milliseconds (ms) when part accounts for.
A typical sonic boom waveform diagram is presented in Fig. 3, and horizontal axis representative contains the temporal characteristics of sonic boom, and the longitudinal axis represents sonic boom Strength characteristic.Deduced according to common sonic boom data analysis: the sonic boom waveform time length T2 often occurred is typically in the range of 2.5 millis Between second and 0.4 second, the present invention, which focuses on, detects more serious sonic boom, i.e., should not have sound, the only shape of white noise originally The case where producing sonic boom under condition.So respectively defining the white noise (white for accounting for time span T1 and T3 before and after T2 noise).In this way, which the physics that waveform shown in Fig. 3 integrally represents is contained to anticipate can be quiet should only have white noise originally Under T1+T2+T3 time span, the sonic boom of duration T 2 is produced.It therefore, can be to adjacent class sonic boom position (Consecutive Candidate pop noise section, CCPS) and adjacent white noise position (Consecutive white noise Section, CWS) following judgement is carried out, representing if meeting the following conditions Formulas I herein has sonic boom generation:
(0.4s>(T2 for CCPS)>2.5ms)&(T1 for(CWS before CCPS)>2s)&(T3 for(CWS After CCPS) > 2s) it is true (conditional I)
In one embodiment, CCPS can be defined as: being formed by multiple positions sequentially in time and occurs more than 1 class Sonic boom position;CWS can be defined as: being formed by multiple positions sequentially in time and occurs more than 1 white noise position.
However, in another embodiment, CCPS can be defined as: formed continuous 2 positions are equal sequentially in time There is class sonic boom position;CWS can be defined as: there is white noise position in formed continuous 2 positions sequentially in time.? In other embodiments, CCPS be can be defined as: forming continuous N number of position sequentially in time and occurs n class sonic boom portion altogether Position, CWS can be defined as: continuous N number of position formed sequentially in time occurs n white noise position altogether, wherein n can be 3, 4,5...N-1。
In one embodiment, white noise position (White noise section, WS) can be defined as: for each Audio frame is sampled (quantity of sampling quantity is such as 50), by the sampling absolute value of amplitude of this 50 sampling results and a default amplitude Value (for example it is -54dB that pcm data, which is 0x40 or db value) compares, to sample the number that absolute value of amplitude is greater than default amplitude Amount forms one first quantity, and the position is white noise line if first quantity is less than or equal to a default fiducial value (such as 10) Position;On the other hand, class noise position (Candidate pop noise section (CCPS)) can be defined as: for each Audio frame is sampled (quantity of sampling quantity is such as 50), by the sampling absolute value of amplitude of this 50 sampling results and a default amplitude Value (for example it is -54dB that pcm data, which is 0x40 or db value) compares, and statistical sampling amplitude is greater than the sampling of default amplitude Quantity, to form one first quantity, the position is class noise position if the first quantity is greater than a default fiducial value (such as 10).
[first embodiment detected about sonic boom]
Fig. 4 is referred to, the flow chart of sonic boom detection method depicted in one embodiment of the invention is presented.As shown in figure 4, quick-fried Sound detection method can comprise the following steps that
One section of sound source signal to be detected is divided into multiple audio frames in chronological order in a time domain by step S401. As an embodiment of the present invention, in the present invention, one section of sound source signal to be detected is divided into multiple audio frames can include: Sound source signal to be detected is divided by multiple audio frames using sliding window mode, wherein when dividing audio frame according to One setting translational length of every movement divides an audio frame, and the length of the audio frame divided in one section of sound source signal of detection Window width is slided for setting.If setting sliding window width as 40 milliseconds (ms) to set translational length as 5 milliseconds (ms), The then sound source signal of one section 100 millisecond (ms) durations to be detected for one, marks off 13 audio frame f, is respectively as follows: the 1st A audio frame f01:0_40ms, the 2nd audio frame f02:5ms_45ms;3rd audio frame f01:10ms_50ms ... the last one Audio frame f13:60ms_100ms.
Step S402 is sampled for each audio frame;Number of sampling in the present invention, in each audio frame Are as follows: fl*fs, fl are audio frame length, and fs is default sampling frequency.As an embodiment of the present invention, fl can be between 1 milli Between second to 0.4 second, fs can be between 16kHz and 44.1kHz.When being directed to each audio frame, fl is 40 milliseconds, fs When for 16kHz, number of sampling is then 0.04*16000=640.
Among step S403, each sampling in each audio frame, all number of sampling has a corresponding sampling Amplitude.In the present embodiment, a sampling amplitude of each sampling result in each audio frame is preset with one respectively Sampling amplitude is compared;Since sonic boom belongs to of short duration pulse.Default sampling amplitude can be set as between human ear hearing Near the boundary value of intensity of sound in one's power.
In one embodiment, presetting sampling amplitude can be determined by the sampling amplitude of multiple audio frames;Another In a embodiment, default sampling amplitude can also be determined by the short-time energy difference between two audio frames;In another implementation In example, default sampling amplitude can be determined by low-frequency range the average energy value of all audio frames, application of the invention not office It is limited to this.It therefore, can be according to the comparison result of each audio frame, to confirm that each audio frame belongs to white noise position (White noise section) or class sonic boom position (candidate pop noise section).
In one embodiment, if a sampling amplitude of sampling result is greater than the default sampling amplitude in an audio frame When one first quantity N1 is less than a default fiducial value, it can confirm that the audio frame is white noise position, wherein first quantity N1 Can be between 5-fs, this is depending on different applications.It is somebody's turn to do if a sampling amplitude of sampling result is greater than in an audio frame When one first quantity N1 of default sampling amplitude is not less than (being greater than or equal to) default fiducial value, the audio frame can be confirmed For class sonic boom position.It is with individual audio frame is sampled among sound source signal strength information above to determine single a audio frame Whether in white noise position or class sonic boom position.
Among step S404, it is also contemplated that the strength information that sampling is implied from the multiple audio frames of sound source signal is to sentence Break in those audio frames and whether according to time sequencing adjacent white noise position (consecutive white noise occurs Section), adjacent class sonic boom position (consecutive candidate pop noise section) and adjacent white noise Position (consecutive white noise section), for confirming whether the sound source signal sonic boom occurs.
In one embodiment, when an audio frame and at least one audio frame adjacent thereto are white noise position, than If the sampling frequency of the 2nd audio frame f02 and the 3rd audio frame f03 is 50, and in the 2nd audio frame f02 and the 3rd audio The sampling amplitude of 50 sampling results in frame f03, after counting, the first quantity N1 greater than default sampling amplitude is respectively 8 and 9, when respectively less than one default fiducial value (for example being 10), then the 2nd audio frame f02 and the 3rd audio frame f03 is determined As white noise position.Since the 2nd audio frame f02 and the 3rd audio frame f03 is white noise position, then audio frame f02 and F03 is attributed to the same adjacent white noise position, 45 milliseconds (ms) when accounting for altogether.
In one embodiment, when an audio frame at least one audio frame adjacent thereto is class sonic boom position, such as the The sampling frequency of 11 audio frame f11 and the 12nd audio frame f12 is 50, and in the 11st audio frame f11 and the 12nd audio The sampling amplitude of 50 sampling results in frame f12, the first quantity N1 that default sampling amplitude is greater than after counting are respectively 11 and 12, when being all larger than equal to default fiducial value (for example being 10), then the 11st audio frame f11 and the 12nd audio frame f12 It is all determined as class sonic boom position.Since the 11st audio frame f11 and the 12nd audio frame f12 is class sonic boom position, then Audio frame f11 and f12 are attributed to the same adjacent class sonic boom position, 45 milliseconds (ms) when accounting for altogether.
Further, since the waveform of sonic boom is similar to an of short duration pulse, present invention contemplates that multiple audio frames are implied with Temporal characteristics about sonic boom.When there is adjacent white noise position, adjacent class sonic boom portion according to time sequencing in those audio frames It is further comprising the steps of: to judge the previous adjacent white noise position at adjacent class sonic boom position when position and adjacent white noise position One first total time length T1 whether it was greater than one first preset time shared by the corresponding audio frame of institute;Judge adjacent class sonic boom position Whether fallen in a preset time section one second total time length T2 shared by the corresponding audio frame of institute;And judge that adjacent class is quick-fried Whether be greater than third total time length T3 shared by the corresponding audio frame of the latter of line position adjacent white noise position institute this One preset time.
In one embodiment, when first total time length T1 and third total time length T3 to be all larger than this first pre- If the time, and when second total time length T2 is fallen in the preset time section, confirm that sonic boom occurs in the sound source signal.
In one embodiment, first preset time be 2 seconds, the preset time section between 2.5 milliseconds to 0.4 second it Between.In other examples, first preset time and the preset time section can be designated as other values, so that phase The combined waveform of the threes such as adjacent white noise position, adjacent class sonic boom position and adjacent white noise position is more similar to an of short duration arteries and veins Punching.
[second embodiment detected about sonic boom]
Fig. 5 is referred to, the block diagram of sonic boom detection device 500 depicted in one embodiment of the invention is presented.Such as Fig. 5 institute Show, sonic boom detection device 500 includes: a sampling module 501, comparison module 502 and a control module 503, above three module Equal electrical communication each other.Sonic boom detection device 500 can run among a server apparatus.
The server apparatus includes: processor, storage device, input and output controller, network communication interface.Said modules Permeable bus is communicated.
Multiple program modules are stored in the present embodiment, in storage device, for example, operating system, network communication module and Application program.
Processor can carry out the various of execute server equipment with the various modules in the application program in read storage device Functional application and data processing.Processor in the present embodiment can be one, or it is multiple, it can be CPU, place Manage units/modules and ASIC etc..
Input and output controller has display and input equipment, for completing the input, output and display of related data Deng.
Operating system includes but is not limited to: Android operation system, Symbian operating system, Windows operating system, (SuSE) Linux OS etc..
Application program for realizing disclosed herein sonic boom detection device 500 and all modules that should have.
Sampling module 501 is used to be directed to a sound source signal, and acquirement is formed by multiple sequentially in time in a time domain It audio frame and is sampled for each audio frame f.
Comparison module 502, for sampling amplitude and one for one of each sampling result in each audio frame f respectively Default sampling amplitude is compared, and it is white according to the comparison result of each audio frame f to confirm that each audio frame f belongs to Noise position or class sonic boom position.
Control module 503, for whether judging in those audio frames according to the adjacent white noise position of time sequencing appearance, phase Adjacent class sonic boom position and adjacent white noise position, for confirming whether the sound source signal sonic boom occurs, wherein each audio frame f When at least one adjacent audio frame is white noise position, then its corresponding audio frame belongs to adjacent white noise position, Mei Yiyin When at least one adjacent audio frame of frequency frame f is class sonic boom position, then its corresponding audio frame belongs to adjacent class sonic boom position.
In one embodiment, which is the sampling amplitude that white noise position refers to sampling result in the audio frame Greater than default one first quantity for sampling amplitude less than a default fiducial value.
In one embodiment, which judges the previous adjacent white noise position institute at adjacent class sonic boom position One first total time length T1 whether it was greater than one first preset time shared by corresponding audio frame;Judge adjacent class sonic boom position institute Whether fallen in a preset time section one second total time length T2 shared by corresponding audio frame;And judge adjacent class sonic boom Whether be greater than third total time length T3 shared by the corresponding audio frame of the latter at position adjacent white noise position institute this first Preset time.
In the present embodiment, network communication interface is matched with network communication module completes the various network signals of server apparatus Transmitting-receiving and downloading network on audio file etc..
In other examples, sonic boom detection device includes: a sampling module, comparison module 502 and a control module, Equal electrical communication between above three module.Sonic boom detection device can run method of the invention, and not need collocation another One server apparatus.
In addition, each embodiment of the invention can pass through the data processing by data processing equipment such as calculator execution Program is realized.Obviously, data processor constitutes the present invention.In addition, being commonly stored in the data in a storage device Handling routine is by directly reading out storage device for program or by installing or copying to data processing equipment for program It stores in equipment (such as hard disk and/or memory) and executes.Therefore, such storage device also constitutes the present invention.Storage device can be with Use any kind of recording mode, such as paper storage device (such as paper tape), magnetic memory apparatus (such as floppy disk, hard disk, flash memory Deng), light storage device, magneto optical storage devices etc..
(the possibility technical effect of embodiment)
Pass through the sonic boom detection method and device of the embodiment of the present application, it is expected to free manual testing and be difficult to long-time, wide model It encloses, carefully listen to the predicament whether sonic boom generates.Moreover, artificial judgment sonic boom mode is often excessively subjective, it need to rely on and listen individually The susceptibility of the sense of hearing of person.The present invention can establish more objective testing standard.Furthermore it is subtle when carrying out manual testing Sonic boom be difficult to discover sometimes, sonic boom detection device disclosed in this invention can still confirm subtle sonic boom.
Finally it should be noted that, in preceding description, although by the concept of the technology of the present invention with multiple exemplary realities Example is applied to be specifically illustrated in and illustrate, however in the field of technique ordinarily skilled artisan will understand that, without departing substantially from by following Under conditions of the range of the concept of the technology of the present invention that claim is defined, the various changes in form and details can be made to it Change.

Claims (10)

1. a kind of sonic boom detection method, which is characterized in that the described method includes:
For a sound source signal, acquirement is formed by multiple audio frames sequentially in time in a time domain;
It is sampled for each audio frame;
A sampling amplitude of each sampling result in each audio frame and a default sampling amplitude are compared respectively Compared with;
According to the comparison result of each audio frame, to confirm that each audio frame belongs to white noise position or class sonic boom position;With And
Judge in multiple audio frame whether according to time sequencing adjacent white noise position, adjacent class sonic boom position and adjacent occur White noise position, for confirming whether the sound source signal sonic boom occurs, wherein when an audio frame at least one audio adjacent thereto When frame is white noise position, then its corresponding audio frame belongs to adjacent white noise position, when an audio frame is adjacent thereto at least When one audio frame is class sonic boom position, then its corresponding audio frame belongs to adjacent class sonic boom position.
2. the method according to claim 1, wherein each of multiple audio frame sampling frequency is one Default sampling frequency.
3. according to the method described in claim 2, it is characterized in that, the audio frame is that white noise position refers to and takes in the audio frame The sampling amplitude of sample result is greater than one first quantity of the default sampling amplitude less than a default fiducial value.
4. according to the method described in claim 3, it is characterized in that, the audio frame belongs to class sonic boom position refers in the audio frame The sampling amplitude of sampling result is greater than one first quantity of the default sampling amplitude not less than the default fiducial value.
5. the method according to claim 1, wherein adjacent when occurring in multiple audio frame according to time sequencing It is further comprising the steps of when white noise position, adjacent class sonic boom position and adjacent white noise position:
Judge that one first total time shared by the previous adjacent corresponding audio frame of white noise position institute at adjacent class sonic boom position is long Whether degree is greater than one first preset time;
Judge shared by the corresponding audio frame of adjacent class sonic boom position institute one second total time length whether fall in a preset time area In;And
Judge that a third total time shared by the adjacent corresponding audio frame of white noise position institute of the latter at adjacent class sonic boom position is long Whether degree is greater than first preset time.
6. according to the method described in claim 5, it is characterized in that, when first total time length and the third total time it is long Degree is greater than first preset time, and when second total time length is fallen in the preset time section, confirm the sound source signal There is sonic boom.
7. according to the method described in claim 6, the preset time section is situated between it is characterized in that, first preset time is 2 seconds Between 2.5 milliseconds to 0.4 second.
8. a kind of sonic boom detection device, which is characterized in that described device includes:
One sampling module, for being directed to a sound source signal, acquirement is formed by multiple audios sequentially in time in a time domain It frame and is sampled for each audio frame;
One comparison module takes for respectively presetting a sampling amplitude of each sampling result in each audio frame with one Sample amplitude is compared, and confirmed according to the comparison result of each audio frame each audio frame belong to white noise position or Class sonic boom position;And
One control module, for judging in multiple audio frame whether according to time sequencing adjacent white noise position, adjacent occur Class sonic boom position and adjacent white noise position, for confirming whether the sound source signal sonic boom occurs, wherein when an audio frame and its When at least one adjacent audio frame is white noise position, then its corresponding audio frame belongs to adjacent white noise position, when a sound When frequency frame at least one audio frame adjacent thereto is class sonic boom position, then its corresponding audio frame belongs to adjacent class sonic boom portion Position, wherein equal electrical communication between the control module and the sampling module and the comparison module.
9. device according to claim 8, which is characterized in that the audio frame refers to for white noise position to be taken in the audio frame The sampling amplitude of sample result is greater than one first quantity of the default sampling amplitude less than a default fiducial value.
10. device according to claim 8, which is characterized in that the control module judges the previous of adjacent class sonic boom position Shared by a corresponding audio frame of adjacent white noise position institute one first total time length whether be greater than one first preset time;Judgement Shared by the corresponding audio frame of adjacent class sonic boom position institute one second total time length whether fall in a preset time section;And Judge shared by the corresponding audio frame of the latter at adjacent class sonic boom position adjacent white noise position institute one third total time length be It is no to be greater than first preset time.
CN201910012237.4A 2019-01-07 2019-01-07 Plosive detection method and device Active CN109658955B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910012237.4A CN109658955B (en) 2019-01-07 2019-01-07 Plosive detection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910012237.4A CN109658955B (en) 2019-01-07 2019-01-07 Plosive detection method and device

Publications (2)

Publication Number Publication Date
CN109658955A true CN109658955A (en) 2019-04-19
CN109658955B CN109658955B (en) 2021-03-09

Family

ID=66119062

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910012237.4A Active CN109658955B (en) 2019-01-07 2019-01-07 Plosive detection method and device

Country Status (1)

Country Link
CN (1) CN109658955B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110265064A (en) * 2019-06-12 2019-09-20 腾讯音乐娱乐科技(深圳)有限公司 Audio sonic boom detection method, device and storage medium
CN112735481A (en) * 2020-12-18 2021-04-30 Oppo(重庆)智能科技有限公司 POP sound detection method and device, terminal equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7986179B2 (en) * 2009-06-12 2011-07-26 Elite Semiconductor Memory Technology Inc. Circuit and method for reducing popping sound
CN103886868A (en) * 2014-04-16 2014-06-25 上海其高电子科技有限公司 Impact sound detection method and detection system
CN104143341A (en) * 2013-05-23 2014-11-12 腾讯科技(深圳)有限公司 Sonic boom detection method and device
CN105118520A (en) * 2015-07-13 2015-12-02 腾讯科技(深圳)有限公司 Elimination method and device of audio beginning sonic boom

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7986179B2 (en) * 2009-06-12 2011-07-26 Elite Semiconductor Memory Technology Inc. Circuit and method for reducing popping sound
CN104143341A (en) * 2013-05-23 2014-11-12 腾讯科技(深圳)有限公司 Sonic boom detection method and device
CN103886868A (en) * 2014-04-16 2014-06-25 上海其高电子科技有限公司 Impact sound detection method and detection system
CN105118520A (en) * 2015-07-13 2015-12-02 腾讯科技(深圳)有限公司 Elimination method and device of audio beginning sonic boom

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110265064A (en) * 2019-06-12 2019-09-20 腾讯音乐娱乐科技(深圳)有限公司 Audio sonic boom detection method, device and storage medium
CN110265064B (en) * 2019-06-12 2021-10-08 腾讯音乐娱乐科技(深圳)有限公司 Audio frequency crackle detection method, device and storage medium
CN112735481A (en) * 2020-12-18 2021-04-30 Oppo(重庆)智能科技有限公司 POP sound detection method and device, terminal equipment and storage medium
CN112735481B (en) * 2020-12-18 2022-08-05 Oppo(重庆)智能科技有限公司 POP sound detection method and device, terminal equipment and storage medium

Also Published As

Publication number Publication date
CN109658955B (en) 2021-03-09

Similar Documents

Publication Publication Date Title
AU774088B2 (en) Apparatus and methods for detecting emotions in the human voice
JP4308278B2 (en) Method and apparatus for objective voice quality measurement of telecommunications equipment
Klein-Hennig et al. The influence of different segments of the ongoing envelope on sensitivity to interaural time delays
CN108010512B (en) Sound effect acquisition method and recording terminal
CN101426168B (en) Sounding body abnormal sound detection method and system
US7050978B2 (en) System and method of providing evaluation feedback to a speaker while giving a real-time oral presentation
EP2529370B1 (en) Systems and methods for speech extraction
US10381025B2 (en) Multiple pitch extraction by strength calculation from extrema
US20070299663A1 (en) Speech recognition optimization tool
US8929561B2 (en) System and method for automated audio mix equalization and mix visualization
US20060100866A1 (en) Influencing automatic speech recognition signal-to-noise levels
CN102214464A (en) Transient state detecting method of audio signals and duration adjusting method based on same
CN105118522A (en) Noise detection method and device
CN113259832B (en) Microphone array detection method and device, electronic equipment and storage medium
CN109658955A (en) Sonic boom detection method and device
CN103165138A (en) Visual feedback of audio input levels
CA2701439C (en) Measuring double talk performance
CN109994129B (en) Speech processing system, method and device
CN110390954B (en) Method and device for evaluating quality of voice product
Vanitha et al. Automation of noise detection using internet of things
Bäckström et al. Voice activity detection
Dai et al. An improved model of masking effects for robust speech recognition system
Dai et al. 2D Psychoacoustic modeling of equivalent masking for automatic speech recognition
CN109788399A (en) A kind of echo cancel method and system of speaker
Voran Estimation of speech intelligibility and quality

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant