CN101458944B - Sound recording control method and sound recording device - Google Patents

Sound recording control method and sound recording device Download PDF

Info

Publication number
CN101458944B
CN101458944B CN 200810247079 CN200810247079A CN101458944B CN 101458944 B CN101458944 B CN 101458944B CN 200810247079 CN200810247079 CN 200810247079 CN 200810247079 A CN200810247079 A CN 200810247079A CN 101458944 B CN101458944 B CN 101458944B
Authority
CN
China
Prior art keywords
recording
sound
collection unit
data
sound collection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 200810247079
Other languages
Chinese (zh)
Other versions
CN101458944A (en
Inventor
张晨
冯宇红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxi Zhonggan Microelectronics Co Ltd
Original Assignee
Wuxi Vimicro Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxi Vimicro Corp filed Critical Wuxi Vimicro Corp
Priority to CN 200810247079 priority Critical patent/CN101458944B/en
Publication of CN101458944A publication Critical patent/CN101458944A/en
Application granted granted Critical
Publication of CN101458944B publication Critical patent/CN101458944B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a record control method and a record device therefore, for improving the reliability of sound control recording. The record device is provided with at least two audio collection units. The record control method for the record device comprises: obtaining target audio data of prior frame collected by each audio collection unit, and determining the target audio signal strength corresponding to the target audio data of each prior frame; when the ratio of two audio signal strengths is higher than a first judgment threshold value as a first setting, storing the target audio data of prior frame collected by each audio collection unit as a record data, while the first judgment threshold value is determined by the recording distance of the record device and the distances among the audio collection units; when the continuous time from the initial frame not satisfying the first setting to prior frame reaches a default time length, stopping storing the target audio data of prior frame collected by each audio collection unit as record data.

Description

A kind of recording control method and sound pick-up outfit
Technical field
The present invention relates to signalling technique, particularly a kind of recording control technology.
Background technology
Along with popularizing of digital product, the application of various sound pick-up outfits is more and more wider, such as recording pen or with other digital product of sound-recording function etc., is more and more used by people.For saving storage space, sound pick-up outfit with the sound control recording function arises at the historic moment, purpose is whether there is to control recording by monitoring objective sound, begin recording when having target sound to exist, suspend recording when not having target sound, can avoid like this waste of storage space, also can so that recording material is compacter, save playback duration in addition.
The solution of existing sound control recording is, sets a fixing threshold value, then detects frame by frame the energy of collection signal, if energy greater than this threshold value, then starts recording, if energy less than this threshold value, then suspends recording.The present application people finds that according to the distance of recording distance, the characteristic of target sound is different, and prior art does not provide corresponding acoustic control mechanism for the target sound of different distance, thereby causes the reliability of Sound control function lower.
Summary of the invention
The embodiment of the invention provides a kind of recording control method and sound pick-up outfit, in order to improve the reliability of sound control recording.
A kind of recording control method of sound pick-up outfit, be provided with at least two sound collection unit on the described sound pick-up outfit, described sound pick-up outfit has the pattern of closely saying, the described pattern of closely saying is recording distance at the farthest recording distance of setting with interior recording mode, saying that closely under the pattern, the recording control method of described sound pick-up outfit comprises:
Obtain the present frame target sound data that each sound collection unit collects, and determine target sound signal intensity corresponding to each present frame target sound data;
When the ratio of two voice signal intensity wherein greater than for saying that closely first of the first decision threshold that pattern is set imposes a condition when satisfying, the present frame target sound data of each sound collection unit collection are stored as recording data, and described the first decision threshold is determined according to recording distance and the spacing between each sound collection unit of sound pick-up outfit;
Impose a condition when not satisfying when first, judge first imposes a condition whether arrive the reticent duration of setting from beginning ungratified start frame to duration of present frame: impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when described first, suspend recording, stop the present frame target sound data of each sound collection unit collection are stored as recording data; Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, continue the present frame target sound data of each sound collection unit collection are stored as recording data.
Better, the described pattern of closely saying is set with farthest recording distance, determines that according to recording distance and the spacing between each sound collection unit of described sound pick-up outfit the concrete grammar of the first decision threshold comprises:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit group, determine: Z i=(R+d i) 2/ R 2, wherein, 1≤i≤I, wherein: Z iBe minimum acoustic ratio threshold value corresponding to i group sound collection unit group, R is the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group;
Determine that described the first decision threshold is: more than or equal to Z 1~Z IMiddle minimum value and less than or equal to peaked arbitrary value wherein.
Wherein, described more than or equal to Z 1~Z IMiddle minimum value also less than or equal to peaked arbitrary value wherein is: Z 1~Z IMean value.
Better, determine that according to recording distance and the spacing between each sound collection unit of described sound pick-up outfit the concrete grammar of the first decision threshold comprises:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit group, determine: B i=(r+d i) 2/ r 2, wherein, 1≤i≤I, wherein: B iBe conventional acoustic ratio threshold value corresponding to i group sound collection unit group, r is the conventional recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group;
Determine that B is described B 1~B IMean value;
Determine that described the first decision threshold is greater than 1 value less than B.
Wherein, describedly greater than 1 value less than B be: 1 and the mean value of B.
Describedly determine also to comprise before target sound signal intensity corresponding to each present frame target sound data:
Determine the current recording pattern of described sound pick-up outfit, and after the current recording pattern is that recording distance is the first mode of R farthest, continues affirmation described first and whether satisfiedly impose a condition.
Further, after definite sound pick-up outfit is activated recording or suspends recording, also comprise: start and record preliminary data, the target sound data of described preliminary data for collecting in the setting of each sound collection unit before the present frame duration for subsequent use; And
During described startup recording, the preliminary data that also will record before is stored as the recording data before the present frame.
A kind of sound pick-up outfit comprises at least two sound collection unit, and described sound pick-up outfit has the pattern of closely saying, the described pattern of closely saying is recording distance at the farthest recording distance of setting with interior recording mode, and described sound pick-up outfit also comprises:
The first threshold storage unit is used for being stored as the first decision threshold of saying that closely pattern is set, and described the first decision threshold is determined according to recording distance and the spacing between each sound collection unit of sound pick-up outfit;
The recording data storage unit is used for the storage recording data;
The recording control module, be used under the pattern of closely saying of described sound pick-up outfit, obtaining described the first decision threshold from described first threshold storage unit, and after receiving the present frame target sound data that each sound collection unit collects, determine the target sound signal intensity that each present frame target sound data is corresponding, and impose a condition when satisfying greater than first of the first decision threshold when the ratio of two voice signal intensity wherein, store the present frame target sound data of each sound collection unit collection into described recording data storage unit; Impose a condition when not satisfying when first, judge first imposes a condition whether arrive the reticent duration of setting from beginning ungratified start frame to duration of present frame: impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when described first, suspend recording, stop the present frame target sound data of each sound collection unit collection are stored as recording data; Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, continue the present frame target sound data of each sound collection unit collection are stored as recording data.
Further, described sound pick-up outfit also comprises:
The first decision threshold determining unit is used for determining that described the first decision threshold is more than or equal to Z 1~Z IMiddle minimum value or less than or equal to peaked arbitrary value wherein, wherein, I makes up the quantity of the sound collection unit group of rear formation in twos for each sound collection unit, and the first decision threshold that will determine stores in the first threshold storage unit, wherein Z iAfter any two sound collection unit combination, minimum acoustic ratio threshold value corresponding to i group sound collection unit group, Z i=(R+d i) 2/ R 2, wherein, 1≤i≤I, R are the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group; Perhaps
Be used for determining that described the first decision threshold B ' is greater than 1 value less than B, B is B 1~B IBetween arbitrary value, B iAfter any two sound collection unit combination, conventional acoustic ratio threshold value corresponding to i group sound collection unit group, B i=(r+d i) 2/ r 2, wherein, 1≤i≤I, r are the conventional recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group.
And then described sound pick-up outfit also comprises:
Buffer unit, be used for the preliminary data that buffer memory is recorded, described recording control module is after definite sound pick-up outfit is activated recording or suspends recording, also be used for according to the duration for subsequent use of setting, the target sound data that collect in the setting of each sound collection unit before the present frame duration for subsequent use are stored in the described buffer unit as preliminary data, and when starting recording, the preliminary data of storing in the buffer unit is dumped in the described recording data storage unit as the recording data before starting.
The recording control method that the embodiment of the invention proposes, improved the method that only adopts energy threshold in the present sound control recording, but during the recording of in setpoint distance, carrying out, adopt acoustic ratio between two sound collection unit as the judgement foundation of whether recording.Further also propose to record the technology of a period of time preliminary data, guarantee the not data of lose objects sound incipient stage, further improved the recording accuracy.
Description of drawings
The pattern of closely the saying acoustic ratio decision threshold preparation method synoptic diagram that Fig. 1 provides for the embodiment of the invention;
The signal-noise ratio estimation method schematic flow sheet that Fig. 2 provides for the embodiment of the invention;
The pattern of closely the saying recording control method schematic flow sheet that Fig. 3 provides for the embodiment of the invention;
The pattern of far the saying recording control method schematic flow sheet that Fig. 4 provides for the embodiment of the invention;
The sound pick-up outfit structural representation that Fig. 5 provides for the embodiment of the invention.
Embodiment
The embodiment of the invention fully takes into account the impact that recording distance causes the target sound characteristic, according to two kinds of different recording control technologys of the far and near proposition of recording distance, is elaborated below in conjunction with accompanying drawing.
One, closely says pattern
The pattern of closely saying also can be described as dictation mode, and namely recording distance is closer, and it is lower that the sensitivity of sound collection unit can arrange, for example interview or the recording carried out during readme.At this moment, sound wave is spherical wave when arriving the sound collection unit of recording device, square being inversely proportional to of the intensity of acoustic wave of spherical wave and distance, generally speaking, recording device possesses two or more sound collection unit, if the intensity rate of the sound that the alternative sounds collecting unit collects satisfies the spherical wave characteristic, can judge accordingly that then target sound exists, and should start recording.And the characteristic harmony spacing of spherical wave from and collecting unit between spacing relevant, therefore when setting the acoustic ratio decision threshold, need according to maximum recording distance or conventional recording distance for saying that closely pattern is set, and the distance between each sound collection unit is definite.
As shown in Figure 1, two sound collection unit that possess take sound pick-up outfit are as example, and the spacing of two sound collection unit is 3cm, the conventional recording distance of closely saying is 10cm, sound pick-up outfit is illustrated recording pen for example, and two sound collection unit are illustrated microphone Mic1 and Mic2 for example, then:
Sound source apart from Mic1 apart from r1=10cm;
Sound source apart from Mic2 apart from r2=13cm;
Suppose that the intensity of sound that two sound collection unit are recorded is P1 and P2, then as shown in Equation 1:
Pr = P 1 P 2 = r 2 * r 2 r 1 * r 1 = 1.69 - - - ( 1 )
Consider if the maximum distance of sound source recording distance pen greater than 10cm, then this ratio Pr can reduce, otherwise can raise.The placing direction that further contemplates recording pen can not be as shown in Figure 1, be in a straight line with sound source, if put tiltedly, the range difference of sound source to two sound collection unit will be less than 3cm so, and the coverage of then closely saying will shorten, therefore, the decision threshold of intensity of sound is less than 1.69, rule of thumb decision threshold can be set as 1.3, closely say the target sound sound source to determine whether to exist, thereby realize closely saying that target sound detects.
The accuracy that detects for increasing the sound intensity, the embodiment of the invention can also adopt the single order low-pass filtering that Pr is done smoothing processing, and concrete grammar is:
Pr n’=Pr n-1’*alfa+Pr n*(1-alfa)
Pr wherein nBe the sound intensity value of the n time collection, Pr N-1' sound intensity average of carrying out obtaining after the single order low-pass filtering according to the sound intensity value that gathers for the n-1 time, Pr n' being the sound intensity value sound intensity average of carrying out obtaining after the single order low-pass filtering according to the n time collection, alfa is the weighting coefficient between 0~1, can be 0.9,0.8 or other value, the embodiment of the invention does not add restriction.Those skilled in the art can also adopt other filtering method to carry out smoothing processing, describe in detail no longer one by one here.
Based on above-mentioned principle, when the given conventional recording distance of pattern or the recording distance farthest closely said, and in the situation of the known spacing between the unit of respectively recording, can determine acoustic ratio threshold value Threshold, the acoustic ratio Pr of the same frame voice signal that gathers when any two sound collection unit that detect is during greater than Threshold, just can be judged to be and say that closely target sound occurs, the present frame target sound data of each sound collection unit collection need to be stored as recording data, when above-mentioned condition arrives when setting reticent duration from beginning ungratified start frame to duration of present frame, can be judged to driftlessness sound, stop the present frame target sound data of each sound collection unit collection are stored as recording data.
Those skilled in the art rule of thumb are worth the decision threshold that can set acoustic ratio, also can be according to the conventional recording distance of closely saying pattern or recording distance farthest, and the spacing of respectively recording between the unit is rationally calculated, the below provides two kinds of circulars, establishes sound pick-up outfit and comprises a plurality of sound collection unit.
The first is calculated according to the farthest recording distance of setting and is determined, specifically comprises the steps:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit, determine according to formula 1: Z i=(R+d i) 2/ R 2, wherein: Z iBe minimum acoustic ratio threshold value corresponding to i group sound collection unit, R is the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit;
Determine that described the first decision threshold is: more than or equal to Z 1~Z IMiddle minimum value or less than or equal to peaked arbitrary value wherein.
In the said method, utilize formula 1, can calculate according to recording distance farthest the minimum acoustic ratio of each group sound collection equipment, then the acoustic ratio decision threshold is set as: more than or equal to Z 1~Z IMiddle minimum value or less than or equal to peaked arbitrary value wherein.Better, the acoustic ratio decision threshold is set as: Z 1~Z IMean value.Those skilled in the art can also be according to Z 1~Z I, determine the concrete value of B ' by test method, describe in detail no longer one by one here.
According to the first computing method, can also determine further that the farthest recording distance R of device is formula B '=(R+d i) 2/ R 2Greater than zero solution.
The second calculates according to the conventional recording distance of setting to be determined, specifically comprises the steps:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit, determine according to formula 1: B i=(r+d i) 2/ r 2, wherein: B iBe conventional acoustic ratio threshold value corresponding to i group sound collection unit, r is the conventional recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit;
Determine that B is described B 1~B IMean value;
Determine that described the first decision threshold B ' is greater than 1 value less than B.
According to the principle of embodiment of the invention technical scheme, the decision threshold of recording distance should be less than the decision threshold of conventional recording distance farthest, and therefore getting B ' is greater than 1 value less than B, and certainly, B also can be B 1~B IIn maximal value or minimum value, better B ' is: 1 and the mean value of B.Those skilled in the art can also be according to B 1~B I, determine the concrete value of B ' by test method, describe in detail no longer one by one here.
According to above-mentioned principle, in sound pick-up outfit, set the pattern of closely saying, when user selection is closely said the pattern recording, whether record according to the acoustic ratio threshold determination of setting, because the setting of acoustic ratio threshold value has taken into full account spherical wave characteristic and the recording distance of closely saying sound source, thereby improved the reliability of recording control.
According to above-mentioned principle, after the farthest recording distance of closely saying pattern was determined, if record closely saying outside the farthest recording distance of pattern, then the embodiment of the invention was referred to as far to say the pattern recording.
For the detection of far saying the target sound under the pattern, the embodiment of the invention has also proposed corresponding detection method, and the below is elaborated.
Two, far say pattern
When the embodiment of the invention is considered sound pick-up outfit away from sound source, the arrival of acoustic signals becomes plane wave basically when respectively recording collecting device, the voice signal intensity of each sound collection unit collection and the correlativity of distance can be ignored, and whether the signal-to-noise characteristic of sound can exist for judgement sound, therefore the embodiment of the invention proposes a kind of far saying under the pattern, utilize noise recently estimating target sound have the recording control technology of probability.
As shown in Figure 2, detect principle schematic for far saying target sound, be averaged for the multiple signals of each sound collection unit collection, utilize signal averaging analysis can reduce operand.Wherein:
The Avg module is exactly the operation that two paths of signals is averaged, and obtains average signal S f, then signal by analysis window level and smooth after, utilize Fourier transform FFT, signal is transformed from the time domain to frequency domain, suppose frequency-region signal Y[k] represent, to Y[k] carry out SNR estimation, at first want the variance of estimating background noise comprising.Ground unrest normally unstable and the time become, this just requires the variation that noise Estimation Algorithm can the real-time follow-up ground unrest, suppose that at first the energy of signal is greater than the energy of noise, this hypothesis can both satisfy in general application scenario, so the ultimate principle that noise is estimated is exactly when target sound exists probability less, by continuous search least energy, come the estimating noise variance, concrete grammar comprises the steps:
1, at first obtains S fThe spectrum energy of every spectral line, and carry out smoothing processing;
Smoothing processing comprises with Hanning window to be made segment smoothing and further does temporal smoothing processing with single order recurrence average disposal route, wherein, with Hanning window as segment smoothing is:
S f [ i ] = Σ k = - W W b [ k ] | Y [ k - i ] | 2
Wherein b represents Hanning window, and the width of Hanning window is 2W, and W can get 1.
Further doing temporal smoothing processing with single order recurrence average disposal route is:
S[i]=α sS[i]+(1-α s)S f[i]
α wherein sSatisfy 0<α s<1
2, the signal S[i of search after the smoothing processing] the local least energy S of every spectral line Min[i];
The search of this local minimum can be with falling the recursion shortcut calculation realization that rises slowly, that is: soon
If S[i]>Smin[i], Smin[i then]=Smin[i] * alfa+S[i] * (1-alfa)
If S[i]<=Smin[i], Smin[i then]=Smin[i] * beta+S[i] * (1-beta)
Wherein alfa and beta are the numbers between 0~1, fall soon the characteristics that rise slowly in order to embody, general alfa>beta;
3, to each bar spectral line, respectively with Smin[i] as noise variance, and S[i] add the variance of target sound for noise.Be that the target sound variance is:
Sv[i]=S[i]-Smin[i]
Then the signal to noise ratio (S/N ratio) on i spectral line is:
SNR[i]=Sv[i]/Smin[i]
The SNR[i of all spectral lines] average signal-to-noise ratio be:
SNR=Average(SNR[i]),i=0-fftsize/2
Above-mentioned SNR estimation technology is well known to those skilled in the art, those skilled in the art can also adopt other SNR estimation technology to obtain the average signal-to-noise ratio of a plurality of signals, in the embodiment of the invention, when the average signal-to-noise ratio of the signal acquisition that collects according to a plurality of signal gathering unit greater than 1, perhaps than 1 slightly large number, for example 1.1 or 1.2 o'clock, perhaps signal quality is very good, and signal to noise ratio (S/N ratio) is very large, reaches tens or during hundreds of, can adjudicate the existence of far saying sound source, begin recording.Far saying under the pattern, according to different recording quality requirements, the decision threshold of signal to noise ratio (S/N ratio) can be set as the number greater than 1, generally be no more than 1.5 and get final product.
Three, record preliminary data
The embodiment of the invention might be missed some useful voice datas before also further contemplating and starting recording, therefore after sound pick-up outfit is activated or suspends recording each time, one section preliminary data of rear loop recording, the target sound data of preliminary data for collecting in the setting of each sound collection unit before the present frame duration for subsequent use; And start each time when recording, the preliminary data that also will record before is stored as the recording data between the present frame.For realizing recording of preliminary data, the embodiment of the invention provides a kind of specific implementation:
At first, according to the duration for subsequent use of setting, the rollback internal memory of application respective stored amount, wherein:
In the rollback internal memory, each frame voice data that each sound collection unit gathers can be stored as a circular linked list structure, and each node of this circular linked list structure can represent with a following structure:
Figure GDA0000111250100000101
Node represents the structure title of this node, Data1[L] be a certain frame signal that Mic1 gathers, Data2 is a certain frame signal that Mic2 gathers.NextNode is for pointing to next frame signal, the i.e. pointer of next node.Wherein L is frame length.
Suppose that the sampling rate of signal is 8k, frame length L is 128, then, if wish the data of temporary 0.5s in the rollback internal memory, then probably needs temporary 32 frames.Namely can arrange has 32 nodes in the circular linked list, and is defined as: Node1, and Node2 ..., then Node32 is together in series 32 nodes during initialization, forms circular linked list, that is:
Node?1->NextNode=Node2;
Node2->NextNode=Node3;
Node31->NextNode=Node32;
Node32->NextNode=Node1;
Suppose that NodeCurrent is present node, every frame signal then, need to do:
The signal that Mic1 is gathered is assigned to NodeCurrent.Data1
The signal that Mic2 is gathered is assigned to NodeCurrent.Data2
NodeCurrent=NodeCurrent->NextNode
By this method, for each signal gathering unit, can in the rollback internal memory, keep all the time the data of up-to-date 0.5s.Prepare against when needing and use.
If previous frame is in the time-out recording state, and the present frame court verdict then starts recording for target sound is arranged, and connects the rollback memory modules, recording start point is rolled back to the data reference position of rollback internal memory.Suppose the node position NodeHead of reference position, then can according to the present node NodeCurrent of rollback internal memory, obtain start node by NodeHead=NodeCurrent->NextNode.Then the data in the node in the whole circular list are all recorded.The data that so just the part of target sound the initial segment will be able to be lost are originally retrieved by the mode of rollback.The standby time length of rollback can be controlled by the node number is set.
Based on above-mentioned principle, can arrange in sound pick-up outfit closely makes peace far says two kinds of patterns, according to user's selection, adopt the control method of correspondence to judge whether the startup recording, also can closely saying sound pick-up outfit or saying that far adopting wherein corresponding control method to control in the sound pick-up outfit records in special use.
As shown in Figure 3, the pattern of closely the saying recording control method that provides of the embodiment of the invention comprises the steps:
S300, sound pick-up outfit start;
S301, record preliminary data and be kept in the buffer memory;
S302, obtain the present frame target sound data that each sound collection unit collects, and determine target sound signal intensity corresponding to each present frame target sound data;
Whether S303, judgement first impose a condition satisfies;
S304, impose a condition and satisfy time recording when first;
When the ratio of two voice signal intensity wherein greater than first of the first decision threshold recording when satisfying that imposes a condition, comprise: the present frame target sound data of each sound collection unit collection are stored as recording data, if recorded preliminary data in the buffer memory, then also preliminary data is stored as recording data, and stops step S301;
S305, impose a condition when not satisfying when first, judge that first imposes a condition and whether arrive and set reticent duration from beginning ungratified start frame to duration of present frame;
Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, continue step S304 recording, otherwise execution in step S306 suspends recording, comprise: stop the present frame target sound data of each sound collection unit collection are stored as recording data, and trigger step S301 and carry out, be preliminary data with data recording, be kept in the buffer memory.
First imposes a condition is and closely says the mode decision condition, namely two voice signal intensity whether ratio is specifically determined method such as front greater than for the acoustic ratio decision threshold of saying that closely pattern is set, no longer be repeated in this description here.
Setting reticent duration is the maximum duration that the driftlessness sound status continues, and can utilize a counter to detect, and only has after the driftlessness sound status continues for some time, and just suspends recording.The reason of doing like this is, the people generally has the target sound intermittent phase in a minute, and therefore, the short target sound intermittent phase should be given and reservation.Therefore, the duration of setting driftlessness sound status is 3s for example, behind the 3s, if still be judged to be driftlessness sound, then suspends recording.Driftlessness sound status counter starts recording and all returns 0 being judged to be at every turn.
As shown in Figure 4, the pattern of far the saying recording control method that provides of the embodiment of the invention comprises the steps:
S400, sound pick-up outfit start;
S401, record preliminary data and be kept in the buffer memory;
S402, obtain the present frame target sound data that each sound collection unit collects, and determine the average signal-to-noise ratio of current frame signal according to each present frame target sound data;
Whether S403, judgement average signal-to-noise ratio impose a condition greater than second of the second decision threshold satisfies;
S404, impose a condition and satisfy time recording when second;
When the ratio of two voice signal intensity wherein greater than second of the second decision threshold recording when satisfying that imposes a condition, comprise: the present frame target sound data of each sound collection unit collection are stored as recording data, if recorded preliminary data in the buffer memory, then also preliminary data is stored as recording data, and stops step S401;
S405, impose a condition when not satisfying when second, judge that second imposes a condition and whether arrive and set reticent duration from beginning ungratified start frame to duration of present frame;
Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when second, continue step S404 recording, otherwise execution in step S406 suspends recording, comprise: stop the present frame target sound data of each sound collection unit collection are stored as recording data, and trigger step S401 and carry out, be preliminary data with data recording, be kept in the buffer memory.
Second imposes a condition is and far says the mode decision condition, and namely whether the average signal-to-noise ratio of current frame signal specifically determines method such as front greater than for the signal to noise ratio (S/N ratio) decision threshold of saying that far pattern is set, no longer is repeated in this description here.
If sound pick-up outfit is provided with simultaneously to be selected in and closely says pattern and far say pattern, then according to user's selection, after start, judge first recording mode, then according to the recording mode of user selection, enter Fig. 3 or control flow shown in Figure 4.
As shown in Figure 5, the embodiment of the invention also provide a kind of can be according to the sound pick-up outfit of closely saying pattern control recording, comprise at least two sound collection unit 501 (5011,5012...501n), also comprise:
First threshold storage unit 502 is used for storage the first decision threshold, and the first decision threshold is determined according to recording distance and the spacing between each sound collection unit of sound pick-up outfit;
Recording data storage unit 503 is used for the storage recording data;
Recording control module 504, be used for obtaining the first decision threshold from the first threshold storage unit, and the present frame target sound data that receiving each sound collection unit and collect, determine the target sound signal intensity that each present frame target sound data is corresponding, and impose a condition when satisfying greater than first of the first decision threshold when the ratio of two voice signal intensity wherein, store the present frame target sound data of each sound collection unit collection into storage unit, impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, stop the present frame target sound data of each sound collection unit collection are stored as recording data.
This sound pick-up outfit can also be according to gain of parameter the first decision threshold of setting, and then this sound pick-up outfit further can also comprise:
The first decision threshold determining unit 505 is used for determining that the first decision threshold is more than or equal to Z 1~Z IMiddle minimum value or less than or equal to peaked arbitrary value wherein, and the first decision threshold that will determine stores in the first threshold storage unit, wherein Z iAfter any two sound collection unit combination, minimum acoustic ratio threshold value corresponding to i group sound collection unit, Z i=(R+d i) 2/ R 2, R is the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit; Perhaps be used for determining that the first decision threshold B ' is greater than 1 value less than B, B is B 1~B IMean value, B iAfter any two sound collection unit combination, conventional acoustic ratio threshold value corresponding to i group sound collection unit, B i=(r+d i) 2/ r 2, r is the conventional recording distance of sound pick-up outfit.
If this sound pick-up outfit further can also according to far saying pattern control recording, then also comprise:
The second decision threshold storage unit 506 is used for storage the second decision threshold, and the second decision threshold is greater than 1;
Mode setting unit 507, the recording mode that is used for the reception user arranges indicator signal, and export to the recording control module, the recording control module arranges indicator signal according to the recording mode that receives and confirms the current recording pattern when recording distance is the first mode of R farthest, continues to confirm that first imposes a condition and whether satisfy after receiving the present frame target sound data that each sound collection unit collects; Otherwise obtain the second decision threshold from the second decision threshold storage unit, determine the average signal-to-noise ratio of current frame signal according to each present frame target sound data, and impose a condition when satisfying greater than second of the second decision threshold when average signal-to-noise ratio, the present frame target sound data of each sound collection unit collection are stored as recording data, impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when second, stop the present frame target sound data of each sound collection unit collection are stored as recording data.
If this sound pick-up outfit also further can be recorded preliminary data, then also comprise:
Buffer unit 508, be used for the preliminary data that buffer memory is recorded, the recording control module is after definite sound pick-up outfit is activated or suspends recording, also be used for according to the duration for subsequent use of setting, the target sound data that collect in the setting of each sound collection unit before the present frame duration for subsequent use are stored in the buffer unit as preliminary data, and when starting recording, the preliminary data of storing in the buffer unit is dumped in the recording data storage unit as the recording data before starting.
Certainly, if the pattern sound pick-up outfit is far said in special use, then can include only: at least two sound collection unit 501 (5011,5012...501n), recording data storage unit 503, the second decision threshold storage unit 506 and recording control module 504, recording control module 504 is according to far saying pattern recording control method control recording.Special use is far said when the pattern sound pick-up outfit also further can be recorded preliminary data, then be may further include buffer unit 508.
The recording control method that the embodiment of the invention proposes has improved the method that only adopts energy threshold in the present sound control recording, but has far said two kinds according to closely making peace, and uses respectively the target sound detection algorithm that is fit to.Closely saying under the mode judgement foundation that adopts the acoustic ratio conduct between two sound collection unit whether to record; And far saying under the mode, adopt the probability that exists of SNR estimation target sound, so that the recording control technology still has preferably judgment accuracy under low signal-to-noise ratio.Further also propose to record the technology of a period of time preliminary data, guarantee the not data of lose objects sound incipient stage, further improved the recording accuracy.
Obviously, those skilled in the art can carry out various changes and modification to the embodiment of the invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (10)

1. the recording control method of a sound pick-up outfit, be provided with at least two sound collection unit on the described sound pick-up outfit, it is characterized in that, described sound pick-up outfit has the pattern of closely saying, the described pattern of closely saying is recording distance at the farthest recording distance of setting with interior recording mode, saying that closely under the pattern, the recording control method of described sound pick-up outfit comprises:
Obtain the present frame target sound data that each sound collection unit collects, and determine target sound signal intensity corresponding to each present frame target sound data;
When the ratio of two voice signal intensity wherein greater than for saying that closely first of the first decision threshold that pattern is set imposes a condition when satisfying, the present frame target sound data of each sound collection unit collection are stored as recording data, and described the first decision threshold is determined according to recording distance and the spacing between each sound collection unit of sound pick-up outfit;
Impose a condition when not satisfying when first, judge first imposes a condition whether arrive the reticent duration of setting from beginning ungratified start frame to duration of present frame: impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when described first, suspend recording, stop the present frame target sound data of each sound collection unit collection are stored as recording data; Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, continue the present frame target sound data of each sound collection unit collection are stored as recording data.
2. recording control method as claimed in claim 1 is characterized in that, determines that according to recording distance and the spacing between each sound collection unit of described sound pick-up outfit the concrete grammar of the first decision threshold comprises:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit group, determine: Z i=(R+d i) 2/ R 2, wherein, 1≤i≤I, wherein: Z iBe minimum acoustic ratio threshold value corresponding to i group sound collection unit group, R is the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group;
Determine that described the first decision threshold is: more than or equal to Z 1~Z IMiddle minimum value and less than or equal to peaked arbitrary value wherein.
3. recording control method as claimed in claim 2 is characterized in that, and is described more than or equal to Z 1~Z IMiddle minimum value also less than or equal to peaked arbitrary value wherein is: Z 1~Z IMean value.
4. recording control method as claimed in claim 1 is characterized in that, determines that according to recording distance and the spacing between each sound collection unit of described sound pick-up outfit the concrete grammar of the first decision threshold comprises:
Determine that each sound collection unit makes up the I group sound collection unit group of rear formation in twos;
To wherein each organizes sound collection unit group, determine: B i=(r+d i) 2/ r 2, wherein, 1≤i≤I, wherein:
B iBe conventional acoustic ratio threshold value corresponding to i group sound collection unit group, r is the conventional recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group;
Determine that B is described B 1~B IMean value;
Determine that described the first decision threshold is greater than 1 value less than B.
5. recording control method as claimed in claim 4 is characterized in that, describedly greater than 1 value less than B is: 1 and the mean value of B.
6. such as claim 2 or 4 described recording control methods, it is characterized in that, describedly determine also to comprise before target sound signal intensity corresponding to each present frame target sound data:
Determine the current recording pattern of described sound pick-up outfit, and when the current recording pattern for recording distance farthest be R closely say pattern after, continue to confirm that described first imposes a condition and whether satisfy.
7. recording control method as claimed in claim 1 is characterized in that:
After definite sound pick-up outfit is activated recording or suspends recording, also comprise: start and record preliminary data, the target sound data of described preliminary data for collecting in the setting of each sound collection unit before the present frame duration for subsequent use; And
During described startup recording, the preliminary data that also will record before is stored as the recording data before the present frame.
8. a sound pick-up outfit comprises at least two sound collection unit, it is characterized in that, described sound pick-up outfit has the pattern of closely saying, the described pattern of closely saying is recording distance at the farthest recording distance of setting with interior recording mode, and described sound pick-up outfit also comprises:
The first threshold storage unit is used for being stored as the first decision threshold of saying that closely pattern is set, and described the first decision threshold is determined according to recording distance and the spacing between each sound collection unit of sound pick-up outfit;
The recording data storage unit is used for the storage recording data;
The recording control module, be used under the pattern of closely saying of described sound pick-up outfit, obtaining described the first decision threshold from described first threshold storage unit, and after receiving the present frame target sound data that each sound collection unit collects, determine the target sound signal intensity that each present frame target sound data is corresponding, and impose a condition when satisfying greater than first of the first decision threshold when the ratio of two voice signal intensity wherein, store the present frame target sound data of each sound collection unit collection into described recording data storage unit; Impose a condition when not satisfying when first, judge first imposes a condition whether arrive the reticent duration of setting from beginning ungratified start frame to duration of present frame: impose a condition and arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when described first, suspend recording, stop the present frame target sound data of each sound collection unit collection are stored as recording data; Impose a condition and do not arrive when setting reticent duration from beginning ungratified start frame to duration of present frame when first, continue the present frame target sound data of each sound collection unit collection are stored as recording data.
9. sound pick-up outfit as claimed in claim 8 is characterized in that, also comprises:
The first decision threshold determining unit is used for determining that described the first decision threshold is more than or equal to Z 1~Z IMiddle minimum value and less than or equal to peaked arbitrary value wherein, wherein, I makes up the quantity of the sound collection unit group of rear formation in twos for each sound collection unit, and the first decision threshold that will determine stores in the first threshold storage unit, wherein Z iAfter any two sound collection unit combination, minimum acoustic ratio threshold value corresponding to i group sound collection unit group, Z i=(R+d i) 2/ R 2, wherein, 1≤i≤I, R are the farthest recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group; Perhaps
Be used for determining that described the first decision threshold B ' is greater than 1 value less than B, B is B 1~B IBetween arbitrary value, B iAfter any two sound collection unit combination, conventional acoustic ratio threshold value corresponding to i group sound collection unit group, B i=(r+d i) 2/ r 2, wherein, 1≤i≤I, r are the conventional recording distance of sound pick-up outfit, d iIt is the spacing between two sound collection unit in the i group sound collection unit group.
10. sound pick-up outfit as claimed in claim 8 or 9 is characterized in that, also comprises:
Buffer unit, be used for the preliminary data that buffer memory is recorded, described recording control module is after definite sound pick-up outfit is activated recording or suspends recording, also be used for according to the duration for subsequent use of setting, the target sound data that collect in the setting of each sound collection unit before the present frame duration for subsequent use are stored in the described buffer unit as preliminary data, and when starting recording, the preliminary data of storing in the buffer unit is dumped in the described recording data storage unit as the recording data before starting.
CN 200810247079 2008-12-31 2008-12-31 Sound recording control method and sound recording device Expired - Fee Related CN101458944B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200810247079 CN101458944B (en) 2008-12-31 2008-12-31 Sound recording control method and sound recording device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810247079 CN101458944B (en) 2008-12-31 2008-12-31 Sound recording control method and sound recording device

Related Child Applications (2)

Application Number Title Priority Date Filing Date
CN201110351355.1A Division CN102655009B (en) 2008-12-31 2008-12-31 Voice record controlling method and voice recording device
CN201110351455.4A Division CN102655010B (en) 2008-12-31 2008-12-31 Voice record controlling method and voice recording device

Publications (2)

Publication Number Publication Date
CN101458944A CN101458944A (en) 2009-06-17
CN101458944B true CN101458944B (en) 2013-01-09

Family

ID=40769748

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810247079 Expired - Fee Related CN101458944B (en) 2008-12-31 2008-12-31 Sound recording control method and sound recording device

Country Status (1)

Country Link
CN (1) CN101458944B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107147972A (en) * 2016-03-01 2017-09-08 卡讯电子股份有限公司 Audio signal output control method and system
CN108962284B (en) * 2018-07-04 2021-06-08 科大讯飞股份有限公司 Voice recording method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1140859A (en) * 1995-07-13 1997-01-22 三星电子株式会社 Data recording apparatus and method for semiconductor memory card
CN101025981A (en) * 2007-01-23 2007-08-29 无敌科技(西安)有限公司 Digital recording system and method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1140859A (en) * 1995-07-13 1997-01-22 三星电子株式会社 Data recording apparatus and method for semiconductor memory card
CN101025981A (en) * 2007-01-23 2007-08-29 无敌科技(西安)有限公司 Digital recording system and method

Also Published As

Publication number Publication date
CN101458944A (en) 2009-06-17

Similar Documents

Publication Publication Date Title
CN101458943B (en) Sound recording control method and sound recording device
EP3703052B1 (en) Echo cancellation method and apparatus based on time delay estimation
CN110556103B (en) Audio signal processing method, device, system, equipment and storage medium
CN109767769B (en) Voice recognition method and device, storage medium and air conditioner
US20110099010A1 (en) Multi-channel noise suppression system
US8239194B1 (en) System and method for multi-channel multi-feature speech/noise classification for noise suppression
US20080312918A1 (en) Voice performance evaluation system and method for long-distance voice recognition
CN102819009B (en) Driver sound localization system and method for automobile
CN103165137B (en) Speech enhancement method of microphone array under non-stationary noise environment
CN105872156A (en) Echo time delay tracking method and device
EP2907121B1 (en) Real-time traffic detection
WO2021093808A1 (en) Detection method and apparatus for effective voice signal, and device
US20110099007A1 (en) Noise estimation using an adaptive smoothing factor based on a teager energy ratio in a multi-channel noise suppression system
WO2014177084A1 (en) Voice activation detection method and device
US9026437B2 (en) Location determination system and mobile terminal
US10009477B2 (en) Pure delay estimation
US20130006150A1 (en) Bruxism detection device and bruxism detection method
CN102655010B (en) Voice record controlling method and voice recording device
CN102655009B (en) Voice record controlling method and voice recording device
CN101458944B (en) Sound recording control method and sound recording device
CN107635082A (en) A kind of both-end sounding end detecting system
US20220122592A1 (en) Energy efficient custom deep learning circuits for always-on embedded applications
US20190394578A1 (en) Music classifier and related methods
CN103578478A (en) Method and system for obtaining musical beat information in real time
CN102739286B (en) Echo cancellation method used in communication system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: WUXI VIMICRO CORPORATION

Free format text: FORMER OWNER: BEIJING ZHONGXING MICROELECTRONICS CO., LTD.

Effective date: 20110328

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100083 15/F, SHINING BUILDING, NO. 35, XUEYUAN ROAD, HAIDIAN DISTRICT, BEIJING TO: 214028 (CHUANGYUAN BUILDING), NATIONAL INTEGRATED CIRCUIT DESIGN PARK, NO. 21-1, YANGTES RIVER ROAD, WUXI NEW DISTRICT, JIANGSU PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20110328

Address after: 214028 national integrated circuit design Park, Changjiang Road, New District, Jiangsu,, Wuxi

Applicant after: Wuxi Vimicro Co., Ltd.

Address before: 100083, Haidian District, Xueyuan Road, Beijing No. 35, Nanjing Ning building, 15 Floor

Applicant before: Beijing Vimicro Corporation

C14 Grant of patent or utility model
GR01 Patent grant
CP03 Change of name, title or address

Address after: 214135 Taihu International Science Park Sensor Network University Science Park 530 Building A1001, No. 18 Qingyuan Road, Wuxi City, Jiangsu Province

Patentee after: WUXI ZHONGGAN MICROELECTRONIC CO., LTD.

Address before: 214028 National Integrated Circuit Design Park 21-1 Changjiang Road, New District, Wuxi City, Jiangsu Province (Chuangyuan Building)

Patentee before: Wuxi Vimicro Co., Ltd.

CP03 Change of name, title or address
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130109

Termination date: 20191231