CN102568473A

CN102568473A - Method and device for recording voice signals

Info

Publication number: CN102568473A
Application number: CN2011104540838A
Authority: CN
Inventors: 邵颖; 张然; 刘湘洲
Original assignee: SHENZHEN VCYBER TECHNOLOGY Co Ltd
Current assignee: SHENZHEN VCYBER TECHNOLOGY Co Ltd
Priority date: 2011-12-30
Filing date: 2011-12-30
Publication date: 2012-07-11

Abstract

The invention discloses a method and a device for recording voice signals, and relates to the field of voice identification. The method and the device solve the problem about noise interference when the voice signals are recorded. The method comprises the following steps of: when a user starts a voice recording function, acquiring current input voice signals according to the preset first time window length; calculating the average energy value of the voice signals; comparing the average energy value of the voice signals with the preset average energy value of environmental noise to acquire a first comparison result; and determining whether to record the voice signals or not according to the first comparison result. According to the technical scheme provided by the embodiment, the method and the device can be applied in a voice identification system.

Description

The method for recording of voice signal and device

Technical field

The present invention relates to field of speech recognition, relate in particular to a kind of method for recording and device of voice signal.

Background technology

Along with intelligent development of science and technology, the mankind no longer have been satisfied with through mode such as mouse, button and equipment and have carried out alternately, but hope and can carry out alternately through mode and the equipment of voice that realization is controlled the voice of equipment.Speech recognition technology reaches its maturity as one of core technology of interactive voice technology, and be applied in information processing gradually, fields such as education and business application, consumer electronics.

An important step of speech recognition is that the voice signal that the user sends is recorded, and speech recognition system adopts relevant speech recognition algorithm that the sound signal of recording generation is carried out speech recognition then.In the prior art, after the user started speech identifying function, microphone can begin recording automatically; Yet in the use of reality, the user not necessarily sends voice signal at once; Generally speaking, start speech identifying function and send the user and can have part free time between the voice signal, at this moment between in the section; Microphone can be recorded to the noise of surrounding environment, owing to there is noise, has reduced the recognition accuracy of speech recognition system.

Summary of the invention

Embodiments of the invention provide a kind of method for recording and device of voice signal, can improve the accuracy rate of speech recognition.

On the one hand, a kind of method for recording of voice signal is provided, has comprised: after the user starts the voice recording function, obtained the voice signal of current input according to the very first time window length that is provided with in advance; Calculate the average energy value of said voice signal; The average energy value of said voice signal the average energy value with the neighbourhood noise that is provided with is in advance compared, obtain first comparative result; Determine whether said voice signal is recorded according to said first comparative result.

On the other hand, a kind of record device of voice signal is provided, has comprised:

First acquiring unit is used for after the user starts the voice recording function, obtains the voice signal of current input according to the very first time window length that is provided with in advance;

First computing unit is used to calculate the average energy value of the voice signal that said first acquiring unit obtains;

First comparing unit, the average energy value of the voice signal that is used for said first computing unit is obtained compares with the average energy value of the neighbourhood noise that is provided with in advance, obtains first comparative result;

Confirm the unit, first comparative result that the user obtains according to said first comparing unit determines whether said voice signal is recorded.

The method for recording of the voice messaging that the embodiment of the invention provides and device; Comparative result according to the average energy value of the average energy value of neighbourhood noise and voice signal determines whether voice signal is recorded; Be recorded to the problem of simple neighbourhood noise when having avoided voice signal to record; Because technical scheme of the present invention is considered the influence that neighbourhood noise is recorded voice signal, make that the voice signal that adopts technical scheme provided by the invention to record is more accurate, thereby improved the accuracy of the voice signal of recording being carried out speech recognition; Further, save voice signal and recorded the storage resources and the communication resource that takies.

Description of drawings

In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.

The method for recording process flow diagram one of the voice signal that Fig. 1 provides for the embodiment of the invention;

The method for recording flowchart 2 of the voice signal that Fig. 2 provides for the embodiment of the invention;

The record device structural representation one of the voice signal that Fig. 3 provides for the embodiment of the invention;

The record device structural representation two of the voice signal that Fig. 4 provides for the embodiment of the invention;

The record device structural representation three of the voice signal that Fig. 5 provides for the embodiment of the invention;

The record device structural representation four of the voice signal that Fig. 6 provides for the embodiment of the invention.

Embodiment

To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.

The problem of noise when recording in order to solve voice signal, the embodiment of the invention provides a kind of method for recording and device of voice signal.

As shown in Figure 1, the method for recording of the voice signal that the real embodiment of the present invention provides comprises:

Step 101 after the user starts the voice recording function, is obtained the voice signal of current input according to the very first time window length that is provided with in advance.

Present embodiment does not carry out concrete qualification to very first time window length, and in the use of reality, the user can not do here and give unnecessary details according to the self-defined setting of demand very first time window length.

Step 102, the average energy value of computing voice signal.

Step 103 compares the average energy value of voice signal the average energy value with the neighbourhood noise that is provided with in advance, obtains first comparative result.

In the present embodiment; The average energy value of neighbourhood noise can be to store in advance, for example: can neighbourhood noise be arranged to different state, as: quiet, generally perhaps noisy etc.; And the average energy value of different environment noise is set according to different state; The average energy value like the corresponding neighbourhood noise of rest state is 20dB, and the average energy value of the neighbourhood noise that general state is corresponding is 30dB, and the average energy value of the neighbourhood noise that noisy mode is corresponding is 50dB etc.

The average energy value of neighbourhood noise also can change detection acquisition in real time according to dynamic environment, then this moment, before step 103, can also comprise the step of obtaining the average energy value of current environment noise according to second time window length of setting in advance.Need to prove, ground identical with very first time window length, the embodiment of the invention is not carried out concrete qualification to second time window length yet, and the user can self-defined as required setting, does not do here and gives unnecessary details.

Alternatively; If the average energy value of neighbourhood noise is to detect in real time to obtain; For the convenience of follow-up use, to save and detect the resource that consumes, the method for recording of the voice signal that the embodiment of the invention provides can also comprise the step that the average energy value of current environment noise is stored.

Certainly, more than be merely two kinds of concrete giving an example, in the use of reality, the user can also obtain the average energy value of neighbourhood noise through other modes, gives unnecessary details no longer one by one here.

Step 104 determines whether voice signal is recorded according to first comparative result.

Particularly, if first comparative result is the average energy value of voice signal the average energy value greater than the neighbourhood noise that is provided with in advance, voice signal is recorded; Otherwise, voice signal is not recorded.

The method for recording of the voice messaging that the embodiment of the invention provides; Comparative result according to the average energy value of the average energy value of neighbourhood noise and voice signal determines whether voice signal is recorded; Be recorded to the problem of simple neighbourhood noise when having avoided voice signal to record; Because technical scheme of the present invention is considered the influence that neighbourhood noise is recorded voice signal, make that the voice signal that adopts technical scheme provided by the invention to record is more accurate, thereby improved the accuracy of the voice signal of recording being carried out speech recognition; Further, save voice signal and recorded the storage resources and the communication resource that takies.

For fear of the influence that transient noise is recorded voice signal, as shown in Figure 2, another embodiment of the present invention also provides a kind of method for recording of voice messaging, this method and as shown in Figure 1 basic identical, and its difference is: after step 101, also comprise:

Step 105 is carried out buffer memory to voice signal.

In the present embodiment, step 105 specifically be positioned at after the step 101 and step 102 before, in the use of reality, step 105 also can be positioned at other positions, does not do here and gives unnecessary details.

Step 106 if first comparative result is the average energy value of voice signal the average energy value greater than the neighbourhood noise that is provided with in advance, is obtained next section voice signal adjacent with voice signal according to very first time window length.

Step 107 is calculated the average energy value of next section voice signal.

Step 108 compares the average energy value of next section voice signal the average energy value with the neighbourhood noise that is provided with in advance, obtains second comparative result.

Then this moment, step 104 replaces with: according to first comparative result and second comparative result, determine whether voice signal and next section voice signal are recorded.

As shown in Figure 3, the embodiment of the invention also provides a kind of record device of voice signal, comprising:

First acquiring unit 301 is used for after the user starts the voice recording function, obtains the voice signal of current input according to the very first time window length that is provided with in advance;

First computing unit 302 is used to calculate the average energy value of the voice signal that first acquiring unit 301 obtains;

First comparing unit 303, the average energy value of the voice signal that is used for first computing unit 302 is obtained compares with the average energy value of the neighbourhood noise that is provided with in advance, obtains first comparative result;

Confirm unit 304, the user determines whether voice signal is recorded according to first comparative result that first comparing unit 303 obtains.

Further, as shown in Figure 4, the record device of the voice signal that the embodiment of the invention provides can also comprise:

Second acquisition unit 305 is used for obtaining according to second time window length that is provided with in advance the average energy value of current environment noise;

Then said first comparing unit 303, the average energy value of the voice signal that can also be used for first computing unit 302 is obtained and the average energy value of the current environment noise that second acquisition unit 305 obtains compare, and obtain first comparative result.

Further, as shown in Figure 5, the record device of the voice signal that the embodiment of the invention provides can also comprise:

Storage unit 306, the average energy value of the current environment noise that is used for second acquisition unit 305 is obtained is stored;

Then said first comparing unit 303, the average energy value of the current environment noise of the average energy value of the voice signal that can also be used for first computing unit 302 is obtained and storage unit 306 storages compares, and obtains first comparative result.

Further, as shown in Figure 6, the record device of the voice signal that the embodiment of the invention provides can also comprise:

Buffer unit 307 is used for the voice signal that first acquiring unit 301 obtains is carried out buffer memory;

The 3rd acquiring unit 308; If being used for the comparative result that first comparing unit 303 obtains is the average energy value of voice signal the average energy value greater than the neighbourhood noise that is provided with in advance, obtain next section voice signal adjacent with voice signal according to very first time window length;

Second computing unit 309 is used to calculate the average energy value of next section voice signal that the 3rd acquiring unit 308 obtains;

Second comparing unit 310, the average energy value that is used for the average energy value that second computing unit 309 is obtained and the neighbourhood noise that is provided with in advance compares, and obtains second comparative result;

Then said definite unit 304; Second comparative result that first comparative result that can also be used for obtaining according to first comparing unit 303 and second comparing unit 310 obtain determines whether that the voice signal that voice signal and the 3rd acquiring unit 308 to buffer unit 307 buffer memorys obtain records.

Need to prove that the method for recording of the voice signal that the concrete implementation method of the record device of the voice signal that the embodiment of the invention provides can provide referring to the embodiment of the invention is said, repeats no more here.

The record device of the voice messaging that the embodiment of the invention provides; Comparative result according to the average energy value of the average energy value of neighbourhood noise and voice signal determines whether voice signal is recorded; Be recorded to the problem of simple neighbourhood noise when having avoided voice signal to record; Because technical scheme of the present invention is considered the influence that neighbourhood noise is recorded voice signal, make that the voice signal that adopts technical scheme provided by the invention to record is more accurate, thereby improved the accuracy of the voice signal of recording being carried out speech recognition; Further, save voice signal and recorded the storage resources and the communication resource that takies.

The method for recording of the voice signal that the embodiment of the invention provides and device can be applied in the speech recognition system.

The above; Be merely embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technician who is familiar with the present technique field is in the technical scope that the present invention discloses; Can expect easily changing or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by said protection domain with claim.

Claims

1. the method for recording of a voice signal is characterized in that, comprising:

After the user starts the voice recording function, obtain the voice signal of current input according to the very first time window length that is provided with in advance;

Calculate the average energy value of said voice signal;

The average energy value of said voice signal the average energy value with the neighbourhood noise that is provided with is in advance compared, obtain first comparative result;

Determine whether said voice signal is recorded according to said first comparative result.

2. method according to claim 1 is characterized in that, also comprises:

Obtain the average energy value of current environment noise according to second time window length that is provided with in advance;

Then said the average energy value of said voice signal the average energy value with the neighbourhood noise that is provided with is in advance compared, obtains first comparative result and be:

The average energy value of said voice signal and the average energy value of said current environment noise are compared, obtain first comparative result.

3. method according to claim 2 is characterized in that, also comprises:

The average energy value to said current environment noise is stored.

4. method according to claim 1 is characterized in that, said determine whether said voice signal recorded according to said first comparative result comprise:

If the average energy value that said first comparative result is said voice signal is recorded said voice signal greater than the average energy value of the said neighbourhood noise that is provided with in advance;

Otherwise, said voice signal is not recorded.

5. method according to claim 1 is characterized in that, also comprises:

Said voice signal is carried out buffer memory;

If the average energy value that said first comparative result is said voice signal obtains next section voice signal adjacent with said voice signal greater than the average energy value of the neighbourhood noise that is provided with in advance according to said very first time window length;

Calculate the average energy value of said next section voice signal;

The average energy value of said next section voice signal and the average energy value of the said neighbourhood noise that is provided with are in advance compared, obtain second comparative result;

Then said determine whether said voice signal recorded according to said first comparative result replace with:

According to said first comparative result and second comparative result, determine whether said voice signal and said next section voice signal are recorded.

6. the record device of a voice signal is characterized in that, comprising:

7. device according to claim 6 is characterized in that, also comprises:

Second acquisition unit is used for obtaining according to second time window length that is provided with in advance the average energy value of current environment noise;

Then said first comparing unit, the average energy value of the voice signal that also is used for said first computing unit is obtained and the average energy value of the current environment noise that said second acquisition unit obtains compare, and obtain first comparative result.

8. device according to claim 7 is characterized in that, also comprises:

Storage unit, the average energy value of the current environment noise that is used for said second acquisition unit is obtained is stored;

Then said first comparing unit, the average energy value of the average energy value of the voice signal that also is used for said first computing unit is obtained and the current environment noise of said cell stores compares, and obtains first comparative result.

9. device according to claim 6 is characterized in that, also comprises:

Buffer unit is used for the voice signal that said first acquiring unit obtains is carried out buffer memory;

The 3rd acquiring unit; If being used for the comparative result that said first comparing unit obtains is the average energy value of said voice signal the average energy value greater than the neighbourhood noise that is provided with in advance, obtain next section voice signal adjacent with said voice signal according to said very first time window length;

Second computing unit is used to calculate the average energy value of next section voice signal that said the 3rd acquiring unit obtains;

Second comparing unit, the average energy value and said the average energy value of the neighbourhood noise of setting in advance of being used for said second computing unit is obtained compare, and obtain second comparative result;

Then said definite unit; Second comparative result that first comparative result that also is used for obtaining according to first comparing unit and said second comparing unit obtain determines whether that the voice signal that voice signal and said the 3rd acquiring unit to said buffer unit buffer memory obtain records.