CN104811559B

CN104811559B - Noise-reduction method, communication means and mobile terminal

Info

Publication number: CN104811559B
Application number: CN201510223926.1A
Authority: CN
Inventors: 戴佑俊; 蒋伟鹏
Original assignee: Shanghai Green Orange Industry Co Ltd
Current assignee: Shanghai Green Orange Industry Co Ltd
Priority date: 2015-05-05
Filing date: 2015-05-05
Publication date: 2018-11-20
Anticipated expiration: 2035-05-05
Also published as: CN104811559A

Abstract

The invention discloses a kind of noise-reduction method, communication means and mobile terminal, the noise-reduction method is used for mobile terminal, and the mobile terminal includes an at least microphone, the voiceprint of the mobile terminal prestored user, and the noise-reduction method includes：Voice data is received by the microphone, wherein the voice data includes voice data and background sound；The voice data of the user is obtained from the voice data by the voiceprint.Noise-reduction method, communication means and mobile terminal of the invention can be realized noise when reducing mobile phone communication using single microphone, and effectively save cost simultaneously simplifies mobile phone hardware structure.

Description

Noise-reduction method, communication means and mobile terminal

Technical field

The present invention relates to a kind of noise-reduction method, communication means and mobile terminals.

Background technique

As social development and the progress of economy, mobile phone have become tool indispensable in people's daily life, hand Machine guarantees that other side can not be by noise effect when call under noisy environment, it will usually carry out noise reduction process.

In the prior art, noise-reduction method is double mic (microphone) noise-reduction methods, and this method can use two microphones, Extra microphone not only will increase hardware cost but also the design of meeting image complete machine mechanism, layout, keep handset structure relatively multiple It is miscellaneous.And the voice data handled using double mic noise-reduction methods would generally be by signal enhanced processing, so that voice signal It is easy distortion, voice result of broadcast is poor.

Summary of the invention

The technical problem to be solved by the present invention is in order to overcome in the prior art mobile phone noise-reduction method keep handset structure complicated, The defect of hardware cost height and effect difference, providing a kind of single microphone of utilization can be realized as noise reduction and the better noise reduction of effect Method, communication means and mobile terminal.

The present invention is to solve above-mentioned technical problem by following technical proposals：

A kind of noise-reduction method is used for mobile terminal, and the mobile terminal includes an at least microphone, it is characterized in that, institute The voiceprint of mobile terminal prestored user is stated, the noise-reduction method includes：

Voice data is received by the microphone, wherein the voice data includes voice data and background sound；

The voice data of the user is obtained from the voice data by the voiceprint.

Vocal print (voiceprint) is the sound wave spectrum for the carrying verbal information that electricity consumption acoustic instrument is shown.Human language Generation is a complicated physiology physical process between Body Languages maincenter and vocal organs, the acoustical generator that people uses in speech Official：Tongue, tooth, larynx, lung, nasal cavity everyone widely different in terms of size and form, so the vocal print of any two people Map is all variant.

Application on Voiceprint Recognition (Voiceprint Recognition) has been applied in every field, including identity validation, Criminal investigation work etc., Application on Voiceprint Recognition include that speaker recognizes (Speaker Identification), and in criminal investigation field, speaker is distinguished Recognizing can accomplish to judge in more people's conversational speech that certain section of dialogue is described in which.

The present invention is extracted from one section of voice data comprising voice data and background sound using sound groove recognition technology in e Voice data extracts, and excludes background sound, and the voice data of acquisition is clear, and distortion level is low.

Noise-reduction method of the invention has a wide range of application, including voice data is used to make a phone call, sends voice SMS, society The language of software is handed over to chat and voice memo etc..

Preferably, the mobile terminal includes a display screen, include before receiving voice data：

A text information is shown on the display screen, and receives the voice data that user reads aloud the text information；

From identification set of the extraction sound characteristic to be formed in the voiceprint in the voice data of the text information；

Wherein, the mobile terminal passes through the voice data in the identification set identification voice data, the text envelope Breath includes dialogue everyday expressions.

The task of feature extraction is to extract and select have the characteristics such as separability is strong, stability is high to the vocal print of speaker Acoustics or language feature.Different from speech recognition, the feature of Application on Voiceprint Recognition must be " personalization " feature, and Speaker Identification Feature must be " common feature " for speaker.Although major part Voiceprint Recognition System is all acoustics level at present Feature, but the feature for characterizing a personal touch should be multifaceted, including：With the anatomical structure of the pronunciation mechanism of the mankind Related acoustic feature such as frequency spectrum, cepstrum, formant, fundamental tone, reflection coefficient etc., nasal sound, band deep breathing sound, hoarse sound, Laugh etc..From the angle that can be modeled using mathematical method, feature packet that vocal print automatic identification model can be used at present It includes：Acoustic feature, lexical characteristics, prosodic features, languages, dialect and accent information, channel information etc..

Dialogue everyday expressions may include number, greet the every-day languages frequencies of occurrences such as term, doubt statement, case statement Higher word or sentence.

Preferably, the voiceprint further includes a background set,

The sound characteristic of the background sound of common call scene is preset in the background set；

And/or

The mobile terminal receives voice data, and extracts the sound characteristic of background sound from the voice data with shape At the background set；

The mobile terminal identifies the voice data in voice data by the identification set and background set.

When identification, which is integrated into, can not identify voice data in identification, background set identification background sound can use simultaneously To obtain voice data after background sound is eliminated.Common call scene includes the ambient sound of the ambient sound at station, office Sound and the ambient sound of school etc..

Using the present invention, user can acquire sound in operating position, interchange of position with self-setting background set, user Sound characteristic in the voice data is set as using after background set is used for by data.

Preferably, the voice data is divided into several sound sections according to preset duration, the noise-reduction method includes：

Judge whether voice data is obtained by identification set in current sound section, if then utilizing the drop The next sound section of method for de-noising processing, if otherwise by the background sound in background set removal current sound section with It obtains the voice data and handles next sound section using the noise-reduction method.

Next sound section refers to that processing obtains next sound section of the sound section of voice data recently, When handling " next sound section ", " next sound section " is considered as current sound section, at the noise-reduction method Reason, which refers to, to be judged whether to obtain voice data by identification set in next sound section and be executed according to judging result Follow-up process.The present invention is handled as unit of single sound section, the sound section handled for one, first with Identification set removes background sound using background set if it can not identify to obtain voice data to obtain voice number According to according to the next sound section of timing sequence process until voice data all complete by processing after the completion of a sound section processing.

Preferably, statistics obtains the number of the sound section of voice data by the background set, if the number is big Then pass through double mic noise-reduction methods in one first preset value and handles the voice data in remaining sound section to obtain voice number According to.

If the number is greater than, the first preset value is likely to occur user's flu sound change or other people use the mobile phone, At this moment it will receive influence using the effect of Application on Voiceprint Recognition, thus it is possible to vary improve noise reduction effect at using double mic noise-reduction methods.

Judge whether voice data is obtained by identification set in current sound section, if then utilizing the drop The next sound section of method for de-noising processing, if otherwise handling the voice data in current sound section by double mic noise-reduction methods To obtain voice data and handle next sound section using the noise-reduction method.

If made a phone call in strange background environment, using background set, the effect is unsatisfactory, thus it is possible to vary Cheng Li Noise reduction effect is improved with double mic noise-reduction methods.

Preferably, the noise-reduction method includes：

The number that the unused identification set obtains the sound section of voice data is counted, if the number is greater than one second Preset value then verifies the identity of the user after the completion of receiving voice data, if the non-rule starting of the identity is described mobile whole The anti-theft modes at end.

Application on Voiceprint Recognition itself can be used as the means of verifying identity, and the application judges that mobile terminal is using Application on Voiceprint Recognition It is no to fall into his manpower the safety for improving mobile phone.Anti-theft modes can mobile phone can not unlock, anti-brush machine mode opens for locking Dynamic, transmission location information etc..

The present invention also provides a kind of communication means, are used for mobile terminal, it is characterized in that, the communication means includes：

In telephone relation or when sending voice messaging, judge in the voice data received whether include legitimate user language Then sound sends the voice data if being then voice data by voice data translation using noise-reduction method as described above.

The mode for sending the voice data include make a phone call, send voice SMS, social software language chat and language Sound memorandum etc..

The present invention provides a kind of mobile terminal again, and the mobile terminal includes an at least microphone and a display screen, Feature is that the voiceprint of the mobile terminal prestored user, the mobile terminal further includes a receiving module, an acquisition mould Block and a processing module,

The display screen is for showing a text information；

The receiving module is used to receive voice data by microphone, and the voice data includes that user reads aloud the text The voice data of this information；

The processing module is used to extract sound characteristic from the voice data of the text information to form the vocal print Identification set in information, and the sound characteristic for extracting background sound from voice data is to form the background set；

The module that obtains is used to obtain the voice data in voice data by the identification set and background set；

Wherein, the text information includes dialogue everyday expressions, and the back of common call scene is preset in the background set The sound characteristic of scape sound.

Preferably, the mobile terminal further includes a judgment module and a statistical module,

The processing module is also used to the voice data being divided into several sound sections according to preset duration；

The judgment module gathers whether obtain voice data by the identification in current sound section for judging, If then handling next sound section using the noise-reduction method, if otherwise removing current sound area by the background set Interior background sound is to obtain the voice data；

The statistical module is used to count the number for the sound section that voice data is obtained by the background set, if institute It states number and passes through the voice data in double mic noise-reduction method processing current sound section then greater than one first preset value to obtain language Sound data.

On the basis of common knowledge of the art, above-mentioned each optimum condition, can any combination to get each preferable reality of the present invention Example.

The positive effect of the present invention is that：Noise-reduction method, communication means and mobile terminal of the invention can be realized Noise when mobile phone communication is reduced using single microphone, effectively save cost simultaneously simplifies mobile phone hardware structure.

Detailed description of the invention

Fig. 1 is the structural schematic diagram of the mobile phone of the embodiment of the present invention 1.

Fig. 2 is the flow chart of the noise-reduction method of the embodiment of the present invention 1.

Fig. 3 is the flow chart of the noise-reduction method of the embodiment of the present invention 3.

Specific embodiment

The present invention is further illustrated below by the mode of embodiment, but does not therefore limit the present invention to the reality It applies among a range.

Embodiment 1

Referring to Fig. 1, the present embodiment provides a mobile phone 1, the mobile phone includes a microphone 11 and a display screen 12, The voiceprint of the mobile phone prestored user, the mobile phone further include that a receiving module 13, one obtains module 14, a judgment module 15, a statistical module 16 and a processing module 17.

The display screen is for showing a text information, wherein the text information includes dialogue everyday expressions.

The receiving module is used to receive voice data by microphone, and the voice data includes that user reads aloud the text The voice data of this information.

The processing module is used to extract sound characteristic from the voice data of the text information to form the vocal print Identification set in information, and the sound characteristic for extracting background sound from voice data is to form the background set.

The module that obtains is used to obtain the voice data in voice data by the identification set and background set.

The sound characteristic of the background sound of common call scene is preset in the background set.

The sound characteristic of common call scene is preset in the background set of the present embodiment, in addition user can also be voluntarily Acquire the sound characteristic of background sound.

The processing module is also used to the voice data being divided into several sound sections according to preset duration (two seconds).

The statistical module is used to count the number for the sound section that voice data is obtained by the background set, if institute The identity that number verifies the user greater than 50 after the completion of receiving voice data is stated, if described in the non-rule starting of the identity The anti-theft modes of mobile phone.

Referring to fig. 2, using above-mentioned mobile phone, the present embodiment also provides a kind of noise-reduction method, and the noise-reduction method includes：

Step 100 shows a text information on the display screen, and by the microphone receive user read aloud it is described The voice data of text information.

The voice data includes voice data and background sound.

Step 101, from the voice data of the text information extract sound characteristic to be formed in the voiceprint Identification set, and the sound characteristic of background sound is extracted from the voice data to form the background set of voiceprint.

The mobile phone is first carrying out Initialize installation, that is, the voiceprint and background sound of acquisition user using preceding Sound characteristic, the acquisition of sound characteristic in the acquisition and background set of the sound characteristic in identification set can be separated It carries out, for example, when reading aloud the text information, only acquisition identifies the sound characteristic in set to user, user will after the completion of acquisition Mobile phone takes job site to acquire the sound characteristic in background set, and the present embodiment to acquire identification set and background collection simultaneously For the sound characteristic of conjunction, achieve the purpose that quickly to form voiceprint.

The sound characteristic of the background sound of common call scene is preset in the background set of the present embodiment.

Step 102 dials and carries out mobile phone communication, then receives voice data of the user for call.

The voice data is divided into several sound sections according to preset duration (two seconds) by step 103, utilizes the knowledge The voice data in set identification voice data Ji He not obtained.

Step 104 judges whether obtain voice data by identification set in current sound section, if then holding Row step 106, thens follow the steps 105 if not.

Step 105 removes the background sound in current sound section by the background set to obtain the voice number According to.

Step 106, judge it is nearest one acquisition voice data sound section whether be the voice data last A sound section, if then terminating process, if otherwise obtaining nearest one next sound of the sound section of voice data Section is as current sound section and then return step 104.

The noise-reduction method further includes：

Count it is unused it is described identification set obtain voice data sound section number, if the number be greater than 50 if The identity that voice data verifies the user after the completion is received, if the non-rule of the identity starts the anti-theft modes of the mobile phone.

Using above-mentioned noise-reduction method, the present embodiment also provides a kind of communication means, and the communication means includes：

Step 102 include in the voice data that receives of judgement whether include legitimate user voice, if then by voice number According to the receiving end for being sent to call.Concrete mode is that every processing acquisition voice data for completing a sound section just sends one It is a.

The noise-reduction method, communication means and mobile phone of the present embodiment can be realized when reducing mobile phone communication using single microphone Noise, effectively save cost simultaneously simplify mobile phone hardware structure.

Embodiment 2

The present embodiment is substantially the same manner as Example 1, the difference is that only：

The mobile phone includes two microphones.

The noise-reduction method includes：

Statistics obtains the number of the sound section of voice data by the background set, leads to if the number is greater than 10 It crosses double mic noise-reduction methods and handles the voice data in remaining sound section to obtain voice data.

The noise-reduction method of the present embodiment realizes noise reduction using two ways, further ensures that speech quality.

Embodiment 3

Referring to Fig. 3, the present embodiment is substantially the same manner as Example 1, the difference is that only that step 105 replaces with：

Step 105 handles the voice data in current sound section by double mic noise-reduction methods to obtain voice data.

Although specific embodiments of the present invention have been described above, it will be appreciated by those of skill in the art that these It is merely illustrative of, protection scope of the present invention is defined by the appended claims.Those skilled in the art is not carrying on the back Under the premise of from the principle and substance of the present invention, many changes and modifications may be made, but these are changed Protection scope of the present invention is each fallen with modification.

Claims

1. a kind of noise-reduction method, is used for mobile terminal, the mobile terminal includes an at least microphone, which is characterized in that described The voiceprint of mobile terminal prestored user, the voiceprint include identification set and background set, the noise-reduction method packet It includes：

The voice data of the user is obtained from the voice data by the voiceprint,

The voice data is divided into several sound sections according to preset duration, the noise-reduction method further includes：

Judge whether voice data is obtained by identification set in current sound section, if then utilizing the noise reduction side The next sound section of method processing, if otherwise removing the background sound in current sound section by the background set to obtain The voice data simultaneously handles next sound section using the noise-reduction method, wherein passes through the sound of the preparatory typing of user The sound characteristic that voice is extracted in data obtains the identification set in the voiceprint, passes through the background extracted in voice data The sound characteristic of sound is to form the background set.

2. noise-reduction method as described in claim 1, which is characterized in that the mobile terminal includes a display screen, receives sound Include before data：

Wherein, the mobile terminal passes through the voice data in the identification set identification voice data, the text information packet Include dialogue everyday expressions.

3. noise-reduction method as claimed in claim 2, which is characterized in that the voiceprint further includes a background set,

And/or

The mobile terminal receives voice data, and extracts the sound characteristic of background sound from the voice data to be formed State background set；

4. noise-reduction method as described in claim 1, which is characterized in that statistics obtains voice data by the background set The number of sound section passes through dual microphone noise-reduction method if the number is greater than one first preset value and handles remaining sound Voice data in section is to obtain voice data.

5. noise-reduction method as claimed in claim 2, which is characterized in that if the voice data is divided into according to preset duration Dry sound section, the noise-reduction method include：

Judge whether voice data is obtained by identification set in current sound section, if then utilizing the noise reduction side The next sound section of method processing, if otherwise by dual microphone noise-reduction method handle the voice data in current sound section with It obtains voice data and handles next sound section using the noise-reduction method.

6. noise-reduction method as described in claim 4 or 5, which is characterized in that the noise-reduction method includes：

The number that the unused identification set obtains the sound section of voice data is counted, is preset if the number is greater than one second Value then verifies the identity of the user after the completion of receiving voice data, if the non-rule of the identity starts the mobile terminal Anti-theft modes.

7. a kind of communication means, it to be used for mobile terminal, which is characterized in that the communication means includes：

In telephone relation or when sending voice messaging, judge in the voice data received whether include legitimate user voice, if Be then using as described in any one of claim 1 to 6 noise-reduction method by voice data translation be voice data, then send out Send the voice data.

8. a kind of mobile terminal, the mobile terminal includes an at least microphone and a display screen, which is characterized in that the shifting The voiceprint of dynamic terminal prestored user, the mobile terminal further include a receiving module, an acquisition module and a processing mould Block,

The display screen is for showing a text information；

The receiving module is used to receive voice data by microphone, and the voice data includes that user reads aloud the text envelope The voice data of breath；

The processing module is used to extract sound characteristic from the voice data of the text information to form the voiceprint In identification set, and for from voice data extract background sound sound characteristic to form background set；

Wherein, the text information includes dialogue everyday expressions, and the background sound of common call scene is preset in the background set The sound characteristic of sound.