CN104575509A - Voice enhancement processing method and device - Google Patents

Voice enhancement processing method and device Download PDF

Info

Publication number
CN104575509A
CN104575509A CN201410834628.1A CN201410834628A CN104575509A CN 104575509 A CN104575509 A CN 104575509A CN 201410834628 A CN201410834628 A CN 201410834628A CN 104575509 A CN104575509 A CN 104575509A
Authority
CN
China
Prior art keywords
cement
voice
speech enhan
terminal
voice enhancement
Prior art date
Application number
CN201410834628.1A
Other languages
Chinese (zh)
Inventor
赵恒艺
Original Assignee
乐视致新电子科技(天津)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 乐视致新电子科技(天津)有限公司 filed Critical 乐视致新电子科技(天津)有限公司
Priority to CN201410834628.1A priority Critical patent/CN104575509A/en
Publication of CN104575509A publication Critical patent/CN104575509A/en

Links

Abstract

The invention provides a voice enhancement processing method and a device. The method comprises the steps that voice information from terminal equipment is acquired, wherein voice enhancement auxiliary information is carried in the voice information; if voice enhancement processing is required to be performed on the voice information by judging according to equipment identification of the terminal equipment, a corresponding voice enhancement algorithm is acquired from a plurality of local voice enhancement algorithms according to the voice enhancement auxiliary information; and the voice enhancement processing is performed on the voice information according to the acquired voice enhancement algorithm. With the adoption of the technical scheme, a voice enhancement processing procedure can be more pertinent, and an unnecessary computation burden of a server is reduced under the condition that the voice enhancement quality is ensured.

Description

Speech enhan-cement disposal route and device

Technical field

The present invention relates to Internet technical field, particularly relate to a kind of speech enhan-cement disposal route and device.

Background technology

Along with the acoustic enviroment of Intelligent hardware becomes increasingly complex, speech recognition for Intelligent hardware also more has challenge, time distant from microphone when a user speaks, Intelligent hardware likely can not identify the phonetic entry of user, therefore needs to carry out noise reduction and speech enhan-cement process to the voice of input.Prior art by arranging speech enhan-cement chip or carrying out speech enhan-cement by the voice of central processing unit (CPU) to input of Intelligent hardware in Intelligent hardware, if adopt the voice of speech enhan-cement chip to input to carry out speech enhan-cement process, when to speech enhan-cement quality requirements height, need to choose can be suitable for the high speech enhan-cement chip of computation complexity to promote speech enhan-cement quality, thus the hardware cost of terminal device can be improved, if adopt the voice of CPU to input to carry out speech enhan-cement, then can take and consume the local a large amount of computational resource of terminal device.

Summary of the invention

In view of this, the invention provides a kind of speech enhan-cement process disposal route and device, save hardware cost and the computational resource of terminal device further.

According to the first aspect of this method embodiment, provide a kind of speech enhan-cement disposal route, application on the server, comprising:

Obtain the voice messaging from terminal device, in described voice messaging, carry speech enhan-cement supplementary;

If judge to know that described voice messaging needs to carry out speech enhan-cement process according to the device identification of described terminal device, then according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm;

Described voice enhancement algorithm according to obtaining carries out speech enhan-cement process to described voice messaging.

According to the second aspect of this method embodiment, provide a kind of speech enhan-cement treating apparatus, application on the server, comprising:

First acquisition module, for obtaining the voice messaging from terminal device, carries speech enhan-cement supplementary in described voice messaging;

Second acquisition module, if know that the described voice messaging that described first acquisition module obtains needs to carry out speech enhan-cement process for judging according to the device identification of described terminal device, then according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm;

Speech enhan-cement module, carries out speech enhan-cement process for the described voice enhancement algorithm obtained according to described second acquisition module to described voice messaging.

From above technical scheme, the present invention judges to know that voice messaging needs to carry out speech enhan-cement process to according to the device identification of terminal device, according to speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm, owing to can be suitable on the server, therefore avoid and carry out speech enhan-cement on the terminal device, thus alleviate the computation burden of terminal device; Due to corresponding voice enhancement algorithm can be adopted to the voice messaging of terminal device, avoid unnecessary voice enhancement algorithm and enhancing process is carried out to voice messaging, make that speech enhan-cement is carried out to voice messaging and have more specific aim, thus the computation complexity of server when carrying out speech enhan-cement can be reduced substantially, improve the quality of speech enhan-cement, and then make follow-up speech recognition more accurate.

Should be understood that, it is only exemplary and explanatory that above general description and details hereinafter describe, and can not limit the embodiment of the present invention.

Accompanying drawing explanation

Fig. 1 is the process flow diagram of speech enhan-cement disposal route in one embodiment of the present invention;

Fig. 2 is the process flow diagram of speech enhan-cement disposal route in another embodiment of the present invention;

Fig. 3 is the process flow diagram of speech enhan-cement disposal route in another way of example of the present invention;

Fig. 4 is the structural drawing of speech enhan-cement server in one embodiment of the present invention;

Fig. 5 is the system construction drawing of speech enhan-cement process in one embodiment of the present invention;

Fig. 6 is the building-block of logic of speech enhan-cement treating apparatus in one embodiment of the present invention;

Fig. 7 is the building-block of logic of speech enhan-cement treating apparatus in another embodiment of the present invention.

Embodiment

Here will be described exemplary embodiment in detail, its sample table shows in the accompanying drawings.When description below relates to accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawing represents same or analogous key element.Embodiment described in following exemplary embodiment does not represent all embodiments consistent with the application.On the contrary, they only with as in appended claims describe in detail, the example of apparatus and method that some aspects of the application are consistent.

Only for describing the object of specific embodiment at term used in this application, and not intended to be limiting the application." one ", " described " and " being somebody's turn to do " of the singulative used in the application and appended claims is also intended to comprise most form, unless context clearly represents other implications.It is also understood that term "and/or" used herein refer to and comprise one or more project of listing be associated any or all may combine.

Term first, second, third, etc. may be adopted although should be appreciated that to describe various information in the application, these information should not be limited to these terms.These terms are only used for the information of same type to be distinguished from each other out.Such as, when not departing from the application's scope, the first information also can be called as the second information, and similarly, the second information also can be called as the first information.Depend on linguistic context, word as used in this " if " can be construed as into " ... time " or " when ... time " or " in response to determining ".

The application by server according to the voice enhancement algorithm of speech enhan-cement supplementary to the voice messaging determination speech enhan-cement of the terminal device got, and by corresponding voice enhancement algorithm, speech enhan-cement process is carried out to voice messaging, therefore avoid and carry out speech enhan-cement on the terminal device, thus alleviate the computation burden of terminal device; Due to corresponding voice enhancement algorithm can be adopted to the voice messaging of terminal device, thus can adopt and have more voice enhancement algorithm targetedly speech enhan-cement process is carried out to the voice messaging of terminal device, server is avoided to adopt the high voice enhancement algorithm of computation complexity to carry out unnecessary speech enhan-cement process to the voice messaging of terminal device, reduce server computation complexity when carrying out speech enhan-cement process substantially, improve the quality of speech enhan-cement, and then make follow-up speech recognition more accurate.For being further described the application, provide the following example.

Please refer to Fig. 1, Fig. 1 is the process flow diagram of speech enhan-cement disposal route in one embodiment of the present invention, can apply on the server, terminal device in the embodiment of the present invention can comprise: the various equipment with speech voice input function such as in-car TV, intelligent remote controller, smart mobile phone, panel computer, comprise the steps:

Step 101, obtains the voice messaging from terminal device, wherein, carries speech enhan-cement supplementary in voice messaging.

In one embodiment, by the microphones capture of terminal device to analog voice, after terminal device carries out analog to digital conversion and compress speech to analog voice, the voice messaging described in the embodiment of the present invention can be formed.

Step 102, if judge to know that voice messaging needs to carry out speech enhan-cement process according to the device identification of terminal device, then according to speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm.

Because speech enhan-cement not only relates to voice signal digital processing, also relate to Auditory Perception and the phonetics category of people, add the difference of environment residing for terminal device, noise source also can be different, thus voice enhancement algorithm and the environmental correclation residing for terminal device, in addition, due to the difference of the current duty of terminal device, terminal device also can be different by the analog voice that microphones capture arrives, such as, when terminal device is in hands-free mode and map mode, microphone can be easier to capture extraneous noise, therefore work state information and ambient parameter information can be sent to server in the mode of speech enhan-cement supplementary by the embodiment of the present invention, server is determined in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm by speech enhan-cement supplementary, thus can get and have more voice enhancement algorithm thus speech enhan-cement is carried out to voice messaging targetedly.

Step 103, the voice enhancement algorithm according to obtaining carries out speech enhan-cement process to voice messaging.

In one embodiment, such as, terminal device is in hands-free mode or map mode, for the voice messaging of terminal device being in hands-free mode and map mode, the voice enhancement algorithm that computation complexity is higher can be adopted to carry out speech enhan-cement, and for the terminal device under normal mode, the voice enhancement algorithm that computation complexity is lower can be adopted to carry out speech enhan-cement, making speech enhan-cement implementation procedure have more specific aim thus, the unnecessary computation burden of server can be reduced when guaranteeing speech enhan-cement quality.In another embodiment, terminal device is in (noise source is based on the noisy sound of people) in market, or, terminal device is in (noise source is based on the sound of blowing a whistle of vehicle) on road, or, terminal device is in classroom (essentially no noise), in such a case, if terminal device is in market, the voice enhancement algorithm that can adopt the noisy sound (can be identified by frequency) eliminating people carries out speech enhan-cement to the voice messaging of terminal device, if terminal device is positioned on road, the voice enhancement algorithm that can adopt the sound of blowing a whistle eliminating vehicle carries out speech enhan-cement to the voice messaging of terminal device, if terminal device is in classroom, better simply common voice enhancement algorithm can be adopted to carry out speech enhan-cement to the voice messaging of terminal device, make speech enhan-cement process adopt thus and have more voice enhancement algorithm targetedly.

As can be seen from step 101-step 103, the present invention judges to know that voice messaging needs to carry out speech enhan-cement process according to the device identification of terminal device, according to speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm, owing to can be suitable on the server, therefore avoid and carry out speech enhan-cement on the terminal device, thus alleviate the computation burden of terminal device; Due to corresponding voice enhancement algorithm can be adopted to the voice messaging of terminal device, avoid unnecessary voice enhancement algorithm and enhancing process is carried out to the voice messaging of terminal device, thus the computation complexity of server when carrying out speech enhan-cement can be reduced substantially, improve the quality of speech enhan-cement, and then make follow-up speech recognition more accurate.

Refer to Fig. 2, Fig. 2 is the process flow diagram of speech enhan-cement disposal route in another embodiment of the present invention, the present embodiment can be applied on server, and the work state information that the present embodiment is terminal device for speech enhan-cement supplementary carries out exemplary illustration, comprises the steps:

Step 201, obtains the voice messaging from terminal device, carries the work state information of terminal device in voice messaging.

Step 202, if judge to know that voice messaging needs to carry out speech enhan-cement process according to the device identification of terminal device, then current according to work state information determination terminal device duty, duty comprises normal operating conditions, hands-free mode duty and map mode duty.

Step 203, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with terminal device current operating state.

Step 204, the voice enhancement algorithm according to obtaining carries out speech enhan-cement process to voice messaging.

The detailed description of above-mentioned steps 201 with reference to above-mentioned steps 101, can be not described in detail in this.

In above-mentioned steps 202, such as, can have in existing all kinds gets in the terminal device of voice messaging, the voice messaging that intelligent remote controller and in-car TV receive does not need to carry out speech enhan-cement process, the voice messaging of intelligent television and panel computer needs to carry out speech enhan-cement process, therefore can carry out identification terminal equipment by the device identification of terminal device and belong to the terminal device that the terminal device needing to carry out speech enhan-cement still needs to carry out speech enhan-cement.

In one embodiment, the duty of terminal device comprises: hands-free mode duty, map mode duty, normal mode duty.Such as, under hands-free mode duty, because the microphone on terminal device can receive the voice of Correspondent Node, therefore the voice of Correspondent Node can form noise to the phonetic entry of the microphone of terminal device, under map mode duty, due to the voice message in the navigational system of map, form noise also can to the phonetic entry of microphone, and under normal mode duty, the user of terminal device is by closely sounding near microphone, and the noise in the external world can not form too large noise to the phonetic entry of microphone.

In step 203, in one embodiment, local multiple voice enhancement algorithms can comprise: the voice enhancement algorithm based on spectrum subtraction, the voice enhancement algorithm based on wavelet analysis, the voice enhancement algorithm based on Kalman filtering, the enhancing algorithm based on signal subspace, the voice enhancement algorithm based on auditory masking effect, the voice enhancement algorithm based on independent component analysis, the voice enhancement algorithm based on neural network, the voice enhancement algorithm based on deep neural network (Deep Neural Networks, DNN) etc.Correspondingly, grade classification can be carried out according to complexity to above-mentioned voice enhancement algorithm, such as, the voice enhancement algorithm of base DNN is divided into the voice enhancement algorithm of the first computation complexity, by the voice enhancement algorithm based on auditory masking effect, based on the voice enhancement algorithm of independent component analysis, voice enhancement algorithm based on neural network is divided into the voice enhancement algorithm of the second computation complexity, by the voice enhancement algorithm based on spectrum subtraction, based on the voice enhancement algorithm of Kalman filtering, Enhancement Method based on signal subspace is divided into the voice enhancement algorithm of the 3rd computation complexity.It will be understood by those skilled in the art that, more grade classification can be carried out to different voice enhancement algorithms according to computation complexity, above-mentioned first computation complexity, second computation complexity, 3rd computation complexity is only the exemplary illustration of the embodiment of the present invention, it can not form the restriction to the embodiment of the present invention, can be found out by the above-mentioned voice enhancement algorithm exemplified, the embodiment of the present invention is by performing the very high DNN voice enhancement algorithm of computation complexity on the server, can also realize carrying out a large amount of training on the server, thus obtain better speech enhan-cement model.

In step 204, such as, if terminal device is in map mode duty or hands-free mode duty, the voice enhancement algorithm of first computation complexity that computation complexity can be adopted higher carries out speech enhan-cement, if terminal device is in normal mode duty, the voice enhancement algorithm of second computation complexity that computation complexity can be adopted lower and the 3rd computation complexity carries out speech enhan-cement, make the speech enhan-cement processing procedure of terminal device to adopt thus and have more voice enhancement algorithm targetedly, the computation burden that server is unnecessary is reduced when guaranteeing speech enhan-cement quality.

As can be seen from step 201-step 204, in the present embodiment, the duty current by terminal device adopts the voice enhancement algorithm of different computation complexities to carry out speech enhan-cement to voice messaging, making speech enhan-cement to adopt thus and have more voice enhancement algorithm targetedly, reducing the computation burden that server is unnecessary when guaranteeing speech enhan-cement quality.

Refer to Fig. 3, Fig. 3 is the process flow diagram of speech enhan-cement disposal route in another way of example of the present invention, the present embodiment can be applied on server, the present embodiment carries out exemplary illustration for the ambient parameter information of speech enhan-cement supplementary environment residing for terminal device, comprises the steps:

Step 301, obtains the voice messaging from terminal device, carries the ambient parameter information of environment residing for terminal device in described voice messaging.

Step 302, if judge to know that voice messaging needs to carry out speech enhan-cement process according to the device identification of terminal device, then environmentally parameter information determination noise type.

Step 303, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with noise type.

Step 304, the voice enhancement algorithm according to obtaining carries out speech enhan-cement process to voice messaging.

The detailed description of step 301 with reference to the detailed description of above-mentioned steps 101, can be not described in detail in this.

In step 302, in one embodiment, because noise is larger by the impact of environment, therefore can be classified by environment residing for terminal device, thus can realize adopting corresponding voice enhancement algorithm to different noises, thus make to have more the process of noise reduction enhancing targetedly to the voice of terminal device.If terminal device is in (noise source is based on the noisy sound of people) in market, the voice enhancement algorithm that can adopt the sound (can be identified by frequency) eliminating people carries out speech enhan-cement, as terminal device is on road (noise source is based on the sound of blowing a whistle of vehicle), the voice enhancement algorithm that can adopt to eliminate vehicle sounds carries out speech enhan-cement, if terminal device is in classroom, better simply common voice enhancement algorithm can be adopted to carry out speech enhan-cement, make speech enhan-cement process adopt thus and have more voice enhancement algorithm targetedly.

In step 303, corresponding with the description of above-mentioned steps 302, if the ambient parameter information detected from voice messaging represents that terminal device is on road, the noise then can blown a whistle for vehicle carries out the voice enhancement algorithm of speech enhan-cement process, if the ambient parameter information detected from voice messaging represents that terminal device is in market, the voice enhancement algorithm of speech enhan-cement process then can be carried out for the excess noise of people, if the ambient parameter information detected from voice messaging represents that terminal device is in classroom, then can adopt the voice enhancement algorithm that computation complexity is lower, owing to now having more speech enhan-cement process targetedly to the voice messaging of terminal device.

As can be seen from step 301-step 304, the present embodiment is by classifying to noise source, thus voice enhancement algorithm targetedly can be had more to the voice messaging employing of terminal device, thus speech enhan-cement process targetedly can be carried out to the voice of terminal device, the voice less to noise are avoided to carry out the high voice enhancement algorithm of complexity, thus the computation complexity of speech enhan-cement can be reduced, the situation larger voice of noise being caused to speech enhan-cement poor effect owing to having carried out the lower voice enhancement algorithm of complexity can also be avoided, thus guarantee normally carrying out of later stage speech recognition.

Corresponding to above-mentioned speech enhan-cement disposal route, the application also proposed the structural drawing of the speech enhan-cement server shown in Fig. 4.Please refer to Fig. 4, at hardware view, this speech enhan-cement server comprises processor, external interface, internal memory and nonvolatile memory, certainly also may comprise the hardware required for other business.Processor reads corresponding computer program and then runs in internal memory from nonvolatile memory, and logic level is formed speech enhan-cement treating apparatus.Certainly, except software realization mode, the application does not get rid of other implementations, mode of such as logical device or software and hardware combining etc., that is the executive agent of following treatment scheme is not limited to each logical block, also can be hardware or logical device.

In order to more clearly understand the technical scheme of the embodiment of the present invention, refer to Fig. 5, Fig. 5 is the system construction drawing of speech enhan-cement process in one embodiment of the present invention, the analog voice that user is inputted by microphone is converted to digital signal after being processed by voice input module 51 by terminal device 50, after the voice compression module 56 in host CPU 52 carries out compress speech, be sent to speech enhan-cement server 53, after speech enhan-cement server 53 adopts the speech enhan-cement disposal route described in the embodiment of the present invention, voice letter after strengthening is sent to speech recognition server 54, speech recognition is carried out for speech recognition server 54, semantic understanding, the process such as phonetic synthesis, after the voice messaging of speech recognition server 54 couples of users identifies, return to speech recognition server 54 to terminal device 50 and carry out mutual voice with user, such as, after speech recognition server 54 identifies according to the voice of user, make corresponding reply, and carry out interactive voice by voice interaction module 55 with user.Wherein, voice interaction module 55 can be arranged on terminal device 50 by the mode of speech application (app), user by the current ambient parameter information of the setting options determination terminal device of app, thus can be determined at voice enhancement algorithm corresponding to current residing environment facies.Can be found out by this system construction drawing, the embodiment of the present invention can adopt the voice enhancement algorithm that complexity is higher to carry out speech enhan-cement to the voice of terminal device 50 by speech enhan-cement server 54, saves the computational resource that terminal device 50 pairs of voice carry out speech enhan-cement; In addition, can be known by above-described embodiment, directly can upgrade at speech enhan-cement server 54 pairs of voice enhancement algorithms, avoid and upgrade at terminal device 50 pairs of voice enhancement algorithms, thus improve experience when user carries out speech enhan-cement.

Please refer to Fig. 6, Fig. 6 is the building-block of logic of speech enhan-cement treating apparatus in one embodiment of the present invention, can apply on the server, and this speech enhan-cement treating apparatus can comprise:

First acquisition module 61, for obtaining the voice messaging from terminal device, carries speech enhan-cement supplementary in voice messaging;

Second acquisition module 62, if for judging to know that the voice messaging that the first acquisition module 61 obtains needs to carry out speech enhan-cement process according to the device identification of terminal device, then according to speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm;

Speech enhan-cement module 63, carries out speech enhan-cement process for the voice enhancement algorithm obtained according to the second acquisition module 62 to voice messaging.

The present invention is by the voice enhancement algorithm of the second acquisition module 62 to the voice messaging determination speech enhan-cement of the terminal device that the first acquisition module 61 gets, speech enhan-cement module 63 carries out speech enhan-cement process by corresponding voice enhancement algorithm to voice messaging, therefore avoid and carry out speech enhan-cement on the terminal device, thus alleviate the computation burden of terminal device; Because speech enhan-cement module 63 can adopt corresponding voice enhancement algorithm to the voice messaging of terminal device, thus can adopt and have more voice enhancement algorithm targetedly speech enhan-cement process is carried out to the voice messaging of terminal device, server is avoided to adopt the high voice enhancement algorithm of computation complexity to carry out unnecessary speech enhan-cement process to the voice messaging of terminal device, reduce server computation complexity when carrying out speech enhan-cement process substantially, improve the quality of speech enhan-cement, and then make follow-up speech recognition more accurate.

Please refer to Fig. 7, Fig. 7 is the building-block of logic of speech enhan-cement treating apparatus in another embodiment of the present invention, and the present embodiment is described on the basis of above-mentioned Fig. 6 embodiment.

In one embodiment, speech enhan-cement supplementary can be the work state information of terminal device, and the second acquisition module 62 can comprise:

First determining unit 621, for the duty current according to work state information determination terminal device, duty comprises normal operating conditions, hands-free mode duty and map mode duty;

First acquiring unit 622, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with the terminal device current operating state that the first determining unit 621 is determined.

First acquiring unit 622 obtains by the duty that terminal device is current the voice enhancement algorithm adopted voice messaging, making speech enhan-cement to adopt thus and have more voice enhancement algorithm targetedly, reducing the computation burden that server is unnecessary when guaranteeing speech enhan-cement quality.

In another embodiment, the ambient parameter information of speech enhan-cement supplementary environment residing for terminal device, the second acquisition module 62 can comprise:

Second determining unit 623, for environmentally parameter information determination noise type;

Second acquisition unit 624, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with the noise type that the second determining unit 623 is determined.

Second acquisition unit 624 obtains by the ambient parameter information that terminal device is current the voice enhancement algorithm adopted voice messaging, making speech enhan-cement to adopt thus and have more voice enhancement algorithm targetedly, reducing the computation burden that server is unnecessary when guaranteeing speech enhan-cement quality.

By describing above and can finding out, speech enhan-cement disposal route provided by the invention and device, according to speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm, owing to being suitable on the server, therefore avoid and carry out speech enhan-cement on the terminal device, thus alleviate the computation burden of terminal device; Due to corresponding voice enhancement algorithm can be adopted to the voice messaging of terminal device, avoid unnecessary voice enhancement algorithm and enhancing process is carried out to voice messaging, make that speech enhan-cement is carried out to voice messaging and have more specific aim, thus server computation complexity when carrying out speech enhan-cement process can be reduced substantially, improve the quality of speech enhan-cement, and then make follow-up speech recognition more accurate.

The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within the scope of protection of the invention.

Claims (10)

1. a speech enhan-cement disposal route, application on the server, is characterized in that, comprising:
Obtain the voice messaging from terminal device, in described voice messaging, carry speech enhan-cement supplementary;
If judge to know that described voice messaging needs to carry out speech enhan-cement process according to the device identification of described terminal device, then according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm;
Described voice enhancement algorithm according to obtaining carries out speech enhan-cement process to described voice messaging.
2. method according to claim 1, is characterized in that, described speech enhan-cement supplementary is the work state information of described terminal device.
3. method according to claim 2, is characterized in that, described according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm, comprising:
Determine according to described work state information the duty that described terminal device is current, described duty comprises normal operating conditions, hands-free mode duty and map mode duty;
In multiple voice enhancement algorithms of this locality, obtain the voice enhancement algorithm corresponding with described terminal device current operating state.
4. method according to claim 1, is characterized in that, the ambient parameter information of described speech enhan-cement supplementary environment residing for described terminal device.
5. method according to claim 4, is characterized in that, described according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm, comprising:
According to described ambient parameter information determination noise type;
In multiple voice enhancement algorithms of this locality, obtain the voice enhancement algorithm corresponding with described noise type.
6. a speech enhan-cement treating apparatus, application on the server, is characterized in that, comprising:
First acquisition module, for obtaining the voice messaging from terminal device, carries speech enhan-cement supplementary in described voice messaging;
Second acquisition module, if know that the described voice messaging that described first acquisition module obtains needs to carry out speech enhan-cement process for judging according to the device identification of described terminal device, then according to described speech enhan-cement supplementary in multiple voice enhancement algorithms of this locality, obtain corresponding voice enhancement algorithm;
Speech enhan-cement module, carries out speech enhan-cement process for the described voice enhancement algorithm obtained according to described second acquisition module to described voice messaging.
7. device according to claim 6, is characterized in that, described speech enhan-cement supplementary is the work state information of described terminal device.
8. device according to claim 7, is characterized in that, described second acquisition module comprises:
First determining unit, for determining according to described work state information the duty that described terminal device is current, described duty comprises normal operating conditions, hands-free mode duty and map mode duty;
First acquiring unit, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with the described terminal device current operating state that described first determining unit is determined.
9. device according to claim 6, is characterized in that, the ambient parameter information of described speech enhan-cement supplementary environment residing for described terminal device.
10. device according to claim 9, is characterized in that, described second acquisition module comprises:
Second determining unit, for according to described ambient parameter information determination noise type;
Second acquisition unit, in multiple voice enhancement algorithms of this locality, obtains the voice enhancement algorithm corresponding with the described noise type that described second determining unit is determined.
CN201410834628.1A 2014-12-29 2014-12-29 Voice enhancement processing method and device CN104575509A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410834628.1A CN104575509A (en) 2014-12-29 2014-12-29 Voice enhancement processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410834628.1A CN104575509A (en) 2014-12-29 2014-12-29 Voice enhancement processing method and device

Publications (1)

Publication Number Publication Date
CN104575509A true CN104575509A (en) 2015-04-29

Family

ID=53091409

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410834628.1A CN104575509A (en) 2014-12-29 2014-12-29 Voice enhancement processing method and device

Country Status (1)

Country Link
CN (1) CN104575509A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966517A (en) * 2015-06-02 2015-10-07 华为技术有限公司 Voice frequency signal enhancement method and device
CN108231086A (en) * 2017-12-24 2018-06-29 航天恒星科技有限公司 A kind of deep learning voice enhancer and method based on FPGA
CN108873987A (en) * 2018-06-02 2018-11-23 熊冠 A kind of intelligence control system and method for stereo of stage
CN109087659A (en) * 2018-08-03 2018-12-25 三星电子(中国)研发中心 Audio optimization method and apparatus
CN110085223A (en) * 2019-04-02 2019-08-02 北京云知声信息技术有限公司 A kind of voice interactive method of cloud interaction

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5570423A (en) * 1994-08-25 1996-10-29 Alcatel N.V. Method of providing adaptive echo cancellation
CN1329453A (en) * 2000-06-21 2002-01-02 阿尔卡塔尔公司 Telephony and hand-free speed of wireless terminal equipment with echo compensation
US20050182624A1 (en) * 2004-02-16 2005-08-18 Microsoft Corporation Method and apparatus for constructing a speech filter using estimates of clean speech and noise
CN1875611A (en) * 2003-11-20 2006-12-06 摩托罗拉公司(在特拉华州注册的公司) Method and apparatus for adaptive echo and noise control
CN101583996A (en) * 2006-12-30 2009-11-18 摩托罗拉公司 A method and noise suppression circuit incorporating a plurality of noise suppression techniques
CN102014205A (en) * 2010-11-19 2011-04-13 中兴通讯股份有限公司 Method and device for treating voice call quality
CN102801861A (en) * 2012-08-07 2012-11-28 歌尔声学股份有限公司 Voice enhancing method and device applied to cell phone
CN103456305A (en) * 2013-09-16 2013-12-18 东莞宇龙通信科技有限公司 Terminal and speech processing method based on multiple sound collecting units
CN103489452A (en) * 2013-09-24 2014-01-01 小米科技有限责任公司 Method and device for eliminating call noise and terminal device
CN104036786A (en) * 2014-06-25 2014-09-10 青岛海信信芯科技有限公司 Method and device for denoising voice
CN104052886A (en) * 2014-06-27 2014-09-17 联想(北京)有限公司 Information processing method and electronic device
CN104092801A (en) * 2014-05-22 2014-10-08 中兴通讯股份有限公司 Intelligent terminal call noise reduction method and intelligent terminal

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5570423A (en) * 1994-08-25 1996-10-29 Alcatel N.V. Method of providing adaptive echo cancellation
CN1329453A (en) * 2000-06-21 2002-01-02 阿尔卡塔尔公司 Telephony and hand-free speed of wireless terminal equipment with echo compensation
CN1875611A (en) * 2003-11-20 2006-12-06 摩托罗拉公司(在特拉华州注册的公司) Method and apparatus for adaptive echo and noise control
US20050182624A1 (en) * 2004-02-16 2005-08-18 Microsoft Corporation Method and apparatus for constructing a speech filter using estimates of clean speech and noise
CN101583996A (en) * 2006-12-30 2009-11-18 摩托罗拉公司 A method and noise suppression circuit incorporating a plurality of noise suppression techniques
CN102014205A (en) * 2010-11-19 2011-04-13 中兴通讯股份有限公司 Method and device for treating voice call quality
CN102801861A (en) * 2012-08-07 2012-11-28 歌尔声学股份有限公司 Voice enhancing method and device applied to cell phone
CN103456305A (en) * 2013-09-16 2013-12-18 东莞宇龙通信科技有限公司 Terminal and speech processing method based on multiple sound collecting units
CN103489452A (en) * 2013-09-24 2014-01-01 小米科技有限责任公司 Method and device for eliminating call noise and terminal device
CN104092801A (en) * 2014-05-22 2014-10-08 中兴通讯股份有限公司 Intelligent terminal call noise reduction method and intelligent terminal
CN104036786A (en) * 2014-06-25 2014-09-10 青岛海信信芯科技有限公司 Method and device for denoising voice
CN104052886A (en) * 2014-06-27 2014-09-17 联想(北京)有限公司 Information processing method and electronic device

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104966517A (en) * 2015-06-02 2015-10-07 华为技术有限公司 Voice frequency signal enhancement method and device
CN104966517B (en) * 2015-06-02 2019-02-01 华为技术有限公司 A kind of audio signal Enhancement Method and device
CN108231086A (en) * 2017-12-24 2018-06-29 航天恒星科技有限公司 A kind of deep learning voice enhancer and method based on FPGA
CN108873987A (en) * 2018-06-02 2018-11-23 熊冠 A kind of intelligence control system and method for stereo of stage
CN109087659A (en) * 2018-08-03 2018-12-25 三星电子(中国)研发中心 Audio optimization method and apparatus
CN110085223A (en) * 2019-04-02 2019-08-02 北京云知声信息技术有限公司 A kind of voice interactive method of cloud interaction

Similar Documents

Publication Publication Date Title
US9940935B2 (en) Method and device for voiceprint recognition
TWI582753B (en) Method, system, and computer-readable storage medium for operating a virtual assistant
US9685161B2 (en) Method for updating voiceprint feature model and terminal
JP6393730B2 (en) Voice identification method and apparatus
US10204626B2 (en) Method and apparatus for recognizing speech by lip reading
US10217463B2 (en) Hybridized client-server speech recognition
KR102103057B1 (en) Voice trigger for a digital assistant
CN103634472B (en) User mood and the method for personality, system and mobile phone is judged according to call voice
TWI684148B (en) Grouping processing method and device of contact person
KR20190100334A (en) Contextual Hotwords
CN103065631B (en) A kind of method of speech recognition, device
US10117032B2 (en) Hearing aid system, method, and recording medium
CN107799126B (en) Voice endpoint detection method and device based on supervised machine learning
CN103650035B (en) Via social graph, speech model and the user context identification people close to mobile device users
CN105940407B (en) System and method for assessing the intensity of audio password
US20160162469A1 (en) Dynamic Local ASR Vocabulary
US20180336888A1 (en) Method and Apparatus of Training Acoustic Feature Extracting Model, Device and Computer Storage Medium
WO2016209444A1 (en) Language model modification for local speech recognition systems using remote sources
JP5644013B2 (en) Speech processing
CN107147618B (en) User registration method and device and electronic equipment
EP2987312B1 (en) System and method for acoustic echo cancellation
JP5928606B2 (en) Vehicle-based determination of passenger's audiovisual input
US20150120291A1 (en) Scene Recognition Method, Device and Mobile Terminal Based on Ambient Sound
US20150095027A1 (en) Key phrase detection
US20130195285A1 (en) Zone based presence determination via voiceprint location awareness

Legal Events

Date Code Title Description
PB01 Publication
C06 Publication
SE01 Entry into force of request for substantive examination
C10 Entry into substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150429

RJ01 Rejection of invention patent application after publication