CN107332984A

CN107332984A - A kind of method of speech processing and mobile terminal

Info

Publication number: CN107332984A
Application number: CN201710474944.6A
Authority: CN
Inventors: 史建兴
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2017-06-21
Filing date: 2017-06-21
Publication date: 2017-11-07
Anticipated expiration: 2037-06-21
Also published as: CN107332984B

Abstract

The invention provides a kind of method of speech processing and mobile terminal, this method includes：Gather the speech data of user's input；Detect and whether there is reverb signal in the speech data；When there is reverb signal in the speech data, using mode parameter set in advance, the speech data collected is handled.Therefore, the present invention can be handled the voice of input according to practical application scene, improve the call experience of user.

Description

A kind of method of speech processing and mobile terminal

Technical field

The present invention relates to communication technical field, more particularly to a kind of method of speech processing and mobile terminal.

Background technology

With the popularization of the development mobile device of society, the basic function conversed as mobile terminal, consumer is to call Some specific demand more and more highers, such as when having a meeting, for important phone, user has to answer, but user Sound is excessive when receiving calls, and can influence the progress of meeting, sound is too small to cause other side to hear content of speaking.

In addition, in some cases, user is when receiving calls, it is desirable to can not be heard by third party, so spoken sounds It can lower.But, the call of handle in current mobile terminal is that reference man normally speaks the parameter of debugging, so that in user's communication When Shi Shengyin very littles, it is impossible to other side is not just heard content of speaking.

It follows that in the prior art, the call experience of mobile terminal is poor.

The content of the invention

The embodiment provides a kind of method of speech processing and mobile terminal, to solve movement of the prior art The problem of call experience of terminal is poor.

The embodiment provides a kind of method of speech processing, including：

Gather the speech data of user's input；

Detect and whether there is reverb signal in the speech data；

When there is the reverb signal in the speech data, using mode parameter set in advance, to what is collected The speech data is handled.

Embodiments of the invention additionally provide a kind of mobile terminal, including：

Voice acquisition module, the speech data for gathering user's input；

Reverberation detection module, reverb signal is whether there is for detecting in the speech data；

Speech processing module, for when there is the reverb signal in the speech data, using mould set in advance Formula parameter, is handled the speech data collected.

Embodiments of the invention additionally provide a kind of mobile terminal, including memory, processor and are stored in the storage On device and the voice processing program that can run on the processor, the voice processing program is by real during the computing device The step of showing method of speech processing as described above.

Embodiments of the invention additionally provide a kind of computer-readable recording medium, are stored thereon with computer program, should The step in method of speech processing as described above is realized when program is executed by processor.

And in embodiments of the invention, after the speech data of user's input is collected, the voice number collected can be detected It whether there is reverb signal in, so that when there is reverb signal, using mode parameter set in advance, to the language collected Sound data are handled.It follows that embodiments of the invention, can according to practical application scene to the voice of input at Reason, so that the call demand of user is met, the call experience of lifting user.

Brief description of the drawings

In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by institute in the description to the embodiment of the present invention The accompanying drawing needed to use is briefly described, it should be apparent that, drawings in the following description are only some implementations of the present invention Example, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these accompanying drawings Obtain other accompanying drawings.

Fig. 1 represents a kind of flow chart of method of speech processing provided in an embodiment of the present invention；

Fig. 2 represents the flow chart of another method of speech processing provided in an embodiment of the present invention；

Fig. 3 represents a kind of one of structured flowchart of mobile terminal provided in an embodiment of the present invention；

Fig. 4 represents the two of the structured flowchart of a kind of mobile terminal provided in an embodiment of the present invention；

Fig. 5 represents the structured flowchart of another mobile terminal provided in an embodiment of the present invention；

Fig. 6 represents the structured flowchart of another mobile terminal provided in an embodiment of the present invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.

The embodiment provides a kind of method of speech processing, as shown in figure 1, this method includes：

Step 101：Gather the speech data of user's input.

Wherein, when the speech data is conversed including user using mobile terminal, collected by mobile terminal The voice signal that user sends, and user using mobile terminal send speech message when, the use collected by mobile terminal The voice signal that family is sent.

Step 102：Detect and whether there is reverb signal in the speech data.

Wherein, the voice signal entered after the reverb signal is reflected for the voice signal that user sends in microphone.

In addition, when user is conversed or sent speech message using mobile terminal, and lower one's voice when speaking, in order to be able to Partner is allowed not hear sound of speaking, user can be with stick shift near the microphone of mobile terminal, so that what user sent Voice signal is blocked the hand reflection of microphone, hence into microphone, produces reverb signal.And when user will block Mike When the hand of wind is removed, the voice signal that user sends will not then be blocked microphone hand reflection so that will not be in Mike's elegance Reverb signal is detected in the speech data of collection.

, can be according to whether there is reverb signal, area in the speech data collected it follows that embodiments of the invention Mobile terminal is divided to gather different application scene residing during the speech data of user's input, so that voice number of point situation to collection According to being handled.

Step 103：It is right using mode parameter set in advance when there is the reverb signal in the speech data The speech data collected is handled.

Wherein, the speech processes used when mode parameter set in advance carries out secret words pattern by mobile terminal are joined Number., can be by the voice signal after handling the voice signal collected using the mode parameter set in advance Amplitude is amplified, so that the user of opposite equip. can not clearly hear the content that user speaks.

In addition, when the reverb signal is not present in the speech data, keeping logical under mobile terminal normal condition Words pattern, i.e., not to collection to the amplitude of speech data be amplified processing.

When detected by step 102 there is reverb signal in the speech data when, in embodiments of the invention, will move Dynamic terminal is switched to secret words pattern, loads mode parameter set in advance, the speech data collected is handled so that Even if user sends the sound of very little, partner can be also allowed to listen and must will be apparent that；When detecting the voice by step 102 When reverb signal is not present in data, embodiments of the invention switch mobile terminal into the call mode of normal condition.

It follows that embodiments of the invention, after the speech data of user's input is collected, can detect the language collected It whether there is reverb signal in sound data, so that when there is reverb signal, using mode parameter set in advance, to collecting Speech data handled, realize the call of secret words pattern, prevent user's sound of speaking leak, protect privacy.Therefore, originally The embodiment of invention, is handled the voice of input according to practical application scene, so as to meet the call demand of user, is lifted The call experience of user.

Embodiments of the invention additionally provide another method of speech processing, as shown in Fig. 2 this method includes：

Step 201：Gather the speech data of user's input.

Wherein, the speech data includes user's hair that mobile terminal when user is conversed using mobile terminal is collected The voice that the user that mobile terminal is collected when the voice signal and user gone out sends speech message using mobile terminal sends is believed Number.

Step 202：Detect in the speech data with the presence or absence of two frequency spectrum identical voice signals.

Step 203：When there is two frequency spectrum identical voice signals in the speech data, the speech data is determined In there is the reverb signal, otherwise, it determines in the speech data be not present the reverb signal.

Wherein, acquisition time later voice signal relatively is described mixed in the two frequency spectrum identical voice signals existed Ring signal.

In addition, the voice signal that the reverb signal enters in microphone after being reflected for the voice signal that user sends. So, the frequency spectrum for the voice signal that frequency spectrum and the user of reverb signal send is identical, so the voice signal gathered when microphone During two frequency spectrum identical voice signals of middle presence, it is possible to determine that there is reverb signal in the voice signal collected.

Further, in the above-mentioned detection speech data whether there is two frequency spectrum identical voice signals the step of it Afterwards, methods described also includes：

When there is the reverb signal in the speech data, between the time for judging two frequency spectrum identical voice signals Every whether be less than predetermined threshold value, and the time interval be less than the predetermined threshold value when, perform use pattern set in advance Parameter, the step of handling the speech data collected.

Wherein, during the speech data that user's input is gathered in mobile terminal, hinder if there are other in local environment Hinder thing, then the voice signal that user sends can also be reflected by barrier, and the reflected signal is gathered by the microphone of mobile terminal Then, there is frequency spectrum identical voice signal in the speech data that can equally detect microphone collection, still, now user is simultaneously The secret words pattern of mobile terminal need not be used.

Therefore, there is reverb signal in embodiments of the invention, i.e., in the speech data for detect collection in the presence of two frequencies When composing identical voice signal, further detect whether the time interval of the two frequency spectrum identical voice signals is less than default threshold Value, and when less than predetermined threshold value, be just switched to the secret words pattern of mobile terminal, that is, load mode parameter set in advance, The speech data collected is handled.

The voice signal sent due to user reflected by extraneous barrier after the voice signal that is sent with user of signal it Between time interval, the language that signal and user after the hand reflection of microphone that is blocked more than the voice signal that user sends send Time interval between message number, so, by setting predetermined threshold value, it can effectively exclude the voice signal that user sends outer The situation that boundary's barrier reflects and entered in microphone, so as to avoid the mistake processing of the speech data to collecting.

In addition, the step of whether the above-mentioned time interval for judging two frequency spectrum identical voice signals is less than predetermined threshold value Afterwards, in addition to：When the time interval is more than or equal to the predetermined threshold value, keep logical under mobile terminal normal condition Words pattern, i.e., not to collection to the amplitude of speech data be amplified processing.

In the embodiment of the present invention, when between two voice signals of frequency spectrum identical present in the speech data collected When time interval is more than or equal to predetermined threshold value, represent that reverb signal present in the speech data collected is not by user The hand reflection of microphone is blocked, i.e., now user does not need to use the secret words pattern of mobile terminal, i.e., need not load Mode parameter set in advance is handled the speech data collected, then keeps the call mould under mobile terminal normal condition Formula.

Step 204：It is right using mode parameter set in advance when there is the reverb signal in the speech data The speech data collected is handled.

Wherein, the speech processes used when mode parameter set in advance carries out secret words pattern by mobile terminal are joined Number., can be by the speech data after handling the speech data collected using the mode parameter set in advance Amplitude is amplified, so that the user of opposite equip. can not clearly hear the content that user speaks.

Specifically, when there is reverb signal in the speech data collected, secret words pattern is switched mobile terminal into, Mode parameter set in advance is loaded, the speech data collected is handled so that even if user sends the sound of very little, Also the counterpart device user for receiving speech data can be allowed not hear the content of speaking of the user；When the speech data collected In be not present reverb signal when, switch mobile terminal into the call mode of normal condition so that mobile terminal can be realized just Normal voice call function.

Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters；Step 204 includes：According to the drop Make an uproar parameter, remove the reverb signal in the speech data collected, obtain the first voice signal, the reverb signal is institute State the later voice signal relatively of acquisition time in two frequency spectrum identical voice signals；It is right according to the voice amplifying parameters First voice signal is amplified processing, obtains the second voice signal.

In the embodiment of the present invention, when mobile terminal is in secret words pattern, it will be mixed present in the speech data collected Ring signal to remove, and processing is amplified to removing the voice signal after the reverb signal.

Wherein, can be by reverberation during the reverb signal in the speech data collected according to noise reduction parameters, removal Signal is together removed with other noises, so that the voice signal after noise reduction becomes apparent from.Further, since in secret words pattern Under collect user input speech data when, the sound of speaking of user is smaller, so, can by speech data enhanced processing To allow the counterpart device user for receiving the speech data more clearly to hear the content of speaking of the user.

Further, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range Amount；It is described that processing is amplified to first voice signal according to the voice amplifying parameters, obtain the second voice signal Step, including：According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain the Three voice signals；According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal the shaking at different frequency range Width, obtains the second voice signal.

In embodiments of the invention, during the first voice signal is amplified processing, first, the first voice is believed Number overall amplification, then, according to the corresponding amplitude increase of predetermined different frequency range, lifts shaking for different frequency range one by one Width so that the second voice signal of acquisition can meet more preferable frequency response curve, improves the transmission quality of speech data, So as to further improve the sense of hearing of speech data.

In summary, in embodiments of the present invention, when there are two frequency spectrum phases in the speech data of the collection of mobile terminal With voice signal, and the time intervals of the two voice signals is when being less than predetermined threshold value, then loads pattern set in advance ginseng Number, handles the speech data collected, realizes the call of secret words pattern, prevents user's sound of speaking from leaking, protection Privacy；When identical in the absence of two frequency spectrum identical voice signals, or two frequency spectrums of presence in the speech data collected Voice signal time interval be more than or equal to predetermined threshold value when, then keep mobile terminal normal condition under call.Thus Understand, embodiments of the invention can be handled the speech data of input according to practical application scene, so as to meet user Call demand, lifting user call experience.

Embodiments of the invention additionally provide a kind of mobile terminal, as shown in figure 3, the mobile terminal 300 includes：

Voice acquisition module 301, the speech data for gathering user's input；

Reverberation detection module 302, reverb signal is whether there is for detecting in the speech data；

Speech processing module 304, for when there is the reverb signal in the speech data, using set in advance Mode parameter, is handled the speech data collected.

Preferably, as shown in figure 4, the reverberation detection module 302 includes：

Detecting signal unit 3021, for detecting in the speech data with the presence or absence of two frequency spectrum identical voice letters Number；

Determining unit 3022, for when there is two frequency spectrum identical voice signals in the speech data, determining institute State in speech data and there is the reverb signal, otherwise, it determines the reverb signal is not present in the speech data.

Preferably, as shown in figure 4, the mobile terminal 300 also includes：

Judge module 303 is spaced, for when there is the reverb signal in the speech data, judging two frequency spectrum phases Whether the time interval of same voice signal is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, touches Send out speech processing module 304 described and use mode parameter set in advance, the speech data collected is handled Step.

Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters；As shown in figure 4, the speech processes Module 304 includes：

First processing units 3041, for according to the noise reduction parameters, removing mixed in the speech data collected Signal is rung, the first voice signal is obtained, the reverb signal is acquisition time phase in described two frequency spectrum identical voice signals To later voice signal；

Second processing unit 3042, for according to the voice amplifying parameters, being amplified to first voice signal Processing, obtains the second voice signal.

Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range； As shown in figure 4, the second processing unit 3042 includes：

First amplification subelement 30421, for according to the overall amplitude increase, increasing first voice signal The amplitude of all frequency ranges, obtains the 3rd voice signal；

Second amplification subelement 30422, for according to the corresponding amplitude increase of different frequency range, increasing the 3rd voice Amplitude of the signal at different frequency range, obtains the second voice signal.

Embodiments of the invention, gather the speech data that user inputs, so as to trigger reverberation by voice acquisition module 301 Detection module 302, which is detected, whether there is reverb signal in the speech data, so that when there is reverb signal, at triggering voice Manage module 304 and use mode parameter set in advance, the speech data collected is handled.

Wherein, when mobile terminal gathers the speech data of user's input, speak, connect in order to be able to allow if user lowers one's voice The user for receiving the counterpart device of voice signal does not hear the sound of speaking of the user, and the user can use stick shift in the Mike of mobile terminal Near wind, so that there is reverb signal in the voice signal of microphone collection.And in embodiments of the invention, collecting After the speech data of user's input, it can detect and whether there is reverb signal in the speech data collected, so as to there is reverberation During signal, using mode parameter set in advance, the speech data collected is handled, the logical of secret words pattern is realized Words, prevent user's sound of speaking from leaking, and protect privacy.It follows that embodiments of the invention, can be according to practical application scene The voice of input is handled, so that the call demand of user is met, the call experience of lifting user.

Embodiments of the invention additionally provide a kind of computer-readable recording medium, are stored thereon with computer program, should The step in method of speech processing described above is realized when program is executed by processor.

Embodiments of the invention additionally provide a kind of mobile terminal, and the mobile terminal can be mobile phone, tablet personal computer, individual Digital assistants (Personal Digital Assistant, PDA) or vehicle-mounted computer etc..

As shown in figure 5, mobile terminal 500 include radio frequency (Radio Frequency, RF) circuit 510, it is memory 520, defeated Enter unit 530, display unit 540, processor 560, voicefrequency circuit 570, the He of Wi-Fi (Wireless Fidelity) module 580 Power supply 590.

Wherein, input block 530 can be used for the numeral or character information for receiving user's input, and produce and mobile terminal The signal input that 500 user is set and function control is relevant.Specifically, in the embodiment of the present invention, the input block 530 can With including contact panel 531.Contact panel 531, also referred to as touch-screen, collect touch operation of the user on or near it (such as user uses the operations of any suitable object or annex on contact panel 531 such as finger, stylus), and according to advance The formula of setting drives corresponding attachment means.

Optionally, contact panel 531 may include both touch detecting apparatus and touch controller.Wherein, inspection is touched Survey device and detect the touch orientation of user, and detect the signal that touch operation is brought, transmit a signal to touch controller；Touch Controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives the processor 560, and The order sent of reception processing device 560 and it can be performed.Furthermore, it is possible to using resistance-type, condenser type, infrared ray and surface The polytypes such as sound wave realize contact panel 531.Except contact panel 531, input block 530 can also be set including other inputs Standby 532, other input equipments 532 can include but is not limited to physical keyboard, function key, and (such as volume control button, switch are pressed Key etc.), trace ball, mouse, the one or more in action bars etc..

Wherein, display unit 540 can be used for information and the movement for showing the information inputted by user or being supplied to user The various menu interfaces of terminal 500.Display unit 540 may include display panel 541, optionally, can use LCD or organic hairs The forms such as optical diode (Organic Light-Emitting Diode, OLED) configure display panel 541.

It should be noted that contact panel 531 can cover display panel 541, touch display screen is formed, when touch display screen inspection Measure after the touch operation on or near it, processor 860 is sent to determine the type of touch event, with preprocessor 560 provide corresponding visual output according to the type of touch event in touch display screen.

Touch display screen includes Application Program Interface viewing area and conventional control viewing area.The Application Program Interface viewing area And arrangement mode of the conventional control viewing area is not limited, can be arranged above and below, left-right situs etc. can distinguish two and show Show the arrangement mode in area.The Application Program Interface viewing area is displayed for the interface of application program.Each interface can be with The interface element such as the icon comprising at least one application program and/or widget desktop controls.The Application Program Interface viewing area It can also be the empty interface not comprising any content.The conventional control viewing area is used to show the higher control of utilization rate, for example, Application icons such as settings button, interface numbering, scroll bar, phone directory icon etc..

Wherein processor 560 is the control centre of mobile terminal 500, utilizes various interfaces and connection whole mobile phone Various pieces, software program and/or module in first memory 521 are stored in by operation or execution, and call storage Data in second memory 522, perform the various functions and processing data of mobile terminal 500, so as to mobile terminal 500 Carry out integral monitoring.Optionally, processor 560 may include one or more processing units.

In embodiments of the present invention, processor 560 can gather the speech data of user's input, and detect the voice number It whether there is reverb signal in, so as to when there is the reverb signal in the speech data, use mould set in advance Formula parameter, is handled the speech data collected.

Preferably, when processor 560 whether there is reverb signal in the detection speech data, specifically for：Detection With the presence or absence of two frequency spectrum identical voice signals in the speech data；It is identical when there are two frequency spectrums in the speech data Voice signal when, determine there is the reverb signal in the speech data, otherwise, it determines being not present in the speech data The reverb signal.

Preferably, processor 560 in the speech data is detected with the presence or absence of two frequency spectrum identical voice signals it Afterwards, it is additionally operable to：When there is the reverb signal in the speech data, the time of two frequency spectrum identical voice signals is judged Whether interval is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mould set in advance Formula parameter, the step of handling the speech data collected.

Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters；Processor 560 is using presetting Mode parameter, when handling the speech data collected, specifically for：According to the noise reduction parameters, removal is adopted Reverb signal in the speech data collected, obtains the first voice signal, and the reverb signal is described two frequency spectrum phases With voice signal in acquisition time later voice signal relatively；According to the voice amplifying parameters, to first voice Signal is amplified processing, obtains the second voice signal.

Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range； Processor 560 according to the voice amplifying parameters, is being amplified processing to first voice signal, obtains the second voice letter Number when, specifically for：According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain Obtain the 3rd voice signal；According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal at different frequency range Amplitude, obtain the second voice signal.

Mobile terminal 500 can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here Repeat no more.

In embodiments of the present invention, when mobile terminal gathers the speech data of user's input, if user lowers one's voice Words, in order to be able to allow the user for the counterpart device for receiving the speech data not hear the sound of speaking of the user, the user can use stick shift Near the microphone of mobile terminal, so that there is reverb signal in the speech data of microphone collection.And the present invention In embodiment, after the speech data of user's input is collected, it can detect in the speech data collected with the presence or absence of reverberation letter Number, so that when there is reverb signal, using mode parameter set in advance, the speech data collected is handled, it is real The call of existing secret words pattern, prevents user's sound of speaking from leaking, and protects privacy.It follows that embodiments of the invention, can The voice of input is handled according to practical application scene, so as to meet the call demand of user, the call body of user is lifted Test.

Specifically, the mobile terminal 600 shown in Fig. 6 includes：At least one processor 601, memory 602, at least one Network interface 604, other users interface 603.Each component in mobile terminal 600 is coupled by bus system 605. It is understood that bus system 605 is used to realize the connection communication between these components.Bus system 605 except include data/address bus it Outside, in addition to power bus, controlling bus and status signal bus in addition.But for the sake of clear explanation, in figure 6 will be various total Line is all designated as bus system 605.

Wherein, user interface 603 can include display, keyboard or pointing device (for example, mouse, trace ball (trackball), touch-sensitive plate or touch-screen etc..

It is appreciated that the memory 602 in the embodiment of the present invention can be volatile memory or nonvolatile memory, Or may include both volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read-only storage (Read- Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside high Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM (Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory (Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate SDRAM, DDRSDRAM), enhanced Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct Rambus RAM, DRRAM).The embodiment of the present invention description system and method memory 602 be intended to including but not limited to these With the memory of any other suitable type.

In some embodiments, memory 602 stores following element, can perform module or data structure, or Their subset of person, or their superset：Operating system 6021 and application program 6022.

Wherein, operating system 6021, comprising various system programs, such as ccf layer, core library layer, driving layer, are used for Realize various basic businesses and handle hardware based task.Application program 6022, includes various application programs, such as media Player (Media Player), browser (Browser) etc., for realizing various applied business.Realize the embodiment of the present invention The program of method may be embodied in application program 6022.

In embodiments of the present invention, by calling program or the instruction of the storage of memory 602, specifically, can be application The program stored in program 6022 or instruction.

In embodiments of the present invention, processor 601 can gather the speech data of user's input, and detect the voice number It whether there is reverb signal in, so as to when there is the reverb signal in the speech data, use mould set in advance Formula parameter, is handled the speech data collected.

The method that the embodiments of the present invention are disclosed can apply in processor 601, or be realized by processor 601. Processor 601 is probably a kind of IC chip, the disposal ability with signal.In implementation process, the above method it is each Step can be completed by the integrated logic circuit of the hardware in processor 601 or the instruction of software form.Above-mentioned processing Device 601 can be general processor, digital signal processor (Digital Signal Processor, DSP), special integrated electricity Road (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field Programmable Gate Array, FPGA) or other PLDs, discrete gate or transistor logic, Discrete hardware components.It can realize or perform disclosed each method, step and the logic diagram in the embodiment of the present invention.It is general Processor can be microprocessor or the processor can also be any conventional processor etc..With reference to institute of the embodiment of the present invention The step of disclosed method, can be embodied directly in hardware decoding processor and perform completion, or with the hardware in decoding processor And software module combination performs completion.Software module can be located at random access memory, and flash memory, read-only storage may be programmed read-only In the ripe storage medium in this area such as memory or electrically erasable programmable memory, register.The storage medium is located at Memory 602, processor 601 reads the information in memory 602, the step of completing the above method with reference to its hardware.

It is understood that the embodiment of the present invention description these embodiments can with hardware, software, firmware, middleware, Microcode or its combination are realized.Realized for hardware, processing unit can be realized in one or more application specific integrated circuits (Application Specific Integrated Circuit, ASIC), digital signal processor (Digital Signal Processor, DSP), digital signal processing appts (DSP Device, DSPD), programmable logic device (Programmable Logic Device, PLD), field programmable gate array (Field Programmable Gate Array, FPGA), general place Manage in device, controller, microcontroller, microprocessor, other electronic units for performing the application function or its combination.

Realize, can be realized by performing the module (such as process, function) of function of the embodiment of the present invention for software The technology of the embodiment of the present invention.Software code is storable in memory and by computing device.Memory can be in processing Realized in device or outside processor.

Preferably, when processor 601 whether there is reverb signal in the detection speech data, specifically for：Detection With the presence or absence of two frequency spectrum identical voice signals in the speech data；It is identical when there are two frequency spectrums in the speech data Voice signal when, determine there is the reverb signal in the speech data, otherwise, it determines being not present in the speech data The reverb signal.

Preferably, processor 601 in the speech data is detected with the presence or absence of two frequency spectrum identical voice signals it Afterwards, it is additionally operable to：When there is the reverb signal in the speech data, the time of two frequency spectrum identical voice signals is judged Whether interval is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mould set in advance Formula parameter, the step of handling the speech data collected.

Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters；Processor 601 is using presetting Mode parameter, when handling the speech data collected, specifically for：According to the noise reduction parameters, removal is adopted Reverb signal in the speech data collected, obtains the first voice signal, and the reverb signal is described two frequency spectrum phases With voice signal in acquisition time later voice signal relatively；According to the voice amplifying parameters, to first voice Signal is amplified processing, obtains the second voice signal.

Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range； Processor 601 according to the voice amplifying parameters, is being amplified processing to first voice signal, obtains the second voice letter Number when, specifically for：According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain Obtain the 3rd voice signal；According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal at different frequency range Amplitude, obtain the second voice signal.

Mobile terminal 600 can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here Repeat no more.

Those of ordinary skill in the art it is to be appreciated that with reference to disclosed in the embodiment of the present invention embodiment description it is each The unit and algorithm steps of example, can be realized with the combination of electronic hardware or computer software and electronic hardware.These Function is performed with hardware or software mode actually, depending on the application-specific and design constraint of technical scheme.Specialty Technical staff can realize described function to each specific application using distinct methods, but this realization should not Think beyond the scope of this invention.

It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.

In embodiment provided herein, it should be understood that disclosed apparatus and method, others can be passed through Mode is realized.For example, device embodiment described above is only schematical, for example, the division of the unit, is only A kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can combine or Person is desirably integrated into another system, or some features can be ignored, or does not perform.Another, shown or discussed is mutual Between coupling or direct-coupling or communication connection can be the INDIRECT COUPLING or communication link of device or unit by some interfaces Connect, can be electrical, machinery or other forms.

The unit illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.

In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.

If the function is realized using in the form of SFU software functional unit and is used as independent production marketing or in use, can be with It is stored in a computer read/write memory medium.Understood based on such, technical scheme is substantially in other words The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter Calculation machine software product is stored in a storage medium, including some instructions are to cause a computer equipment (can be individual People's computer, server, or network equipment etc.) perform all or part of step of each of the invention embodiment methods described. And foregoing storage medium includes：USB flash disk, mobile hard disk, ROM, RAM, magnetic disc or CD etc. are various can be with store program codes Medium.

The foregoing is only a specific embodiment of the invention, but protection scope of the present invention is not limited thereto, any Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all be contained Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.

Claims

1. a kind of method of speech processing, it is characterised in that including：

Gather the speech data of user's input；

Detect and whether there is reverb signal in the speech data；

When there is reverb signal in the speech data, using mode parameter set in advance, to the voice collected Data are handled.

2. according to the method described in claim 1, it is characterised in that with the presence or absence of reverberation letter in the detection speech data Number the step of, including：

Detect in the speech data with the presence or absence of two frequency spectrum identical voice signals；

When there is two frequency spectrum identical voice signals in the speech data, determine there is reverberation letter in the speech data Number, otherwise, it determines reverb signal is not present in the speech data.

3. method according to claim 2, it is characterised in that with the presence or absence of two frequencies in the detection speech data After the step of composing identical voice signal, methods described also includes：

When there is reverb signal in the speech data, judge whether the time interval of two frequency spectrum identical voice signals is small In predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mode parameter set in advance, to adopting The step of speech data collected is handled.

4. method according to claim 2, it is characterised in that the mode parameter includes noise reduction parameters and voice amplification ginseng Number；

Use mode parameter set in advance, the step of handling the speech data collected, including：

According to the noise reduction parameters, the reverb signal in the speech data collected is removed, the first voice signal, institute is obtained It is the later voice signal relatively of acquisition time in described two frequency spectrum identical voice signals to state reverb signal；

According to the voice amplifying parameters, processing is amplified to first voice signal, the second voice signal is obtained.

5. method according to claim 4, it is characterised in that the voice amplifying parameters include overall amplitude increase and The corresponding amplitude increase of different frequency range；

It is described that processing is amplified to first voice signal according to the voice amplifying parameters, obtain the second voice signal The step of, including：

According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain the 3rd voice Signal；

According to the corresponding amplitude increase of different frequency range, increase amplitude of the 3rd voice signal at different frequency range, obtain Second voice signal.

6. a kind of mobile terminal, it is characterised in that including：

Voice acquisition module, the speech data for gathering user's input；

Speech processing module is right using mode parameter set in advance for when there is reverb signal in the speech data The speech data collected is handled.

7. mobile terminal according to claim 6, it is characterised in that the reverberation detection module includes：

Detecting signal unit, for detecting in the speech data with the presence or absence of two frequency spectrum identical voice signals；

Determining unit, for when there is two frequency spectrum identical voice signals in the speech data, determining the voice number There is reverb signal in, otherwise, it determines reverb signal is not present in the speech data.

8. mobile terminal according to claim 7, it is characterised in that also include：

Judge module is spaced, for when there is reverb signal in the speech data, judging two frequency spectrum identical voice letters Number time interval whether be less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, trigger the voice Processing module, which is performed, uses mode parameter set in advance, the step of handling the speech data collected.

9. mobile terminal according to claim 7, it is characterised in that the mode parameter includes noise reduction parameters and voice is put Big parameter；

The speech processing module includes：

First processing units, for according to the noise reduction parameters, removing the reverb signal in the speech data collected, obtaining The first voice signal, the reverb signal is acquisition time later language relatively in described two frequency spectrum identical voice signals Message number；

Second processing unit, for according to the voice amplifying parameters, processing to be amplified to first voice signal, is obtained Second voice signal.

10. mobile terminal according to claim 9, it is characterised in that the voice amplifying parameters include overall amplitude and increased The a large amount of and corresponding amplitude increase of different frequency range；

The second processing unit includes：

First amplification subelement, for according to the overall amplitude increase, increasing all frequency ranges of first voice signal Amplitude, obtain the 3rd voice signal；

Second amplification subelement, for according to the corresponding amplitude increase of different frequency range, increasing the 3rd voice signal not With the amplitude at frequency range, the second voice signal is obtained.

11. a kind of mobile terminal, it is characterised in that including memory, processor and be stored on the memory and can be in institute The voice processing program run on processor is stated, the voice processing program is realized such as claim during the computing device The step of method of speech processing any one of 1 to 5.

12. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor The step in the method for speech processing as any one of claim 1 to 5 is realized during execution.