CN107332984A - A kind of method of speech processing and mobile terminal - Google Patents
A kind of method of speech processing and mobile terminal Download PDFInfo
- Publication number
- CN107332984A CN107332984A CN201710474944.6A CN201710474944A CN107332984A CN 107332984 A CN107332984 A CN 107332984A CN 201710474944 A CN201710474944 A CN 201710474944A CN 107332984 A CN107332984 A CN 107332984A
- Authority
- CN
- China
- Prior art keywords
- voice
- speech data
- signal
- mobile terminal
- voice signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72448—User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
- H04M1/72454—User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/60—Substation equipment, e.g. for use by subscribers including speech amplifiers
- H04M1/6008—Substation equipment, e.g. for use by subscribers including speech amplifiers in the transmitter circuit
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72484—User interfaces specially adapted for cordless or mobile telephones wherein functions are triggered by incoming communication events
Abstract
The invention provides a kind of method of speech processing and mobile terminal, this method includes:Gather the speech data of user's input;Detect and whether there is reverb signal in the speech data;When there is reverb signal in the speech data, using mode parameter set in advance, the speech data collected is handled.Therefore, the present invention can be handled the voice of input according to practical application scene, improve the call experience of user.
Description
Technical field
The present invention relates to communication technical field, more particularly to a kind of method of speech processing and mobile terminal.
Background technology
With the popularization of the development mobile device of society, the basic function conversed as mobile terminal, consumer is to call
Some specific demand more and more highers, such as when having a meeting, for important phone, user has to answer, but user
Sound is excessive when receiving calls, and can influence the progress of meeting, sound is too small to cause other side to hear content of speaking.
In addition, in some cases, user is when receiving calls, it is desirable to can not be heard by third party, so spoken sounds
It can lower.But, the call of handle in current mobile terminal is that reference man normally speaks the parameter of debugging, so that in user's communication
When Shi Shengyin very littles, it is impossible to other side is not just heard content of speaking.
It follows that in the prior art, the call experience of mobile terminal is poor.
The content of the invention
The embodiment provides a kind of method of speech processing and mobile terminal, to solve movement of the prior art
The problem of call experience of terminal is poor.
The embodiment provides a kind of method of speech processing, including:
Gather the speech data of user's input;
Detect and whether there is reverb signal in the speech data;
When there is the reverb signal in the speech data, using mode parameter set in advance, to what is collected
The speech data is handled.
Embodiments of the invention additionally provide a kind of mobile terminal, including:
Voice acquisition module, the speech data for gathering user's input;
Reverberation detection module, reverb signal is whether there is for detecting in the speech data;
Speech processing module, for when there is the reverb signal in the speech data, using mould set in advance
Formula parameter, is handled the speech data collected.
Embodiments of the invention additionally provide a kind of mobile terminal, including memory, processor and are stored in the storage
On device and the voice processing program that can run on the processor, the voice processing program is by real during the computing device
The step of showing method of speech processing as described above.
Embodiments of the invention additionally provide a kind of computer-readable recording medium, are stored thereon with computer program, should
The step in method of speech processing as described above is realized when program is executed by processor.
And in embodiments of the invention, after the speech data of user's input is collected, the voice number collected can be detected
It whether there is reverb signal in, so that when there is reverb signal, using mode parameter set in advance, to the language collected
Sound data are handled.It follows that embodiments of the invention, can according to practical application scene to the voice of input at
Reason, so that the call demand of user is met, the call experience of lifting user.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by institute in the description to the embodiment of the present invention
The accompanying drawing needed to use is briefly described, it should be apparent that, drawings in the following description are only some implementations of the present invention
Example, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these accompanying drawings
Obtain other accompanying drawings.
Fig. 1 represents a kind of flow chart of method of speech processing provided in an embodiment of the present invention;
Fig. 2 represents the flow chart of another method of speech processing provided in an embodiment of the present invention;
Fig. 3 represents a kind of one of structured flowchart of mobile terminal provided in an embodiment of the present invention;
Fig. 4 represents the two of the structured flowchart of a kind of mobile terminal provided in an embodiment of the present invention;
Fig. 5 represents the structured flowchart of another mobile terminal provided in an embodiment of the present invention;
Fig. 6 represents the structured flowchart of another mobile terminal provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on this hair
Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made
Example, belongs to the scope of protection of the invention.
The embodiment provides a kind of method of speech processing, as shown in figure 1, this method includes:
Step 101:Gather the speech data of user's input.
Wherein, when the speech data is conversed including user using mobile terminal, collected by mobile terminal
The voice signal that user sends, and user using mobile terminal send speech message when, the use collected by mobile terminal
The voice signal that family is sent.
Step 102:Detect and whether there is reverb signal in the speech data.
Wherein, the voice signal entered after the reverb signal is reflected for the voice signal that user sends in microphone.
In addition, when user is conversed or sent speech message using mobile terminal, and lower one's voice when speaking, in order to be able to
Partner is allowed not hear sound of speaking, user can be with stick shift near the microphone of mobile terminal, so that what user sent
Voice signal is blocked the hand reflection of microphone, hence into microphone, produces reverb signal.And when user will block Mike
When the hand of wind is removed, the voice signal that user sends will not then be blocked microphone hand reflection so that will not be in Mike's elegance
Reverb signal is detected in the speech data of collection.
, can be according to whether there is reverb signal, area in the speech data collected it follows that embodiments of the invention
Mobile terminal is divided to gather different application scene residing during the speech data of user's input, so that voice number of point situation to collection
According to being handled.
Step 103:It is right using mode parameter set in advance when there is the reverb signal in the speech data
The speech data collected is handled.
Wherein, the speech processes used when mode parameter set in advance carries out secret words pattern by mobile terminal are joined
Number., can be by the voice signal after handling the voice signal collected using the mode parameter set in advance
Amplitude is amplified, so that the user of opposite equip. can not clearly hear the content that user speaks.
In addition, when the reverb signal is not present in the speech data, keeping logical under mobile terminal normal condition
Words pattern, i.e., not to collection to the amplitude of speech data be amplified processing.
When detected by step 102 there is reverb signal in the speech data when, in embodiments of the invention, will move
Dynamic terminal is switched to secret words pattern, loads mode parameter set in advance, the speech data collected is handled so that
Even if user sends the sound of very little, partner can be also allowed to listen and must will be apparent that;When detecting the voice by step 102
When reverb signal is not present in data, embodiments of the invention switch mobile terminal into the call mode of normal condition.
It follows that embodiments of the invention, after the speech data of user's input is collected, can detect the language collected
It whether there is reverb signal in sound data, so that when there is reverb signal, using mode parameter set in advance, to collecting
Speech data handled, realize the call of secret words pattern, prevent user's sound of speaking leak, protect privacy.Therefore, originally
The embodiment of invention, is handled the voice of input according to practical application scene, so as to meet the call demand of user, is lifted
The call experience of user.
Embodiments of the invention additionally provide another method of speech processing, as shown in Fig. 2 this method includes:
Step 201:Gather the speech data of user's input.
Wherein, the speech data includes user's hair that mobile terminal when user is conversed using mobile terminal is collected
The voice that the user that mobile terminal is collected when the voice signal and user gone out sends speech message using mobile terminal sends is believed
Number.
Step 202:Detect in the speech data with the presence or absence of two frequency spectrum identical voice signals.
Step 203:When there is two frequency spectrum identical voice signals in the speech data, the speech data is determined
In there is the reverb signal, otherwise, it determines in the speech data be not present the reverb signal.
Wherein, acquisition time later voice signal relatively is described mixed in the two frequency spectrum identical voice signals existed
Ring signal.
In addition, the voice signal that the reverb signal enters in microphone after being reflected for the voice signal that user sends.
So, the frequency spectrum for the voice signal that frequency spectrum and the user of reverb signal send is identical, so the voice signal gathered when microphone
During two frequency spectrum identical voice signals of middle presence, it is possible to determine that there is reverb signal in the voice signal collected.
Further, in the above-mentioned detection speech data whether there is two frequency spectrum identical voice signals the step of it
Afterwards, methods described also includes:
When there is the reverb signal in the speech data, between the time for judging two frequency spectrum identical voice signals
Every whether be less than predetermined threshold value, and the time interval be less than the predetermined threshold value when, perform use pattern set in advance
Parameter, the step of handling the speech data collected.
Wherein, during the speech data that user's input is gathered in mobile terminal, hinder if there are other in local environment
Hinder thing, then the voice signal that user sends can also be reflected by barrier, and the reflected signal is gathered by the microphone of mobile terminal
Then, there is frequency spectrum identical voice signal in the speech data that can equally detect microphone collection, still, now user is simultaneously
The secret words pattern of mobile terminal need not be used.
Therefore, there is reverb signal in embodiments of the invention, i.e., in the speech data for detect collection in the presence of two frequencies
When composing identical voice signal, further detect whether the time interval of the two frequency spectrum identical voice signals is less than default threshold
Value, and when less than predetermined threshold value, be just switched to the secret words pattern of mobile terminal, that is, load mode parameter set in advance,
The speech data collected is handled.
The voice signal sent due to user reflected by extraneous barrier after the voice signal that is sent with user of signal it
Between time interval, the language that signal and user after the hand reflection of microphone that is blocked more than the voice signal that user sends send
Time interval between message number, so, by setting predetermined threshold value, it can effectively exclude the voice signal that user sends outer
The situation that boundary's barrier reflects and entered in microphone, so as to avoid the mistake processing of the speech data to collecting.
In addition, the step of whether the above-mentioned time interval for judging two frequency spectrum identical voice signals is less than predetermined threshold value
Afterwards, in addition to:When the time interval is more than or equal to the predetermined threshold value, keep logical under mobile terminal normal condition
Words pattern, i.e., not to collection to the amplitude of speech data be amplified processing.
In the embodiment of the present invention, when between two voice signals of frequency spectrum identical present in the speech data collected
When time interval is more than or equal to predetermined threshold value, represent that reverb signal present in the speech data collected is not by user
The hand reflection of microphone is blocked, i.e., now user does not need to use the secret words pattern of mobile terminal, i.e., need not load
Mode parameter set in advance is handled the speech data collected, then keeps the call mould under mobile terminal normal condition
Formula.
Step 204:It is right using mode parameter set in advance when there is the reverb signal in the speech data
The speech data collected is handled.
Wherein, the speech processes used when mode parameter set in advance carries out secret words pattern by mobile terminal are joined
Number., can be by the speech data after handling the speech data collected using the mode parameter set in advance
Amplitude is amplified, so that the user of opposite equip. can not clearly hear the content that user speaks.
In addition, when user is conversed or sent speech message using mobile terminal, and lower one's voice when speaking, in order to be able to
Partner is allowed not hear sound of speaking, user can be with stick shift near the microphone of mobile terminal, so that what user sent
Voice signal is blocked the hand reflection of microphone, hence into microphone, produces reverb signal.And when user will block Mike
When the hand of wind is removed, the voice signal that user sends will not then be blocked microphone hand reflection so that will not be in Mike's elegance
Reverb signal is detected in the speech data of collection.
, can be according to whether there is reverb signal, area in the speech data collected it follows that embodiments of the invention
Mobile terminal is divided to gather different application scene residing during the speech data of user's input, so that voice number of point situation to collection
According to being handled.
Specifically, when there is reverb signal in the speech data collected, secret words pattern is switched mobile terminal into,
Mode parameter set in advance is loaded, the speech data collected is handled so that even if user sends the sound of very little,
Also the counterpart device user for receiving speech data can be allowed not hear the content of speaking of the user;When the speech data collected
In be not present reverb signal when, switch mobile terminal into the call mode of normal condition so that mobile terminal can be realized just
Normal voice call function.
Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters;Step 204 includes:According to the drop
Make an uproar parameter, remove the reverb signal in the speech data collected, obtain the first voice signal, the reverb signal is institute
State the later voice signal relatively of acquisition time in two frequency spectrum identical voice signals;It is right according to the voice amplifying parameters
First voice signal is amplified processing, obtains the second voice signal.
In the embodiment of the present invention, when mobile terminal is in secret words pattern, it will be mixed present in the speech data collected
Ring signal to remove, and processing is amplified to removing the voice signal after the reverb signal.
Wherein, can be by reverberation during the reverb signal in the speech data collected according to noise reduction parameters, removal
Signal is together removed with other noises, so that the voice signal after noise reduction becomes apparent from.Further, since in secret words pattern
Under collect user input speech data when, the sound of speaking of user is smaller, so, can by speech data enhanced processing
To allow the counterpart device user for receiving the speech data more clearly to hear the content of speaking of the user.
Further, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range
Amount;It is described that processing is amplified to first voice signal according to the voice amplifying parameters, obtain the second voice signal
Step, including:According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain the
Three voice signals;According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal the shaking at different frequency range
Width, obtains the second voice signal.
In embodiments of the invention, during the first voice signal is amplified processing, first, the first voice is believed
Number overall amplification, then, according to the corresponding amplitude increase of predetermined different frequency range, lifts shaking for different frequency range one by one
Width so that the second voice signal of acquisition can meet more preferable frequency response curve, improves the transmission quality of speech data,
So as to further improve the sense of hearing of speech data.
In summary, in embodiments of the present invention, when there are two frequency spectrum phases in the speech data of the collection of mobile terminal
With voice signal, and the time intervals of the two voice signals is when being less than predetermined threshold value, then loads pattern set in advance ginseng
Number, handles the speech data collected, realizes the call of secret words pattern, prevents user's sound of speaking from leaking, protection
Privacy;When identical in the absence of two frequency spectrum identical voice signals, or two frequency spectrums of presence in the speech data collected
Voice signal time interval be more than or equal to predetermined threshold value when, then keep mobile terminal normal condition under call.Thus
Understand, embodiments of the invention can be handled the speech data of input according to practical application scene, so as to meet user
Call demand, lifting user call experience.
Embodiments of the invention additionally provide a kind of mobile terminal, as shown in figure 3, the mobile terminal 300 includes:
Voice acquisition module 301, the speech data for gathering user's input;
Reverberation detection module 302, reverb signal is whether there is for detecting in the speech data;
Speech processing module 304, for when there is the reverb signal in the speech data, using set in advance
Mode parameter, is handled the speech data collected.
Preferably, as shown in figure 4, the reverberation detection module 302 includes:
Detecting signal unit 3021, for detecting in the speech data with the presence or absence of two frequency spectrum identical voice letters
Number;
Determining unit 3022, for when there is two frequency spectrum identical voice signals in the speech data, determining institute
State in speech data and there is the reverb signal, otherwise, it determines the reverb signal is not present in the speech data.
Preferably, as shown in figure 4, the mobile terminal 300 also includes:
Judge module 303 is spaced, for when there is the reverb signal in the speech data, judging two frequency spectrum phases
Whether the time interval of same voice signal is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, touches
Send out speech processing module 304 described and use mode parameter set in advance, the speech data collected is handled
Step.
Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters;As shown in figure 4, the speech processes
Module 304 includes:
First processing units 3041, for according to the noise reduction parameters, removing mixed in the speech data collected
Signal is rung, the first voice signal is obtained, the reverb signal is acquisition time phase in described two frequency spectrum identical voice signals
To later voice signal;
Second processing unit 3042, for according to the voice amplifying parameters, being amplified to first voice signal
Processing, obtains the second voice signal.
Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range;
As shown in figure 4, the second processing unit 3042 includes:
First amplification subelement 30421, for according to the overall amplitude increase, increasing first voice signal
The amplitude of all frequency ranges, obtains the 3rd voice signal;
Second amplification subelement 30422, for according to the corresponding amplitude increase of different frequency range, increasing the 3rd voice
Amplitude of the signal at different frequency range, obtains the second voice signal.
Embodiments of the invention, gather the speech data that user inputs, so as to trigger reverberation by voice acquisition module 301
Detection module 302, which is detected, whether there is reverb signal in the speech data, so that when there is reverb signal, at triggering voice
Manage module 304 and use mode parameter set in advance, the speech data collected is handled.
Wherein, when mobile terminal gathers the speech data of user's input, speak, connect in order to be able to allow if user lowers one's voice
The user for receiving the counterpart device of voice signal does not hear the sound of speaking of the user, and the user can use stick shift in the Mike of mobile terminal
Near wind, so that there is reverb signal in the voice signal of microphone collection.And in embodiments of the invention, collecting
After the speech data of user's input, it can detect and whether there is reverb signal in the speech data collected, so as to there is reverberation
During signal, using mode parameter set in advance, the speech data collected is handled, the logical of secret words pattern is realized
Words, prevent user's sound of speaking from leaking, and protect privacy.It follows that embodiments of the invention, can be according to practical application scene
The voice of input is handled, so that the call demand of user is met, the call experience of lifting user.
Embodiments of the invention additionally provide a kind of mobile terminal, including memory, processor and are stored in the storage
On device and the voice processing program that can run on the processor, the voice processing program is by real during the computing device
The step of showing method of speech processing as described above.
Embodiments of the invention additionally provide a kind of computer-readable recording medium, are stored thereon with computer program, should
The step in method of speech processing described above is realized when program is executed by processor.
Embodiments of the invention additionally provide a kind of mobile terminal, and the mobile terminal can be mobile phone, tablet personal computer, individual
Digital assistants (Personal Digital Assistant, PDA) or vehicle-mounted computer etc..
As shown in figure 5, mobile terminal 500 include radio frequency (Radio Frequency, RF) circuit 510, it is memory 520, defeated
Enter unit 530, display unit 540, processor 560, voicefrequency circuit 570, the He of Wi-Fi (Wireless Fidelity) module 580
Power supply 590.
Wherein, input block 530 can be used for the numeral or character information for receiving user's input, and produce and mobile terminal
The signal input that 500 user is set and function control is relevant.Specifically, in the embodiment of the present invention, the input block 530 can
With including contact panel 531.Contact panel 531, also referred to as touch-screen, collect touch operation of the user on or near it
(such as user uses the operations of any suitable object or annex on contact panel 531 such as finger, stylus), and according to advance
The formula of setting drives corresponding attachment means.
Optionally, contact panel 531 may include both touch detecting apparatus and touch controller.Wherein, inspection is touched
Survey device and detect the touch orientation of user, and detect the signal that touch operation is brought, transmit a signal to touch controller;Touch
Controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives the processor 560, and
The order sent of reception processing device 560 and it can be performed.Furthermore, it is possible to using resistance-type, condenser type, infrared ray and surface
The polytypes such as sound wave realize contact panel 531.Except contact panel 531, input block 530 can also be set including other inputs
Standby 532, other input equipments 532 can include but is not limited to physical keyboard, function key, and (such as volume control button, switch are pressed
Key etc.), trace ball, mouse, the one or more in action bars etc..
Wherein, display unit 540 can be used for information and the movement for showing the information inputted by user or being supplied to user
The various menu interfaces of terminal 500.Display unit 540 may include display panel 541, optionally, can use LCD or organic hairs
The forms such as optical diode (Organic Light-Emitting Diode, OLED) configure display panel 541.
It should be noted that contact panel 531 can cover display panel 541, touch display screen is formed, when touch display screen inspection
Measure after the touch operation on or near it, processor 860 is sent to determine the type of touch event, with preprocessor
560 provide corresponding visual output according to the type of touch event in touch display screen.
Touch display screen includes Application Program Interface viewing area and conventional control viewing area.The Application Program Interface viewing area
And arrangement mode of the conventional control viewing area is not limited, can be arranged above and below, left-right situs etc. can distinguish two and show
Show the arrangement mode in area.The Application Program Interface viewing area is displayed for the interface of application program.Each interface can be with
The interface element such as the icon comprising at least one application program and/or widget desktop controls.The Application Program Interface viewing area
It can also be the empty interface not comprising any content.The conventional control viewing area is used to show the higher control of utilization rate, for example,
Application icons such as settings button, interface numbering, scroll bar, phone directory icon etc..
Wherein processor 560 is the control centre of mobile terminal 500, utilizes various interfaces and connection whole mobile phone
Various pieces, software program and/or module in first memory 521 are stored in by operation or execution, and call storage
Data in second memory 522, perform the various functions and processing data of mobile terminal 500, so as to mobile terminal 500
Carry out integral monitoring.Optionally, processor 560 may include one or more processing units.
In embodiments of the present invention, processor 560 can gather the speech data of user's input, and detect the voice number
It whether there is reverb signal in, so as to when there is the reverb signal in the speech data, use mould set in advance
Formula parameter, is handled the speech data collected.
Preferably, when processor 560 whether there is reverb signal in the detection speech data, specifically for:Detection
With the presence or absence of two frequency spectrum identical voice signals in the speech data;It is identical when there are two frequency spectrums in the speech data
Voice signal when, determine there is the reverb signal in the speech data, otherwise, it determines being not present in the speech data
The reverb signal.
Preferably, processor 560 in the speech data is detected with the presence or absence of two frequency spectrum identical voice signals it
Afterwards, it is additionally operable to:When there is the reverb signal in the speech data, the time of two frequency spectrum identical voice signals is judged
Whether interval is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mould set in advance
Formula parameter, the step of handling the speech data collected.
Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters;Processor 560 is using presetting
Mode parameter, when handling the speech data collected, specifically for:According to the noise reduction parameters, removal is adopted
Reverb signal in the speech data collected, obtains the first voice signal, and the reverb signal is described two frequency spectrum phases
With voice signal in acquisition time later voice signal relatively;According to the voice amplifying parameters, to first voice
Signal is amplified processing, obtains the second voice signal.
Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range;
Processor 560 according to the voice amplifying parameters, is being amplified processing to first voice signal, obtains the second voice letter
Number when, specifically for:According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain
Obtain the 3rd voice signal;According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal at different frequency range
Amplitude, obtain the second voice signal.
Mobile terminal 500 can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here
Repeat no more.
In embodiments of the present invention, when mobile terminal gathers the speech data of user's input, if user lowers one's voice
Words, in order to be able to allow the user for the counterpart device for receiving the speech data not hear the sound of speaking of the user, the user can use stick shift
Near the microphone of mobile terminal, so that there is reverb signal in the speech data of microphone collection.And the present invention
In embodiment, after the speech data of user's input is collected, it can detect in the speech data collected with the presence or absence of reverberation letter
Number, so that when there is reverb signal, using mode parameter set in advance, the speech data collected is handled, it is real
The call of existing secret words pattern, prevents user's sound of speaking from leaking, and protects privacy.It follows that embodiments of the invention, can
The voice of input is handled according to practical application scene, so as to meet the call demand of user, the call body of user is lifted
Test.
Embodiments of the invention additionally provide a kind of mobile terminal, and the mobile terminal can be mobile phone, tablet personal computer, individual
Digital assistants (Personal Digital Assistant, PDA) or vehicle-mounted computer etc..
Specifically, the mobile terminal 600 shown in Fig. 6 includes:At least one processor 601, memory 602, at least one
Network interface 604, other users interface 603.Each component in mobile terminal 600 is coupled by bus system 605.
It is understood that bus system 605 is used to realize the connection communication between these components.Bus system 605 except include data/address bus it
Outside, in addition to power bus, controlling bus and status signal bus in addition.But for the sake of clear explanation, in figure 6 will be various total
Line is all designated as bus system 605.
Wherein, user interface 603 can include display, keyboard or pointing device (for example, mouse, trace ball
(trackball), touch-sensitive plate or touch-screen etc..
It is appreciated that the memory 602 in the embodiment of the present invention can be volatile memory or nonvolatile memory,
Or may include both volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read-only storage (Read-
Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable
Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or
Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside high
Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM
(Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory
(Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate
SDRAM, DDRSDRAM), enhanced Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links
Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct
Rambus RAM, DRRAM).The embodiment of the present invention description system and method memory 602 be intended to including but not limited to these
With the memory of any other suitable type.
In some embodiments, memory 602 stores following element, can perform module or data structure, or
Their subset of person, or their superset:Operating system 6021 and application program 6022.
Wherein, operating system 6021, comprising various system programs, such as ccf layer, core library layer, driving layer, are used for
Realize various basic businesses and handle hardware based task.Application program 6022, includes various application programs, such as media
Player (Media Player), browser (Browser) etc., for realizing various applied business.Realize the embodiment of the present invention
The program of method may be embodied in application program 6022.
In embodiments of the present invention, by calling program or the instruction of the storage of memory 602, specifically, can be application
The program stored in program 6022 or instruction.
In embodiments of the present invention, processor 601 can gather the speech data of user's input, and detect the voice number
It whether there is reverb signal in, so as to when there is the reverb signal in the speech data, use mould set in advance
Formula parameter, is handled the speech data collected.
The method that the embodiments of the present invention are disclosed can apply in processor 601, or be realized by processor 601.
Processor 601 is probably a kind of IC chip, the disposal ability with signal.In implementation process, the above method it is each
Step can be completed by the integrated logic circuit of the hardware in processor 601 or the instruction of software form.Above-mentioned processing
Device 601 can be general processor, digital signal processor (Digital Signal Processor, DSP), special integrated electricity
Road (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field
Programmable Gate Array, FPGA) or other PLDs, discrete gate or transistor logic,
Discrete hardware components.It can realize or perform disclosed each method, step and the logic diagram in the embodiment of the present invention.It is general
Processor can be microprocessor or the processor can also be any conventional processor etc..With reference to institute of the embodiment of the present invention
The step of disclosed method, can be embodied directly in hardware decoding processor and perform completion, or with the hardware in decoding processor
And software module combination performs completion.Software module can be located at random access memory, and flash memory, read-only storage may be programmed read-only
In the ripe storage medium in this area such as memory or electrically erasable programmable memory, register.The storage medium is located at
Memory 602, processor 601 reads the information in memory 602, the step of completing the above method with reference to its hardware.
It is understood that the embodiment of the present invention description these embodiments can with hardware, software, firmware, middleware,
Microcode or its combination are realized.Realized for hardware, processing unit can be realized in one or more application specific integrated circuits
(Application Specific Integrated Circuit, ASIC), digital signal processor (Digital Signal
Processor, DSP), digital signal processing appts (DSP Device, DSPD), programmable logic device (Programmable
Logic Device, PLD), field programmable gate array (Field Programmable Gate Array, FPGA), general place
Manage in device, controller, microcontroller, microprocessor, other electronic units for performing the application function or its combination.
Realize, can be realized by performing the module (such as process, function) of function of the embodiment of the present invention for software
The technology of the embodiment of the present invention.Software code is storable in memory and by computing device.Memory can be in processing
Realized in device or outside processor.
Preferably, when processor 601 whether there is reverb signal in the detection speech data, specifically for:Detection
With the presence or absence of two frequency spectrum identical voice signals in the speech data;It is identical when there are two frequency spectrums in the speech data
Voice signal when, determine there is the reverb signal in the speech data, otherwise, it determines being not present in the speech data
The reverb signal.
Preferably, processor 601 in the speech data is detected with the presence or absence of two frequency spectrum identical voice signals it
Afterwards, it is additionally operable to:When there is the reverb signal in the speech data, the time of two frequency spectrum identical voice signals is judged
Whether interval is less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mould set in advance
Formula parameter, the step of handling the speech data collected.
Preferably, the mode parameter includes noise reduction parameters and voice amplifying parameters;Processor 601 is using presetting
Mode parameter, when handling the speech data collected, specifically for:According to the noise reduction parameters, removal is adopted
Reverb signal in the speech data collected, obtains the first voice signal, and the reverb signal is described two frequency spectrum phases
With voice signal in acquisition time later voice signal relatively;According to the voice amplifying parameters, to first voice
Signal is amplified processing, obtains the second voice signal.
Preferably, the voice amplifying parameters include overall amplitude increase and the corresponding amplitude increase of different frequency range;
Processor 601 according to the voice amplifying parameters, is being amplified processing to first voice signal, obtains the second voice letter
Number when, specifically for:According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain
Obtain the 3rd voice signal;According to the corresponding amplitude increase of different frequency range, increase the 3rd voice signal at different frequency range
Amplitude, obtain the second voice signal.
Mobile terminal 600 can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here
Repeat no more.
In embodiments of the present invention, when mobile terminal gathers the speech data of user's input, if user lowers one's voice
Words, in order to be able to allow the user for the counterpart device for receiving the speech data not hear the sound of speaking of the user, the user can use stick shift
Near the microphone of mobile terminal, so that there is reverb signal in the speech data of microphone collection.And the present invention
In embodiment, after the speech data of user's input is collected, it can detect in the speech data collected with the presence or absence of reverberation letter
Number, so that when there is reverb signal, using mode parameter set in advance, the speech data collected is handled, it is real
The call of existing secret words pattern, prevents user's sound of speaking from leaking, and protects privacy.It follows that embodiments of the invention, can
The voice of input is handled according to practical application scene, so as to meet the call demand of user, the call body of user is lifted
Test.
Those of ordinary skill in the art it is to be appreciated that with reference to disclosed in the embodiment of the present invention embodiment description it is each
The unit and algorithm steps of example, can be realized with the combination of electronic hardware or computer software and electronic hardware.These
Function is performed with hardware or software mode actually, depending on the application-specific and design constraint of technical scheme.Specialty
Technical staff can realize described function to each specific application using distinct methods, but this realization should not
Think beyond the scope of this invention.
It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description,
The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.
In embodiment provided herein, it should be understood that disclosed apparatus and method, others can be passed through
Mode is realized.For example, device embodiment described above is only schematical, for example, the division of the unit, is only
A kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can combine or
Person is desirably integrated into another system, or some features can be ignored, or does not perform.Another, shown or discussed is mutual
Between coupling or direct-coupling or communication connection can be the INDIRECT COUPLING or communication link of device or unit by some interfaces
Connect, can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit
The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs
's.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, can also
That unit is individually physically present, can also two or more units it is integrated in a unit.
If the function is realized using in the form of SFU software functional unit and is used as independent production marketing or in use, can be with
It is stored in a computer read/write memory medium.Understood based on such, technical scheme is substantially in other words
The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter
Calculation machine software product is stored in a storage medium, including some instructions are to cause a computer equipment (can be individual
People's computer, server, or network equipment etc.) perform all or part of step of each of the invention embodiment methods described.
And foregoing storage medium includes:USB flash disk, mobile hard disk, ROM, RAM, magnetic disc or CD etc. are various can be with store program codes
Medium.
The foregoing is only a specific embodiment of the invention, but protection scope of the present invention is not limited thereto, any
Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all be contained
Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.
Claims (12)
1. a kind of method of speech processing, it is characterised in that including:
Gather the speech data of user's input;
Detect and whether there is reverb signal in the speech data;
When there is reverb signal in the speech data, using mode parameter set in advance, to the voice collected
Data are handled.
2. according to the method described in claim 1, it is characterised in that with the presence or absence of reverberation letter in the detection speech data
Number the step of, including:
Detect in the speech data with the presence or absence of two frequency spectrum identical voice signals;
When there is two frequency spectrum identical voice signals in the speech data, determine there is reverberation letter in the speech data
Number, otherwise, it determines reverb signal is not present in the speech data.
3. method according to claim 2, it is characterised in that with the presence or absence of two frequencies in the detection speech data
After the step of composing identical voice signal, methods described also includes:
When there is reverb signal in the speech data, judge whether the time interval of two frequency spectrum identical voice signals is small
In predetermined threshold value, and when the time interval is less than the predetermined threshold value, performs and use mode parameter set in advance, to adopting
The step of speech data collected is handled.
4. method according to claim 2, it is characterised in that the mode parameter includes noise reduction parameters and voice amplification ginseng
Number;
Use mode parameter set in advance, the step of handling the speech data collected, including:
According to the noise reduction parameters, the reverb signal in the speech data collected is removed, the first voice signal, institute is obtained
It is the later voice signal relatively of acquisition time in described two frequency spectrum identical voice signals to state reverb signal;
According to the voice amplifying parameters, processing is amplified to first voice signal, the second voice signal is obtained.
5. method according to claim 4, it is characterised in that the voice amplifying parameters include overall amplitude increase and
The corresponding amplitude increase of different frequency range;
It is described that processing is amplified to first voice signal according to the voice amplifying parameters, obtain the second voice signal
The step of, including:
According to the overall amplitude increase, increase the amplitude of all frequency ranges of first voice signal, obtain the 3rd voice
Signal;
According to the corresponding amplitude increase of different frequency range, increase amplitude of the 3rd voice signal at different frequency range, obtain
Second voice signal.
6. a kind of mobile terminal, it is characterised in that including:
Voice acquisition module, the speech data for gathering user's input;
Reverberation detection module, reverb signal is whether there is for detecting in the speech data;
Speech processing module is right using mode parameter set in advance for when there is reverb signal in the speech data
The speech data collected is handled.
7. mobile terminal according to claim 6, it is characterised in that the reverberation detection module includes:
Detecting signal unit, for detecting in the speech data with the presence or absence of two frequency spectrum identical voice signals;
Determining unit, for when there is two frequency spectrum identical voice signals in the speech data, determining the voice number
There is reverb signal in, otherwise, it determines reverb signal is not present in the speech data.
8. mobile terminal according to claim 7, it is characterised in that also include:
Judge module is spaced, for when there is reverb signal in the speech data, judging two frequency spectrum identical voice letters
Number time interval whether be less than predetermined threshold value, and when the time interval is less than the predetermined threshold value, trigger the voice
Processing module, which is performed, uses mode parameter set in advance, the step of handling the speech data collected.
9. mobile terminal according to claim 7, it is characterised in that the mode parameter includes noise reduction parameters and voice is put
Big parameter;
The speech processing module includes:
First processing units, for according to the noise reduction parameters, removing the reverb signal in the speech data collected, obtaining
The first voice signal, the reverb signal is acquisition time later language relatively in described two frequency spectrum identical voice signals
Message number;
Second processing unit, for according to the voice amplifying parameters, processing to be amplified to first voice signal, is obtained
Second voice signal.
10. mobile terminal according to claim 9, it is characterised in that the voice amplifying parameters include overall amplitude and increased
The a large amount of and corresponding amplitude increase of different frequency range;
The second processing unit includes:
First amplification subelement, for according to the overall amplitude increase, increasing all frequency ranges of first voice signal
Amplitude, obtain the 3rd voice signal;
Second amplification subelement, for according to the corresponding amplitude increase of different frequency range, increasing the 3rd voice signal not
With the amplitude at frequency range, the second voice signal is obtained.
11. a kind of mobile terminal, it is characterised in that including memory, processor and be stored on the memory and can be in institute
The voice processing program run on processor is stated, the voice processing program is realized such as claim during the computing device
The step of method of speech processing any one of 1 to 5.
12. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor
The step in the method for speech processing as any one of claim 1 to 5 is realized during execution.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710474944.6A CN107332984B (en) | 2017-06-21 | 2017-06-21 | Voice processing method and mobile terminal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710474944.6A CN107332984B (en) | 2017-06-21 | 2017-06-21 | Voice processing method and mobile terminal |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107332984A true CN107332984A (en) | 2017-11-07 |
CN107332984B CN107332984B (en) | 2020-05-22 |
Family
ID=60195101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710474944.6A Active CN107332984B (en) | 2017-06-21 | 2017-06-21 | Voice processing method and mobile terminal |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107332984B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110580910A (en) * | 2018-06-08 | 2019-12-17 | 北京搜狗科技发展有限公司 | Audio processing method, device and equipment and readable storage medium |
CN110660403A (en) * | 2018-06-28 | 2020-01-07 | 北京搜狗科技发展有限公司 | Audio data processing method, device and equipment and readable storage medium |
CN110580910B (en) * | 2018-06-08 | 2024-04-26 | 北京搜狗科技发展有限公司 | Audio processing method, device, equipment and readable storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1783214A (en) * | 2004-12-01 | 2006-06-07 | 哈曼贝克自动系统-威美科公司 | Reverberation estimation and suppression system |
CN102572086A (en) * | 2010-12-31 | 2012-07-11 | 联想(北京)有限公司 | Voice gain regulation method and communication terminal |
CN103501375A (en) * | 2013-09-16 | 2014-01-08 | 华为终端有限公司 | Method and device for controlling sound effect |
CN104135574A (en) * | 2014-08-21 | 2014-11-05 | 广东欧珀移动通信有限公司 | Silent mode switching method and device for mobile terminal |
WO2014183529A1 (en) * | 2013-12-02 | 2014-11-20 | 中兴通讯股份有限公司 | Mobile terminal talk mode switching method, device and storage medium |
CN106357871A (en) * | 2016-09-29 | 2017-01-25 | 维沃移动通信有限公司 | Voice amplifying method and mobile terminal |
-
2017
- 2017-06-21 CN CN201710474944.6A patent/CN107332984B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1783214A (en) * | 2004-12-01 | 2006-06-07 | 哈曼贝克自动系统-威美科公司 | Reverberation estimation and suppression system |
CN102572086A (en) * | 2010-12-31 | 2012-07-11 | 联想(北京)有限公司 | Voice gain regulation method and communication terminal |
CN103501375A (en) * | 2013-09-16 | 2014-01-08 | 华为终端有限公司 | Method and device for controlling sound effect |
WO2014183529A1 (en) * | 2013-12-02 | 2014-11-20 | 中兴通讯股份有限公司 | Mobile terminal talk mode switching method, device and storage medium |
CN104135574A (en) * | 2014-08-21 | 2014-11-05 | 广东欧珀移动通信有限公司 | Silent mode switching method and device for mobile terminal |
CN106357871A (en) * | 2016-09-29 | 2017-01-25 | 维沃移动通信有限公司 | Voice amplifying method and mobile terminal |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110580910A (en) * | 2018-06-08 | 2019-12-17 | 北京搜狗科技发展有限公司 | Audio processing method, device and equipment and readable storage medium |
CN110580910B (en) * | 2018-06-08 | 2024-04-26 | 北京搜狗科技发展有限公司 | Audio processing method, device, equipment and readable storage medium |
CN110660403A (en) * | 2018-06-28 | 2020-01-07 | 北京搜狗科技发展有限公司 | Audio data processing method, device and equipment and readable storage medium |
CN110660403B (en) * | 2018-06-28 | 2024-03-08 | 北京搜狗科技发展有限公司 | Audio data processing method, device, equipment and readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN107332984B (en) | 2020-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106919313A (en) | The startup method and mobile terminal of a kind of application program | |
CN107272956A (en) | A kind of display methods, mobile terminal and computer-readable recording medium | |
CN106504777A (en) | A kind of processing method of recording data and mobile terminal | |
CN106303804A (en) | The control method of a kind of mike and mobile terminal | |
CN106708367A (en) | Display method of conversation interface and mobile terminal | |
CN107071119A (en) | A kind of sound removing method and mobile terminal | |
CN106341535A (en) | Audio playing control method and mobile terminal | |
CN106255000A (en) | A kind of audio signal sample method and mobile terminal | |
CN106685801A (en) | Control method of notification messages and mobile terminal | |
CN106973167A (en) | A kind of income prompting method and mobile terminal | |
CN106445116A (en) | Method for calling message notification bar, and mobile terminal | |
CN106254640A (en) | A kind of task start method based on alarm clock and mobile terminal | |
CN106557240A (en) | A kind of detection method and mobile terminal | |
CN107423018A (en) | A kind of multi-screen display method and terminal | |
CN106874046A (en) | The operating method and mobile terminal of a kind of application program | |
CN106843885A (en) | The method of controlling operation thereof and mobile terminal of a kind of mobile terminal | |
CN106254694A (en) | A kind of method of incoming call blocking, mobile terminal and core net | |
CN107690032A (en) | A kind of call control method and mobile terminal | |
CN106648242A (en) | Control method of touch-control operation and mobile terminal | |
CN107071127A (en) | A kind of way of recording and mobile terminal | |
CN106385489A (en) | Method for determining uplink voice data and mobile terminal | |
CN106210344A (en) | A kind of call mode method to set up and mobile terminal | |
CN107332984A (en) | A kind of method of speech processing and mobile terminal | |
CN106888330A (en) | The call method and mobile terminal of a kind of mobile terminal | |
CN107066860A (en) | A kind of fingerprint identification method and mobile terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |