CN104967717B - Noise-reduction method under terminal speech interactive mode and device - Google Patents
Noise-reduction method under terminal speech interactive mode and device Download PDFInfo
- Publication number
- CN104967717B CN104967717B CN201510274680.0A CN201510274680A CN104967717B CN 104967717 B CN104967717 B CN 104967717B CN 201510274680 A CN201510274680 A CN 201510274680A CN 104967717 B CN104967717 B CN 104967717B
- Authority
- CN
- China
- Prior art keywords
- noise
- noise reduction
- signal
- terminal
- voice signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/02—Constructional features of telephone sets
- H04M1/19—Arrangements of transmitters, receivers, or complete sets to prevent eavesdropping, to attenuate local noise or to prevent undesired transmission; Mouthpieces or receivers specially adapted therefor
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Telephone Function (AREA)
Abstract
The invention discloses the noise-reduction method under a kind of terminal speech interactive mode, including: when receiving voice messaging, obtain voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source;The noise reduction parameters of corresponding voice signal is determined according to described position;Corresponding voice signal is adjusted, so that described voice messaging is carried out noise reduction according to described noise reduction parameters.Invention additionally discloses the denoising device under a kind of terminal speech interactive mode.The present invention realizes the active noise reduction under speech interaction mode, produces noise jamming, improve interactive voice effect during reducing interactive voice.
Description
Technical field
The present invention relates to field of terminal, particularly relate to the noise-reduction method under terminal speech interactive mode and device.
Background technology
Along with the development of terminal technology, increasing terminal enters in the middle of daily life and work, and described terminal is as a example by mobile terminal, and user can pass through mobile terminal typing voice, or carries out the interactive voice scenes such as voice call.Currently in mobile terminal sound typing or voice call process, there is no corresponding noise reduction technology, when carrying out telephone operation the most in the handsfree mode, owing to environmental noise disturbs, cause more unpleasant to hear clear voice call content.Under current speech interaction mode, mainly by downstream signal noise reduction, downstream signal noise reduction is merely by suppressing the far-end noise in downstream signal, and the environmental noise to near-end is useless.Therefore, there is not the mode that voice is carried out active noise reduction, cause interactive voice process to produce noise jamming, affect the effect of interactive voice.
Summary of the invention
Present invention is primarily targeted at the noise-reduction method and device proposed under a kind of terminal speech interactive mode, aim to solve the problem that under current speech interaction mode, there is not the mode that voice is carried out active noise reduction, cause interactive voice process to produce noise jamming, the problem affecting the effect of interactive voice.
For achieving the above object, the noise-reduction method under a kind of terminal speech interactive mode that the present invention provides, including: when receiving voice messaging, obtain voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source;
The noise reduction parameters of corresponding voice signal is determined according to described position;
Corresponding voice signal is adjusted, so that described voice messaging is carried out noise reduction according to described noise reduction parameters.
Preferably, the step of the position of described acquisition described voice signal correspondence sound source includes:
Noise signal is identified from described voice signal;
Obtain the position of described noise signal correspondence sound source.
Preferably, the step of the position of described acquisition described noise signal correspondence sound source includes:
The phase contrast and the acoustic pressure that obtain described noise signal correspondence waveform are poor;
Sound source corresponding to the described noise signal three-dimensional space position relative to terminal is calculated according to described phase contrast and acoustic pressure difference.
Preferably, the described step determining noise reduction parameters that each voice signal is corresponding according to described position includes:
Described noise signal correspondence waveform is calculated relative to the phase contrast of terminal and time delay according to described three-dimensional space position;
The noise reduction parameters of corresponding noise signal is generated according to described phase contrast and time delay.
Preferably, the described step determining noise reduction parameters that each voice signal is corresponding according to described position, replace with:
Determine the state parameter that terminal is current;
The noise reduction parameters of corresponding voice signal is determined according to described state parameter and described position.
In addition, for achieving the above object, the present invention also proposes the denoising device under a kind of terminal speech interactive mode, including: acquisition module, for when receiving voice messaging, obtain voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source;
Determine module, for determining the noise reduction parameters of corresponding voice signal according to described position;
Noise reduction module, for adjusting corresponding voice signal according to described noise reduction parameters, so that described voice messaging is carried out noise reduction.
Preferably, described acquisition module includes:
Recognition unit, for identifying noise signal from described voice signal;
Acquiring unit, for obtaining the position of described noise signal correspondence sound source.
Preferably, described acquiring unit, including:
Obtain subelement, poor for the phase contrast and acoustic pressure obtaining described noise signal correspondence waveform;
Computation subunit, for calculating sound source corresponding to the described noise signal three-dimensional space position relative to terminal according to described phase contrast and acoustic pressure difference.
Preferably, described determine that module comprises determining that unit and signal generating unit,
Described determine unit, for calculating described noise signal correspondence waveform relative to the phase contrast of terminal and time delay according to described three-dimensional space position;
Described signal generating unit, for generating the noise reduction parameters of corresponding noise signal according to described phase contrast and time delay.
Preferably, described determine unit, be additionally operable to determine the state parameter that terminal is current;It is additionally operable to determine the noise reduction parameters of corresponding voice signal according to described state parameter and described position.
The sound source position by the voice signal according to the voice messaging received that the present invention proposes determines the noise reduction parameters of correspondence, and voice messaging is carried out noise reduction.Realize the active noise reduction under speech interaction mode, produce noise jamming during reducing interactive voice, improve interactive voice effect.
Accompanying drawing explanation
Fig. 1 is the hardware configuration signal of the mobile terminal realizing each embodiment of the present invention;
Fig. 2 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 1;
Fig. 3 is the schematic flow sheet of the first embodiment of the noise-reduction method under terminal speech interactive mode of the present invention;
Fig. 4 is the schematic flow sheet that the present invention obtains position one embodiment of described voice signal correspondence sound source;
Fig. 5 is the schematic flow sheet that the present invention obtains position one embodiment of described noise signal correspondence sound source;
Fig. 6 is the schematic flow sheet that the present invention determines noise reduction parameters one embodiment of corresponding voice signal according to described position;
The schematic flow sheet of the second embodiment of the noise-reduction method under Fig. 7 terminal speech of the present invention interactive mode;
Fig. 8 is the high-level schematic functional block diagram of the preferred embodiment of the denoising device under terminal speech interactive mode of the present invention;
Fig. 9 is the refinement high-level schematic functional block diagram of acquisition module one embodiment in Fig. 8;
Figure 10 is the refinement high-level schematic functional block diagram of acquiring unit one embodiment in Fig. 9;
Figure 11 is the refinement high-level schematic functional block diagram determining module one embodiment in Fig. 8.
The realization of the object of the invention, functional characteristics and advantage will in conjunction with the embodiments, are described further referring to the drawings.
Detailed description of the invention
Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
The mobile terminal realizing each embodiment of the present invention is described referring now to accompanying drawing.In follow-up description, use the suffix being used for representing such as " module ", " parts " or " unit " of element only for the explanation of the beneficially present invention, itself do not have specific meaning.Therefore, " module " can mixedly use with " parts ".
Mobile terminal can be implemented in a variety of manners.Such as, the terminal described in the present invention can include mobile terminal and the fixed terminal of such as numeral TV, desk computer etc. of such as mobile phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP (portable media player), guider etc..Hereinafter it is assumed that terminal is mobile terminal.However, it will be understood by those skilled in the art that, in addition to being used in particular for the element of mobile purpose, structure according to the embodiment of the present invention can also apply to the terminal of fixed type.
Fig. 1 is the hardware configuration signal of the mobile terminal realizing each embodiment of the present invention.
Mobile terminal 100 can include wireless communication unit 110, A/V (audio/video) input block 120, user input unit 130, sensing unit 140, output unit 150, memorizer 160, interface unit 170, controller 180 and power subsystem 190 etc..Fig. 1 shows the mobile terminal with various assembly, it should be understood that be not required for implementing all assemblies illustrated.Can alternatively implement more or less of assembly.Will be discussed in more detail below the element of mobile terminal.
Wireless communication unit 110 generally includes one or more assembly, and it allows the radio communication between mobile terminal 100 and wireless communication system or network.Such as, at least one during wireless communication unit can include broadcast reception module 111, mobile communication module 112, wireless Internet module 113, short range communication module 114 and positional information module 115.
Broadcast reception module 111 receives broadcast singal and/or broadcast related information via broadcast channel from external broadcasting management server.Broadcast channel can include satellite channel and/or terrestrial channel.Broadcast management server can be to generate and send the broadcast singal and/or broadcast related information and the server sending it to terminal generated before broadcast singal and/or the server of broadcast related information or reception.Broadcast singal can include TV broadcast singal, radio signals, data broadcasting signal etc..And, broadcast singal may further include the broadcast singal combined with TV or radio signals.Broadcast related information can also provide via mobile communications network, and in this case, broadcast related information can be received by mobile communication module 112.Broadcast singal can exist in a variety of manners, such as, it can exist with the electronic program guides (EPG) of DMB (DMB), the form of the electronic service guidebooks (ESG) etc. of digital video broadcast-handheld (DVB-H).Broadcast reception module 111 can be broadcasted by using various types of broadcast systems to receive signal.Especially, broadcast reception module 111 can be by using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), DVB-hand-held (DVB-H), forward link media (MediaFLO@) Radio Data System, the digit broadcasting system of received terrestrial digital broadcasting integrated service (ISDB-T) etc. receive digital broadcasting.Broadcast reception module 111 may be constructed such that and is adapted to provide for the various broadcast systems of broadcast singal and above-mentioned digit broadcasting system.The broadcast singal and/or the broadcast related information that receive via broadcast reception module 111 can be stored in memorizer 160 (or other type of storage medium).
Mobile communication module 112 sends radio signals at least one in base station (such as, access point, node B etc.), exterior terminal and server and/or receives from it radio signal.Such radio signal can include voice call signal, video calling signal or the various types of data sending according to text and/or Multimedia Message and/or receiving.
Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.This module can internally or externally be couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by this module can include WLAN (WLAN) (Wi-Fi), Wibro (WiMAX), Wimax (worldwide interoperability for microwave access), HSDPA (high-speed downlink packet access) etc..
Short range communication module 114 is the module for supporting junction service.Some examples of short-range communication technology include bluetoothTM, RF identification (RFID), Infrared Data Association (IrDA), ultra broadband (UWB), purple honeybeeTMEtc..
Positional information module 115 is the module of positional information for checking or obtain mobile terminal.The typical case of positional information module is GPS (global positioning system).According to current technology, GPS module 115 calculates from the range information of three or more satellites and correct time information and for the Information application triangulation calculated, thus according to longitude, latitude with highly accurately calculate three-dimensional current location information.Currently, for calculating method three satellites of use of position and temporal information and by the position using an other satellite correction to calculate and the error of temporal information.Additionally, GPS module 115 can calculate velocity information by Continuous plus current location information in real time.
A/V input block 120 is used for receiving audio or video signal.A/V input block 120 can include camera 121 and mike 1220, and the view data of the camera 121 static images to being obtained by image capture apparatus in Video Capture pattern or image capture mode or video processes.Picture frame after process may be displayed on display unit 151.Picture frame after camera 121 processes can be stored in memorizer 160 (or other storage medium) or be transmitted via wireless communication unit 110, can provide two or more cameras 1210 according to the structure of mobile terminal.Mike 122 can receive sound (voice data) via mike in telephone calling model, logging mode, speech recognition mode etc. operational mode, and can be voice data by such acoustic processing.Audio frequency (voice) data after process can be converted to can be sent to via mobile communication module 112 the form output of mobile communication base station in the case of telephone calling model.Mike 122 can be implemented various types of noise and eliminate noise or the interference that (or suppression) algorithm produces with elimination (or suppression) during receiving and send audio signal.
User input unit 130 can generate key input data to control the various operations of mobile terminal according to the order of user's input.User input unit 130 allows user to input various types of information, and can include keyboard, metal dome, touch pad (such as, detection due to touched and cause resistance, pressure, the sensitive component of change of electric capacity etc.), roller, rocking bar etc..Especially, when touch pad is superimposed upon on display unit 151 as a layer, touch screen can be formed.
Sensing unit 140 detects the current state of mobile terminal 100, (such as, mobile terminal 100 open or close state), the position of mobile terminal 100, user for mobile terminal 100 contact (i.e., touch input) presence or absence, the orientation of mobile terminal 100, the acceleration or deceleration of mobile terminal 100 move and direction etc., and generate order or the signal of operation for controlling mobile terminal 100.Such as, when mobile terminal 100 is embodied as sliding-type mobile phone, sensing unit 140 can sense this sliding-type phone and open or close.It addition, sensing unit 140 can detect power subsystem 190 and whether provide whether electric power or interface unit 170 couple with external device (ED).Sensing unit 140 can include that this is described by proximity transducer 1410 by combining touch screen below.
Interface unit 170 is used as at least one external device (ED) and is connected, with mobile terminal 100, the interface that can pass through.Such as, external device (ED) can include wired or wireless head-band earphone port, external power source (or battery charger) port, wired or wireless FPDP, memory card port, have the port of device of identification module, audio frequency input/output (I/O) port, video i/o port, ear port etc. for connecting.Identification module can be that storage is for verifying that user uses the various information of mobile terminal 100 and can include subscriber identification module (UIM), client identification module (SIM), Universal Subscriber identification module (USIM) etc..It addition, the device (hereinafter referred to as " identifying device ") with identification module can be to take the form of smart card, therefore, identify that device can be connected with mobile terminal 100 via port or other attachment means.Interface unit 170 may be used for receiving from the input (such as, data message, electric power etc.) of external device (ED) and the one or more elements input received being transferred in mobile terminal 100 or may be used for transmitting data between mobile terminal and external device (ED).
Additionally, when mobile terminal 100 is connected with external base, interface unit 170 can serve as allowing to provide the path of mobile terminal 100 or can serve as to allow from the various command signals of base input from base electric power by it and is transferred to the path of mobile terminal by it.May serve as identifying whether mobile terminal is accurately fitted within the signal base from various command signals or the electric power of base input.Output unit 150 is configured to provide output signal (such as, audio signal, video signal, alarm signal, vibration signal etc.) with vision, audio frequency and/or tactile manner.Output unit 150 can include display unit 151, dio Output Modules 152, alarm unit 153 etc..
Display unit 151 may be displayed on the information processed in mobile terminal 100.Such as, when mobile terminal 100 is in telephone calling model, display unit 151 can show the user interface (UI) relevant with call or other communicate (such as, text messaging, multimedia file download etc.) or graphic user interface (GUI).When mobile terminal 100 is in video calling pattern or image capture mode, display unit 151 can show image and/or the image of reception, UI or GUI illustrating video or image and correlation function etc. of capture.
Meanwhile, when display unit 151 and touch pad the most superposed on one another with formed touch screen time, display unit 151 can serve as input equipment and output device.Display unit 151 can include at least one in liquid crystal display (LCD), thin film transistor (TFT) LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc..Some in these display may be constructed such that transparence to allow user to watch from outside, this is properly termed as transparent display, and typical transparent display can for example, TOLED (transparent organic light emitting diode) display etc..According to the specific embodiment wanted, mobile terminal 100 can include two or more display units (or other display device), such as, mobile terminal can include outernal display unit (not shown) and inner display unit (not shown).Touch screen can be used for detecting touch input pressure and touch input position and touch input area.
Dio Output Modules 152 can mobile terminal be in call signal receive under the isotype such as pattern, call mode, logging mode, speech recognition mode, broadcast reception mode time, that wireless communication unit 110 is received or storage in memorizer 160 voice data transducing audio signal and be output as sound.And, dio Output Modules 152 can provide the audio frequency output (such as, call signal receive sound, message sink sound etc.) relevant to the specific function of mobile terminal 100 execution.Dio Output Modules 152 can include speaker, buzzer etc..
Alarm unit 153 can provide output to notify event to mobile terminal 100.Typical event can include calling reception, message sink, key signals input, touch input etc..In addition to audio or video exports, alarm unit 153 can provide in a different manner and export the generation with notification event.Such as, alarm unit 153 can provide output with the form of vibration, when receiving calling, message or some other entrance communication (incoming communication), alarm unit 153 can provide sense of touch output (that is, vibration) to notify to user.By providing such sense of touch to export, even if when the mobile phone of user is in the pocket of user, user also is able to identify the generation of various event.Alarm unit 153 can also provide the output of the generation of notification event via display unit 151 or dio Output Modules 152.
Memorizer 160 can store the process performed by controller 180 and the software program controlling operation etc., or can temporarily store the data (such as, telephone directory, message, still image, video etc.) that oneself maybe will export through output.And, memorizer 160 can be to store vibration and the data of audio signal of the various modes about exporting when touching and being applied to touch screen.
Memorizer 160 can include the storage medium of at least one type, described storage medium includes flash memory, hard disk, multimedia card, card-type memorizer (such as, SD or DX memorizer etc.), random access storage device (RAM), static random-access memory (SRAM), read only memory (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc..And, mobile terminal 100 can be connected the network storage device of the storage function performing memorizer 160 and cooperate with by network.
Controller 180 generally controls the overall operation of mobile terminal.Such as, controller 180 performs the control relevant to voice call, data communication, video calling etc. and process.It addition, controller 180 can include the multi-media module 1810 for reproducing (or playback) multi-medium data, multi-media module 1810 can construct in controller 180, or it is so structured that separates with controller 180.Controller 180 can perform pattern recognition process, so that the handwriting input performed on the touchscreen or picture drafting input are identified as character or image.
Power subsystem 190 receives external power or internal power under the control of controller 180 and provides the suitable electric power operated needed for each element and assembly.
Various embodiment described herein can be implemented to use such as computer software, hardware or its any combination of computer-readable medium.Hardware is implemented, embodiment described herein can be by using application-specific IC (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), processor, controller, microcontroller, microprocessor, being designed to perform at least one in the electronic unit of function described herein and implement, in some cases, such embodiment can be implemented in controller 180.Implementing for software, the embodiment of such as process or function can be implemented with allowing the single software module performing at least one function or operation.Software code can be implemented by the software application (or program) write with any suitable programming language, and software code can be stored in memorizer 160 and be performed by controller 180.
So far, oneself is through describing mobile terminal according to its function.Below, for the sake of brevity, the slide type mobile terminal in the various types of mobile terminals that will describe such as folded form, board-type, oscillating-type, slide type mobile terminal etc. is as example.Therefore, the present invention can be applied to any kind of mobile terminal, and is not limited to slide type mobile terminal.
Mobile terminal 100 may be constructed such that utilization operates via frame or the most wired and wireless communication system of packet transmission data and satellite-based communication system as shown in Figure 1.
The communication system being wherein operable to according to the mobile terminal of the present invention is described referring now to Fig. 2.
Such communication system can use different air interfaces and/or physical layer.Such as, the air interface used by communication system includes such as frequency division multiple access (FDMA), time division multiple acess (TDMA), CDMA (CDMA) and UMTS (UMTS) (especially, Long Term Evolution (LTE)), global system for mobile communications (GSM) etc..As non-limiting example, explained below relates to cdma communication system, but such teaching is equally applicable to other type of system.
With reference to Fig. 2, cdma wireless communication system can include that multiple mobile terminal 100, multiple base station (BS) 270, base station controller (BSC) 275 and mobile switching centre (MSC) 2800MSC280 are configured to and Public Switched Telephony Network (PSTN) 290 formation interface.MSC280 is also structured to and the BSC275 formation interface that can be couple to base station 270 via back haul link.If back haul link can construct according to any one in the interface that Ganji knows, described interface includes such as E1/T1, ATM, IP, PPP, frame relay, HDSL, ADSL or xDSL.It will be appreciated that system as shown in Figure 2 can include multiple BSC2750.
Each BS270 can service one or more subregion (or region), by each subregion of multidirectional antenna or the antenna covering pointing to specific direction radially away from BS270.Or, each subregion can be covered by two or more antennas for diversity reception.Each BS270 may be constructed such that support multiple frequencies distribution, and the distribution of each frequency has specific frequency spectrum (such as, 1.25MHz, 5MHz etc.).
Intersecting that subregion and frequency are distributed can be referred to as CDMA Channel.BS270 can also be referred to as base station transceiver subsystem (BTS) or other equivalent terms.In this case, term " base station " may be used for broadly representing single BSC275 and at least one BS270.Base station can also be referred to as " cellular station ".Or, each subregion of specific BS270 can be referred to as multiple cellular station.
As shown in Figure 2, broadcast singal is sent in system the mobile terminal 100 operated by broadcsting transmitter (BT) 295.Broadcast reception module 111 is arranged on mobile terminal 100 and sentences the broadcast singal that reception is sent by BT295 as shown in Figure 1.In fig. 2 it is shown that several global positioning systems (GPS) satellite 300.Satellite 300 helps to position at least one in multiple mobile terminals 100.
In fig. 2, depict multiple satellite 300, it is understood that be, it is possible to use any number of satellite obtains useful location information.GPS module 115 is generally configured to coordinate the location information wanted with acquisition with satellite 300 as shown in Figure 1.Substitute GPS tracking technique or outside GPS tracking technique, it is possible to use other technology of the position of mobile terminal can be followed the tracks of.It addition, at least one gps satellite 300 can optionally or additionally process satellite dmb transmission.
As a typical operation of wireless communication system, BS270 receives the reverse link signal from various mobile terminals 100.Mobile terminal 100 generally participates in call, information receiving and transmitting communicates with other type of.Each reverse link signal that certain base station 270 receives is processed in specific BS270.The data obtained are forwarded to the BSC275 being correlated with.BSC provides call resource distribution and the mobile management function of the coordination of soft switching process included between BS270.The data received also are routed to MSC280 by BSC275, and it provides the extra route service for forming interface with PSTN290.Similarly, PSTN290 Yu MSC280 forms interface, MSC Yu BSC275 forms interface, and BSC275 correspondingly controls BS270 so that forward link signals to be sent to mobile terminal 100.
Based on above-mentioned mobile terminal hardware configuration and communication system, each embodiment of noise-reduction method under terminal speech interactive mode of the present invention is proposed.
As it is shown on figure 3, first embodiment of the invention proposes the noise-reduction method under a kind of terminal speech interactive mode, including:
Step S10, when receiving voice messaging, obtains voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source;
The noise reduction process under terminal speech interactive mode in the embodiment of the present invention is preferably utilized under handsfree talk mode, the most also can be applied in recording, instant messaging interactive voice scene.
In the present embodiment, described terminal preferably mobile phone, pad etc. can carry out the electronic equipment of voice call, described terminal should at least assemble following device: for the outside speaker of hand-free function of conversing, for detecting the microphone array of sound source position, described microphone array is preferably 4, optimal setting positions is bottom and upper segment, one respectively, left and right, owing to current software can reach 0.1DBA for the accuracy of detection of sound pressure level, the accuracy of detection of phase contrast is up to 0.1 degree, so during shift in position, software has stronger stability, it is not necessary to change corresponding technology.Terminal installation four or above microphone array, when opening hands-free, microphone array detection voice messaging, when receiving voice messaging, obtain voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source.Described voice signal includes human voice signal and noise signal.
Concrete, with reference to Fig. 4, the process of the position of described acquisition described voice signal correspondence sound source includes:
Step S11, identifies noise signal from described voice signal;
Step S12, obtains the position of described noise signal correspondence sound source.
Distinguish voice and environmental noise according to general voice detecting algorithm VAD, from described voice signal, i.e. identify human voice signal and noise signal, the signal of typing is screened, it is judged that voice and noise.Determine voice and the position of noise correspondence sound source.
Concrete, with reference to Fig. 5, the process of the position of described acquisition described noise signal correspondence sound source includes:
Step S121, the phase contrast and the acoustic pressure that obtain described noise signal correspondence waveform are poor;
Step S122, calculates sound source corresponding to the described noise signal three-dimensional space position relative to terminal according to described phase contrast and acoustic pressure difference.
Phase contrast and acoustic pressure by calculating microphone array typing waveform are poor, calculate people from location sound source three-dimensional space position relative to terminal respectively, and noise source of sound is relative to the three-dimensional space position of terminal.Each typing waveform is owing to putting difference with sound source presence bit, square being inversely proportional to of sound pressure level intensity and distance, phase contrast is then relevant with reception delay, subtracted each other by each typing waveform, obtain phase contrast harmony pressure reduction, the speed formula propagated in atmosphere according to sound wave, the sound source positional distance to each MIC can be calculated, find intersection point by drawing circle, can determine whether the particular location that current sound source is positioned in the space relative to terminal.
Step S20, determines the noise reduction parameters of corresponding voice signal according to described position;
Determine the noise reduction parameters of corresponding voice signal according to described position, when there is multiple noise signal, sorting out according to noise spectrum sequence, extracting one by one, respectively superposition reverse compensation sound wave, i.e. described noise reduction parameters is the phase place of reverse compensation sound wave.
Concrete, with reference to Fig. 6, the process of the described noise reduction parameters determining corresponding voice signal according to described position may include that
Step S21, calculates described noise signal correspondence waveform relative to the phase contrast of terminal and time delay according to described three-dimensional space position;
Step S22, generates the noise reduction parameters of corresponding noise signal according to described phase contrast and time delay.
After determining that voice sound source and noise source are relative to the three-dimensional space position of terminal, it is calculated voice and the distance of noise sound source, when there is multiple noise sound source, the distance calculating voice and each noise sound source obtains multiple distance, according to each noise and the distance of voice sound source and the frequency of noise source of sound, calculate terminal typing waveform phase contrast between terminal MIC and human ear position and time delay, generate the noise reduction parameters of corresponding noise signal according to described phase contrast with time delay.Each noise is different because of position, so the noise reduction parameters obtained is the most different, when there is multiple noise, determines the noise reduction parameters that each noise is corresponding, i.e. obtain the phase place of multiple reverse compensation sound wave.
Step S30, adjusts corresponding voice signal according to described noise reduction parameters, so that described voice messaging is carried out noise reduction.
In the phase place that described noise reduction parameters is reverse compensation sound wave, by adding the phase place of reverse compensation sound wave, reach the purpose of noise reduction, be prevented effectively from the interference of environmental noise.The embodiment of the present invention produces the reverse sound wave equal with near-end ambient noise by noise reduction system, is neutralized by noise, thus realizes the effect of active noise reduction.
The present embodiment determines the noise reduction parameters of correspondence by the sound source position of the voice signal according to the voice messaging received, and voice messaging is carried out noise reduction.Realize the active noise reduction under speech interaction mode, produce noise jamming during reducing interactive voice, improve interactive voice effect.
With reference to Fig. 7, proposing the second embodiment of noise-reduction method under terminal speech interactive mode of the present invention, first embodiment based on the noise-reduction method under above-mentioned terminal speech interactive mode, described step S30 may include that
Step S23, determines the state parameter that terminal is current;
Step S24, determines the noise reduction parameters of corresponding voice signal according to described state parameter and described position.
In the present embodiment, should be noted that the placement state of terminal, i.e. determine the state parameter that terminal is current, such as, it is to face up, reverse side is upward, the situation of side, can be according to gravity sensor, the data that the instruments such as gyroscope provide, under different placing attitude, different bearing datas is provided, and then the phase place of the reverse compensation sound wave to superposition, thus reach the function of hands-free active noise reduction, the noise reduction parameters of corresponding voice signal is determined by the state parameter and described position combining active, such as, when terminal is front, the noise reduction parameters of corresponding voice signal is determined according to this state parameter of front and position.The present embodiment is under different placing attitude, i.e. when terminal is in different, it is provided that the different SOT states of termination, and then is modified the reverse compensation waveform of superposition.Improve noise reduction further.
The present invention further provides the denoising device under a kind of terminal speech interactive mode.
With reference to the high-level schematic functional block diagram that Fig. 8, Fig. 8 are the denoising device preferred embodiment under terminal speech interactive mode of the present invention.
Denoising device under the present embodiment a kind of terminal speech interactive mode, the denoising device under described terminal speech interactive mode includes: acquisition module 10,
Described acquisition module 10, for when receiving voice messaging, obtains voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source;
The noise reduction process under terminal speech interactive mode in the embodiment of the present invention is preferably utilized under handsfree talk mode, the most also can be applied in recording, instant messaging interactive voice scene.
In the present embodiment, described terminal preferably mobile phone, pad etc. can carry out the electronic equipment of voice call, described terminal should at least assemble following device: for the outside speaker of hand-free function of conversing, for detecting the microphone array of sound source position, described microphone array is preferably 4, optimal setting positions is bottom and upper segment, one respectively, left and right, owing to current software can reach 0.1DBA for the accuracy of detection of sound pressure level, the accuracy of detection of phase contrast is up to 0.1 degree, so during shift in position, software has stronger stability, it is not necessary to change corresponding technology.Terminal installation four or above microphone array, when opening hands-free, microphone array detection voice messaging, when receiving voice messaging, obtain voice signal corresponding to described voice messaging and the position of described voice signal correspondence sound source.Described voice signal includes human voice signal and noise signal.
With reference to Fig. 9, described acquisition module 10 includes recognition unit 11 and acquiring unit 12,
Described recognition unit 11, for identifying noise signal from described voice signal;
Described acquiring unit 12, for obtaining the position of described noise signal correspondence sound source.
Distinguish voice and environmental noise according to general voice detecting algorithm VAD, from described voice signal, i.e. identify human voice signal and noise signal, the signal of typing is screened, it is judged that voice and noise.Determine voice and the position of noise correspondence sound source.
Concrete, with reference to Figure 10, described acquiring unit 12 includes obtaining subelement 121 and computation subunit 122,
Described acquisition subelement 121, poor for the phase contrast and acoustic pressure obtaining described noise signal correspondence waveform;
Described computation subunit 122, for calculating sound source corresponding to the described noise signal three-dimensional space position relative to terminal according to described phase contrast and acoustic pressure difference.
Phase contrast and acoustic pressure by calculating microphone array typing waveform are poor, calculate people from location sound source three-dimensional space position relative to terminal respectively, and noise source of sound is relative to the three-dimensional space position of terminal.Each typing waveform is owing to putting difference with sound source presence bit, square being inversely proportional to of sound pressure level intensity and distance, phase contrast is then relevant with reception delay, subtracted each other by each typing waveform, obtain phase contrast harmony pressure reduction, the speed formula propagated in atmosphere according to sound wave, the sound source positional distance to each MIC can be calculated, find intersection point by drawing circle, can determine whether the particular location that current sound source is positioned in the space relative to terminal.
Described determine module 20, be additionally operable to determine according to described position the noise reduction parameters of corresponding voice signal;
Determine the noise reduction parameters of corresponding voice signal according to described position, when there is multiple noise signal, sorting out according to noise spectrum sequence, extracting one by one, respectively superposition reverse compensation sound wave, i.e. described noise reduction parameters is the phase place of reverse compensation sound wave.
Concrete, with reference to Figure 11, described determine that module 20 comprises determining that unit 21 and signal generating unit 22,
Described determine unit 21, for calculating described noise signal correspondence waveform relative to the phase contrast of terminal and time delay according to described three-dimensional space position;
Described signal generating unit 22, for generating the noise reduction parameters of corresponding noise signal according to described phase contrast and time delay.
After determining that voice sound source and noise source are relative to the three-dimensional space position of terminal, it is calculated voice and the distance of noise sound source, when there is multiple noise sound source, the distance calculating voice and each noise sound source obtains multiple distance, according to each noise and the distance of voice sound source and the frequency of noise source of sound, calculate terminal typing waveform phase contrast between terminal MIC and human ear position and time delay, generate the noise reduction parameters of corresponding noise signal according to described phase contrast with time delay.Each noise is different because of position, so the noise reduction parameters obtained is the most different, when there is multiple noise, determines the noise reduction parameters that each noise is corresponding, i.e. obtain the phase place of multiple reverse compensation sound wave.
Described noise reduction module 30, for adjusting corresponding voice signal according to described noise reduction parameters, so that described voice messaging is carried out noise reduction.
In the phase place that described noise reduction parameters is reverse compensation sound wave, by adding the phase place of reverse compensation sound wave, reach the purpose of noise reduction, be prevented effectively from the interference of environmental noise.The embodiment of the present invention produces the reverse sound wave equal with near-end ambient noise by noise reduction system, is neutralized by noise, thus realizes the effect of active noise reduction.
The present embodiment determines the noise reduction parameters of correspondence by the sound source position of the voice signal according to the voice messaging received, and voice messaging is carried out noise reduction.Realize the noise reduction under speech interaction mode, produce noise jamming during reducing interactive voice, improve interactive voice effect.
Further, described determine unit 21, be additionally operable to determine the state parameter that terminal is current;It is additionally operable to determine the noise reduction parameters of corresponding voice signal according to described state parameter and described position.
In the present embodiment, should be noted that the placement state of terminal, i.e. determine the state parameter that terminal is current, such as, it is to face up, reverse side is upward, the situation of side, can be according to gravity sensor, the data that the instruments such as gyroscope provide, under different placing attitude, different bearing datas is provided, and then the phase place of the reverse compensation sound wave to superposition, thus reach the function of hands-free active noise reduction, the noise reduction parameters of corresponding voice signal is determined by the state parameter and described position combining active, such as, when terminal is front, the noise reduction parameters of corresponding voice signal is determined according to this state parameter of front and position.The present embodiment is under different placing attitude, i.e. when terminal is in different, it is provided that the different SOT states of termination, and then is modified the reverse compensation waveform of superposition.Improve noise reduction further.
It should be noted that, in this article, term " includes ", " comprising " or its any other variant are intended to comprising of nonexcludability, so that include that the process of a series of key element, method, article or device not only include those key elements, but also include other key elements being not expressly set out, or also include the key element intrinsic for this process, method, article or device.In the case of there is no more restriction, statement " including ... " key element limited, it is not excluded that there is also other identical element in including the process of this key element, method, article or device.
The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art is it can be understood that can add the mode of required general hardware platform by software to above-described embodiment method and realize, can certainly pass through hardware, but a lot of in the case of the former is more preferably embodiment.Based on such understanding, the part that prior art is contributed by technical scheme the most in other words can embody with the form of software product, this computer software product is stored in a storage medium (such as ROM/RAM, magnetic disc, CD), including some instructions with so that a station terminal equipment (can be mobile phone, computer, server, air-conditioner, or the network equipment etc.) perform the method described in each embodiment of the present invention.
These are only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every equivalent structure utilizing description of the invention and accompanying drawing content to be made or equivalence flow process conversion; or directly or indirectly it is used in other relevant technical fields, the most in like manner it is included in the scope of patent protection of the present invention.
Claims (8)
1. the noise-reduction method under a terminal speech interactive mode, it is characterised in that including:
When receiving voice messaging, obtain voice signal corresponding to described voice messaging and described voice letter
The position of number corresponding sound source;
The noise reduction parameters of corresponding voice signal is determined according to described position;
Corresponding voice signal is adjusted, so that described voice messaging is carried out noise reduction according to described noise reduction parameters;
The step of the position of described acquisition described voice signal correspondence sound source includes:
Noise signal is identified from described voice signal;
Obtain the position of described noise signal correspondence sound source.
2. the noise-reduction method under terminal speech interactive mode as claimed in claim 1, it is characterised in that
The step of the position of described acquisition described noise signal correspondence sound source includes:
The phase contrast and the acoustic pressure that obtain described noise signal correspondence waveform are poor;
Sound source corresponding to described noise signal is calculated relative to the three of terminal according to described phase contrast and acoustic pressure difference
Dimension space position.
3. the noise-reduction method under terminal speech interactive mode as claimed in claim 2, it is characterised in that
The step of the described noise reduction parameters determining corresponding voice signal according to described position includes:
The described noise signal correspondence waveform phase place relative to terminal is calculated according to described three-dimensional space position
Difference and time delay;
The noise reduction parameters of corresponding voice signal is generated according to described phase contrast and time delay.
4. the noise-reduction method under the terminal speech interactive mode as described in any one of claims 1 to 3, its
It is characterised by that the step of the described noise reduction parameters determining corresponding voice signal according to described position replaces with:
Determine the state parameter of the current placement state of terminal;
State parameter and described position according to described placement state determine that the noise reduction of corresponding voice signal is joined
Number.
5. the denoising device under a terminal speech interactive mode, it is characterised in that including:
Acquisition module, for when receiving voice messaging, obtains the voice letter that described voice messaging is corresponding
Number and the position of described voice signal correspondence sound source;
Determine module, for determining the noise reduction parameters of corresponding voice signal according to described position;
Noise reduction module, for adjusting corresponding voice signal according to described noise reduction parameters, with to described voice
Information carries out noise reduction;
Described acquisition module includes:
Recognition unit, for identifying noise signal from described voice signal;
Acquiring unit, for obtaining the position of described noise signal correspondence sound source.
6. the denoising device under terminal speech interactive mode as claimed in claim 5, it is characterised in that
Described acquiring unit, including:
Obtain subelement, poor for the phase contrast and acoustic pressure obtaining described noise signal correspondence waveform;
Computation subunit, for calculating, according to described phase contrast and acoustic pressure difference, the sound that described noise signal is corresponding
Source is relative to the three-dimensional space position of terminal.
7. the denoising device under terminal speech interactive mode as claimed in claim 6, it is characterised in that
Described determine that module comprises determining that unit and signal generating unit,
Described determine unit, for calculating described noise signal correspondence ripple according to described three-dimensional space position
Shape is relative to the phase contrast of terminal and time delay;
Described signal generating unit, for generating the noise reduction ginseng of corresponding voice signal according to described phase contrast and time delay
Number.
8. the denoising device under terminal speech interactive mode as claimed in claim 7, it is characterised in that
Described determine unit, be additionally operable to determine the state parameter of the current placement state of terminal;It is additionally operable to according to institute
State the state parameter of placement state and described position determines the noise reduction parameters of corresponding voice signal.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510274680.0A CN104967717B (en) | 2015-05-26 | 2015-05-26 | Noise-reduction method under terminal speech interactive mode and device |
PCT/CN2016/083032 WO2016188394A1 (en) | 2015-05-26 | 2016-05-23 | Noise reduction method, noise reduction apparatus, and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510274680.0A CN104967717B (en) | 2015-05-26 | 2015-05-26 | Noise-reduction method under terminal speech interactive mode and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104967717A CN104967717A (en) | 2015-10-07 |
CN104967717B true CN104967717B (en) | 2016-09-28 |
Family
ID=54221653
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510274680.0A Active CN104967717B (en) | 2015-05-26 | 2015-05-26 | Noise-reduction method under terminal speech interactive mode and device |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN104967717B (en) |
WO (1) | WO2016188394A1 (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104967717B (en) * | 2015-05-26 | 2016-09-28 | 努比亚技术有限公司 | Noise-reduction method under terminal speech interactive mode and device |
CN107146628A (en) * | 2017-04-07 | 2017-09-08 | 宇龙计算机通信科技(深圳)有限公司 | A kind of voice call processing method and mobile terminal |
CN107527615B (en) * | 2017-09-13 | 2021-01-15 | 联想(北京)有限公司 | Information processing method, device, equipment, system and server |
CN108564961A (en) * | 2017-11-29 | 2018-09-21 | 华北计算技术研究所(中国电子科技集团公司第十五研究所) | A kind of voice de-noising method of mobile communication equipment |
CN107945814A (en) * | 2017-11-29 | 2018-04-20 | 华北计算技术研究所(中国电子科技集团公司第十五研究所) | A kind of method of speech processing |
CN113808564B (en) * | 2020-06-12 | 2024-03-19 | 青岛海尔电冰箱有限公司 | Kitchen noise reduction method, refrigerator and computer readable storage medium |
US12119004B2 (en) | 2020-09-17 | 2024-10-15 | Zhejiang Tonghuashun Intelligent Technology Co., Ltd. | Systems and methods for voice audio data processing |
CN112185354A (en) * | 2020-09-17 | 2021-01-05 | 浙江同花顺智能科技有限公司 | Voice text display method, device, equipment and storage medium |
CN114155871A (en) * | 2021-12-16 | 2022-03-08 | 科大讯飞股份有限公司 | Voice noise reduction method and device, storage medium and equipment |
CN114176623B (en) * | 2021-12-21 | 2023-09-12 | 深圳大学 | Sound noise reduction method, system, noise reduction device and computer readable storage medium |
CN115359803B (en) * | 2022-10-21 | 2022-12-27 | 中诚华隆计算机技术有限公司 | Voice noise reduction optimization method and device based on chip |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103079148A (en) * | 2012-12-28 | 2013-05-01 | 中兴通讯股份有限公司 | Method and device for reducing noise of two microphones of terminal |
CN104012074A (en) * | 2011-12-12 | 2014-08-27 | 华为技术有限公司 | Smart audio and video capture systems for data processing systems |
CN104301537A (en) * | 2014-10-15 | 2015-01-21 | 龙旗电子(惠州)有限公司 | Noise reduction mobile phone and noise reduction method |
WO2015066926A1 (en) * | 2013-11-11 | 2015-05-14 | 赵春宁 | Noise reduction method |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150003626A1 (en) * | 2013-02-25 | 2015-01-01 | Max Sound Corporation | Active noise cancellation method for automobiles |
US9681219B2 (en) * | 2013-03-07 | 2017-06-13 | Nokia Technologies Oy | Orientation free handsfree device |
CN104575510B (en) * | 2015-02-04 | 2018-08-24 | 深圳酷派技术有限公司 | Noise-reduction method, denoising device and terminal |
CN104967717B (en) * | 2015-05-26 | 2016-09-28 | 努比亚技术有限公司 | Noise-reduction method under terminal speech interactive mode and device |
-
2015
- 2015-05-26 CN CN201510274680.0A patent/CN104967717B/en active Active
-
2016
- 2016-05-23 WO PCT/CN2016/083032 patent/WO2016188394A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104012074A (en) * | 2011-12-12 | 2014-08-27 | 华为技术有限公司 | Smart audio and video capture systems for data processing systems |
CN103079148A (en) * | 2012-12-28 | 2013-05-01 | 中兴通讯股份有限公司 | Method and device for reducing noise of two microphones of terminal |
WO2015066926A1 (en) * | 2013-11-11 | 2015-05-14 | 赵春宁 | Noise reduction method |
CN104301537A (en) * | 2014-10-15 | 2015-01-21 | 龙旗电子(惠州)有限公司 | Noise reduction mobile phone and noise reduction method |
Also Published As
Publication number | Publication date |
---|---|
WO2016188394A1 (en) | 2016-12-01 |
CN104967717A (en) | 2015-10-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104967717B (en) | Noise-reduction method under terminal speech interactive mode and device | |
CN104750420A (en) | Screen capturing method and device | |
CN106155695A (en) | The removing control device and method of background application | |
CN105760057A (en) | Screenshot device and method | |
CN105187521A (en) | Service processing device and method | |
CN106155694A (en) | A kind of application and the display packing and device applied of attending to anything else | |
CN104796956A (en) | Mobile terminal network switching method and mobile terminal | |
CN105138261A (en) | Shooting parameter adjustment apparatus and method | |
CN106168879A (en) | A kind of method that double-sided screen is mutual and terminal | |
CN105739998A (en) | Animation debug method and mobile terminal | |
CN104777982B (en) | Method and device for switching terminal input method | |
CN104820555A (en) | Transverse and vertical screen modes switching method and device, and terminal | |
CN105100416A (en) | Event reminding method and wearable device | |
CN104993180A (en) | Terminal charging method and device | |
CN106302086A (en) | A kind of different mobile terminal carries out the method for content synchronization, Apparatus and system | |
CN105357367A (en) | Keystroke identification device and method based on pressure sensor | |
CN106101455A (en) | Page jump management method, device and terminal | |
CN104881218A (en) | Mobile terminal screen scrolling method and mobile terminal screen scrolling device | |
CN106791141A (en) | A kind of method of adjustment and mobile terminal of sound effect parameters of conversing | |
CN104898940A (en) | Picture processing method and device | |
CN106302824A (en) | wireless WiFi control method, device and mobile terminal | |
CN106371682A (en) | Gesture recognition system based on proximity sensor and method thereof | |
CN105120067B (en) | The acquisition device of terminal sliding distance and method | |
CN106550133A (en) | Calling identification device and method | |
CN106231657A (en) | The method and device of dual-screen mobile terminal switching battery saving mode |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |