CN104967717A

CN104967717A - Noise reduction method and apparatus in terminal voice interaction mode

Info

Publication number: CN104967717A
Application number: CN201510274680.0A
Authority: CN
Inventors: 张圣杰; 申世安
Original assignee: Nubia Technology Co Ltd
Current assignee: Nubia Technology Co Ltd
Priority date: 2015-05-26
Filing date: 2015-05-26
Publication date: 2015-10-07
Anticipated expiration: 2035-05-26
Also published as: CN104967717B; WO2016188394A1

Abstract

The invention discloses a noise reduction method in terminal voice interaction mode, comprising obtaining a voice signal corresponding with voice information and a sound source position corresponding with the voice signal when receiving the voice information; determining noise reduction parameters corresponding with the voice signal in dependence on the position; and adjusting a corresponding voice signal according to the noise reduction parameters so as to perform noise reduction on the voice information. The invention also discloses a noise reduction apparatus in a terminal voice interaction mode. The noise reduction method and apparatus can realize active noise reduction in a voice interaction mode, reduce noise interference in a voice interaction process, and improve voice interaction effects.

Description

Noise-reduction method under terminal speech interactive mode and device

Technical field

The present invention relates to field of terminal, particularly relate to the noise-reduction method under terminal speech interactive mode and device.

Background technology

Along with the development of terminal technology, increasing terminal enters in the middle of daily life and work, and described terminal is for mobile terminal, and user can pass through mobile terminal typing voice, or carries out the interactive voice scenes such as voice call.At present, in mobile terminal sound typing or voice call process, there is no corresponding noise reduction technology, when carrying out telephone operation especially in the handsfree mode, due to environmental noise interference, cause more unpleasant to hear clear voice call content.Under current speech interaction mode, mainly by downstream signal noise reduction, downstream signal noise reduction is only by suppressing the far-end noise in downstream signal, is useless to the environmental noise of near-end.Therefore, there is not the mode of voice being carried out to active noise reduction, cause interactive voice process to produce noise jamming, affect the effect of interactive voice.

Summary of the invention

Main purpose of the present invention is to propose noise-reduction method under a kind of terminal speech interactive mode and device, under being intended to solve current speech interaction mode, there is not the mode of voice being carried out to active noise reduction, cause interactive voice process to produce noise jamming, affect the problem of the effect of interactive voice.

For achieving the above object, the noise-reduction method under a kind of terminal speech interactive mode provided by the invention, comprising: when receiving voice messaging, obtains the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

The noise reduction parameters of corresponding voice signal is determined according to described position;

The voice signal corresponding according to described noise reduction parameters adjustment, to carry out noise reduction to described voice messaging.

Preferably, the step of the position of the corresponding sound source of the described voice signal of described acquisition comprises:

Noise signal is identified from described voice signal;

Obtain the position of the corresponding sound source of described noise signal.

Preferably, the step of the position of the corresponding sound source of the described noise signal of described acquisition comprises:

The phase difference and the acoustic pressure that obtain the corresponding waveform of described noise signal are poor;

The three-dimensional space position of sound source corresponding to described noise signal relative to terminal is calculated according to described phase difference and acoustic pressure difference.

Preferably, describedly determine that the step of the noise reduction parameters that each voice signal is corresponding comprises according to described position:

The corresponding waveform of described noise signal is calculated relative to the phase difference of terminal and time delay according to described three-dimensional space position;

The noise reduction parameters of corresponding noise signal is generated according to described phase difference and time delay.

Preferably, the described step determining the noise reduction parameters that each voice signal is corresponding according to described position, replaces with:

Determine the state parameter that terminal is current;

The noise reduction parameters of corresponding voice signal is determined according to described state parameter and described position.

In addition, for achieving the above object, the present invention also proposes the denoising device under a kind of terminal speech interactive mode, comprising: acquisition module, for when receiving voice messaging, obtain the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

Determination module, for determining the noise reduction parameters of corresponding voice signal according to described position;

Noise reduction module, for the voice signal corresponding according to described noise reduction parameters adjustment, to carry out noise reduction to described voice messaging.

Preferably, described acquisition module comprises:

Recognition unit, for identifying noise signal from described voice signal;

Acquiring unit, for obtaining the position of the corresponding sound source of described noise signal.

Preferably, described acquiring unit, comprising:

Obtain subelement, poor for the phase difference and acoustic pressure obtaining the corresponding waveform of described noise signal;

Computation subunit, for calculating the three-dimensional space position of sound source corresponding to described noise signal relative to terminal according to described phase difference and acoustic pressure difference.

Preferably, described determination module comprises: determining unit and generation unit,

Described determining unit, for calculating the corresponding waveform of described noise signal relative to the phase difference of terminal and time delay according to described three-dimensional space position;

Described generation unit, for generating the noise reduction parameters of corresponding noise signal according to described phase difference and time delay.

Preferably, described determining unit, also for determining the state parameter that terminal is current; Also for determining the noise reduction parameters of corresponding voice signal according to described state parameter and described position.

What the present invention proposed determines corresponding noise reduction parameters by the sound source position of the voice signal according to the voice messaging received, and carries out noise reduction to voice messaging.Realize the active noise reduction under speech interaction mode, reduce in interactive voice process and produce noise jamming, improve interactive voice effect.

Accompanying drawing explanation

Fig. 1 is the hardware configuration signal of the mobile terminal realizing each embodiment of the present invention;

Fig. 2 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 1;

Fig. 3 is the schematic flow sheet of the first embodiment of noise-reduction method under terminal speech interactive mode of the present invention;

Fig. 4 is the schematic flow sheet that the present invention obtains position one embodiment of the corresponding sound source of described voice signal;

Fig. 5 is the schematic flow sheet that the present invention obtains position one embodiment of the corresponding sound source of described noise signal;

Fig. 6 is the present invention determines noise reduction parameters one embodiment of corresponding voice signal schematic flow sheet according to described position;

The schematic flow sheet of the second embodiment of the noise-reduction method under Fig. 7 terminal speech interactive mode of the present invention;

Fig. 8 is the high-level schematic functional block diagram of the preferred embodiment of denoising device under terminal speech interactive mode of the present invention;

Fig. 9 is the refinement high-level schematic functional block diagram of acquisition module one embodiment in Fig. 8;

Figure 10 is the refinement high-level schematic functional block diagram of acquiring unit one embodiment in Fig. 9;

Figure 11 is the refinement high-level schematic functional block diagram of determination module one embodiment in Fig. 8.

The realization of the object of the invention, functional characteristics and advantage will in conjunction with the embodiments, are described further with reference to accompanying drawing.

Embodiment

Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.

The mobile terminal realizing each embodiment of the present invention is described referring now to accompanying drawing.In follow-up description, use the suffix of such as " module ", " parts " or " unit " for representing element only in order to be conducive to explanation of the present invention, itself is specific meaning not.Therefore, " module " and " parts " can mixedly use.

Mobile terminal can be implemented in a variety of manners.Such as, the terminal described in the present invention can comprise the such as mobile terminal of mobile phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP (portable media player), guider etc. and the fixed terminal of such as digital TV, desktop computer etc.Below, suppose that terminal is mobile terminal.But it will be appreciated by those skilled in the art that except the element except being used in particular for mobile object, structure according to the embodiment of the present invention also can be applied to the terminal of fixed type.

Fig. 1 is the hardware configuration signal of the mobile terminal realizing each embodiment of the present invention.

Mobile terminal 100 can comprise wireless communication unit 110, A/V (audio/video) input unit 120, user input unit 130, sensing cell 140, output unit 150, memory 160, interface unit 170, controller 180 and power subsystem 190 etc.Fig. 1 shows the mobile terminal with various assembly, it should be understood that, does not require to implement all assemblies illustrated.Can alternatively implement more or less assembly.Will be discussed in more detail below the element of mobile terminal.

Wireless communication unit 110 generally includes one or more assembly, and it allows the radio communication between mobile terminal 100 and wireless communication system or network.Such as, wireless communication unit can comprise at least one in broadcast reception module 111, mobile communication module 112, wireless Internet module 113, short range communication module 114 and positional information module 115.

Broadcast reception module 111 via broadcast channel from external broadcasting management server receiving broadcast signal and/or broadcast related information.Broadcast channel can comprise satellite channel and/or terrestrial channel.Broadcast management server can be generate and send the server of broadcast singal and/or broadcast related information or the broadcast singal generated before receiving and/or broadcast related information and send it to the server of terminal.Broadcast singal can comprise TV broadcast singal, radio signals, data broadcasting signal etc.And broadcast singal may further include the broadcast singal combined with TV or radio signals.Broadcast related information also can provide via mobile communications network, and in this case, broadcast related information can be received by mobile communication module 112.Broadcast singal can exist in a variety of manners, such as, it can exist with the form of the electronic service guidebooks (ESG) of the electronic program guides of DMB (DMB) (EPG), digital video broadcast-handheld (DVB-H) etc.Broadcast reception module 111 can by using the broadcast of various types of broadcast system Received signal strength.Especially, broadcast reception module 111 can by using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), digital video broadcasting-hand-held (DVB-H), forward link media (MediaFLO ^@) the digit broadcasting system receiving digital broadcast of Radio Data System, received terrestrial digital broadcasting integrated service (ISDB-T) etc.Broadcast reception module 111 can be constructed to be applicable to providing the various broadcast system of broadcast singal and above-mentioned digit broadcasting system.The broadcast singal received via broadcast reception module 111 and/or broadcast related information can be stored in memory 160 (or storage medium of other type).

Radio signal is sent at least one in base station (such as, access point, Node B etc.), exterior terminal and server and/or receives radio signals from it by mobile communication module 112.Various types of data that such radio signal can comprise voice call signal, video calling signal or send according to text and/or Multimedia Message and/or receive.

Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.This module can be inner or be externally couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by this module can comprise WLAN (WLAN) (Wi-Fi), Wibro (WiMAX), Wimax (worldwide interoperability for microwave access), HSDPA (high-speed downlink packet access) etc.

Short range communication module 114 is the modules for supporting junction service.Some examples of short-range communication technology comprise bluetooth ^tM, radio-frequency (RF) identification (RFID), Infrared Data Association (IrDA), ultra broadband (UWB), purple honeybee ^tMetc..

Positional information module 115 is the modules of positional information for checking or obtain mobile terminal.The typical case of positional information module is GPS (global positioning system).According to current technology, GPS module 115 calculates from the range information of three or more satellite and correct time information and for the Information application triangulation calculated, thus calculates three-dimensional current location information according to longitude, latitude and pin-point accuracy.Current, the method for calculating location and temporal information uses three satellites and by the error of the position that uses an other satellite correction calculation to go out and temporal information.In addition, GPS module 115 can carry out computational speed information by Continuous plus current location information in real time.

A/V input unit 120 is for audio reception or vision signal.A/V input unit 120 can comprise camera 121 and microphone 1220, and the view data of camera 121 to the static images obtained by image capture apparatus in Video Capture pattern or image capture mode or video processes.Picture frame after process may be displayed on display unit 151.Picture frame after camera 121 processes can be stored in memory 160 (or other storage medium) or via wireless communication unit 110 and send, and can provide two or more cameras 1210 according to the structure of mobile terminal.Such acoustic processing can via microphones sound (voice data) in telephone calling model, logging mode, speech recognition mode etc. operational mode, and can be voice data by microphone 122.Audio frequency (voice) data after process can be converted to the formatted output that can be sent to mobile communication base station via mobile communication module 112 when telephone calling model.Microphone 122 can be implemented various types of noise and eliminate (or suppress) algorithm and receiving and sending to eliminate (or suppression) noise or interference that produce in the process of audio signal.

User input unit 130 can generate key input data to control the various operations of mobile terminal according to the order of user's input.User input unit 130 allows user to input various types of information, and keyboard, the young sheet of pot, touch pad (such as, detecting the touch-sensitive assembly of the change of the resistance, pressure, electric capacity etc. that cause owing to being touched), roller, rocking bar etc. can be comprised.Especially, when touch pad is superimposed upon on display unit 151 as a layer, touch-screen can be formed.

Sensing cell 140 detects the current state of mobile terminal 100, (such as, mobile terminal 100 open or close state), the position of mobile terminal 100, user for mobile terminal 100 contact (namely, touch input) presence or absence, the orientation of mobile terminal 100, the acceleration or deceleration of mobile terminal 100 move and direction etc., and generate order or the signal of the operation for controlling mobile terminal 100.Such as, when mobile terminal 100 is embodied as sliding-type mobile phone, sensing cell 140 can sense this sliding-type phone and open or close.In addition, whether whether sensing cell 140 can detect power subsystem 190 provides electric power or interface unit 170 to couple with external device (ED).Sensing cell 140 can comprise proximity transducer 1410 and will be described this in conjunction with touch-screen below.

Interface unit 170 is used as at least one external device (ED) and is connected the interface that can pass through with mobile terminal 100.Such as, external device (ED) can comprise wired or wireless head-band earphone port, external power source (or battery charger) port, wired or wireless FPDP, memory card port, for connecting the port, audio frequency I/O (I/O) port, video i/o port, ear port etc. of the device with identification module.Identification module can be that storage uses the various information of mobile terminal 100 for authentication of users and can comprise subscriber identification module (UIM), client identification module (SIM), Universal Subscriber identification module (USIM) etc.In addition, the device (hereinafter referred to " recognition device ") with identification module can take the form of smart card, and therefore, recognition device can be connected with mobile terminal 100 via port or other jockey.Interface unit 170 may be used for receive from external device (ED) input (such as, data message, electric power etc.) and the input received be transferred to the one or more element in mobile terminal 100 or may be used for transmitting data between mobile terminal and external device (ED).

In addition, when mobile terminal 100 is connected with external base, interface unit 170 can be used as to allow by it electric power to be provided to the path of mobile terminal 100 from base or can be used as the path that allows to be transferred to mobile terminal by it from the various command signals of base input.The various command signal inputted from base or electric power can be used as and identify whether mobile terminal is arranged on the signal base exactly.Output unit 150 is constructed to provide output signal (such as, audio signal, vision signal, alarm signal, vibration signal etc.) with vision, audio frequency and/or tactile manner.Output unit 150 can comprise display unit 151, dio Output Modules 152, alarm unit 153 etc.

Display unit 151 may be displayed on the information of process in mobile terminal 100.Such as, when mobile terminal 100 is in telephone calling model, display unit 151 can show with call or other communicate (such as, text messaging, multimedia file are downloaded etc.) be correlated with user interface (UI) or graphic user interface (GUI).When mobile terminal 100 is in video calling pattern or image capture mode, display unit 151 can the image of display capture and/or the image of reception, UI or GUI that video or image and correlation function are shown etc.

Meanwhile, when display unit 151 and touch pad as a layer superposed on one another to form touch-screen time, display unit 151 can be used as input unit and output device.Display unit 151 can comprise at least one in liquid crystal display (LCD), thin-film transistor LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc.Some in these displays can be constructed to transparence and watch from outside to allow user, and this can be called transparent display, and typical transparent display can be such as TOLED (transparent organic light emitting diode) display etc.According to the specific execution mode wanted, mobile terminal 100 can comprise two or more display units (or other display unit), such as, mobile terminal can comprise outernal display unit (not shown) and inner display unit (not shown).Touch-screen can be used for detecting touch input pressure and touch input position and touch and inputs area.

When dio Output Modules 152 can be under the isotypes such as call signal receiving mode, call mode, logging mode, speech recognition mode, broadcast reception mode at mobile terminal, voice data convert audio signals that is that wireless communication unit 110 is received or that store in memory 160 and exporting as sound.And dio Output Modules 152 can provide the audio frequency relevant to the specific function that mobile terminal 100 performs to export (such as, call signal receives sound, message sink sound etc.).Dio Output Modules 152 can comprise loud speaker, buzzer etc.

Alarm unit 153 can provide and export that event informed to mobile terminal 100.Typical event can comprise calling reception, message sink, key signals input, touch input etc.Except audio or video exports, alarm unit 153 can provide in a different manner and export with the generation of notification event.Such as, alarm unit 153 can provide output with the form of vibration, when receive calling, message or some other enter communication (incoming communication) time, alarm unit 153 can provide sense of touch to export (that is, vibrating) to notify to user.By providing such sense of touch to export, even if when the mobile phone of user is in the pocket of user, user also can identify the generation of various event.Alarm unit 153 also can provide the output of the generation of notification event via display unit 151 or dio Output Modules 152.

Memory 160 software program that can store process and the control operation performed by controller 180 etc., or temporarily can store oneself through exporting the data (such as, telephone directory, message, still image, video etc.) that maybe will export.And, memory 160 can store about when touch be applied to touch-screen time the vibration of various modes that exports and the data of audio signal.

Memory 160 can comprise the storage medium of at least one type, described storage medium comprises flash memory, hard disk, multimedia card, card-type memory (such as, SD or DX memory etc.), random access storage device (RAM), static random-access memory (SRAM), read-only memory (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc.And mobile terminal 100 can be connected the memory function of execute store 160 network storage device with by network cooperates.

Controller 180 controls the overall operation of mobile terminal usually.Such as, controller 180 performs the control relevant to voice call, data communication, video calling etc. and process.In addition, controller 180 can comprise the multi-media module 1810 for reproducing (or playback) multi-medium data, and multi-media module 1810 can be configured in controller 180, or can be configured to be separated with controller 180.Controller 180 can pattern recognition process, is identified as character or image so that input is drawn in the handwriting input performed on the touchscreen or picture.

Power subsystem 190 receives external power or internal power and provides each element of operation and the suitable electric power needed for assembly under the control of controller 180.

Various execution mode described herein can to use such as computer software, the computer-readable medium of hardware or its any combination implements.For hardware implementation, execution mode described herein can by using application-specific IC (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), processor, controller, microcontroller, microprocessor, being designed at least one performed in the electronic unit of function described herein and implementing, in some cases, such execution mode can be implemented in controller 180.For implement software, the execution mode of such as process or function can be implemented with allowing the independent software module performing at least one function or operation.Software code can be implemented by the software application (or program) write with any suitable programming language, and software code can be stored in memory 160 and to be performed by controller 180.

So far, oneself is through the mobile terminal according to its functional description.Below, for the sake of brevity, by the slide type mobile terminal that describes in various types of mobile terminals of such as folded form, board-type, oscillating-type, slide type mobile terminal etc. exemplarily.Therefore, the present invention can be applied to the mobile terminal of any type, and is not limited to slide type mobile terminal.

Mobile terminal 100 as shown in Figure 1 can be constructed to utilize and send the such as wired and wireless communication system of data via frame or grouping and satellite-based communication system operates.

Describe wherein according to the communication system that mobile terminal of the present invention can operate referring now to Fig. 2.

Such communication system can use different air interfaces and/or physical layer.Such as, the air interface used by communication system comprises such as frequency division multiple access (FDMA), time division multiple access (TDMA), code division multiple access (CDMA) and universal mobile telecommunications system (UMTS) (especially, Long Term Evolution (LTE)), global system for mobile communications (GSM) etc.As non-limiting example, description below relates to cdma communication system, but such instruction is equally applicable to the system of other type.

With reference to figure 2, cdma wireless communication system can comprise multiple mobile terminal 100, multiple base station (BS) 270, base station controller (BSC) 275 and mobile switching centre (MSC) 2800MSC280 and be constructed to form interface with Public Switched Telephony Network (PSTN) 290.MSC280 is also constructed to form interface with the BSC275 that can be couple to base station 270 via back haul link.Back haul link can construct according to any one in some interfaces that oneself knows, described interface comprises such as E1/T1, ATM, IP, PPP, frame relay, HDSL, ADSL or xDSL.Will be appreciated that system as shown in Figure 2 can comprise multiple BSC2750.

Each BS270 can serve one or more subregion (or region), by multidirectional antenna or point to specific direction each subregion of antenna cover radially away from BS270.Or each subregion can by two or more antenna covers for diversity reception.Each BS270 can be constructed to support multiple parallel compensate, and each parallel compensate has specific frequency spectrum (such as, 1.25MHz, 5MHz etc.).

Subregion can be called as CDMA Channel with intersecting of parallel compensate.BS270 also can be called as base station transceiver subsystem (BTS) or other equivalent terms.Under these circumstances, term " base station " may be used for broadly representing single BSC275 and at least one BS270.Base station also can be called as " cellular station ".Or each subregion of particular B S270 can be called as multiple cellular station.

As shown in Figure 2, broadcast singal is sent to the mobile terminal 100 at operate within systems by broadcsting transmitter (BT) 295.Broadcast reception module 111 as shown in Figure 1 is arranged on mobile terminal 100 and sentences the broadcast singal receiving and sent by BT295.In fig. 2, several global positioning system (GPS) satellite 300 is shown.Satellite 300 helps at least one in the multiple mobile terminal 100 in location.

In fig. 2, depict multiple satellite 300, but understand, the satellite of any number can be utilized to obtain useful locating information.GPS module 115 as shown in Figure 1 is constructed to coordinate to obtain the locating information wanted with satellite 300 usually.Substitute GPS tracking technique or outside GPS tracking technique, can use can other technology of position of tracking mobile terminal.In addition, at least one gps satellite 300 optionally or extraly can process satellite dmb transmission.

As a typical operation of wireless communication system, BS270 receives the reverse link signal from various mobile terminal 100.Mobile terminal 100 participates in call usually, information receiving and transmitting communicates with other type.Each reverse link signal that certain base station 270 receives is processed by particular B S270.The data obtained are forwarded to relevant BSC275.BSC provides call Resourse Distribute and comprises the mobile management function of coordination of the soft switching process between BS270.The data received also are routed to MSC280 by BSC275, and it is provided for the extra route service forming interface with PSTN290.Similarly, PSTN290 and MSC280 forms interface, and MSC and BSC275 forms interface, and BSC275 correspondingly control BS270 so that forward link signals is sent to mobile terminal 100.

Based on above-mentioned mobile terminal hardware configuration and communication system, each embodiment of noise-reduction method under terminal speech interactive mode of the present invention is proposed.

As shown in Figure 3, first embodiment of the invention proposes the noise-reduction method under a kind of terminal speech interactive mode, comprising:

Step S10, when receiving voice messaging, obtains the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

Under noise reduction process under terminal speech interactive mode in the embodiment of the present invention is preferably applied in handsfree talk mode, also can also be applied in other embodiments of the present invention in recording, instant messaging interactive voice scene.

In the present embodiment, described terminal is preferably mobile phone, pad etc. can carry out the electronic equipment of voice call, described terminal should at least assemble following device: for the outside speaker of hand-free function of conversing, for detecting the microphone array of sound source position, described microphone array is preferably 4, optimal setting positions is bottom and upper segment, one respectively, left and right, because current software can reach 0.1DBA for the accuracy of detection of sound pressure level, the accuracy of detection of phase difference can reach 0.1 degree, so during shift in position, software has stronger stability, do not need to change corresponding technology.The microphone array of terminal installation four or more, when opening hands-free, microphone array detects voice messaging, when receiving voice messaging, obtains the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal.Described voice signal comprises human voice signal and noise signal.

Concrete, with reference to figure 4, the process of the position of the corresponding sound source of the described voice signal of described acquisition comprises:

Step S11, identifies noise signal from described voice signal;

Step S12, obtains the position of the corresponding sound source of described noise signal.

Distinguish voice and environmental noise according to general voice detecting algorithm VAD, namely from described voice signal, identify human voice signal and noise signal, the signal of typing is screened, judge voice and noise.Determine the position of voice and the corresponding sound source of noise.

Concrete, with reference to figure 5, the process of the position of the corresponding sound source of the described noise signal of described acquisition comprises:

Step S121, the phase difference and the acoustic pressure that obtain the corresponding waveform of described noise signal are poor;

Step S122, calculates the three-dimensional space position of sound source corresponding to described noise signal relative to terminal according to described phase difference and acoustic pressure difference.

Poor by the phase difference and acoustic pressure that calculate microphone array typing waveform, compute location people sound source is relative to the three-dimensional space position of terminal respectively, and noise source of sound is relative to the three-dimensional space position of terminal.Each typing waveform due to sound source location difference, square being inversely proportional to of sound pressure level intensity and distance, phase difference is then relevant with reception delay, subtracted each other by each typing waveform, obtain phase difference harmony pressure reduction, according to the speed formula that sound wave is propagated in atmosphere, can be regarded as out the positional distance of sound source to each MIC, finding intersection point by drawing circle, can judge that current sound source is arranged in the particular location in space relative to terminal.

Step S20, determines the noise reduction parameters of corresponding voice signal according to described position;

Determine the noise reduction parameters of corresponding voice signal according to described position, when there is multiple noise signal, sorting out, extract one by one according to noise spectrum sequence, superpose reverse compensation sound wave respectively, that is, described noise reduction parameters is the phase place of reverse compensation sound wave.

Concrete, with reference to figure 6, describedly determine that according to described position the process of the noise reduction parameters of corresponding voice signal can comprise:

Step S21, calculates the corresponding waveform of described noise signal relative to the phase difference of terminal and time delay according to described three-dimensional space position;

Step S22, generates the noise reduction parameters of corresponding noise signal according to described phase difference and time delay.

After determining that voice sound source and noise source are relative to the three-dimensional space position of terminal, calculate the distance of voice and noise sound source, when there is multiple noise sound source, the distance calculating voice and each noise sound source obtains multiple distance, according to each noise and the distance of voice sound source and the frequency of noise source of sound, calculate the phase difference of terminal typing waveform between terminal MIC and people's ear position and time delay, generate the noise reduction parameters of corresponding noise signal according to described phase difference and time delay.Each noise, because of position difference, so the noise reduction parameters obtained is all different, when there is multiple noise, is determined the noise reduction parameters that each noise is corresponding, that is, is obtained the phase place of multiple reverse compensation sound wave.

Step S30, the voice signal corresponding according to described noise reduction parameters adjustment, to carry out noise reduction to described voice messaging.

In the phase place that described noise reduction parameters is reverse compensation sound wave, by adding the phase place of reverse compensation sound wave, reaching the object of noise reduction, effectively avoiding the interference of environmental noise.The embodiment of the present invention produces the reverse sound wave equal with near-end ambient noise by noise reduction system, is neutralized by noise, thus realizes the effect of active noise reduction.

The present embodiment determines corresponding noise reduction parameters by the sound source position of the voice signal according to the voice messaging received, and carries out noise reduction to voice messaging.Realize the active noise reduction under speech interaction mode, reduce in interactive voice process and produce noise jamming, improve interactive voice effect.

With reference to figure 7, propose the second embodiment of the noise-reduction method under terminal speech interactive mode of the present invention, based on the first embodiment of the noise-reduction method under above-mentioned terminal speech interactive mode, described step S30 can comprise:

Step S23, determines the state parameter that terminal is current;

Step S24, determines the noise reduction parameters of corresponding voice signal according to described state parameter and described position.

In the present embodiment, should be noted that the placement state of terminal, namely the state parameter that terminal is current is determined, such as, face up, reverse side upward, the situation of side, can according to gravity sensor, the data that the instruments such as gyroscope provide, under different placing attitude, different bearing datas is provided, and then the phase place to the reverse compensation sound wave superposed, thus reach the function of hands-free active noise reduction, the noise reduction parameters of corresponding voice signal is determined by the state parameter and described position combining active, such as, when terminal is front, the noise reduction parameters of corresponding voice signal is determined according to this state parameter of front and position.The present embodiment under different placing attitude, that is, under terminal is in different states, provides different SOT state of termination, and then revises the reverse compensation waveform of superposition.Further raising noise reduction.

The present invention further provides the denoising device under a kind of terminal speech interactive mode.

With reference to the high-level schematic functional block diagram that Fig. 8, Fig. 8 are the denoising device preferred embodiment under terminal speech interactive mode of the present invention.

Denoising device under a kind of terminal speech interactive mode of the present embodiment, the denoising device under described terminal speech interactive mode comprises: acquisition module 10,

Described acquisition module 10, for when receiving voice messaging, obtains the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

With reference to figure 9, described acquisition module 10 comprises recognition unit 11 and acquiring unit 12,

Described recognition unit 11, for identifying noise signal from described voice signal;

Described acquiring unit 12, for obtaining the position of the corresponding sound source of described noise signal.

Concrete, with reference to Figure 10, described acquiring unit 12 comprises acquisition subelement 121 and computation subunit 122,

Described acquisition subelement 121, poor for the phase difference and acoustic pressure obtaining the corresponding waveform of described noise signal;

Described computation subunit 122, for calculating the three-dimensional space position of sound source corresponding to described noise signal relative to terminal according to described phase difference and acoustic pressure difference.

Described determination module 20, also for determining the noise reduction parameters of corresponding voice signal according to described position;

Concrete, with reference to Figure 11, described determination module 20 comprises: determining unit 21 and generation unit 22,

Described determining unit 21, for calculating the corresponding waveform of described noise signal relative to the phase difference of terminal and time delay according to described three-dimensional space position;

Described generation unit 22, for generating the noise reduction parameters of corresponding noise signal according to described phase difference and time delay.

Described noise reduction module 30, for the voice signal corresponding according to described noise reduction parameters adjustment, to carry out noise reduction to described voice messaging.

The present embodiment determines corresponding noise reduction parameters by the sound source position of the voice signal according to the voice messaging received, and carries out noise reduction to voice messaging.Realize the noise reduction under speech interaction mode, reduce in interactive voice process and produce noise jamming, improve interactive voice effect.

Further, described determining unit 21, also for determining the state parameter that terminal is current; Also for determining the noise reduction parameters of corresponding voice signal according to described state parameter and described position.

It should be noted that, in this article, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thus make to comprise the process of a series of key element, method, article or device and not only comprise those key elements, but also comprise other key elements clearly do not listed, or also comprise by the intrinsic key element of this process, method, article or device.When not more restrictions, the key element limited by statement " comprising ... ", and be not precluded within process, method, article or the device comprising this key element and also there is other identical element.

The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.

Through the above description of the embodiments, those skilled in the art can be well understood to the mode that above-described embodiment method can add required general hardware platform by software and realize, hardware can certainly be passed through, but in a lot of situation, the former is better execution mode.Based on such understanding, technical scheme of the present invention can embody with the form of software product the part that prior art contributes in essence in other words, this computer software product is stored in a storage medium (as ROM/RAM, magnetic disc, CD), comprising some instructions in order to make a station terminal equipment (can be mobile phone, computer, server, air conditioner, or the network equipment etc.) perform method described in each embodiment of the present invention.

These are only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every utilize specification of the present invention and accompanying drawing content to do equivalent structure or equivalent flow process conversion; or be directly or indirectly used in other relevant technical fields, be all in like manner included in scope of patent protection of the present invention.

Claims

1. the noise-reduction method under terminal speech interactive mode, is characterized in that, comprising:

When receiving voice messaging, obtain the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

2. the noise-reduction method under terminal speech interactive mode as claimed in claim 1, is characterized in that, the step of the position of the corresponding sound source of the described voice signal of described acquisition comprises:

Noise signal is identified from described voice signal;

3. the noise-reduction method under terminal speech interactive mode as claimed in claim 2, is characterized in that, the step of the position of the corresponding sound source of the described noise signal of described acquisition comprises:

4. the noise-reduction method under terminal speech interactive mode as claimed in claim 3, is characterized in that, describedly determines that the step of the noise reduction parameters that each voice signal is corresponding comprises according to described position:

5. the noise-reduction method under the terminal speech interactive mode as described in any one of Claims 1-4, is characterized in that, the described step determining the noise reduction parameters that each voice signal is corresponding according to described position, replaces with:

Determine the state parameter that terminal is current;

6. the denoising device under terminal speech interactive mode, is characterized in that, comprising:

Acquisition module, for when receiving voice messaging, obtains the position of voice signal corresponding to described voice messaging and the corresponding sound source of described voice signal;

7. the denoising device under terminal speech interactive mode as claimed in claim 6, it is characterized in that, described acquisition module comprises:

Recognition unit, for identifying noise signal from described voice signal;

8. the denoising device under terminal speech interactive mode as claimed in claim 7, it is characterized in that, described acquiring unit, comprising:

9. the denoising device under terminal speech interactive mode as claimed in claim 8, it is characterized in that, described determination module comprises: determining unit and generation unit,

10. the denoising device under terminal speech interactive mode as claimed in claim 9, is characterized in that, described determining unit, also for determining the state parameter that terminal is current; Also for determining the noise reduction parameters of corresponding voice signal according to described state parameter and described position.