WO2016188394A1 - 降噪方法、降噪装置和计算机可读存储介质 - Google Patents

降噪方法、降噪装置和计算机可读存储介质 Download PDF

Info

Publication number
WO2016188394A1
WO2016188394A1 PCT/CN2016/083032 CN2016083032W WO2016188394A1 WO 2016188394 A1 WO2016188394 A1 WO 2016188394A1 CN 2016083032 W CN2016083032 W CN 2016083032W WO 2016188394 A1 WO2016188394 A1 WO 2016188394A1
Authority
WO
WIPO (PCT)
Prior art keywords
noise reduction
noise
signal
voice signal
sound source
Prior art date
Application number
PCT/CN2016/083032
Other languages
English (en)
French (fr)
Inventor
张圣杰
申世安
Original Assignee
努比亚技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 努比亚技术有限公司 filed Critical 努比亚技术有限公司
Publication of WO2016188394A1 publication Critical patent/WO2016188394A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/19Arrangements of transmitters, receivers, or complete sets to prevent eavesdropping, to attenuate local noise or to prevent undesired transmission; Mouthpieces or receivers specially adapted therefor

Definitions

  • Embodiments of the present invention relate to, but are not limited to, a noise reduction method and apparatus and a computer readable storage medium in a terminal voice interaction mode.
  • the downlink signal is mainly used for noise reduction, and the downlink signal noise reduction is only used to suppress the far-end noise in the downlink signal, which is useless for the near-end environmental noise. That is to say, during the voice input or voice call of the mobile terminal, the noise of the near-end environment is not denoised, so that especially when the call operation is performed in the hands-free mode, it is more difficult to listen due to environmental noise interference. Clear voice call content.
  • Embodiments of the present invention provide a noise reduction method, a noise reduction device, and a computer readable storage medium, which are capable of actively reducing noise in a voice interaction mode, reducing noise interference generated during a voice interaction process, thereby improving voice. Interaction effect.
  • a method for reducing noise according to an embodiment of the present invention includes: acquiring a voice signal corresponding to the received voice information and a location of the sound source corresponding to the voice signal;
  • the acquiring the location of the sound signal corresponding to the sound source comprises: identifying a noise signal from the voice signal; and acquiring a location of the sound signal corresponding to the sound source.
  • the acquiring the location of the noise signal corresponding to the sound source comprises: acquiring a phase difference and a sound pressure difference of the waveform corresponding to the noise signal;
  • determining, according to the location, the noise reduction parameter corresponding to the voice signal includes:
  • a noise reduction parameter corresponding to the noise signal is generated according to the phase difference and the delay.
  • the method before the adjusting the corresponding voice signal according to the determined noise reduction parameter, the method further includes: determining a current state parameter of the terminal;
  • Determining, according to the location, the noise reduction parameter corresponding to the voice signal comprises: determining a noise reduction parameter of the corresponding voice signal according to the state parameter and the location.
  • the embodiment of the present invention further provides a noise reduction method, including acquiring a voice signal corresponding to the received voice information and a location of the sound source corresponding to the voice signal;
  • the acquiring the location of the sound signal corresponding to the sound source comprises: identifying a noise signal from the voice signal; and acquiring a location of the sound signal corresponding to the sound source.
  • the acquiring the location of the noise signal corresponding to the sound source comprises: acquiring a phase difference and a sound pressure difference of the waveform corresponding to the noise signal;
  • determining, according to the location, the noise reduction parameter corresponding to the voice signal includes:
  • a noise reduction parameter corresponding to the noise signal is generated according to the phase difference and the delay.
  • the state parameter is a state in which the terminal is placed.
  • the embodiment of the invention further provides a noise reduction device, comprising an acquisition module, a first determination module, and a noise reduction module; wherein
  • An acquiring module configured to: when receiving the voice information, acquire a voice signal corresponding to the voice information and a location of the sound source corresponding to the voice signal;
  • a first determining module configured to determine a noise reduction parameter of the corresponding voice signal according to the obtained position
  • the noise reduction module is configured to adjust the corresponding voice signal according to the determined noise reduction parameter.
  • the obtaining module includes:
  • An identification unit configured to identify a noise signal from the voice signal
  • an acquiring unit configured to acquire a position of the sound source corresponding to the sound source.
  • the obtaining unit includes:
  • Obtaining a subunit configured to obtain a phase difference and a sound pressure difference of the waveform corresponding to the noise signal
  • the calculating subunit is configured to calculate a three-dimensional spatial position of the sound source corresponding to the noise signal according to the phase difference and the sound pressure difference.
  • the first determining module includes:
  • a first determining unit configured to calculate a phase difference and a delay of the corresponding waveform of the noise signal relative to the terminal according to the three-dimensional spatial position
  • the first generating unit is configured to generate a noise reduction parameter corresponding to the noise signal according to the phase difference and the delay.
  • the first determining unit is further configured to determine a current state parameter of the terminal, and determine a noise reduction parameter of the corresponding voice signal according to the state parameter and the location.
  • the embodiment of the invention further provides a noise reduction device, comprising an acquisition module, a second determination module, and a noise reduction module; wherein
  • An acquiring module configured to: when receiving the voice information, acquire a voice signal corresponding to the voice information and a location of the sound source corresponding to the voice signal;
  • a second determining module configured to determine a current state parameter of the terminal, and determine a noise reduction parameter of the corresponding voice signal according to the state parameter and the location;
  • the noise reduction module is configured to adjust the corresponding voice signal to the determined noise reduction parameter.
  • the obtaining module includes:
  • An identification unit configured to identify a noise signal from the voice signal
  • an acquiring unit configured to acquire a position of the sound source corresponding to the sound source.
  • the obtaining unit includes:
  • Obtaining a subunit configured to obtain a phase difference and a sound pressure difference of the waveform corresponding to the noise signal
  • the calculating subunit is configured to calculate a three-dimensional spatial position of the sound source corresponding to the noise signal according to the phase difference and the sound pressure difference.
  • the second determining module includes:
  • a second determining unit configured to calculate a phase difference and a delay of the corresponding waveform of the noise signal relative to the terminal according to the three-dimensional spatial position; and determine a current state parameter of the terminal;
  • the second generating unit is configured to generate a noise reduction parameter corresponding to the noise signal according to the phase difference and the delay, and determining the current state parameter of the terminal.
  • the embodiment of the present invention further provides a computer readable storage medium storing computer executable instructions for performing the method for implementing end-to-end call encryption according to any of the above.
  • the embodiment of the invention further provides a computer readable storage medium storing computer executable instructions for performing any of the above methods for implementing end-to-end call encryption.
  • the noise reduction parameter is determined by determining the corresponding noise reduction parameter according to the sound source position of the voice signal of the received voice information, and denoising the voice information.
  • the active noise reduction in the voice interaction mode is realized, the noise interference generated during the voice interaction process is reduced, and the voice interaction effect is improved.
  • FIG. 1 is a schematic structural diagram of hardware of a mobile terminal that implements various embodiments of the present invention
  • FIG. 2 is a schematic diagram of a wireless communication system of the mobile terminal shown in FIG. 1;
  • FIG. 3 is a schematic flowchart of a first embodiment of a method for reducing noise in a voice interaction mode of a terminal according to an embodiment of the present invention
  • FIG. 4 is a schematic flowchart of an embodiment of acquiring a location of a sound source corresponding to a voice signal according to an embodiment of the present invention
  • FIG. 5 is a schematic flowchart of an embodiment of acquiring a position of a sound source corresponding to a noise signal according to an embodiment of the present invention
  • FIG. 6 is a schematic flowchart of an embodiment of determining a noise reduction parameter of a corresponding voice signal according to the location according to the embodiment;
  • FIG. 7 is a schematic flowchart diagram of a second embodiment of a noise reduction method in a terminal voice interaction mode according to an embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of an embodiment of a noise reduction apparatus in a voice interaction mode of a terminal according to an embodiment of the present disclosure
  • FIG. 9 is a schematic structural diagram of an embodiment of an acquisition module of FIG. 8;
  • FIG. 10 is a schematic structural diagram of an embodiment of an acquiring unit in FIG. 9; FIG.
  • FIG. 11 is a schematic structural diagram of an embodiment of the determining module of FIG. 8.
  • FIG. 11 is a schematic structural diagram of an embodiment of the determining module of FIG. 8.
  • the mobile terminal can be implemented in various forms.
  • the terminal described in the present invention may include, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a PDA (Personal Digital Assistant), a PAD (Tablet), a PMP (Portable Multimedia Player), a navigation device, etc.
  • Mobile terminals and fixed terminals such as digital TVs, desktop computers, and the like.
  • the terminal is a mobile terminal.
  • those skilled in the art will appreciate that configurations in accordance with embodiments of the present invention can be applied to fixed type terminals in addition to components that are specifically for mobile purposes.
  • FIG. 1 is a schematic diagram showing the hardware structure of a mobile terminal embodying various embodiments of the present invention.
  • the mobile terminal 100 may include a wireless communication unit 110, an A/V (Audio/Video) input unit 120, a user input unit 130, a sensing unit 140, an output unit 150, a memory 160, an interface unit 170, a controller 180, and a power supply unit 190. and many more.
  • Figure 1 illustrates a mobile terminal having various components, but it should be understood that not all illustrated components are required to be implemented. More or fewer components can be implemented instead. The elements of the mobile terminal will be described in detail below.
  • Wireless communication unit 110 typically includes one or more components that permit radio communication between mobile terminal 100 and a wireless communication system or network.
  • the wireless communication unit may include at least one of a broadcast receiving module 111, a mobile communication module 112, a wireless internet module 113, a short-range communication module 114, and a location information module 115.
  • the broadcast receiving module 111 receives a broadcast signal and/or broadcast associated information from an external broadcast management server via a broadcast channel.
  • the broadcast channel can include a satellite channel and/or a terrestrial channel.
  • the broadcast management server may be a server that generates and transmits a broadcast signal and/or broadcast associated information or a server that receives a previously generated broadcast signal and/or broadcast associated information and transmits it to the terminal.
  • the broadcast signal may include a TV broadcast signal, a radio broadcast signal, a data broadcast signal, and the like.
  • the broadcast signal may further include a broadcast signal combined with a TV or radio broadcast signal.
  • the broadcast associated information may also be provided via a mobile communication network, and in this case, the broadcast associated information may be received by the mobile communication module 112.
  • the broadcast signal may exist in various forms, for example, it may exist in the form of Digital Multimedia Broadcasting (DMB) Electronic Program Guide (EPG), Digital Video Broadcasting Handheld (DVB-H) Electronic Service Guide (ESG), and the like.
  • the broadcast receiving module 111 can receive a signal broadcast by using various types of broadcast systems.
  • the broadcast receiving module 111 can use forward link media (MediaFLO) by using, for example, multimedia broadcast-terrestrial (DMB-T), digital multimedia broadcast-satellite (DMB-S), digital video broadcast-handheld (DVB-H) @) data broadcasting system, integrated services digital broadcasting terrestrial (ISDB-T) digital broadcasting systems, etc. to receive digital broadcasts.
  • the broadcast receiving module 111 can be constructed as various broadcast systems suitable for providing broadcast signals as well as the above-described digital broadcast system.
  • the broadcast signal and/or broadcast associated information received via the broadcast receiving module 111 may be stored in the memory 160 (or other type of storage medium).
  • the mobile communication module 112 transmits the radio signals to and/or receives radio signals from at least one of a base station (e.g., an access point, a Node B, etc.), an external terminal, and a server.
  • a base station e.g., an access point, a Node B, etc.
  • Such radio signals may include voice call signals, video call signals, or various types of data transmitted and/or received in accordance with text and/or multimedia messages.
  • the wireless internet module 113 supports wireless internet access of the mobile terminal.
  • the module can be internally or externally coupled to the terminal.
  • the wireless Internet access technologies involved in the module may include WLAN (Wireless LAN) (Wi-Fi), Wibro (Wireless Broadband), Wimax (Worldwide Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access), etc. .
  • the short range communication module 114 is a module for supporting short range communication.
  • Some examples of short-range communication technology include Bluetooth TM, a radio frequency identification (RFID), infrared data association (IrDA), ultra wideband (UWB), ZigBee, etc. TM.
  • the location information module 115 is a module for checking or acquiring location information of the mobile terminal.
  • a typical example of a location information module is GPS (Global Positioning System).
  • GPS Global Positioning System
  • the GPS module 115 calculates distance information and accurate time information from three or more satellites and applies triangulation to the calculated information to accurately calculate three-dimensional current position information based on longitude, latitude, and altitude.
  • the method for calculating position and time information uses three satellites and corrects the calculated position and time information errors by using another satellite.
  • the GPS module 115 is capable of calculating speed information by continuously calculating current position information in real time.
  • the A/V input unit 120 is for receiving an audio or video signal.
  • the A/V input unit 120 may include a camera 121 and a microphone 1220 that processes image data of still pictures or video obtained by the image capturing device in a video capturing mode or an image capturing mode.
  • the processed image frame can be displayed on the display unit 151.
  • the image frames processed by the camera 121 may be stored in the memory 160 (or other storage medium) or transmitted via the wireless communication unit 110, and two or more cameras 1210 may be provided according to the configuration of the mobile terminal.
  • the microphone 122 can be in a telephone call mode, a recording mode, and a language Sound (audio data) is received via a microphone in a sound recognition mode or the like, and such sound can be processed as audio data.
  • the processed audio (voice) data can be converted to a format output that can be transmitted to the mobile communication base station via the mobile communication module 112 in the case of a telephone call mode.
  • the microphone 122 can implement various types of noise cancellation (or suppression) algorithms to cancel (or suppress) noise or interference generated during the process of receiving and transmitting audio signals.
  • the user input unit 130 may generate key input data according to a command input by the user to control various operations of the mobile terminal.
  • the user input unit 130 allows the user to input various types of information, and may include a keyboard, a pot, a touch pad (eg, a touch sensitive component that detects changes in resistance, pressure, capacitance, etc. due to contact), a scroll wheel , rocker, etc.
  • a touch screen can be formed.
  • the sensing unit 140 detects the current state of the mobile terminal 100 (eg, the open or closed state of the mobile terminal 100), the location of the mobile terminal 100, the presence or absence of contact (ie, touch input) by the user with the mobile terminal 100, and the mobile terminal.
  • the sensing unit 140 can sense whether the slide type phone is turned on or off.
  • the sensing unit 140 can detect whether the power supply unit 190 provides power or whether the interface unit 170 is coupled to an external device.
  • Sensing unit 140 may include proximity sensor 1410 which will be described below in connection with a touch screen.
  • the interface unit 170 serves as an interface through which at least one external device can connect with the mobile terminal 100.
  • the external device may include a wired or wireless headset port, an external power (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, and an audio input/output. (I/O) port, video I/O port, headphone port, and more.
  • the identification module may be stored to verify various information used by the user using the mobile terminal 100 and may include a User Identification Module (UIM), a Customer Identification Module (SIM), a Universal Customer Identity Module (USIM), and the like.
  • the device having the identification module may take the form of a smart card, and thus the identification device may be connected to the mobile terminal 100 via a port or other connection device.
  • the interface unit 170 can be configured to receive input from an external device (eg, data information, power, etc.) and transmit the received input to one or more components within the mobile terminal 100 or can be used at the mobile terminal and external device Transfer data between.
  • Output unit 150 is configured to provide an output signal (eg, an audio signal, a video signal, an alarm signal, a vibration signal, etc.) in a visual, audio, and/or tactile manner.
  • the output unit 150 may include a display unit 151, an audio output module 152, an alarm unit 153, and the like.
  • the display unit 151 can display information processed in the mobile terminal 100. For example, when the mobile terminal 100 is in a phone call mode, the display unit 151 can display a user interface (UI) or a graphical user interface (GUI) related to a call or other communication (eg, text messaging, multimedia file download, etc.). When the mobile terminal 100 is in a video call mode or an image capturing mode, the display unit 151 may display a captured image and/or a received image, a UI or GUI showing a video or image and related functions, and the like.
  • UI user interface
  • GUI graphical user interface
  • the display unit 151 can function as an input device and an output device.
  • the display unit 151 may include at least one of a liquid crystal display (LCD), a thin film transistor LCD (TFT-LCD), an organic light emitting diode (OLED) display, a flexible display, a three-dimensional (3D) display, and the like.
  • LCD liquid crystal display
  • TFT-LCD thin film transistor LCD
  • OLED organic light emitting diode
  • a flexible display a three-dimensional (3D) display, and the like.
  • 3D three-dimensional
  • Some of these displays may be configured to be transparent to allow a user to view from the outside, which may be referred to as a transparent display, and a typical transparent display may be, for example, a TOLED (Transparent Organic Light Emitting Diode) display or the like.
  • TOLED Transparent Organic Light Emitting Diode
  • the mobile terminal 100 may include two or more display units (or other display devices), for example, the mobile terminal may include an external display unit (not shown) and an internal display unit (not shown) .
  • the touch screen can be used to detect touch input pressure as well as touch input position and touch input area.
  • the audio output module 152 may convert audio data received by the wireless communication unit 110 or stored in the memory 160 when the mobile terminal is in a call signal receiving mode, a call mode, a recording mode, a voice recognition mode, a broadcast receiving mode, and the like.
  • the audio signal is output as sound.
  • the audio output module 152 can provide audio output (eg, call signal reception sound, message reception sound, etc.) associated with a particular function performed by the mobile terminal 100.
  • the audio output module 152 can include a speaker, a buzzer, and the like.
  • the alarm unit 153 can provide an output to notify the mobile terminal 100 of the occurrence of an event. Typical events may include call reception, message reception, key signal input, touch input, and the like. In addition to the audio or video output, the alert unit 153 can provide an output in a different manner to notify the event Health. For example, the alarm unit 153 can provide an output in the form of vibrations, and when a call, message, or some other incoming communication is received, the alarm unit 153 can provide a tactile output (ie, vibration) to notify the user of it. By providing such a tactile output, the user is able to recognize the occurrence of various events even when the user's mobile phone is in the user's pocket. The alarm unit 153 can also provide an output of the notification event occurrence via the display unit 151 or the audio output module 152.
  • the memory 160 may store a software program or the like for processing and control operations performed by the controller 180, or may temporarily store data (for example, a phone book, a message, a still image, a video, etc.) that has been output or is to be output. Moreover, the memory 160 can store data regarding vibrations and audio signals of various manners that are output when a touch is applied to the touch screen.
  • the memory 160 may include at least one type of storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like.
  • the mobile terminal 100 can cooperate with a network storage device that performs a storage function of the memory 160 through a network connection.
  • the controller 180 typically controls the overall operation of the mobile terminal. For example, the controller 180 performs the control and processing associated with voice calls, data communications, video calls, and the like. Additionally, the controller 180 can include a multimedia module 1810 for reproducing (or playing back) multimedia data, which can be constructed within the controller 180 or can be configured to be separate from the controller 180. The controller 180 may perform a pattern recognition process to recognize a handwriting input or a picture drawing input performed on the touch screen as a character or an image.
  • the power supply unit 190 receives external power or internal power under the control of the controller 180 and provides appropriate power required to operate the various components and components.
  • the various embodiments described herein can be implemented in a computer readable medium using, for example, computer software, hardware, or any combination thereof.
  • the embodiments described herein may be through the use of application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( FPGA), processor, controller, microcontroller, microprocessor, in an electronic unit designed to perform the functions described herein
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable gate arrays
  • processor controller, microcontroller, microprocessor
  • controller 180 programmable logic devices
  • implementations such as procedures or functions may be implemented with separate software modules that permit the execution of at least one function or operation.
  • the software code can be implemented by a software application (or program) written in any suitable programming language, which can be stored in memory 160 and executed by controller 180
  • the mobile terminal has been described in terms of its function.
  • a slide type mobile terminal among various types of mobile terminals such as a folding type, a bar type, a swing type, a slide type mobile terminal, and the like will be described as an example. Therefore, the present invention can be applied to any type of mobile terminal, and is not limited to a slide type mobile terminal.
  • the mobile terminal 100 as shown in FIG. 1 may be configured to operate using a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.
  • a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.
  • Such communication systems may use different air interfaces and/or physical layers.
  • air interfaces used by communication systems include, for example, Frequency Division Multiple Access (FDMA), Time Division Multiple Access (TDMA), Code Division Multiple Access (CDMA), and Universal Mobile Telecommunications System (UMTS) (in particular, Long Term Evolution (LTE)). ), Global System for Mobile Communications (GSM), etc.
  • FDMA Frequency Division Multiple Access
  • TDMA Time Division Multiple Access
  • CDMA Code Division Multiple Access
  • UMTS Universal Mobile Telecommunications System
  • LTE Long Term Evolution
  • GSM Global System for Mobile Communications
  • the following description relates to a CDMA communication system, but such teachings are equally applicable to other types of systems.
  • a CDMA wireless communication system may include a plurality of mobile terminals 100, a plurality of base stations (BS) 270, a base station controller (BSC) 275, and a mobile switching center (MSC) 2800 MSC 280 configured to communicate with a public switched telephone network (PSTN). 290 forms an interface.
  • the MSC 280 is also configured to interface with a BSC 275 that can be coupled to the base station 270 via a backhaul line.
  • the backhaul line can be constructed in accordance with any of a number of well known interfaces including, for example, E1/T1, ATM, IP, PPP, Frame Relay, HDSL, ADSL, or xDSL. It will be appreciated that the system as shown in FIG. 2 may include multiple BSC 2750s.
  • Each BS 270 can serve one or more partitions (or regions), each of which is covered by a multi-directional antenna or an antenna directed to a particular direction radially away from the BS 270. Alternatively, each partition may be covered by two or more antennas for diversity reception. Each BS 270 can be configured to support multiple frequency allocations, and each frequency allocation has a particular frequency spectrum (eg, 1.25 MHz, 5 MHz, etc.).
  • BS270 can also be called base Station Transceiver Subsystem (BTS) or other equivalent terminology.
  • BTS base Station Transceiver Subsystem
  • the term "base station” can be used to generally refer to a single BSC 275 and at least one BS 270.
  • a base station can also be referred to as a "cell station.”
  • each partition of a particular BS 270 may be referred to as a plurality of cellular stations.
  • a broadcast transmitter (BT) 295 transmits a broadcast signal to the mobile terminal 100 operating within the system.
  • a broadcast receiving module 111 as shown in FIG. 1 is provided at the mobile terminal 100 to receive a broadcast signal transmitted by the BT 295.
  • GPS Global Positioning System
  • the satellite 300 helps locate at least one of the plurality of mobile terminals 100.
  • a plurality of satellites 300 are depicted, but it is understood that useful positioning information can be obtained using any number of satellites.
  • the GPS module 115 as shown in Figure 1 is typically configured to cooperate with the satellite 300 to obtain desired positioning information. Instead of GPS tracking technology or in addition to GPS tracking technology, other techniques that can track the location of the mobile terminal can be used. Additionally, at least one GPS satellite 300 can selectively or additionally process satellite DMB transmissions.
  • BS 270 receives inverted link signals from various mobile terminals 100.
  • Mobile terminal 100 typically participates in calls, messaging, and other types of communications.
  • Each inverted link signal received by a particular base station 270 is processed within a particular BS 270.
  • the obtained data is forwarded to the relevant BSC 275.
  • the BSC provides call resource allocation and coordinated mobility management functions including a soft handoff procedure between the BSs 270.
  • the BSC 275 also routes the received data to the MSC 280, which provides additional routing services for interfacing with the PSTN 290.
  • PSTN 290 interfaces with MSC 280, which forms an interface with BSC 275, and BSC 275 controls BS 270 accordingly to transmit forward link signals to mobile terminal 100.
  • the first embodiment of the present invention provides a method for reducing noise in a voice interaction mode of a terminal.
  • the method includes:
  • Step S10 Acquire a voice signal corresponding to the received voice information and a location of the sound source corresponding to the voice signal;
  • the noise reduction process in the voice interaction mode of the terminal in the embodiment of the present invention is preferably applied in the hands-free call mode, and can also be applied in the recording and instant communication voice interaction scenarios in other embodiments of the present invention.
  • the terminal is preferably an electronic device capable of making a voice call, such as a mobile phone or a pad, and the terminal should be equipped with at least the following device: an external speaker for the hands-free function of the call, for checking
  • a microphone array for measuring the position of the sound source the microphone array is preferably four, and the optimal setting position is the upper side and the lower side of the terminal, and one of the left side and the right side is respectively set, because the detection software provided in the related art is for sound pressure
  • the detection accuracy of the level can reach 0.1DBA, and the detection accuracy of the phase difference can reach 0.1 degree. Therefore, when the position changes, the detection software has strong stability and does not need to change the corresponding technology.
  • the terminal is equipped with four or more microphone arrays.
  • the microphone array When the hands-free is turned on, the microphone array detects voice information, and when receiving the voice information, acquires a voice signal corresponding to the voice information and a position of the sound source corresponding to the voice signal. .
  • the voice signal includes a voice signal and a noise signal.
  • the process of acquiring the voice signal corresponding to the location of the sound source in step S10 includes:
  • Step S11 identifying a noise signal from the voice signal
  • Step S12 Acquire a position of the noise signal corresponding to the sound source.
  • the vocal and environmental noise are distinguished, that is, the vocal signal and the noise signal are recognized from the voice signal, and the input signal is screened to determine the vocal and noise. Determine the location of the sound source corresponding to the vocal and noise.
  • step S12 includes:
  • Step S121 acquiring a phase difference and a sound pressure difference of the waveform corresponding to the noise signal
  • Step S122 calculating a three-dimensional spatial position of the sound source corresponding to the noise signal with respect to the terminal according to the phase difference and the sound pressure difference.
  • the three-dimensional spatial position of the positioning human sound source with respect to the terminal and the three-dimensional spatial position of the noise sound source with respect to the terminal are respectively calculated.
  • the input waveform has a position difference with the sound source, the sound pressure level intensity is inversely proportional to the square of the distance, and the phase difference is related to the reception delay.
  • the phase difference and the sound pressure difference are obtained by subtracting the respective input waveforms, according to the sound wave.
  • the velocity formula of propagation in the air can calculate the position distance of the sound source to each MIC. By drawing a circle to find the intersection point, the specific position of the current sound source relative to the terminal in the space can be determined.
  • Step S20 determining a noise reduction parameter of the corresponding voice signal according to the location
  • the noise reduction parameter of the corresponding speech signal is inverse compensation The phase of the sound wave.
  • step S20 may include:
  • Step S21 calculating a phase difference and a delay of the corresponding waveform of the noise signal with respect to the terminal according to the three-dimensional spatial position;
  • Step S22 generating a noise reduction parameter corresponding to the noise signal according to the phase difference and the delay.
  • the distance between the vocal and the noise sound source is calculated, and when there are multiple noise sound sources, the distance between the human voice and each noise sound source is calculated.
  • a plurality of distances are calculated according to the distance between each noise and the vocal sound source and the frequency of the noise source, and the phase difference and delay between the terminal MIC and the human ear position are calculated according to the phase difference and the delay.
  • the noise reduction parameters corresponding to each noise are determined, that is, the phases of the plurality of inverted compensation sound waves are obtained.
  • Step S30 adjusting a corresponding voice signal according to the noise reduction parameter to perform noise reduction on the voice information.
  • the phase of the acoustic wave is inversely compensated, and the phase of the acoustic wave is compensated by adding an inversion to achieve the purpose of noise reduction, thereby effectively avoiding interference of environmental noise.
  • the reverse sound wave equal to the external noise of the near end is generated by the noise reduction system, and the noise is neutralized, thereby realizing the effect of active noise reduction.
  • the corresponding noise reduction parameter is determined according to the sound source position of the voice signal of the received voice information, and the voice information is denoised, thereby realizing active noise reduction in the voice interaction mode, and reducing the generation of the voice interaction process. Noise interference, which enhances the voice interaction.
  • the step S30 may include:
  • Step S23 determining a current state parameter of the terminal
  • Step S24 determining a noise reduction parameter of the corresponding voice signal according to the state parameter and the position.
  • the placement state of the terminal that is, determining the current state parameter of the terminal, for example, the situation of facing up, facing up, and side, and providing data according to instruments such as a gravity sensor and a gyroscope.
  • different orientation data is provided, and the phase of the acoustic wave is compensated for the superimposed inversion, thereby achieving the function of hands-free active noise reduction, and the corresponding speech signal is determined by combining the active state parameters and the position.
  • Noise reduction parameters for example, when the terminal is front, The noise reduction parameter of the corresponding speech signal is determined according to the state parameter and the position of the front side.
  • different terminal states are provided in different placement postures, that is, when the terminals are in different states, and the superposed inverse compensation waveforms are corrected. The noise reduction effect is further improved.
  • the embodiment of the invention further provides a computer readable storage medium storing computer executable instructions for performing the noise reduction method of any of the above.
  • the invention further provides a noise reduction device in a terminal voice interaction mode.
  • FIG. 8 is a schematic diagram of a functional module of a noise reduction device in a voice interaction mode of a terminal according to the present invention, which includes at least an acquisition module 10, a first determination module 20, and a first noise reduction module 30, where
  • the acquiring module 10 is configured to: when receiving the voice information, acquire a voice signal corresponding to the voice information and a location of the voice source corresponding to the voice source;
  • the noise reduction process in the voice interaction mode of the terminal in the embodiment of the present invention is preferably applied in the hands-free call mode, and can also be applied in the recording and instant communication voice interaction scenarios in other embodiments of the present invention.
  • the terminal is preferably an electronic device capable of voice call, such as a mobile phone or a pad, and the terminal should be equipped with at least the following device: an external speaker for the hands-free function of the call, for detecting the position of the sound source.
  • the microphone array the number of the microphone arrays is preferably four, and the optimal setting position is the upper side and the lower side of the terminal, and one of the left side and the right side is respectively set.
  • the detection software for the sound pressure level can be accurately detected by the related art. When the value reaches 0.1DBA, the detection accuracy of the phase difference can reach 0.1 degree. Therefore, when the position changes, the detection software has strong stability and does not need to change the corresponding technology.
  • the terminal is equipped with four or more microphone arrays.
  • the microphone array When the hands-free is turned on, the microphone array detects voice information, and when receiving the voice information, acquires a voice signal corresponding to the voice information and a position of the sound source corresponding to the voice signal. .
  • the voice signal includes a voice signal and a noise signal.
  • the acquiring module 10 includes at least an identifying unit 11 and an obtaining unit 12,
  • the identification unit 11 is configured to acquire a voice signal corresponding to the voice information when the voice information is received, and identify a noise signal from the voice signal;
  • the acquiring unit 12 is configured to acquire a position of the noise signal corresponding to the sound source.
  • the vocal and environmental noise are distinguished according to the general VAD algorithm, that is, the vocal signal and the noise signal are recognized from the voice signal, and the input signal is screened to determine the vocal and noise. Indeed The location of the sound source and noise corresponds to the sound source.
  • the obtaining unit 12 includes at least: an obtaining subunit 121 and a calculating subunit 122,
  • the acquiring subunit 121 is configured to acquire a phase difference and a sound pressure difference of the waveform corresponding to the noise signal;
  • the calculating sub-unit 122 is configured to calculate a three-dimensional spatial position of the sound source corresponding to the noise signal according to the phase difference and the sound pressure difference.
  • the three-dimensional spatial position of the positioning human sound source with respect to the terminal and the three-dimensional spatial position of the noise sound source with respect to the terminal are respectively calculated.
  • the input waveform has a position difference with the sound source, the sound pressure level intensity is inversely proportional to the square of the distance, and the phase difference is related to the reception delay.
  • the phase difference and the sound pressure difference are obtained by subtracting the respective input waveforms, according to the sound wave.
  • the velocity formula of propagation in the air can calculate the position distance of the sound source to each MIC. By drawing a circle to find the intersection point, the specific position of the current sound source relative to the terminal in the space can be determined.
  • the first determining module 20 is configured to determine a noise reduction parameter of the corresponding voice signal according to the location;
  • the noise reduction parameter of the corresponding speech signal is inverse compensation The phase of the sound wave.
  • the first determining module 20 includes at least: a first determining unit 21 and a first generating unit 22,
  • the first determining unit 21 is configured to calculate a phase difference and a delay of the corresponding waveform of the noise signal relative to the terminal according to the three-dimensional spatial position;
  • the first generating unit 22 is configured to generate a noise reduction parameter corresponding to the noise signal according to the phase difference and the delay.
  • the distance between the vocal and the noise sound source is calculated, and when there are multiple noise sound sources, the distance between the human voice and each noise sound source is calculated.
  • a plurality of distances are calculated according to the distance between each noise and the vocal sound source and the frequency of the noise source, and the phase difference and delay between the terminal MIC and the human ear position are calculated according to the phase difference and the delay.
  • Corresponding to the noise reduction parameters of the noise signal Every noise is different because of its location, so The noise reduction parameters are different. When there are multiple noises, the noise reduction parameters corresponding to each noise are determined, that is, the phases of the plurality of inverted compensation sound waves are obtained.
  • the noise reduction module 30 is configured to adjust a corresponding voice signal according to the noise reduction parameter to perform noise reduction on the voice information.
  • the phase of the acoustic wave is inversely compensated, and the phase of the acoustic wave is compensated by adding an inversion to achieve the purpose of noise reduction, thereby effectively avoiding interference of environmental noise.
  • the reverse sound wave equal to the external noise of the near end is generated by the noise reduction system, and the noise is neutralized, thereby realizing the effect of active noise reduction.
  • the corresponding noise reduction parameter is determined according to the sound source position of the voice signal of the received voice information, and the voice information is denoised.
  • the noise reduction in the voice interaction mode is realized, and the noise interference generated during the voice interaction process is reduced, thereby improving the voice interaction effect.
  • the first determining unit 21 is further configured to determine a current state parameter of the terminal, and determine a noise reduction parameter of the corresponding voice signal according to the state parameter and the location.
  • the noise reduction device includes an acquisition module, a second determination module, and a noise reduction module;
  • An acquiring module configured to: when receiving the voice information, acquire a voice signal corresponding to the voice information and a location of the sound source corresponding to the voice signal;
  • a second determining module configured to determine a current state parameter of the terminal, and determine a noise reduction parameter of the corresponding voice signal according to the state parameter and the location;
  • the noise reduction module is configured to adjust the corresponding voice signal according to the determined noise reduction parameter.
  • the obtaining module includes:
  • An identification unit configured to identify a noise signal from the voice signal
  • an acquiring unit configured to acquire a position of the sound source corresponding to the sound source.
  • the obtaining unit includes:
  • Obtaining a subunit configured to obtain a phase difference and a sound pressure difference of the waveform corresponding to the noise signal
  • the calculating subunit is configured to calculate a three-dimensional spatial position of the sound source corresponding to the noise signal according to the phase difference and the sound pressure difference.
  • the second determining module includes:
  • a second determining unit configured to calculate a phase difference and a delay of the corresponding waveform of the noise signal relative to the terminal according to the three-dimensional spatial position; and determine a current state parameter of the terminal;
  • the second generating unit is configured to generate a noise reduction parameter corresponding to the noise signal according to the phase difference and the delay, and the current state parameter of the terminal.
  • the placement state of the terminal that is, determining the current state parameter of the terminal, for example, the situation of facing up, facing up, and side, and providing data according to instruments such as a gravity sensor and a gyroscope.
  • different orientation data is provided, and the phase of the acoustic wave is compensated for the superimposed inversion, thereby achieving the function of hands-free active noise reduction, and the corresponding speech signal is determined by combining the active state parameters and the position.
  • the noise reduction parameter for example, when the terminal is front, determines the noise reduction parameter of the corresponding voice signal according to the state parameter and the position of the front side.
  • different terminal states are provided in different placement postures, that is, when the terminals are in different states, and the superposed inverse compensation waveforms are corrected. The noise reduction effect is further improved.
  • the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better.
  • Implementation Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
  • the noise reduction method, the noise reduction device, and the computer readable storage medium provided by the embodiment of the present invention include: when receiving the voice information, acquiring the voice signal corresponding to the voice information and the location of the sound source corresponding to the voice signal Determining a noise reduction parameter of the corresponding voice signal according to the position; and adjusting a corresponding voice signal according to the noise reduction parameter to perform noise reduction on the voice information.
  • the embodiment of the invention implements active noise reduction in the voice interaction mode, reduces noise interference generated during the voice interaction process, and improves the voice interaction effect.

Abstract

一种终端语音交互模式下的降噪方法,包括:在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;根据所述位置确定对应语音信号的降噪参数;根据所述降噪参数调整对应的语音信号,以对所述语音信息进行降噪。本发明还公开一种终端语音交互模式下的降噪装置。本发明实现语音交互模式下的主动降噪,降低语音交互过程中产生噪音干扰,提高语音交互效果。

Description

降噪方法、降噪装置和计算机可读存储介质 技术领域
本发明实施例涉及但不限于终端语音交互模式下的降噪方法及装置计算机可读存储介质。
背景技术
随着终端技术的不断发展,越来越多的终端进入人们的日常生活和工作当中,以移动终端为例,用户可以通过移动终端进入录入语音,或者进行语音通话等语音交互场景。目前的语音交互模式下,主要是通过下行信号降噪,下行信号降噪仅仅是通过对下行信号中的远端噪声进行抑制,对近端的环境噪音是没有用处的。也就是说,在移动终端语音录入或语音通话过程中,对近端的环境噪音并没有进行降噪处理,这样,尤其是在免提模式下进行通话操作时,由于环境噪音干扰,导致较难听清语音通话内容。
因此,由于相关技术中,在终端语音交互模式下,不存在对语音进行主动降噪的方式,导致了语音交互过程产生噪音干扰,影响了语音交互的效果。
发明内容
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。
本发明实施例提供了一种降噪方法、降噪装置计算机可读存储介质,能够在语音交互模式下,对语音进行主动降噪的方式,降低语音交互过程产生的噪音干扰,从而提升语音交互效果。
本发明实施例提供的一种降噪方法,包括:获取接收到的语音信息对应的语音信号及所述语音信号对应声源的位置;
根据获得的位置确定对应语音信号的降噪参数;
根据确定的降噪参数调整对应的语音信号。
可选地,所述获取所述语音信号对应声源的位置包括:从所述语音信号中识别出噪音信号;获取所述噪音信号对应声源的位置。
可选地,所述获取所述噪音信号对应声源的位置包括:获取所述噪音信号对应波形的相位差及声压差;
根据获得的相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
可选地,所述根据所述位置确定所述语音信号对应的降噪参数包括:
根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
根据所述相位差及延时生成对应噪音信号的降噪参数。
可选地,所述根据确定的降噪参数调整对应的语音信号之前还包括:确定终端当前的状态参数;
所述根据所述位置确定所述语音信号对应的降噪参数包括:根据所述状态参数及所述位置确定对应语音信号的降噪参数。
本发明实施例又提供了一种降噪方法,包括获取接收到的语音信息对应的语音信号及所述语音信号对应声源的位置;
根据获得的位置确定对应语音信号的降噪参数;
确定终端当前的状态参数,根据所述状态参数及所述位置确定对应语音信号的降噪参数。
可选地,所述获取所述语音信号对应声源的位置包括:从所述语音信号中识别出噪音信号;获取所述噪音信号对应声源的位置。
可选地,所述获取所述噪音信号对应声源的位置包括:获取所述噪音信号对应波形的相位差及声压差;
根据获得的相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
可选地,所述根据所述位置确定所述语音信号对应的降噪参数包括:
根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
根据所述相位差及延时生成对应噪音信号的降噪参数。
可选地,所述状态参数天所述终端的摆放状态。
本发明实施例还提供了一种降噪装置,包括获取模块、第一确定模块,及降噪模块;其中,
获取模块,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
第一确定模块,设置为根据获得的位置确定对应语音信号的降噪参数;
降噪模块,设置为根据确定的降噪参数调整对应的语音信号。
可选地,所述获取模块包括:
识别单元,设置为从所述语音信号中识别出噪音信号;
获取单元,设置为获取所述噪音信号对应声源的位置。
可选地,所述获取单元包括:
获取子单元,设置为获取所述噪音信号对应波形的相位差及声压差;
计算子单元,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
可选地,所述第一确定模块包括:
第一确定单元,设置为根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
第一生成单元,设置为根据所述相位差及延时生成对应噪音信号的降噪参数。
可选地,所述第一确定单元,还设置为确定终端当前的状态参数;根据所述状态参数及所述位置确定对应语音信号的降噪参数。
本发明实施例再提供了一种降噪装置,包括获取模块、第二确定模块,及降噪模块;其中,
获取模块,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
第二确定模块,设置为确定终端当前的状态参数,根据所述状态参数及所述位置确定对应语音信号的降噪参数;
降噪模块,设置为确定的降噪参数调整对应的语音信号。
可选地,所述获取模块包括:
识别单元,设置为从所述语音信号中识别出噪音信号;
获取单元,设置为获取所述噪音信号对应声源的位置。
可选地,所述获取单元包括:
获取子单元,设置为获取所述噪音信号对应波形的相位差及声压差;
计算子单元,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
可选地,所述第二确定模块包括:
第二确定单元,设置为根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;以及确定终端当前的状态参数;
第二生成单元,设置为根据所述相位差及延时,以及确定终端当前的状态参数生成对应噪音信号的降噪参数。
本发明实施例还提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述任一项的实现端到端通话加密的方法。
本发明实施例再提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述任一实现端到端通话加密的方法。
本发明实施例提出的通过根据接收的语音信息的语音信号的声源位置来确定对应的降噪参数,对语音信息进行降噪。实现了语音交互模式下的主动降噪,降低了语音交互过程中产生的噪音干扰,提升了语音交互效果。
本发明实施例的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。
在阅读并理解了附图和详细描述后,可以明白其他方面。
附图概述
此处所说明的附图用来提供对本发明实施例的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1为实现本发明各个实施例的移动终端的硬件结构示意;
图2为如图1所示的移动终端的无线通信系统示意图;
图3为本发明实施例中终端语音交互模式下的降噪方法的第一实施例的流程示意图;
图4为本发明实施例中获取所述语音信号对应声源的位置一实施例的流程示意图;
图5为本发明实施例中获取所述噪音信号对应声源的位置一实施例的流程示意图;
图6为本发明实施例中根据所述位置确定对应语音信号的降噪参数一实施例的流程示意图;
图7本发明实施例中终端语音交互模式下的降噪方法的第二实施例的流程示意图;
图8为本发明实施例中终端语音交互模式下的降噪装置的实施例的组成结构示意图;
图9为图8中获取模块的一实施例的组成结构示意图;
图10为9中获取单元的一实施例的组成结构示意图;
图11为图8中确定模块的一实施例的组成结构示意图。
本发明的较佳实施方式
为使本发明的目的、技术方案和优点更加清楚明白,下文中将结合附图对本发明的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。
现在将参考附图描述实现本发明各个实施例的移动终端。在后续的描述中,使用用于表示元件的诸如"模块"、"部件"或"单元"的后缀仅为了有利于本发明的说明,其本身并没有特定的意义。因此,"模块"与"部件"可以混合地 使用。
移动终端可以以各种形式来实施。例如,本发明中描述的终端可以包括诸如移动电话、智能电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、导航装置等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。下面,假设终端是移动终端。然而,本领域技术人员将理解的是,除了特别用于移动目的的元件之外,根据本发明的实施方式的构造也能够应用于固定类型的终端。
图1为实现本发明各个实施例的移动终端的硬件结构示意。
移动终端100可以包括无线通信单元110、A/V(音频/视频)输入单元120、用户输入单元130、感测单元140、输出单元150、存储器160、接口单元170、控制器180和电源单元190等等。图1示出了具有各种组件的移动终端,但是应理解的是,并不要求实施所有示出的组件。可以替代地实施更多或更少的组件。将在下面详细描述移动终端的元件。
无线通信单元110通常包括一个或多个组件,其允许移动终端100与无线通信系统或网络之间的无线电通信。例如,无线通信单元可以包括广播接收模块111、移动通信模块112、无线互联网模块113、短程通信模块114和位置信息模块115中的至少一个。
广播接收模块111经由广播信道从外部广播管理服务器接收广播信号和/或广播相关信息。广播信道可以包括卫星信道和/或地面信道。广播管理服务器可以是生成并发送广播信号和/或广播相关信息的服务器或者接收之前生成的广播信号和/或广播相关信息并且将其发送给终端的服务器。广播信号可以包括TV广播信号、无线电广播信号、数据广播信号等等。而且,广播信号可以进一步包括与TV或无线电广播信号组合的广播信号。广播相关信息也可以经由移动通信网络提供,并且在该情况下,广播相关信息可以由移动通信模块112来接收。广播信号可以以各种形式存在,例如,其可以以数字多媒体广播(DMB)的电子节目指南(EPG)、数字视频广播手持(DVB-H)的电子服务指南(ESG)等等的形式而存在。广播接收模块111可以通过使用各种类型的广播系统接收信号广播。特别地,广播接收模块111可以通过使用诸如多媒体广播-地面(DMB-T)、数字多媒体广播-卫星(DMB-S)、数字视频广播-手持 (DVB-H),前向链路媒体(MediaFLO@)的数据广播系统、地面数字广播综合服务(ISDB-T)等等的数字广播系统接收数字广播。广播接收模块111可以被构造为适合提供广播信号的各种广播系统以及上述数字广播系统。经由广播接收模块111接收的广播信号和/或广播相关信息可以存储在存储器160(或者其它类型的存储介质)中。
移动通信模块112将无线电信号发送到基站(例如,接入点、节点B等等)、外部终端以及服务器中的至少一个和/或从其接收无线电信号。这样的无线电信号可以包括语音通话信号、视频通话信号、或者根据文本和/或多媒体消息发送和/或接收的各种类型的数据。
无线互联网模块113支持移动终端的无线互联网接入。该模块可以内部或外部地耦接到终端。该模块所涉及的无线互联网接入技术可以包括WLAN(无线LAN)(Wi-Fi)、Wibro(无线宽带)、Wimax(全球微波互联接入)、HSDPA(高速下行链路分组接入)等等。
短程通信模块114是用于支持短程通信的模块。短程通信技术的一些示例包括蓝牙TM、射频识别(RFID)、红外数据协会(IrDA)、超宽带(UWB)、紫蜂TM等等。
位置信息模块115是用于检查或获取移动终端的位置信息的模块。位置信息模块的典型示例是GPS(全球定位系统)。根据当前的技术,GPS模块115计算来自三个或更多卫星的距离信息和准确的时间信息并且对于计算的信息应用三角测量法,从而根据经度、纬度和高度准确地计算三维当前位置信息。当前,用于计算位置和时间信息的方法使用三颗卫星并且通过使用另外的一颗卫星校正计算出的位置和时间信息的误差。此外,GPS模块115能够通过实时地连续计算当前位置信息来计算速度信息。
A/V输入单元120用于接收音频或视频信号。A/V输入单元120可以包括相机121和麦克风1220,相机121对在视频捕获模式或图像捕获模式中由图像捕获装置获得的静态图片或视频的图像数据进行处理。处理后的图像帧可以显示在显示单元151上。经相机121处理后的图像帧可以存储在存储器160(或其它存储介质)中或者经由无线通信单元110进行发送,可以根据移动终端的构造提供两个或更多相机1210。麦克风122可以在电话通话模式、记录模式、语 音识别模式等等运行模式中经由麦克风接收声音(音频数据),并且能够将这样的声音处理为音频数据。处理后的音频(语音)数据可以在电话通话模式的情况下转换为可经由移动通信模块112发送到移动通信基站的格式输出。麦克风122可以实施各种类型的噪声消除(或抑制)算法以消除(或抑制)在接收和发送音频信号的过程中产生的噪声或者干扰。
用户输入单元130可以根据用户输入的命令生成键输入数据以控制移动终端的各种操作。用户输入单元130允许用户输入各种类型的信息,并且可以包括键盘、锅仔片、触摸板(例如,检测由于被接触而导致的电阻、压力、电容等等的变化的触敏组件)、滚轮、摇杆等等。特别地,当触摸板以层的形式叠加在显示单元151上时,可以形成触摸屏。
感测单元140检测移动终端100的当前状态,(例如,移动终端100的打开或关闭状态)、移动终端100的位置、用户对于移动终端100的接触(即,触摸输入)的有无、移动终端100的取向、移动终端100的加速或减速移动和方向等等,并且生成用于控制移动终端100的操作的命令或信号。例如,当移动终端100实施为滑动型移动电话时,感测单元140可以感测该滑动型电话是打开还是关闭。另外,感测单元140能够检测电源单元190是否提供电力或者接口单元170是否与外部装置耦接。感测单元140可以包括接近传感器1410将在下面结合触摸屏来对此进行描述。
接口单元170用作至少一个外部装置与移动终端100连接可以通过的接口。例如,外部装置可以包括有线或无线头戴式耳机端口、外部电源(或电池充电器)端口、有线或无线数据端口、存储卡端口、用于连接具有识别模块的装置的端口、音频输入/输出(I/O)端口、视频I/O端口、耳机端口等等。识别模块可以是存储用于验证用户使用移动终端100的各种信息并且可以包括用户识别模块(UIM)、客户识别模块(SIM)、通用客户识别模块(USIM)等等。另外,具有识别模块的装置(下面称为"识别装置")可以采取智能卡的形式,因此,识别装置可以经由端口或其它连接装置与移动终端100连接。接口单元170可以用于接收来自外部装置的输入(例如,数据信息、电力等等)并且将接收到的输入传输到移动终端100内的一个或多个元件或者可以用于在移动终端和外部装置之间传输数据。
另外,当移动终端100与外部底座连接时,接口单元170可以用作允许通 过其将电力从底座提供到移动终端100的路径或者可以用作允许从底座输入的各种命令信号通过其传输到移动终端的路径。从底座输入的各种命令信号或电力可以用作用于识别移动终端是否准确地安装在底座上的信号。输出单元150被构造为以视觉、音频和/或触觉方式提供输出信号(例如,音频信号、视频信号、警报信号、振动信号等等)。输出单元150可以包括显示单元151、音频输出模块152、警报单元153等等。
显示单元151可以显示在移动终端100中处理的信息。例如,当移动终端100处于电话通话模式时,显示单元151可以显示与通话或其它通信(例如,文本消息收发、多媒体文件下载等等)相关的用户界面(UI)或图形用户界面(GUI)。当移动终端100处于视频通话模式或者图像捕获模式时,显示单元151可以显示捕获的图像和/或接收的图像、示出视频或图像以及相关功能的UI或GUI等等。
同时,当显示单元151和触摸板以层的形式彼此叠加以形成触摸屏时,显示单元151可以用作输入装置和输出装置。显示单元151可以包括液晶显示器(LCD)、薄膜晶体管LCD(TFT-LCD)、有机发光二极管(OLED)显示器、柔性显示器、三维(3D)显示器等等中的至少一种。这些显示器中的一些可以被构造为透明状以允许用户从外部观看,这可以称为透明显示器,典型的透明显示器可以例如为TOLED(透明有机发光二极管)显示器等等。根据特定想要的实施方式,移动终端100可以包括两个或更多显示单元(或其它显示装置),例如,移动终端可以包括外部显示单元(未示出)和内部显示单元(未示出)。触摸屏可用于检测触摸输入压力以及触摸输入位置和触摸输入面积。
音频输出模块152可以在移动终端处于呼叫信号接收模式、通话模式、记录模式、语音识别模式、广播接收模式等等模式下时,将无线通信单元110接收的或者在存储器160中存储的音频数据转换音频信号并且输出为声音。而且,音频输出模块152可以提供与移动终端100执行的特定功能相关的音频输出(例如,呼叫信号接收声音、消息接收声音等等)。音频输出模块152可以包括扬声器、蜂鸣器等等。
警报单元153可以提供输出以将事件的发生通知给移动终端100。典型的事件可以包括呼叫接收、消息接收、键信号输入、触摸输入等等。除了音频或视频输出之外,警报单元153可以以不同的方式提供输出以通知事件的发 生。例如,警报单元153可以以振动的形式提供输出,当接收到呼叫、消息或一些其它进入通信(incoming communication)时,警报单元153可以提供触觉输出(即,振动)以将其通知给用户。通过提供这样的触觉输出,即使在用户的移动电话处于用户的口袋中时,用户也能够识别出各种事件的发生。警报单元153也可以经由显示单元151或音频输出模块152提供通知事件的发生的输出。
存储器160可以存储由控制器180执行的处理和控制操作的软件程序等等,或者可以暂时地存储己经输出或将要输出的数据(例如,电话簿、消息、静态图像、视频等等)。而且,存储器160可以存储关于当触摸施加到触摸屏时输出的各种方式的振动和音频信号的数据。
存储器160可以包括至少一种类型的存储介质,所述存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等等。而且,移动终端100可以与通过网络连接执行存储器160的存储功能的网络存储装置协作。
控制器180通常控制移动终端的总体操作。例如,控制器180执行与语音通话、数据通信、视频通话等等相关的控制和处理。另外,控制器180可以包括用于再现(或回放)多媒体数据的多媒体模块1810,多媒体模块1810可以构造在控制器180内,或者可以构造为与控制器180分离。控制器180可以执行模式识别处理,以将在触摸屏上执行的手写输入或者图片绘制输入识别为字符或图像。
电源单元190在控制器180的控制下接收外部电力或内部电力并且提供操作各元件和组件所需的适当的电力。
这里描述的各种实施方式可以以使用例如计算机软件、硬件或其任何组合的计算机可读介质来实施。对于硬件实施,这里描述的实施方式可以通过使用特定用途集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理装置(DSPD)、可编程逻辑装置(PLD)、现场可编程门阵列(FPGA)、处理器、控制器、微控制器、微处理器、被设计为执行这里描述的功能的电子单元中的 至少一种来实施,在一些情况下,这样的实施方式可以在控制器180中实施。对于软件实施,诸如过程或功能的实施方式可以与允许执行至少一种功能或操作的单独的软件模块来实施。软件代码可以由以任何适当的编程语言编写的软件应用程序(或程序)来实施,软件代码可以存储在存储器160中并且由控制器180执行。
至此,己经按照其功能描述了移动终端。下面,为了简要起见,将描述诸如折叠型、直板型、摆动型、滑动型移动终端等等的各种类型的移动终端中的滑动型移动终端作为示例。因此,本发明能够应用于任何类型的移动终端,并且不限于滑动型移动终端。
如图1中所示的移动终端100可以被构造为利用经由帧或分组发送数据的诸如有线和无线通信系统以及基于卫星的通信系统来操作。
现在将参考图2描述其中根据本发明的移动终端能够操作的通信系统。
这样的通信系统可以使用不同的空中接口和/或物理层。例如,由通信系统使用的空中接口包括例如频分多址(FDMA)、时分多址(TDMA)、码分多址(CDMA)和通用移动通信系统(UMTS)(特别地,长期演进(LTE))、全球移动通信系统(GSM)等等。作为非限制性示例,下面的描述涉及CDMA通信系统,但是这样的教导同样适用于其它类型的系统。
参考图2,CDMA无线通信系统可以包括多个移动终端100、多个基站(BS)270、基站控制器(BSC)275和移动交换中心(MSC)2800MSC280被构造为与公共电话交换网络(PSTN)290形成接口。MSC280还被构造为与可以经由回程线路耦接到基站270的BSC275形成接口。回程线路可以根据若干己知的接口中的任一种来构造,所述接口包括例如E1/T1、ATM,IP、PPP、帧中继、HDSL、ADSL或xDSL。将理解的是,如图2中所示的系统可以包括多个BSC2750。
每个BS270可以服务一个或多个分区(或区域),由多向天线或指向特定方向的天线覆盖的每个分区放射状地远离BS270。或者,每个分区可以由用于分集接收的两个或更多天线覆盖。每个BS270可以被构造为支持多个频率分配,并且每个频率分配具有特定频谱(例如,1.25MHz,5MHz等等)。
分区与频率分配的交叉可以被称为CDMA信道。BS270也可以被称为基 站收发器子系统(BTS)或者其它等效术语。在这样的情况下,术语"基站"可以用于笼统地表示单个BSC275和至少一个BS270。基站也可以被称为"蜂窝站"。或者,特定BS270的各分区可以被称为多个蜂窝站。
如图2中所示,广播发射器(BT)295将广播信号发送给在系统内操作的移动终端100。如图1中所示的广播接收模块111被设置在移动终端100处以接收由BT295发送的广播信号。在图2中,示出了几个全球定位系统(GPS)卫星300。卫星300帮助定位多个移动终端100中的至少一个。
在图2中,描绘了多个卫星300,但是理解的是,可以利用任何数目的卫星获得有用的定位信息。如图1中所示的GPS模块115通常被构造为与卫星300配合以获得想要的定位信息。替代GPS跟踪技术或者在GPS跟踪技术之外,可以使用可以跟踪移动终端的位置的其它技术。另外,至少一个GPS卫星300可以选择性地或者额外地处理卫星DMB传输。
作为无线通信系统的一个典型操作,BS270接收来自各种移动终端100的反相链路信号。移动终端100通常参与通话、消息收发和其它类型的通信。特定基站270接收的每个反相链路信号被在特定BS270内进行处理。获得的数据被转发给相关的BSC275。BSC提供通话资源分配和包括BS270之间的软切换过程的协调的移动管理功能。BSC275还将接收到的数据路由到MSC280,其提供用于与PSTN290形成接口的额外的路由服务。类似地,PSTN290与MSC280形成接口,MSC与BSC275形成接口,并且BSC275相应地控制BS270以将正向链路信号发送到移动终端100。
如图3所示,本发明第一实施例提出一种终端语音交互模式下的降噪方法,在接收到语音信息时,包括:
步骤S10,获取接收到的语音信息对应的语音信号及所述语音信号对应声源的位置;
本发明实施例中的终端语音交互模式下的降噪过程优选为应用在免提通话模式下,在本发明其他实施例中也还可以应用在录音、即时通信语音交互场景中。
在本实施例中,所述终端优选为手机、pad等能进行语音通话的电子设备,所述终端应至少装配如下器件:用于通话免提功能的外放喇叭,用于检 测声源位置的麦克风阵列,所述麦克风阵列优选为4个,最佳设置位置为终端的上侧、下侧,左侧和右侧分别设置一个,由于相关技术中提供的检测软件对于声压级的检测精度可以达到0.1DBA,相位差的检测精度可达0.1度,所以位置变动时,检测软件具有较强的稳定性,不需要变动相应的技术。终端装备四个或四个以上的麦克风阵列,在开启免提时,麦克风阵列检测语音信息,在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置。其中,语音信号包括人声信号及噪音信号。
具体的,参考图4,步骤S10中的所述获取语音信号对应声源的位置的过程包括:
步骤S11,从所述语音信号中识别出噪音信号;
步骤S12,获取所述噪音信号对应声源的位置。
根据通用的人声侦测算法(VAD)区分出人声与环境噪音,即从所述语音信号中识别出人声信号及噪音信号,对录入的信号进行筛选,判断出人声和噪音。确定人声及噪音对应声源的位置。
具体的,参考图5,步骤S12包括:
步骤S121,获取所述噪音信号对应波形的相位差及声压差;
步骤S122,根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
通过计算麦克风阵列录入波形的相位差及声压差,分别计算定位人声音源相对于终端的三维空间位置,噪声音源相对于终端的三维空间位置。各录入波形由于和声源存在位置差异,声压级强度和距离的平方成反比,而相位差则是和接收延时有关,通过各个录入波形相减,得到相位差和声压差,根据声波在空气中传播的速度公式,可算出声源对各个MIC的位置距离,通过画圆寻找交点,可判断当前声源相对于终端位于空间中的具体位置。
步骤S20,根据所述位置确定对应语音信号的降噪参数;
根据所述位置确定对应语音信号的降噪参数,在存在多个噪音信号时,根据噪音频谱序列进行归类,逐一提取,分别叠加反相补偿声波,即,所述降噪参数为反相补偿声波的相位。
具体的,参考图6,步骤S20可以包括:
步骤S21,根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
步骤S22,根据所述相位差及延时生成对应噪音信号的降噪参数。
在确定人声声源和噪声声源相对于终端的三维空间位置后,计算得到人声和噪音声源的距离,在存在多个噪音声源时,计算人声与各个噪音声源的距离得到多个距离,根据各个噪音与人声声源的距离及噪声音源的频率,计算出终端录入波形在终端MIC和人耳位置之间的相位差及延时,根据所述相位差和延时生成对应噪音信号的降噪参数。每个噪音因位置不同,所以得到的降噪参数均不同,在存在多个噪音时,确定每个噪音对应的降噪参数,即,得到多个反相补偿声波的相位。
步骤S30,根据所述降噪参数调整对应的语音信号,以对所述语音信息进行降噪。
在所述降噪参数为反相补偿声波的相位,通过加入反相补偿声波的相位,达到降噪的目的,有效避免环境噪音的干扰。本发明实施例通过降噪系统产生与近端外界噪音相等的反向声波,将噪音中和,从而实现了主动降噪的效果。
本实施例通过根据接收的语音信息的语音信号的声源位置来确定对应的降噪参数,对语音信息进行了降噪,实现了语音交互模式下的主动降噪,降低了语音交互过程中产生噪音干扰,从而提升了语音交互效果。
参考图7,提出本发明终端语音交互模式下的降噪方法的第二实施例,基于上述终端语音交互模式下的降噪方法的第一实施例,所述步骤S30可以包括:
步骤S23,确定终端当前的状态参数;
步骤S24,根据所述状态参数及所述位置确定对应语音信号的降噪参数。
在本实施例中,需要注意终端的摆放状态,即确定终端当前的状态参数,例如,是正面朝上、反面朝上、侧方的情况,可根据重力传感器、陀螺仪等仪器提供的数据,在不同的摆放姿态下,提供不同的方位数据,进而对叠加的反相补偿声波的相位,从而达到免提主动降噪的功能,通过结合主动的状态参数及所述位置确定对应语音信号的降噪参数,例如,在终端为正面时, 根据正面这个状态参数及位置确定对应语音信号的降噪参数。本实施例在不同的摆放姿态下,即,在终端处于不同的状态下,提供不同的终端状态,进而对叠加的反相补偿波形进行了修正。进一步提高了降噪效果。
本发明实施例还提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述任一项的降噪方法。
本发明进一步提供一种终端语音交互模式下的降噪装置。
参照图8,图8为本发明终端语音交互模式下的降噪装置较佳实施例的功能模块示意图,至少包括:获取模块10、第一确定模块20以及第一降噪模块30,其中,
所述获取模块10,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
本发明实施例中的终端语音交互模式下的降噪过程优选为应用在免提通话模式下,在本发明其他实施例中也还可以应用在录音、即时通信语音交互场景中。
在本实施例中,所述终端优选为手机、pad等能进行语音通话的电子设备,所述终端应至少装配如下器件:用于通话免提功能的外放喇叭,用于检测声源位置的麦克风阵列,所述麦克风阵列优选为4个,最佳设置位置为终端的上侧、下侧,左侧和右侧分别设置一个,由于相关技术中提供的检测软件对于声压级的检测精度可以达到0.1DBA,相位差的检测精度可达0.1度,所以位置变动时,检测软件具有较强的稳定性,不需要变动相应的技术。终端装备四个或四个以上的麦克风阵列,在开启免提时,麦克风阵列检测语音信息,在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置。其中,语音信号包括人声信号及噪音信号。
参考图9,所述获取模块10至少包括识别单元11和获取单元12,
所述识别单元11,设置为在接收到语音信息时,获取所述语音信息对应的语音信号,从所述语音信号中识别出噪音信号;
所述获取单元12,设置为获取所述噪音信号对应声源的位置。
根据通用的VAD算法区分出人声与环境噪音,即从所述语音信号中识别出人声信号及噪音信号,对录入的信号进行筛选,判断出人声和噪音。确 定人声及噪音对应声源的位置。
具体的,参考图10,所述获取单元12至少包括:获取子单元121和计算子单元122,
所述获取子单元121,设置为获取所述噪音信号对应波形的相位差及声压差;
所述计算子单元122,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
通过计算麦克风阵列录入波形的相位差及声压差,分别计算定位人声音源相对于终端的三维空间位置,噪声音源相对于终端的三维空间位置。各录入波形由于和声源存在位置差异,声压级强度和距离的平方成反比,而相位差则是和接收延时有关,通过各个录入波形相减,得到相位差和声压差,根据声波在空气中传播的速度公式,可算出声源对各个MIC的位置距离,通过画圆寻找交点,可判断当前声源相对于终端位于空间中的具体位置。
如图8所示,第一确定模块20,设置为根据所述位置确定对应语音信号的降噪参数;
根据所述位置确定对应语音信号的降噪参数,在存在多个噪音信号时,根据噪音频谱序列进行归类,逐一提取,分别叠加反相补偿声波,即,所述降噪参数为反相补偿声波的相位。
具体的,参考图11,所述第一确定模块20至少包括:第一确定单元21和第一生成单元22,
所述第一确定单元21,设置为根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
所述第一生成单元22,设置为根据所述相位差及延时生成对应噪音信号的降噪参数。
在确定人声声源和噪声声源相对于终端的三维空间位置后,计算得到人声和噪音声源的距离,在存在多个噪音声源时,计算人声与各个噪音声源的距离得到多个距离,根据各个噪音与人声声源的距离及噪声音源的频率,计算出终端录入波形在终端MIC和人耳位置之间的相位差及延时,根据所述相位差和延时生成对应噪音信号的降噪参数。每个噪音因位置不同,所以得到 的降噪参数均不同,在存在多个噪音时,确定每个噪音对应的降噪参数,即,得到多个反相补偿声波的相位。
如图8所示,降噪模块30,设置为根据所述降噪参数调整对应的语音信号,以对所述语音信息进行降噪。
在所述降噪参数为反相补偿声波的相位,通过加入反相补偿声波的相位,达到降噪的目的,有效避免环境噪音的干扰。本发明实施例通过降噪系统产生与近端外界噪音相等的反向声波,将噪音中和,从而实现了主动降噪的效果。
本实施例通过根据接收的语音信息的语音信号的声源位置来确定对应的降噪参数,对语音信息进行了降噪。实现了语音交互模式下的降噪,降低了语音交互过程中产生噪音干扰,从而提升了语音交互效果。
可选地,所述第一确定单元21,还设置为确定终端当前的状态参数;根据所述状态参数及所述位置确定对应语音信号的降噪参数。
在本发明实施例的另一降噪装置实施例中,降噪装置包括获取模块、第二确定模块,及降噪模块;其中,
获取模块,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
第二确定模块,设置为确定终端当前的状态参数,根据所述状态参数及所述位置确定对应语音信号的降噪参数;
降噪模块,设置为根据确定的降噪参数调整对应的语音信号。
其中,
所述获取模块包括:
识别单元,设置为从所述语音信号中识别出噪音信号;
获取单元,设置为获取所述噪音信号对应声源的位置。
所述获取单元包括:
获取子单元,设置为获取所述噪音信号对应波形的相位差及声压差;
计算子单元,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
其中,所述第二确定模块包括:
第二确定单元,设置为根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;以及确定终端当前的状态参数;
第二生成单元,设置为根据所述相位差及延时,以及终端当前的状态参数生成对应噪音信号的降噪参数。
在本实施例中,需要注意终端的摆放状态,即确定终端当前的状态参数,例如,是正面朝上、反面朝上、侧方的情况,可根据重力传感器、陀螺仪等仪器提供的数据,在不同的摆放姿态下,提供不同的方位数据,进而对叠加的反相补偿声波的相位,从而达到免提主动降噪的功能,通过结合主动的状态参数及所述位置确定对应语音信号的降噪参数,例如,在终端为正面时,根据正面这个状态参数及位置确定对应语音信号的降噪参数。本实施例在不同的摆放姿态下,即,在终端处于不同的状态下,提供不同的终端状态,进而对叠加的反相补偿波形进行修正。进一步提高了降噪效果。
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本发明各个实施例所述的方法。
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间 接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。
工业实用性
本发明实施例提出的降噪方法、降噪装置和计算机可读存储介质,其方法包括:在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;根据所述位置确定对应语音信号的降噪参数;根据所述降噪参数调整对应的语音信号,以对所述语音信息进行降噪。本发明实施例实现了语音交互模式下的主动降噪,降低了语音交互过程中产生的噪音干扰,从而提升了语音交互效果。

Claims (20)

  1. 一种降噪方法,包括:获取接收到的语音信息对应的语音信号及所述语音信号对应声源的位置;
    根据获得的位置确定对应语音信号的降噪参数;
    根据确定的降噪参数调整对应的语音信号。
  2. 根据权利要求1所述的降噪方法,其中,所述获取所述语音信号对应声源的位置包括:从所述语音信号中识别出噪音信号;获取所述噪音信号对应声源的位置。
  3. 根据权利要求2所述的降噪方法,其中,所述获取所述噪音信号对应声源的位置包括:获取所述噪音信号对应波形的相位差及声压差;
    根据获得的相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
  4. 根据权利要求3所述的降噪方法,其中,所述根据所述位置确定所述语音信号对应的降噪参数包括:
    根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
    根据所述相位差及延时生成对应噪音信号的降噪参数。
  5. 根据权利要求1至4任一项所述的降噪方法,所述根据确定的降噪参数调整对应的语音信号之前还包括:确定终端当前的状态参数;
    所述根据所述位置确定所述语音信号对应的降噪参数包括:根据所述状态参数及所述位置确定对应语音信号的降噪参数。
  6. 一种降噪方法,包括获取接收到的语音信息对应的语音信号及所述语音信号对应声源的位置;
    根据获得的位置确定对应语音信号的降噪参数;
    确定终端当前的状态参数,根据所述状态参数及所述位置确定对应语音信号的降噪参数。
  7. 根据权利要求6所述的降噪方法,其特征在于,其中,所述获取所述 语音信号对应声源的位置包括:从所述语音信号中识别出噪音信号;获取所述噪音信号对应声源的位置。
  8. 根据权利要求7所述的降噪方法,其中,所述获取所述噪音信号对应声源的位置包括:获取所述噪音信号对应波形的相位差及声压差;
    根据获得的相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
  9. 根据权利要求8所述的降噪方法,其中,所述根据所述位置确定所述语音信号对应的降噪参数包括:
    根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;
    根据所述相位差及延时生成对应噪音信号的降噪参数。
  10. 根据权利要求6所述的方法,其特征在于,所述状态参数天所述终端的摆放状态。
  11. 一种降噪装置,包括获取模块、第一确定模块,及降噪模块;其中,
    获取模块,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
    第一确定模块,设置为根据获得的位置确定对应语音信号的降噪参数;
    降噪模块,设置为根据确定的降噪参数调整对应的语音信号。
  12. 根据权利要求11所述的降噪装置,其中,所述获取模块包括:
    识别单元,设置为从所述语音信号中识别出噪音信号;
    获取单元,设置为获取所述噪音信号对应声源的位置。
  13. 根据权利要求12所述的降噪装置,其中,所述获取单元包括:
    获取子单元,设置为获取所述噪音信号对应波形的相位差及声压差;
    计算子单元,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
  14. 根据权利要求13所述的降噪装置,其中,所述第一确定模块包括:
    第一确定单元,设置为根据所述三维空间位置计算出所述噪音信号对应 波形相对于终端的相位差及延时;
    第一生成单元,设置为根据所述相位差及延时生成对应噪音信号的降噪参数。
  15. 根据权利要求14所述的降噪装置,其特征在于,所述第一确定单元,还设置为确定终端当前的状态参数;根据所述状态参数及所述位置确定对应语音信号的降噪参数。
  16. 一种降噪装置,包括获取模块、第二确定模块,及降噪模块;其中,
    获取模块,设置为在接收到语音信息时,获取所述语音信息对应的语音信号及所述语音信号对应声源的位置;
    第二确定模块,设置为确定终端当前的状态参数,根据所述状态参数及所述位置确定对应语音信号的降噪参数;
    降噪模块,设置为确定的降噪参数调整对应的语音信号。
  17. 根据权利要求16所述的降噪装置,其中,所述获取模块包括:
    识别单元,设置为从所述语音信号中识别出噪音信号;
    获取单元,设置为获取所述噪音信号对应声源的位置。
  18. 根据权利要求17所述的降噪装置,其中,所述获取单元包括:
    获取子单元,设置为获取所述噪音信号对应波形的相位差及声压差;
    计算子单元,设置为根据所述相位差及声压差计算所述噪音信号对应的声源相对于终端的三维空间位置。
  19. 根据权利要求18所述的降噪装置,其中,所述第二确定模块包括:
    第二确定单元,设置为根据所述三维空间位置计算出所述噪音信号对应波形相对于终端的相位差及延时;以及确定终端当前的状态参数;
    第二生成单元,设置为根据所述相位差及延时,以及确定终端当前的状态参数生成对应噪音信号的降噪参数。
  20. 一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行权1~权5,和/或权6~权10任一项的实现端到端通话加密的方法。
PCT/CN2016/083032 2015-05-26 2016-05-23 降噪方法、降噪装置和计算机可读存储介质 WO2016188394A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510274680.0A CN104967717B (zh) 2015-05-26 2015-05-26 终端语音交互模式下的降噪方法及装置
CN201510274680.0 2015-05-26

Publications (1)

Publication Number Publication Date
WO2016188394A1 true WO2016188394A1 (zh) 2016-12-01

Family

ID=54221653

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/083032 WO2016188394A1 (zh) 2015-05-26 2016-05-23 降噪方法、降噪装置和计算机可读存储介质

Country Status (2)

Country Link
CN (1) CN104967717B (zh)
WO (1) WO2016188394A1 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113808564A (zh) * 2020-06-12 2021-12-17 青岛海尔电冰箱有限公司 厨房降噪方法、冰箱及计算机可读存储介质
CN114176623A (zh) * 2021-12-21 2022-03-15 深圳大学 声音降噪方法、系统、降噪设备及计算机可读存储介质
CN115359803A (zh) * 2022-10-21 2022-11-18 中诚华隆计算机技术有限公司 一种基于芯片实现的语音降噪优化方法及装置

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104967717B (zh) * 2015-05-26 2016-09-28 努比亚技术有限公司 终端语音交互模式下的降噪方法及装置
CN107146628A (zh) * 2017-04-07 2017-09-08 宇龙计算机通信科技(深圳)有限公司 一种语音通话处理方法及移动终端
CN107527615B (zh) * 2017-09-13 2021-01-15 联想(北京)有限公司 信息处理方法、装置、设备、系统及服务器
CN108564961A (zh) * 2017-11-29 2018-09-21 华北计算技术研究所(中国电子科技集团公司第十五研究所) 一种移动通信设备的语音降噪方法
CN107945814A (zh) * 2017-11-29 2018-04-20 华北计算技术研究所(中国电子科技集团公司第十五研究所) 一种语音处理方法
CN112185354A (zh) * 2020-09-17 2021-01-05 浙江同花顺智能科技有限公司 一种语音文本的显示方法、装置、设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103079148A (zh) * 2012-12-28 2013-05-01 中兴通讯股份有限公司 一种终端双麦克风降噪的方法及装置
EP2775738A1 (en) * 2013-03-07 2014-09-10 Nokia Corporation Orientation free handsfree device
US20150003626A1 (en) * 2013-02-25 2015-01-01 Max Sound Corporation Active noise cancellation method for automobiles
CN104575510A (zh) * 2015-02-04 2015-04-29 深圳酷派技术有限公司 降噪方法、降噪装置和终端
CN104967717A (zh) * 2015-05-26 2015-10-07 努比亚技术有限公司 终端语音交互模式下的降噪方法及装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9246543B2 (en) * 2011-12-12 2016-01-26 Futurewei Technologies, Inc. Smart audio and video capture systems for data processing systems
CN105103219B (zh) * 2013-11-11 2019-08-09 赵春宁 降低噪音的方法
CN104301537A (zh) * 2014-10-15 2015-01-21 龙旗电子(惠州)有限公司 一种降噪手机及降噪方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103079148A (zh) * 2012-12-28 2013-05-01 中兴通讯股份有限公司 一种终端双麦克风降噪的方法及装置
US20150003626A1 (en) * 2013-02-25 2015-01-01 Max Sound Corporation Active noise cancellation method for automobiles
EP2775738A1 (en) * 2013-03-07 2014-09-10 Nokia Corporation Orientation free handsfree device
CN104575510A (zh) * 2015-02-04 2015-04-29 深圳酷派技术有限公司 降噪方法、降噪装置和终端
CN104967717A (zh) * 2015-05-26 2015-10-07 努比亚技术有限公司 终端语音交互模式下的降噪方法及装置

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113808564A (zh) * 2020-06-12 2021-12-17 青岛海尔电冰箱有限公司 厨房降噪方法、冰箱及计算机可读存储介质
CN113808564B (zh) * 2020-06-12 2024-03-19 青岛海尔电冰箱有限公司 厨房降噪方法、冰箱及计算机可读存储介质
CN114176623A (zh) * 2021-12-21 2022-03-15 深圳大学 声音降噪方法、系统、降噪设备及计算机可读存储介质
CN114176623B (zh) * 2021-12-21 2023-09-12 深圳大学 声音降噪方法、系统、降噪设备及计算机可读存储介质
CN115359803A (zh) * 2022-10-21 2022-11-18 中诚华隆计算机技术有限公司 一种基于芯片实现的语音降噪优化方法及装置

Also Published As

Publication number Publication date
CN104967717B (zh) 2016-09-28
CN104967717A (zh) 2015-10-07

Similar Documents

Publication Publication Date Title
WO2016188394A1 (zh) 降噪方法、降噪装置和计算机可读存储介质
WO2016034055A1 (zh) 移动终端及其操作方法、计算机存储介质
WO2016173468A1 (zh) 组合操作方法和装置、触摸屏操作方法及电子设备
WO2017107597A1 (zh) 灵敏度校准方法、装置及移动终端、存储介质
WO2016029766A1 (zh) 移动终端及其操作方法和计算机存储介质
WO2017071481A1 (zh) 一种移动终端及其实现分屏的方法
CN106488376B (zh) 一种对移动终端的音频元件进行故障诊断的方法和装置
WO2017020771A1 (zh) 终端控制装置及方法
WO2016188379A1 (zh) 一种信息处理方法及装置、终端、存储介质
WO2017071310A1 (zh) 视频通话系统、装置和方法
WO2016173395A1 (zh) 启动音效的方法、终端设备及计算机存储介质
WO2016058458A1 (zh) 电池电量的管理方法、移动终端和计算机存储介质
WO2017143855A1 (zh) 具有截屏功能的装置和截屏方法
WO2016155509A1 (zh) 移动终端的握持方式判断方法及装置
CN106157970B (zh) 一种音频识别方法以及终端
WO2017088631A1 (zh) 一种移动终端及其增减调节方法及装置、存储介质
WO2017143854A1 (zh) 一种移动终端及控制音量的方法、计算机可读存储介质
WO2016188495A1 (zh) 语音切换方法、终端、服务器及系统、存储介质
WO2017185863A1 (zh) 一种音频控制方法及终端、计算机存储介质
CN105975753A (zh) 计算用户移动速度的方法及移动终端
WO2016155425A1 (zh) 数据发送操作撤销方法、装置及计算机存储介质
WO2017113893A1 (zh) 一种搜网的方法及装置
CN105430258A (zh) 一种自拍合影的方法和装置
CN104951229A (zh) 屏幕截图方法和装置
WO2018032917A1 (zh) 一种移动终端、获取对焦值的方法以及计算机可读存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16799282

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16799282

Country of ref document: EP

Kind code of ref document: A1