CN111356908B

CN111356908B - Noise reduction method and terminal

Info

Publication number: CN111356908B
Application number: CN201780097001.4A
Authority: CN
Inventors: 曹军; 张玉磊; 王成永; 桂明建
Original assignee: Shenzhen Transsion Communication Co Ltd
Current assignee: Shenzhen Transsion Communication Co Ltd
Priority date: 2017-09-29
Filing date: 2017-09-29
Publication date: 2022-05-24
Anticipated expiration: 2037-09-29
Also published as: CN111356908A; WO2019061323A1

Abstract

The embodiment of the invention discloses a noise reduction method and a terminal, wherein the method comprises the following steps: the method comprises the steps that a terminal detects a target distance, wherein the target distance comprises the distance from a mouth of a user to a main microphone of the terminal and the distance from the mouth of the user to a secondary microphone of the terminal; the terminal is matched with a reference value corresponding to the target distance, a plurality of distances are prestored in the terminal, each distance in the plurality of distances corresponds to a reference value of the terminal, the reference values are preset differences used for representing sound signals, and the plurality of distances comprise the target distance; and the terminal filters the environmental noise according to an actual value and the reference value, wherein the actual value is a difference value between the sound signal of the main microphone and the sound signal of the auxiliary microphone. The embodiment of the invention can improve the noise reduction performance.

Description

Noise reduction method and terminal

Technical Field

The invention relates to the technical field of computers, in particular to a noise reduction method and a terminal.

Background

In daily communication of people, voice communication is an indispensable communication mode, such as telephone communication, sending voice messages and the like. In the process of voice communication by a user through a terminal, a microphone is responsible for recording voice information of the user, and because the microphone records noise of the environment where the user is located while recording the voice information of the user, the noise of the environment where the user is located is always a non-negligible factor influencing the voice communication quality. In order to solve this problem, the skilled person proposes a dual-microphone noise reduction method, and the principle of dual-microphone noise reduction is as follows: the terminal is provided with two microphones, one microphone is a main microphone close to the mouth of the user, the intensity of the collected human voice signal is higher, the other microphone is an auxiliary microphone far away from the mouth of the user, and the intensity of the collected human voice signal is lower; the strength of the two human voice signals collected by the terminal is approximately different by 6dB, and the strength of the collected ambient noise is almost the same, so that the ambient noise collected by the main microphone and the ambient noise collected by the auxiliary microphone can be offset by subtracting the strength of the sound signal collected by the auxiliary microphone from the strength of the sound signal collected by the main microphone, and the noise reduction is realized; in addition, if the signal strength difference generated by the subtraction is 6dB, the signal strength difference of 6dB belongs to human voice.

In the process of actually applying the dual-microphone noise reduction method, the strength difference of the human voice signals received by the two microphones is probably not 6dB, and at this time, if the noise reduction is performed according to the above principle, the noise reduction performance may be greatly reduced.

Disclosure of Invention

The embodiment of the invention provides a noise reduction method and a terminal, which can improve the noise reduction performance.

In a first aspect, an embodiment of the present invention provides a noise reduction method, where the method includes:

the terminal detects a target distance, wherein the target distance comprises the distance from the mouth of a user to a main microphone of the terminal and/or comprises the distance from the mouth of the user to a secondary microphone of the terminal;

the terminal is matched with a reference value corresponding to the target distance, a plurality of distances are prestored in the terminal, a reference value is respectively corresponding to each distance in the plurality of distances, the reference value is a preset difference value used for representing sound signals, and the plurality of distances comprise the target distance;

the terminal filters the environmental noise according to an actual value and the reference value, wherein the actual value is a difference value between the sound signal of the main microphone and the sound signal of the auxiliary microphone.

In a second aspect, an embodiment of the present invention provides a terminal, where the terminal includes a unit configured to perform the method of the first aspect.

In a third aspect, an embodiment of the present invention provides another terminal, where the terminal includes a processor and a memory, where the processor and the memory are connected to each other, where the memory is used to store program instructions, and the processor is used to call the program instructions in the memory to execute the method of the first aspect.

In a fourth aspect, embodiments of the present invention provide a computer-readable storage medium storing program instructions that, when executed by a processor, cause the processor to perform the method of the first aspect.

In the embodiment of the present invention, a plurality of distances are pre-stored in the terminal, and each of the plurality of distances corresponds to a reference value, that is, the reference value used for filtering the human voice changes with the distance from the mouth of the user to the microphone (including the main microphone and the sub-microphone) of the terminal, compared to the prior art that the same reference value is used in any case, the noise reduction performance is better, and the filtered human voice is more realistic.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

Fig. 1 is a schematic flow chart of a noise reduction method according to an embodiment of the present invention;

fig. 2 is a scene schematic diagram of a target distance measuring method according to an embodiment of the present invention;

fig. 3 is a schematic view of a scene of another method for measuring a target distance according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of a terminal according to an embodiment of the present invention;

fig. 5 is a schematic structural diagram of another terminal according to an embodiment of the present invention;

fig. 6 is a schematic structural diagram of another terminal according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.

It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.

It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.

As used in this specification and the appended claims, the term "if" may be interpreted contextually as "when … …" or "upon" or "in response to a determination" or "in response to a detection". Similarly, the phrase "if it is determined" or "if a [ described condition or event ] is detected" may be interpreted contextually to mean "upon determining" or "in response to determining" or "upon detecting [ described condition or event ]" or "in response to detecting [ described condition or event ]".

In particular implementations, the terminals described in embodiments of the invention include, but are not limited to, other portable devices such as mobile phones, laptop computers, or tablet computers having touch sensitive surfaces (e.g., touch screen displays and/or touch pads). It should also be understood that in some embodiments, the device is not a portable communication device, but is a desktop computer having a touch-sensitive surface (e.g., a touch screen display and/or a touchpad).

In the discussion that follows, a terminal that includes a display and a touch-sensitive surface is described. However, it should be understood that the terminal may include one or more other physical user interface devices such as a physical keyboard, mouse, and/or joystick.

The terminal supports various applications, such as one or more of the following: a drawing application, a presentation application, a word processing application, a website creation application, a disc burning application, a spreadsheet application, a gaming application, a telephone application, a video conferencing application, an email application, an instant messaging application, an exercise support application, a photo management application, a digital camera application, a web browsing application, a digital music player application, and/or a digital video player application.

Various applications that may be executed on the terminal may use at least one common physical user interface device, such as a touch-sensitive surface. One or more functions of the touch-sensitive surface and corresponding information displayed on the terminal can be adjusted and/or changed between applications and/or within respective applications. In this way, a common physical architecture (e.g., touch-sensitive surface) of the terminal can support various applications with user interfaces that are intuitive and transparent to the user.

In the embodiment of the present invention, the terminal may include various devices such as a Mobile phone, a tablet computer, a Personal Digital Assistant (PDA), a Mobile Internet Device (MID), and an intelligent wearable Device (e.g., an intelligent watch and an intelligent bracelet), which are not limited in the embodiment of the present invention.

Referring to fig. 1, a schematic flow chart of a noise reduction method according to an embodiment of the present invention is shown, where the method may include:

101. the terminal detects the target distance.

The terminal comprises a main microphone and a secondary microphone, wherein the main microphone collects (or detects) the voice of the person and the sound of the surrounding environment obviously, the secondary microphone collects the sound of the surrounding environment obviously but does not collect the voice of the person obviously, so that the voice can be filtered according to the voice signals collected by the main microphone and the voice signals collected by the secondary microphone, for example, when the terminal is a mobile phone, the voice can be filtered during the hands-free conversation. The target distance in the embodiment of the present invention includes a distance from the mouth of the user to the primary microphone of the terminal and includes a distance from the mouth of the user to the secondary microphone of the terminal.

Optionally, the terminal obtains the mouth image of the user through the camera, specifically: the terminal matches the acquired distance corresponding to the mouth image of the user, the terminal prestores a plurality of mouth images, and each mouth image in the plurality of mouth images corresponds to its own distance. Optionally, the camera of the terminal acquires the mouth image of the user in real time, and if the mouth image of the user changes in size and/or displacement, the terminal re-matches the distance corresponding to the changed mouth image of the user, and takes the distance corresponding to the changed mouth image of the user re-matched by the terminal as the target distance. By acquiring the mouth image of the user in real time, the problem that the target distance is inaccurate to acquire due to the movement of the user can be avoided, and the noise reduction performance is improved.

The terminal can detect the target distance through various schemes, and four alternatives are listed below.

In the first scheme, the terminal detects the distance from the mouth of the user to the terminal through an infrared sensor, and the detected distance from the mouth of the user to the terminal can be referred to as a target distance. Fig. 2 is a scene schematic diagram of a target distance measuring method according to an embodiment of the present invention. An infrared sensor of the terminal emits infrared rays, the infrared rays are shielded by the mouth of a user in the transmission process and are reflected, and the infrared sensor receives the reflected infrared rays; the terminal calculates the target distance based on the time difference between the emission of the infrared ray and the reception of the infrared ray and the propagation velocity of the infrared ray. Optionally, the terminal re-emits the infrared rays at preset time intervals (e.g., 1 second, 2 seconds, 3 seconds, etc.); after the terminal re-receives the infrared ray, calculating the time difference between re-emission and re-reception of the infrared ray again; when the recalculated time difference and the previous time difference change, the terminal calculates a distance based on the recalculated time difference and the infrared ray propagation speed, and the distance calculated based on the recalculated time difference and the infrared ray propagation speed is a target distance. Therefore, the problem that the target distance is inaccurate to obtain due to the movement of a user can be avoided by re-emitting the infrared rays at preset intervals, and the noise reduction performance is improved.

A second scheme, the terminal has at least one secondary microphone, the terminal determines the secondary microphone with the largest difference between the distance from the mouth of the user in the at least one secondary microphone and the distance from the primary microphone of the terminal to the mouth of the user as a target secondary microphone, the target distance includes the distance from the mouth of the user to the primary microphone of the terminal and the distance from the mouth of the user to the target secondary microphone, and the terminal has a plurality of secondary microphones and detects the distance from the mouth of the user to each secondary microphone in the at least one secondary microphone and the distance from the mouth of the user to the primary microphone of the terminal; the target distance includes a distance from the mouth of the user to a primary microphone of the terminal and a distance from the mouth of the user to a target secondary microphone, and the target secondary microphone is a secondary microphone of the plurality of secondary microphones whose difference between the distance from the mouth of the user and the distance from the primary microphone of the terminal to the mouth of the user is the largest. Fig. 3 is a schematic view of a scene of another target distance measuring method according to an embodiment of the present invention. Fig. 3 includes a main microphone and a plurality of sub microphones, and fig. 3 illustrates an example in which the plurality of sub microphones includes a sub microphone 1 and a sub microphone 2. It can be seen that, by determining the target secondary microphone, the problem of weakening human voice signals caused by noise reduction processing is avoided when the distance from the mouth of the user to the primary microphone of the terminal is equivalent to the distance from the mouth of the user to the secondary microphone of the terminal, so that the noise reduction performance is improved.

In a third aspect, the secondary microphone of the terminal may be movable, and if a difference between a distance from the mouth of the user to the primary microphone of the terminal and a distance from the mouth of the user to the secondary microphone of the terminal is not greater than a second predetermined value, the terminal adjusts a position of the secondary microphone such that a difference between the distance from the mouth of the user to the primary microphone of the terminal and a distance from the mouth of the user to the secondary microphone of the terminal is greater than the second predetermined value. Wherein, the second preset value can be any one of the equidistance values of 0.5 cm, 1 cm and 1.5 cm. Therefore, by adjusting the position of the secondary microphone, the problem of weakening human voice signals caused by noise reduction processing under the condition that the distance from the mouth of the user to the main microphone of the terminal is equivalent to the distance from the mouth of the user to the secondary microphone of the terminal is avoided, and the noise reduction performance is improved.

According to the fourth scheme, the terminal acquires the mouth state of the user through a camera; if the mouth state of the user is a motion state and the sound signals acquired by the main microphone and the auxiliary microphone are weaker than a third preset value, the terminal outputs prompt information to prompt the user to input sound again, and the third preset threshold value is a preset reference contrast value; the actual value is the difference between the re-input sound signal captured by the primary microphone and the re-input sound signal captured by the secondary microphone. When the sound signals acquired by the main microphone and the auxiliary microphone are weaker than a third preset value, the sound volume value of the sound signals input by the user is too small, and if the noise reduction processing is carried out, the human sound signals are easy to eliminate. Therefore, by the method for outputting the prompt information, the user can be prompted to re-input the sound signal under the condition that the volume value of the input sound signal is too small, the problem that the noise signal is not accurately eliminated due to the fact that the volume value of the sound signal input by the user is too small can be avoided, and the noise reduction performance is improved.

102. And the terminal is matched with the reference value corresponding to the target distance.

Specifically, the terminal prestores a plurality of distances, and each distance in the plurality of distances corresponds to a reference value, the reference value is a difference value which is preset and used for representing the sound signals, the plurality of distances comprises the target distance, for example, in practical application, when the difference value between the sound signal of the main microphone and the sound signal of the auxiliary microphone is found to be 6db, the human voice can be well separated and divided, and then the reference value can be set to be 6 db; as another example, in practical applications, it is found that when the difference between the sound signal of the main microphone and the sound signal of the sub microphone is 7db, the divided voice can be well separated, and then the reference value can be set to 7 db. In summary, the reference value belongs to a value set in advance for reference comparison.

103. The terminal filters out the environmental noise according to the actual value and the reference value.

Wherein the actual value is a difference between the detected sound signal of the primary microphone and the detected sound signal of the secondary microphone. Optionally, if the absolute value of the difference between the actual value and the reference value is smaller than a first preset value (e.g., 0dB, 0.5dB, 1dB, 1.5dB, etc.), the terminal regards the actual value as a useful signal. This first preset threshold value can be seen out for the value that is used for reference contrast that sets up in advance, and through the aforesaid mode, the terminal can be according to this actual value and this reference value filtering ambient noise, has avoided the inaccurate problem of filtering ambient noise that arouses according to the poor filtering ambient noise of the human voice signal intensity that fixed main microphone and vice microphone acquireed, promotes the performance of making an uproar.

Optionally, if a difference between a distance from the mouth of the user to the primary microphone of the terminal and a distance from the mouth of the user to the secondary microphone of the terminal is greater than a second preset value, the terminal filters the ambient noise according to the actual value and the reference value, and the second preset threshold value is a preset value for reference comparison; and if the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the auxiliary microphone of the terminal is not more than a second preset value, the terminal takes the sound acquired by the main microphone as a useful signal. It should be noted that, in the case that the distance from the mouth of the user to the main microphone of the terminal is equivalent to the distance from the mouth of the user to the sub-microphone of the terminal, the sound signals acquired by the main microphone and the sub-microphone are similar, and if the noise reduction processing is performed, the problem of weakening or even eliminating the human sound signal can be caused.

In the method shown in fig. 1, a plurality of distances are pre-stored in the terminal, and each distance in the plurality of distances corresponds to a reference value, namely, the reference value used for filtering the human voice changes along with the change of the distance from the mouth of the user to the microphone (including the main microphone and the auxiliary microphone) of the terminal, compared with the prior art that the same reference value is used in any case, the noise reduction performance is better, and the filtered human voice is more real.

The method of embodiments of the present invention is described above and the apparatus of embodiments of the present invention is further described below.

Fig. 4 is a schematic block diagram of a terminal according to an embodiment of the present invention. The terminal comprises a detection unit 401, a matching unit 402 and a filtering unit 403, each of which is described below.

The detection unit 401 is configured to detect a target distance, which includes a distance from the mouth of the user to the primary microphone of the terminal and a distance from the mouth of the user to the secondary microphone of the terminal.

The matching unit 402 is configured to match a reference value corresponding to the target distance, where the matching unit 402 prestores a plurality of distances, and each of the plurality of distances corresponds to a reference value, where the reference values are preset to represent differences between sound signals, and the plurality of distances includes the target distance.

The filtering unit 403 is configured to filter the ambient noise according to an actual value and the reference value, where the actual value is a difference between the sound signal of the primary microphone and the sound signal of the secondary microphone.

In another alternative, the filtering unit 403 filters the environmental noise according to the actual value and the reference value, specifically: if the absolute value of the difference between the actual value and the reference value is smaller than a first predetermined value, the filtering unit 403 uses the actual value as a useful signal. Therefore, the problem that the strength difference of human voice signals acquired by the main microphone and the auxiliary microphone is not 6dB due to the fact that the position of the terminal is not fixed can be solved, and the noise reduction performance is improved.

In another alternative, the filtering unit 403 filters the environmental noise according to the actual value and the reference value, specifically: if the difference between the distance from the mouth of the user to the primary microphone of the terminal and the distance from the mouth of the user to the secondary microphone of the terminal is greater than a second preset value, the filtering unit 403 filters the ambient noise according to the actual value and the reference value; if the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the sub-microphone of the terminal is not greater than a second preset value, the filtering unit 403 uses the sound obtained by the main microphone as a useful signal. It should be understood that, in the case that the distance from the mouth of the user to the main microphone of the terminal is equivalent to the distance from the mouth of the user to the sub-microphone of the terminal, the sound signals obtained by the main microphone and the sub-microphone are similar, and if the noise reduction processing is performed, the problem of weakening of the human voice signal can be caused.

In another optional scheme, the detecting unit 401 detects the target distance, specifically: firstly, acquiring an image of a mouth of a user through a camera; then, the distance corresponding to the acquired mouth image of the user is matched, a plurality of mouth images are prestored in the terminal, each piece of mouth image information in the plurality of mouth images corresponds to its own distance, and the distance corresponding to the mouth image of the user acquired by the detection unit 401 is the target distance.

In another optional scheme, the detecting unit 401 detects the target distance, specifically: and detecting the distance from the mouth of the user to the terminal through an infrared sensor, wherein the distance from the mouth of the user to the terminal is a target distance. In another optional scheme, the terminal includes multiple microphones, and the detection unit 401 detects the target distance, specifically: detecting a distance from the user's mouth to each of a plurality of sub-microphones and a distance from the user's mouth to a primary microphone of the terminal; the target distance includes a distance from the mouth of the user to a primary microphone of the terminal and a distance from the mouth of the user to a target secondary microphone, and the target secondary microphone is a secondary microphone of the plurality of secondary microphones whose difference between the distance from the mouth of the user and the distance from the primary microphone of the terminal to the mouth of the user is the largest. It can be seen that, by determining the target secondary microphone by the detection unit 401, the problem of weakening human voice signals caused by noise reduction processing when the distance from the mouth of the user to the primary microphone of the terminal is equivalent to the distance from the mouth of the user to the secondary microphone of the terminal is avoided, so as to improve the performance of noise reduction.

In yet another optional scheme, the terminal further includes a prompting unit 404, where the prompting unit 404 is configured to, after the detecting unit 401 detects the target distance, obtain a state of the mouth of the user through a camera before the matching unit 402 matches the reference value corresponding to the target distance; if the mouth state of the user is a motion state and the sound signals acquired by the primary microphone and the secondary microphone are weaker than a third preset value, the prompt unit 404 outputs prompt information to prompt the user to input sound again; the actual value is the difference between the re-input sound signal captured by the primary microphone and the re-input sound signal captured by the secondary microphone. Therefore, by the method for outputting the prompt message, the problem of inaccurate noise signal elimination caused by the fact that the volume value of the sound signal input by the user is too small can be avoided, and the noise reduction performance is improved.

It should be noted that the implementation of each operation may also correspond to the corresponding description of the method embodiment shown in fig. 1.

In the terminal 40 shown in fig. 4, a plurality of distances are pre-stored in the terminal, and each of the plurality of distances has a corresponding reference value, that is, the reference value used for filtering human voice varies with the distance from the mouth of the user to the microphones (including the primary microphone and the secondary microphone) of the terminal, compared to the prior art that the same reference value is used in any case, the noise reduction performance is better, and the filtered human voice is more realistic.

Referring to fig. 5, a schematic block diagram of another terminal provided in the embodiment of the present invention is shown. The terminal in the present embodiment shown in fig. 5 may include: one or more processors 501; one or more input devices 502, one or more output devices 503, and memory 505. The processor 501, the input device 502, the output device 503, and the memory 505 are connected by a bus 505. The memory 502 is used to store instructions.

The Processor 501 may be a Central Processing Unit (CPU), and may be other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

The input device 502 may include a touch screen, a fingerprint sensor (for collecting fingerprint information of a user and direction information of a fingerprint), an infrared sensor, a camera, etc., for example, the input device 502 in some operations below refers to a camera, the input device 502 in some operations below refers to an infrared sensor, etc., and the output device 503 may include a display (LCD, etc., or display screen), a speaker, a camera, etc., for example, the output device 503 in some operations below refers to a display screen, the output device 503 in some operations below refers to a speaker, etc.

The memory 505 may include both read-only memory and random access memory and provides instructions and data to the processor 501. A portion of memory 505 may also include non-volatile random access memory. For example, memory 505 may also store information of the device type.

The processor 501 is operable to execute program instructions stored by the memory 505 to perform the following operations:

detecting a target distance through the input device 502, the target distance including a distance from a mouth of the user to a primary microphone of the terminal and including a distance from the mouth of the user to a secondary microphone of the terminal; the input device 502 may be a camera for acquiring an image of the mouth of the user to obtain a target distance, and the input device 502 may also be an infrared sensor for obtaining the target distance by an infrared distance measurement principle.

The memory 505 is pre-stored with a plurality of distances, and each of the plurality of distances corresponds to a reference value, which is preset to represent a difference between the sound signals, and the plurality of distances includes the target distance.

And filtering out the environmental noise according to an actual value and the reference value, wherein the actual value is a difference value between the sound signal of the main microphone and the sound signal of the auxiliary microphone.

In yet another alternative, the processor 501 filters the environmental noise according to the actual value and the reference value, and may be: and if the absolute value of the difference value between the actual value and the reference value is smaller than a first preset value, the terminal takes the actual value as a useful signal. Therefore, the problem that the strength difference of the human voice signals acquired by the main microphone and the auxiliary microphone is not fixed due to the fact that the position of the terminal is not fixed can be solved, and the noise reduction performance is improved.

In yet another alternative, the processor 501 filters the ambient noise according to the actual value and the reference value, and may further be: if the difference between the distance from the mouth of the user to the primary microphone of the terminal and the distance from the mouth of the user to the secondary microphone of the terminal is greater than a second preset value, the processor 501 filters the ambient noise according to the actual value and the reference value; if the difference between the distance from the mouth of the user to the primary microphone of the terminal and the distance from the mouth of the user to the secondary microphone of the terminal is not greater than a second preset value, the processor 501 takes the sound obtained by the primary microphone as a useful signal. It should be understood that, in the case that the distance from the mouth of the user to the main microphone of the terminal is equivalent to the distance from the mouth of the user to the sub-microphone of the terminal, the sound signals obtained by the main microphone and the sub-microphone are similar, and if the noise reduction processing is performed, the problem of weakening of the human voice signal can be caused.

In yet another alternative, the processor 501 detects a target distance through the input device 502 (the input device 502 is a camera here), specifically: acquiring a mouth image of a user through a camera; the distance corresponding to the acquired mouth image of the user is matched, a plurality of mouth images are prestored in the memory 505, each piece of mouth image information in the plurality of mouth images corresponds to its own distance, and the acquired distance corresponding to the mouth image of the user is the target distance.

In yet another alternative, the processor 501 detects a target distance through the input device 502 (the input device 502 is an infrared sensor here), specifically: the processor 501 detects the distance from the mouth of the user to the terminal, which is the target distance, through an infrared sensor. In yet another alternative, the terminal includes at least one, specifically, may include multiple microphones, and the processor 501 detects the target distance through the input device 502 (the input device 502 may be an infrared sensor, a camera, or another device capable of detecting the target distance), specifically: determining a target secondary microphone as a secondary microphone, wherein the secondary microphone with the largest difference between the distance from the at least one secondary microphone to the mouth of the user and the distance from the primary microphone of the terminal to the mouth of the user is the largest, and the target distance comprises the distance from the mouth of the user to the primary microphone of the terminal and the distance from the mouth of the user to the target secondary microphone, and particularly, the distance from the mouth of the user to each secondary microphone of the plurality of secondary microphones and the distance from the mouth of the user to the primary microphone of the terminal can be detected; the target distance includes a distance from the mouth of the user to a primary microphone of the terminal and a distance from the mouth of the user to a target secondary microphone, and the target secondary microphone is a secondary microphone of the plurality of secondary microphones whose difference between the distance from the mouth of the user and the distance from the primary microphone of the terminal to the mouth of the user is the largest. It can be seen that, by determining the target secondary microphone by the detection unit 401, the problem of weakening human voice signals caused by noise reduction processing when the distance from the mouth of the user to the primary microphone of the terminal is equivalent to the distance from the mouth of the user to the secondary microphone of the terminal is avoided, so as to improve the performance of noise reduction.

In another optional solution, the terminal includes multiple microphones, and the processor 501 detects the target distance through the input device 502, specifically: after the processor 501 detects the target distance through the input device 502 (the input device 502 may be an infrared sensor, a camera, or another device capable of detecting the target distance), before the processor 501 matches the reference value corresponding to the target distance, the method further includes: acquiring the state of the mouth of a user through a camera; if the mouth state of the user is a motion state and the sound signals acquired by the main microphone and the auxiliary microphone are weaker than a third preset value, outputting prompt information to prompt the user to input sound again; the actual value is the difference between the re-input sound signal captured by the primary microphone and the re-input sound signal captured by the secondary microphone. Therefore, by the method for outputting the prompt message, the problem of inaccurate noise signal elimination caused by the fact that the volume value of the sound signal input by the user is too small can be avoided, and the noise reduction performance is improved.

It should be noted that, the implementation of each operation may also correspond to the corresponding description of the method embodiment shown in fig. 1.

In the terminal 50 shown in fig. 5, a plurality of distances are pre-stored in the terminal, and each of the plurality of distances has a reference value, namely, the reference value used for filtering the human voice changes along with the distance from the mouth of the user to the microphone (including the main microphone and the sub-microphone) of the terminal, compared with the prior art that the same reference value is used in any case, the noise reduction performance is better, and the filtered human voice is more real.

As shown in fig. 6, fig. 6 is a schematic structural diagram of another terminal (e.g., a mobile phone) provided in the embodiment of the present invention. The terminal 600 may include: baseband chip 610, memory 616 (one or more computer-readable storage media), Radio Frequency (RF) module 616, and peripheral system 617. These components may communicate over one or more communication buses 614.

The peripheral system 617 mainly serves to implement an interactive function between the terminal 610 and a user/external environment, and mainly includes input and output devices of the terminal 600. In a specific implementation, the peripheral system 617 may include: a touch screen (which may also be a touch display screen) controller 618, a camera controller 619, an audio controller 620, and a sensor management module 621. Each controller may be coupled to a corresponding peripheral device (e.g., a touch screen 623, a camera 624, an audio circuit 626, and a sensor 626). It should be noted that the peripheral system 617 may also include other I/O peripherals.

The baseband chip 610 may integrally include: one or more processors 611, a clock module 622, and a power management module 613. The clock module 622 integrated in the baseband chip 610 is mainly used for generating clocks required for data transmission and timing control for the processor 611. The power management module 613 integrated in the baseband chip 610 is mainly used to provide stable and high-precision voltage for the processor 611, the rf module 616 and peripheral systems.

A Radio Frequency (RF) module 616 for receiving and transmitting RF signals, mainly integrates a receiver and a transmitter of the terminal 600. Radio Frequency (RF) module 616 communicates with communication networks and other communication devices via radio frequency signals. In particular implementations, the Radio Frequency (RF) module 616 may include, but is not limited to: an antenna system, an RF transceiver, one or more amplifiers, a tuner, one or more oscillators, a digital signal processor, a CODEC chip, a SIM card, a storage medium, and the like. In some embodiments, the Radio Frequency (RF) module 616 may be implemented on a separate chip.

A memory 616 is coupled to the processor 611 for storing various software programs and/or sets of instructions. In particular implementations, memory 616 may include high-speed random access memory and may also include non-volatile memory, such as one or more magnetic disk storage devices, flash memory devices, or other non-volatile solid-state storage devices. The memory 616 may store an operating system (hereinafter referred to simply as a system), such as an embedded operating system like ANDROID, IOS, WINDOWS, or LINUX. The memory 616 may also store network communication programs that may be used to communicate with one or more additional devices, one or more terminal devices, and one or more network devices. The memory 616 may also store a user interface program, which may vividly display the content of the application program through a graphical operation interface, and receive the control operation of the application program from the user through input controls such as menus, dialog boxes, and buttons.

The memory 616 may also store one or more application programs. As shown in fig. 6, these applications may include: social applications (e.g., Facebook), image management applications (e.g., photo album), shopping applications (e.g., Taobao), browsers (e.g., Safari, Google Chrome), and so forth.

Processor 611 may be used to read and execute computer readable instructions herein. In particular, the processor 611 may be configured to call a program stored in the memory 616, for example, a program for implementing the noise reduction method provided in the present application, and execute instructions contained in the program, thereby implementing the method embodiment shown in fig. 1.

It should be understood that terminal 600 is only one example provided by embodiments of the present invention and that terminal 600 may have more or less components than shown, may combine two or more components, or may have a different configuration implementation of components. In another embodiment of the present invention, a chip system is provided, which includes at least one processor, a memory and an interface circuit, the memory, the transceiver and the at least one processor being interconnected by a line, the at least one memory having instructions stored therein; when the instructions are executed by the processor, the method of the embodiment shown in fig. 1 is implemented.

In another embodiment of the invention, a computer program product is provided, on which the method of the embodiment shown in fig. 1 is implemented when the computer program product is run on a processor.

In a further embodiment of the invention, a computer-readable storage medium is provided, which stores a computer program which, when executed by a processor, implements the method of the embodiment shown in fig. 1.

The computer readable storage medium may be an internal storage unit of the terminal according to any of the foregoing embodiments, for example, a hard disk or a memory of the terminal. The computer readable storage medium may also be an external storage device of the terminal, such as a plug-in hard disk provided on the terminal, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the computer readable storage medium may also include both an internal storage unit and an external storage device of the terminal. The computer-readable storage medium is used for storing the computer program and other programs and data required by the terminal. The computer readable storage medium may also be used to temporarily store data that has been output or is to be output.

Those of ordinary skill in the art will appreciate that the elements and algorithm steps of the examples described in connection with the embodiments disclosed herein may be embodied in electronic hardware, computer software, or combinations of both, and that the components and steps of the examples have been described in a functional general in the foregoing description for the purpose of illustrating clearly the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

It can be clearly understood by those skilled in the art that, for convenience and simplicity of description, the specific working process of the terminal and the unit described above may refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.

In the several embodiments provided in the present invention, it should be understood that the disclosed terminal and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the unit is only one logical functional division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may also be an electrical, mechanical or other form of connection.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiments of the present invention.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention essentially contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A method of noise reduction, comprising:

the method comprises the steps that a terminal detects a target distance, wherein the target distance comprises the distance from a mouth of a user to a main microphone of the terminal and the distance from the mouth of the user to a secondary microphone of the terminal;

the terminal is matched with a reference value corresponding to the target distance, a plurality of distances are prestored in the terminal, each distance in the plurality of distances corresponds to a reference value, the reference values are preset differences used for representing sound signals, and the plurality of distances comprise the target distance;

the terminal filters environmental noise according to an actual value and the reference value, wherein the actual value is a difference value between the sound signal of the main microphone and the sound signal of the auxiliary microphone;

wherein, vice microphone is at least one vice microphone, the terminal detection target distance includes:

determining a secondary microphone of the at least one secondary microphone, which has a largest difference between a distance to the mouth of the user and a distance to the mouth of the user from a primary microphone of the terminal, as a target secondary microphone, wherein the target distance includes the distance to the mouth of the user from the primary microphone of the terminal and the distance to the target secondary microphone from the mouth of the user;

and if the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the target auxiliary microphone of the terminal is smaller than or equal to a second preset value, the terminal adjusts the position of the target auxiliary microphone so that the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the target auxiliary microphone of the terminal is larger than the second preset value.

2. The method of claim 1, wherein the terminal filters the ambient noise according to the actual value and the reference value, comprising:

and if the absolute value of the difference value between the actual value and the reference value is smaller than a first preset value, the terminal takes the actual value as a useful signal.

3. The method of claim 1, wherein the terminal filters the ambient noise according to the actual value and the reference value, and comprises:

if the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the auxiliary microphone of the terminal is larger than a second preset value, filtering the environmental noise by the terminal according to an actual value and the reference value;

and if the difference between the distance from the mouth of the user to the main microphone of the terminal and the distance from the mouth of the user to the auxiliary microphone of the terminal is not larger than a second preset value, the terminal takes the sound acquired by the main microphone as a useful signal.

4. The method according to any one of claims 1 to 3, wherein the terminal detects a target distance, comprising:

the terminal acquires an image of the mouth of the user through a camera;

the terminal matches the acquired distance corresponding to the mouth image of the user, the terminal prestores a plurality of mouth images, each mouth image in the mouth images corresponds to the own distance, and the acquired distance corresponding to the mouth image of the user is the target distance.

5. The method according to any one of claims 1 to 3, wherein the terminal detects a target distance, and comprises:

the terminal detects the distance from the mouth of the user to the terminal through an infrared sensor, and the distance from the mouth of the user to the terminal is a target distance.

6. The method according to any one of claims 1 to 3, wherein after the terminal detects a target distance and before the terminal matches a reference value corresponding to the target distance, further comprising:

the terminal acquires the state of the mouth of the user through a camera;

and if the mouth state of the user is a motion state and the sound signals acquired by the main microphone and the auxiliary microphone are weaker than a third preset value, the terminal outputs prompt information to prompt the user to input sound again.

7. A terminal, characterized in that it comprises means for performing the method of any of claims 1 to 6.

8. A terminal comprising a processor and a memory, wherein the memory is configured to store program instructions and the processor is configured to invoke the program instructions to perform the method of any of claims 1 to 6.

9. A computer-readable storage medium, characterized in that the computer storage medium stores program instructions that, when executed by a processor, cause the processor to perform the method of any of claims 1 to 6.