CN110364176A - Audio signal processing method and device - Google Patents

Audio signal processing method and device Download PDF

Info

Publication number
CN110364176A
CN110364176A CN201910774202.4A CN201910774202A CN110364176A CN 110364176 A CN110364176 A CN 110364176A CN 201910774202 A CN201910774202 A CN 201910774202A CN 110364176 A CN110364176 A CN 110364176A
Authority
CN
China
Prior art keywords
signal
road
area
terminal device
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910774202.4A
Other languages
Chinese (zh)
Inventor
于利标
张静
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201910774202.4A priority Critical patent/CN110364176A/en
Publication of CN110364176A publication Critical patent/CN110364176A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

This application discloses audio signal processing method and devices, are related to voice technology field.Specific implementation are as follows: terminal device carries out space filtering to the collected road n omnidirectional microphone array signal using the space filtering coefficient in the area m Ge Yin, generates the enhanced signal in the road m, and m and n are positive integer;Terminal device carries out pre- enhancing processing to the enhanced signal in the road m respectively, and pre- enhancing processing includes that echo cancellor and noise are estimated, obtains the road m in advance and enhance treated signal;Energy highest and the highest signal all the way of signal-to-noise ratio are selected in terminal device enhances that treated from the road m in advance signal, noise suppressed and automatic growth control are carried out to selected signal all the way, the voice signal that generates that treated.To obtain high-quality voice signal, and do not limited by usage scenario.

Description

Audio signal processing method and device
Technical field
This application involves the voice process technologies in voice technology field.
Background technique
With popularizing for smart machine such as intelligent sound box and smart television, far field interactive voice becomes these smart machines Standard configuration, in order to promote the interactive experience under the scene of far field, smart machine is provided with omnidirectional microphone array, according to different product Mode of appearance, the linear array that common omnidirectional microphone Array Model is made of 4 omnidirectional microphones, 3,4 or 6 The circular array of omnidirectional microphone composition.As intellectual product enters family, speech communication function on intellectual product increasingly As rigid demand, speech communication requires sound quality relatively high, especially far field communication more sensitive to being distorted, sound source distance Microphone distance is remoter, and signal-to-noise ratio is lower, and reverberation influences bigger.Moreover, the microphone array and loudspeaker height collection of intellectual product On Cheng Yi cavity, in order to ensure the echo signal of microphone acquisition will not cause the serious mistake of reference signal because of spilling Very, the sensitivity of microphone is all very low, and the local voice signal of acquisition passes through sample quantization, the significant bit of each sampling point Also very few (less than 10 bit), thus the quantizing noise significantly affected to quality can be generated.Therefore it needs to voice signal Enhancing, to improve the voice quality of communication.
In the prior art, voice signals enhancement is mainly carried out to smart machine using single microphone signal enhancing method, The process of single microphone signal enhancing method are as follows: to the data of microphone acquisition carry out echo cancellor, stationary noise inhibits and oneself Dynamic gain control, obtains enhanced voice signal, and still, single microphone Signal Enhanced Technology is only suitable for short distance and quiet ring Communication under border, usage scenario are limited.
Summary of the invention
The application provides a kind of audio signal processing method and device, to promote quality of speech signal, solves existing method The limited problem of usage scenario.
In a first aspect, the application provides a kind of audio signal processing method, comprising:
Terminal device carries out the collected road n omnidirectional microphone array signal using the space filtering coefficient in the area m Ge Yin Space filtering generates the enhanced signal in the road m, and m and n are positive integer, and n is the microphone for forming the omnidirectional microphone array Number;
The terminal device carries out pre- enhancing processing, the pre- enhancing processing packet to the enhanced signal in the road m respectively Echo cancellor and noise estimation are included, the road m is obtained in advance and enhance treated signal;
The terminal device selects energy highest and signal-to-noise ratio highest one from the signal that enhances that treated in advance of the road m Road signal carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.
One embodiment in above-mentioned application has the following advantages that or the utility model has the advantages that uses the area m Ge Yin by terminal device Space filtering coefficient to the collected road n omnidirectional microphone array signal carry out space filtering, generate the enhanced letter in the road m Number, then terminal device carries out pre- enhancing processing to the enhanced signal in the road m respectively, and pre- enhancing processing includes echo cancellor and makes an uproar Sound estimation, obtains the road m in advance and enhances treated signal, selects energy in last terminal device enhances that treated from the road m in advance signal Highest and the highest signal all the way of signal-to-noise ratio are measured, noise suppressed and automatic growth control are carried out to selected signal all the way, it is raw At treated voice signal.To obtain high-quality voice signal, pass through the fixed beam technology of direction-agile and space Multitone Division promotes the signal-to-noise ratio of communication voice and inhibits the influence of ambient noise and reverberation, improves sound quality, and do not used Scene restriction.
Optionally, the terminal device is using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional microphone battle array Column signal carries out space filtering, comprising:
Terminal device is complete to the road n in each sound area respectively using the space filtering coefficient in each sound area in the area the m Ge Yin Space filtering is carried out to microphone array signals.
Optionally, the terminal device is using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional microphone battle array Before column signal carries out space filtering, the method also includes:
The terminal device is available according to the shape and microphone number of omnidirectional microphone array and the terminal device Central processor CPU computing capability and free memory carry out sound Division to acoustic space, obtain the area m Ge Yin;
The terminal device is that each sound area generates fixed space filtering coefficient according to preset rules, obtains the area m Ge Yin Space filtering coefficient.
One embodiment in above-mentioned application has the following advantages that or the utility model has the advantages that the space filtering coefficient in each sound area is raw After be it is fixed, pass through the space multitone Division side for being based on fixed beam technology (area Ji Yin space filtering coefficient is fixed) Case, does not depend on that positioning etc. is other to be easy algorithm affected by environment, therefore sound quality is more stable.
Optionally, the terminal device stores the space filtering coefficient in the area the m Ge Yin.
One embodiment in above-mentioned application has the following advantages that or the utility model has the advantages that terminal device stores the area m Ge Yin It after space filtering coefficient, can directly be used in communication process, compare other microphone array technical solutions, the present embodiment The computation complexity of method is low, and the storage resource needed is also seldom.
Optionally, the terminal device carries out sound Division, after obtaining sound Division information, the side to acoustic space Method further include:
The terminal device adjusts the centric angle in sound area according to the probability of the sound source appearance position of statistics.
The one embodiment stated in application has the following advantages that or the utility model has the advantages that the centric angle in sound area can be according to statistics Sound source appearance position probability carry out dynamic adjustment, to can reach signal enhancing ability as big as possible.
Optionally, the acoustic space includes in 360 degree of plane spaces, 180 degree plane space and 90 degree of plane spaces Any one.
Optionally, the terminal device selects energy highest and signal-to-noise ratio from the signal that enhances that treated in advance of the road m Before highest signal all the way, the method also includes:
Enhancing the road m treated in advance between the area Liang Geyin, signal is smoothed, the road m after obtaining smoothing processing Signal;
The terminal device selects energy highest and signal-to-noise ratio highest one from the signal that enhances that treated in advance of the road m Road signal, comprising:
Energy highest is selected from the road the m signal after smoothing processing for the terminal device and signal-to-noise ratio is highest believes all the way Number.
Second aspect, the application provide a kind of speech signal processing device, comprising:
Filter module, for using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional microphone array signal Space filtering is carried out, generates the enhanced signal in the road m, m and n are positive integer, and n is the Mike for forming the omnidirectional microphone array The number of wind;
Pre- enhancing processing module, for carrying out pre- enhancing processing, the pre- enhancing respectively to the enhanced signal in the road m Processing includes that echo cancellor and noise are estimated, obtains the road m in advance and enhance treated signal;
Processing module, for selecting energy highest from the signal that enhances that treated in advance of the road m and signal-to-noise ratio is highest Signal all the way carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.
Optionally, the filter module is used for:
Using the space filtering coefficient in each sound area in the area the m Ge Yin respectively to the road the n omnidirectional microphone in each sound area Array signal carries out space filtering.
Optionally, described device further include:
Sound zoning sub-module, for using the space filtering coefficient in the area m Ge Yin to the collected road n in the filter module Before omnidirectional microphone array signal carries out space filtering, according to the shape of omnidirectional microphone array and microphone number with it is described The available central processor CPU computing capability of terminal device and free memory carry out sound Division to acoustic space, obtain m Sound area;
Filter factor generation module is obtained for being that each sound area generates fixed space filtering coefficient according to preset rules To the space filtering coefficient in the area m Ge Yin.
Optionally, the processing module is also used to: storing the space filtering coefficient in the area the m Ge Yin.
Optionally, the processing module is also used to:
Sound Division is carried out to acoustic space in the sound zoning sub-module, after obtaining sound Division information, according to system The centric angle in the probability adjustment sound area of the sound source appearance position of meter.
Optionally, the acoustic space includes in 360 degree of plane spaces, 180 degree plane space and 90 degree of plane spaces Any one.
Optionally, the processing module is also used to:
Before selecting energy highest and the highest signal all the way of signal-to-noise ratio in the signal that enhances that treated in advance of the road m, Enhancing the road m treated in advance between the area Liang Geyin, signal is smoothed, the road the m signal after obtaining smoothing processing;
The processing module is used for: selecting energy highest and signal-to-noise ratio highest one from the road the m signal after smoothing processing Road signal.
The dress of Speech processing provided by each possible embodiment of above-mentioned second aspect and above-mentioned second aspect It sets, its advantages may refer to brought by each optional embodiment of above-mentioned first aspect and first aspect beneficial to effect Fruit, details are not described herein.
The third aspect, the application provide a kind of terminal device, comprising:
At least one processor;And
The memory being connect at least one described processor communication;Wherein,
The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one A processor executes, so that at least one described processor is able to carry out first aspect and each optional embodiment of first aspect Any one of described in method.
Fourth aspect, the application provide a kind of non-transitory computer-readable storage medium for being stored with computer instruction, institute Computer instruction is stated for making the computer execute any one of first aspect and each optional embodiment of first aspect institute The method stated.
Detailed description of the invention
Attached drawing does not constitute the restriction to the application for more fully understanding this programme.Wherein:
Fig. 1 is a kind of application scenarios schematic diagram of the application;
Fig. 2 is a kind of flow chart of audio signal processing method embodiment provided by the present application;
Fig. 3 is a kind of flow chart of audio signal processing method embodiment provided by the present application;
Fig. 4 is a kind of audio signal processing method process schematic provided by the present application;
Fig. 5 is a kind of structural schematic diagram of speech signal processing device embodiment provided by the present application;
Fig. 6 is a kind of structural schematic diagram of speech signal processing device embodiment provided by the present application;
Fig. 7 is the block diagram according to the terminal device of the audio signal processing method of the embodiment of the present application.
Specific embodiment
It explains below in conjunction with exemplary embodiment of the attached drawing to the application, including the various of the embodiment of the present application Details should think them only exemplary to help understanding.Therefore, those of ordinary skill in the art should recognize It arrives, it can be with various changes and modifications are made to the embodiments described herein, without departing from the scope and spirit of the present application.Together Sample, for clarity and conciseness, descriptions of well-known functions and structures are omitted from the following description.
In the embodiment of the present application, " illustrative " or " such as " etc. words for indicate make example, illustration or explanation, this Shen Please be described as in embodiment " illustrative " or " such as " any embodiment or scheme be not necessarily to be construed as than other realities Apply example or scheme more preferably or more advantage.Specifically, use " illustrative " or " such as " etc. words be intended to it is specific just Related notion is presented in formula.
Far field communication is easy to be influenced by sound source distance, ambient noise and reverberation, can also generate and significantly affect to quality Quantizing noise, it is therefore desirable to voice signals enhancement, to improve the voice quality of communication, in the prior art use single microphone Signal enhancing method to smart machine carry out voice signals enhancement, but single microphone Signal Enhanced Technology be only suitable for closely and Communication under quiet environment, usage scenario are limited, to solve this problem, the application provide a kind of audio signal processing method and Device carries out the collected road n omnidirectional microphone array signal using the space filtering coefficient in the area m Ge Yin by terminal device Space filtering generates the enhanced signal in the road m, then carries out pre- enhancing processing respectively to the enhanced signal in the road m, at pre- enhancing Reason includes that echo cancellor and noise are estimated, obtains the road m in advance and enhance treated signal, finally enhances that treated believes in advance from the road m Energy highest and the highest signal all the way of signal-to-noise ratio are selected in number, and noise suppressed and automatic increasing are carried out to selected signal all the way Benefit control, generates treated voice signal.To obtain high-quality voice signal, pass through the fixed beam skill of direction-agile Art and space multitone Division promote the signal-to-noise ratio of communication voice and inhibit the influence of ambient noise and reverberation, improve sound quality, and It is not limited by usage scenario.With reference to the accompanying drawing by specific embodiment, to the audio signal processing method of the embodiment of the present application Specific implementation process be described in detail.
Fig. 1 is a kind of application scenarios schematic diagram of the application, as shown in Figure 1, terminal device is equipped with omnidirectional microphone battle array Column, according to the mode of appearance of different product, linear battle array that common omnidirectional microphone Array Model is made of 4 omnidirectional microphones Column, the circular array, etc. of 3,4 or 6 omnidirectional microphones composition, in communication process, by built in terminal device Speech signal processing device collects the road n omnidirectional microphone array signal, and n is for forming the microphone of omnidirectional microphone array Number, speech signal processing device is to the collected road n omnidirectional microphone array signal using at voice signal provided by the present application Reason method is handled, then output treated voice signal, to obtain high-quality voice signal, Lai Jinhang voice is logical News.Terminal device in the application for example can be intelligent sound box, Mobile player and smart television etc..
Fig. 2 is a kind of flow chart of audio signal processing method embodiment provided by the present application, the execution in the present embodiment Main body is terminal device, is specifically as follows the module built in terminal device, as shown in Fig. 2, the method for the present embodiment may include:
S101, terminal device are using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional microphone array signal Space filtering is carried out, the enhanced signal in the road m is generated, m and n are positive integer, and n is the microphone for forming omnidirectional microphone array Number.
Specifically, terminal device is equipped with omnidirectional microphone array, and in communication process, it is complete that terminal device collects the road n To after microphone array signals, using the area m Ge Yin space filtering coefficient to the collected road n omnidirectional microphone array signal into Row space filtering generates the enhanced signal in the road m, i.e. signal after the generation road m space filtering, not the space filtering system in unisonance area Number is different.
Wherein, terminal device is using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional microphone array signal Space filtering is carried out, is specifically as follows:
Terminal device is using the space filtering coefficient in each sound area in the area m Ge Yin respectively to the road the n omnidirectional wheat in each sound area Gram wind array signal carries out space filtering, which is known as the enhancing of multitone area fixed beam.
The space filtering coefficient in the area m Ge Yin therein can be in line computation, be also possible to the generation in the system free time After be stored in terminal device, in communication process directly using storage the area m Ge Yin space filtering coefficient.The following detailed description of In the process of line computation.
As a kind of enforceable mode, before S101, the method for the present embodiment further include:
Terminal device is according to the shape and microphone number of omnidirectional microphone array and the available central processing of terminal device Device (Central Processing Unit, CPU) computing capability and free memory carry out sound Division to acoustic space, and m is a Sound area, while the main lobe width in sound area and the centric angle in sound area can be obtained, optionally, acoustic space therein includes 360 degree Any one of plane space, 180 degree plane space and 90 degree of plane spaces, such as the acoustic space of intelligent sound box is 360 degree Plane space, the acoustic space of smart television are smart television front 180 degree plane space.For example, 360 degree of plane spaces are drawn Be divided into 4,6 perhaps 8 areas Ge Yin 180 degree plane space is divided into 3 or 4 areas Ge Yin, etc..Then, terminal device It is that each sound area generates fixed space filtering coefficient according to preset rules, obtains the space filtering coefficient in the area m Ge Yin, it is therein Preset rules for example can be delay and technology, can also be the existing method for generating space filtering coefficient, no longer superfluous herein It states.In the present embodiment, the space filtering coefficient in each sound area generate after be it is fixed, by being based on fixed beam technology (i.e. sound Area's space filtering coefficient is fixed) space multitone zoning offshoot program, do not depend on positioning etc. it is other be easy algorithm affected by environment, Therefore sound quality is more stable.
Specifically, the strategy of acoustic space sound Division is mainly (such as round by the shape of omnidirectional microphone array Array and linear array etc.) and the factors such as computing resource (i.e. the available CPU computing capability of terminal device and free memory) shadow It rings.Number of microphone is more, can provide stronger wave beam enhancing ability;Circular array also provides higher increasing than linear array Strong ability, more computing resources can also ensure that the enhancing ability of wave beam is fully used, and elementary tactics is number of microphone More, computing resource is more, can divide the area Yue Duoyin, and the main lobe width in each sound area can be narrower, provides more increasings Strong ability.
Further, after the space filtering coefficient for generating the area m Ge Yin, the method for the present embodiment can also include: that terminal is set The space filtering coefficient in the standby storage area m Ge Yin.After terminal device stores the space filtering coefficient in the area m Ge Yin, in communication process In can directly use, compare other microphone array technical solutions, the computation complexity of the method for the present embodiment is low, needs Storage resource is also seldom.
Further, terminal device carries out sound Division, after obtaining sound Division information, the present embodiment to acoustic space Method can also include:
Terminal device adjusts the centric angle in sound area according to the probability of the sound source appearance position of statistics.The centric angle in sound area Dynamic adjustment can be carried out according to the probability of the sound source appearance position of statistics, to can reach signal enhancing energy as big as possible Power.
S102, terminal device carry out pre- enhancing processing to the enhanced signal in the road m respectively, and pre- enhancing processing disappears including echo Except estimating with noise, the road m is obtained in advance and enhances treated signal.
Energy highest is selected in S103, terminal device enhance that treated from the road m in advance signal and signal-to-noise ratio is highest all the way Signal carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.
Since signal source may will do it across sound Qu Yidong, optionally, the method for the present embodiment is from the road m, enhancing is handled in advance When selecting energy highest and the highest signal all the way of signal-to-noise ratio in signal afterwards, can also include:
Enhancing the road m treated in advance between the area Liang Geyin, signal is smoothed.
Audio signal processing method provided in this embodiment uses the space filtering coefficient in the area m Ge Yin by terminal device Space filtering is carried out to the collected road n omnidirectional microphone array signal, generates the enhanced signal in the road m, then terminal device Pre- enhancing processing is carried out respectively to the enhanced signal in the road m, pre- enhancing processing includes that echo cancellor and noise are estimated, obtains the road m Pre- enhancing treated signal selects energy highest and signal-to-noise ratio in last terminal device enhances that treated from the road m in advance signal Highest signal all the way carries out noise suppressed and automatic growth control to selected signal all the way, the voice that generates that treated Signal.To obtain high-quality voice signal, promoted by the fixed beam technology and space multitone Division of direction-agile It communicates the signal-to-noise ratio of voice and inhibits the influence of ambient noise and reverberation, improve sound quality, and do not limited by usage scenario.
And quality of speech signal is promoted using the existing omnidirectional microphone array of terminal device in the present embodiment, without volume Outer hardware requirement, further, in this embodiment not depended on by the space multitone zoning offshoot program based on fixed beam technology Positioning etc. is other to be easy algorithm affected by environment, and sound quality is more stable.
A specific embodiment is used below, and the technical solution of embodiment of the method shown in Fig. 2 is described in detail.
Fig. 3 is a kind of flow chart of audio signal processing method embodiment provided by the present application, the execution in the present embodiment Main body is terminal device, is specifically as follows the module built in terminal device, as shown in figure 3, the method for the present embodiment may include:
S201, terminal device are according to the shape and microphone number and the available CPU of terminal device of omnidirectional microphone array Computing capability and free memory carry out sound Division to acoustic space, obtain the area m Ge Yin.
Optionally, after obtaining sound Division information, terminal device can also be according to the general of the sound source appearance position of statistics Rate adjusts the centric angle in sound area, to can reach signal enhancing ability as big as possible.
S202, terminal device are that each sound area generates fixed space filtering coefficient according to preset rules, obtain the area m Ge Yin Space filtering coefficient.
S203, terminal device store the space filtering coefficient in the area m Ge Yin.
S204, terminal device are complete to the road n in each sound area respectively using the space filtering coefficient in each sound area in the area m Ge Yin Space filtering is carried out to microphone array signals, generates the enhanced signal in the road m, m and n are positive integer, and n is composition omnidirectional Mike The number of the microphone of wind array.
Optionally, S204 can be directly executed after S202, that is, is exactly space filtering coefficient by way of in line computation. As another enforceable mode, S201-S203 can be executed in the terminal device system free time, generate space filtering coefficient It after storage, can directly be used in communication process, compare other microphone array technical solutions, the meter of the method for the present embodiment Calculation complexity is low, and the storage resource needed is also seldom.
Fig. 4 is a kind of audio signal processing method process schematic provided by the present application, as shown in figure 4, collecting to obtain the road n Omnidirectional microphone array signal MCI0, MCI1 ..., MCIn, terminal device using each sound area in the area m Ge Yin space filtering system Number carries out space filtering to the road the n omnidirectional microphone array signal in each sound area respectively, which is known as multitone area fixed beam Enhancing.
S205, terminal device carry out pre- enhancing processing to the enhanced signal in the road m respectively, and pre- enhancing processing disappears including echo Except estimating with noise, the road m is obtained in advance and enhances treated signal.
Energy highest is selected in S206, terminal device enhance that treated from the road m in advance signal and signal-to-noise ratio is highest all the way Signal selects top-quality signal all the way, carry out post filtering processing to selected signal all the way, i.e. progress noise suppressed And automatic growth control, generate treated voice signal.
After generating treated voice signal, output treated voice signal.
Since signal source may will do it across sound Qu Yidong, optionally, selected in the signal that enhances that treated in advance from the road m When energy highest and the highest signal all the way of signal-to-noise ratio, the signal that can also enhance in advance the road m that treated between the area Liang Geyin It is smoothed.
Audio signal processing method provided in this embodiment, fixed beam technology and space multitone area by direction-agile The signal-to-noise ratio for promoting communication voice and the influence for inhibiting ambient noise and reverberation are divided, sound quality is improved, to obtain high-quality Voice signal, and do not limited by usage scenario.And it is mentioned in the present embodiment using the existing omnidirectional microphone array of terminal device Quality of speech signal is risen, without additional hardware requirement, further, in this embodiment passing through the space based on fixed beam technology Multitone zoning offshoot program, do not depend on positioning etc. it is other be easy algorithm affected by environment, sound quality is more stable.
Fig. 5 is a kind of structural schematic diagram of speech signal processing device embodiment provided by the present application, as shown in figure 5, this The speech signal processing device 100 of embodiment may include: filter module 101, pre- enhancing processing module 102 and processing module 103, wherein filter module 101 is used for the space filtering coefficient using the area m Ge Yin to the collected road n omnidirectional microphone array Signal carries out space filtering, generates the enhanced signal in the road m, and m and n are positive integer, and n is the Mike for forming omnidirectional microphone array The number of wind;
Pre- enhancing processing module 102 for carrying out pre- enhancing processing to the enhanced signal in the road m respectively, pre- enhancing processing packet Echo cancellor and noise estimation are included, the road m is obtained in advance and enhance treated signal;
Processing module 103 is for selecting energy highest and signal-to-noise ratio highest one in the signal that enhances that treated in advance from the road m Road signal carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.
Further, filter module 101 is used for: using the space filtering coefficient in each sound area in the area m Ge Yin respectively to every The road the n omnidirectional microphone array signal in the area Ge Yin carries out space filtering.
Above method embodiment can be performed in device provided in this embodiment, implements principle and technical effect, can join See above method embodiment, details are not described herein again for the present embodiment.
Fig. 6 is a kind of structural schematic diagram of speech signal processing device embodiment provided by the present application, as shown in fig. 6, this It can also include: sound zoning sub-module 104 and filtering system further on the basis of the device of embodiment device shown in Fig. 5 Number generation modules 105, wherein sound zoning sub-module 104 is used in filter module using the space filtering coefficient in the area m Ge Yin to adopting Before the road the n omnidirectional microphone array signal collected carries out space filtering, according to the shape and microphone of omnidirectional microphone array Number and the available central processor CPU computing capability of terminal device and free memory carry out sound Division to acoustic space, obtain To the area m Ge Yin.Filter factor generation module 105 is used to be that each sound area generates fixed space filtering system according to preset rules Number, obtains the space filtering coefficient in the area m Ge Yin.
Optionally, processing module 103 is also used to: the space filtering coefficient in the storage area m Ge Yin.
Optionally, processing module 103 is also used to: being carried out sound Division to acoustic space in sound zoning sub-module, is obtained sound After Division information, the centric angle in sound area is adjusted according to the probability of the sound source appearance position of statistics.
Optionally, acoustic space includes any in 360 degree of plane spaces, 180 degree plane space and 90 degree of plane spaces ?.
Optionally, processing module 103 is also used to: selecting energy highest and noise in the signal that enhances that treated in advance from the road m Before highest signal all the way, enhancing the road m that treated in advance between the area Liang Geyin, signal is smoothed, and is put down Sliding treated the road m signal;Processing module 103 is used for: selecting energy highest and signal-to-noise ratio from the road the m signal after smoothing processing Highest signal all the way.
Above method embodiment can be performed in device provided by the embodiments of the present application, implements principle and technical effect, It can be found in above method embodiment, details are not described herein again for the present embodiment.
According to an embodiment of the present application, present invention also provides a kind of terminal devices and a kind of readable storage medium storing program for executing.
As shown in fig. 7, Fig. 7 is the block diagram according to the terminal device of the audio signal processing method of the embodiment of the present application.Eventually End equipment is intended to indicate that various forms of digital computers, such as, laptop computer, desktop computer, workbench, a number Word assistant, server, blade server, mainframe computer and other suitable computer.Terminal device also may indicate that respectively The mobile device of kind form, such as, personal digital assistant, cellular phone, smart phone, wearable device and other similar meters Calculate device.Component, their connection and relationship shown in this article and their function are merely exemplary, and are not intended to Limit the realization of the application that is described herein and/or requiring.
As shown in fig. 7, the terminal device includes: one or more processors 701, memory 702, and each for connecting The interface of component, including high-speed interface and low-speed interface.All parts are interconnected using different buses, and can be pacified It installs in other ways on public mainboard or as needed.Processor can to the instruction executed in terminal device into Row processing, including storage in memory or on memory (such as, to be coupled to interface in external input/output device Display equipment) on show GUI graphical information instruction.In other embodiments, if desired, can be by multiple processors And/or multiple bus is used together with multiple memories with multiple memories.It is also possible to multiple terminal devices are connected, it is each Equipment provides the necessary operation in part (for example, as server array, one group of blade server or multiprocessor system System).In Fig. 7 by taking a processor 701 as an example.
Memory 702 is non-transitory computer-readable storage medium provided herein.Wherein, memory is stored with The instruction that can be executed by least one processor, so that at least one processor executes Speech processing provided herein Method.The non-transitory computer-readable storage medium of the application stores computer instruction, and the computer instruction is for making computer Execute audio signal processing method provided herein.
Memory 702 is used as a kind of non-transitory computer-readable storage medium, can be used for storing non-instantaneous software program, non- Instantaneous computer executable program and module, as the corresponding program of audio signal processing method in the embodiment of the present application refers to Order/module (for example, attached filter module shown in fig. 5 101, pre- enhancing processing module 102 and processing module 103).Processor 701 non-instantaneous software program, instruction and the modules being stored in memory 702 by operation, thereby executing each of server Kind functional application and data processing, i.e. audio signal processing method in realization above method embodiment.
Memory 702 may include storing program area and storage data area, wherein storing program area can store operation system Application program required for system, at least one function;Storage data area can be stored according to the terminal device of Speech processing Use created data etc..In addition, memory 702 may include high-speed random access memory, it can also include non-instantaneous Memory, for example, at least a disk memory, flush memory device or other non-instantaneous solid-state memories.In some implementations In example, optional memory 702 includes the memory remotely located relative to processor 701, these remote memories can pass through It is connected to the network to the terminal device of Speech processing.The example of above-mentioned network include but is not limited to internet, intranet, Local area network, mobile radio communication and combinations thereof.
The terminal device of the present embodiment can also include: input unit 703 and output device 704.Processor 701, storage Device 702, input unit 703 and output device 704 can be connected by bus or other modes, to be connected by bus in Fig. 7 It is connected in example.
Input unit 703 can receive the number or character information of input, and generate and the terminal device of the present embodiment User setting and function control related key signals input, such as touch screen, keypad, mouse, track pad, touch tablet, refer to Show the input units such as bar, one or more mouse button, trace ball, control stick.Output device 704 may include that display is set Standby, auxiliary lighting apparatus (for example, LED) and haptic feedback devices (for example, vibrating motor) etc..The display equipment may include but It is not limited to, liquid crystal display (LCD), light emitting diode (LED) display and plasma scope.In some embodiments In, display equipment can be touch screen.
The various embodiments of system and technology described herein can be in digital electronic circuitry, integrated circuit system It is realized in system, dedicated ASIC (specific integrated circuit), computer hardware, firmware, software, and/or their combination.These are various Embodiment may include: to implement in one or more computer program, which can be It executes and/or explains in programmable system containing at least one programmable processor, which can be dedicated Or general purpose programmable processors, number can be received from storage system, at least one input unit and at least one output device According to and instruction, and data and instruction is transmitted to the storage system, at least one input unit and this at least one output Device.
These calculation procedures (also referred to as program, software, software application or code) include the machine of programmable processor Instruction, and can use programming language, and/or the compilation/machine language of level process and/or object-oriented to implement these Calculation procedure.As used herein, term " machine readable media " and " computer-readable medium " are referred to for referring to machine It enables and/or data is supplied to any computer program product, equipment, and/or the device of programmable processor (for example, disk, light Disk, memory, programmable logic device (PLD)), including, receive the machine readable of the machine instruction as machine-readable signal Medium.Term " machine-readable signal " is referred to for machine instruction and/or data to be supplied to any of programmable processor Signal.
In order to provide the interaction with user, system and technology described herein, the computer can be implemented on computers The display device for showing information to user is included (for example, CRT (cathode-ray tube) or LCD (liquid crystal display) monitoring Device);And keyboard and indicator device (for example, mouse or trace ball), user can by the keyboard and the indicator device come Provide input to computer.The device of other types can be also used for providing the interaction with user;For example, being supplied to user's Feedback may be any type of sensory feedback (for example, visual feedback, audio feedback or touch feedback);And it can use Any form (including vocal input, voice input or tactile input) receives input from the user.
System described herein and technology can be implemented including the computing system of background component (for example, as data Server) or the computing system (for example, application server) including middleware component or the calculating including front end component System is (for example, the subscriber computer with graphic user interface or web browser, user can pass through graphical user circle Face or the web browser to interact with the embodiment of system described herein and technology) or including this backstage portion In any combination of computing system of part, middleware component or front end component.Any form or the number of medium can be passed through Digital data communicates (for example, communication network) and is connected with each other the component of system.The example of communication network includes: local area network (LAN), wide area network (WAN) and internet.
Computer system may include client and server.Client and server is generally off-site from each other and usually logical Communication network is crossed to interact.By being run on corresponding computer and each other with the meter of client-server relation Calculation machine program generates the relationship of client and server.
According to the technical solution of the embodiment of the present application, by terminal device using the space filtering coefficient in the area m Ge Yin to adopting The road the n omnidirectional microphone array signal collected carries out space filtering, generates the enhanced signal in the road m, then terminal device is to the road m Enhanced signal carries out pre- enhancing processing respectively, and pre- enhancing processing includes that echo cancellor and noise estimate that obtain the road m enhances in advance Treated signal selects energy highest in last terminal device enhances that treated from the road m in advance signal and signal-to-noise ratio is highest Signal all the way carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.From And high-quality voice signal is obtained, communication voice is promoted by the fixed beam technology and space multitone Division of direction-agile Signal-to-noise ratio and inhibit the influence of ambient noise and reverberation, improve sound quality, and do not limited by usage scenario.
It should be understood that various forms of processes illustrated above can be used, rearrangement increases or deletes step.Example Such as, each step recorded in the application of this hair can be performed in parallel or be sequentially performed the order that can also be different and execute, As long as it is desired as a result, being not limited herein to can be realized technical solution disclosed in the present application.
Above-mentioned specific embodiment does not constitute the limitation to the application protection scope.Those skilled in the art should be bright White, according to design requirement and other factors, various modifications can be carried out, combination, sub-portfolio and substitution.It is any in the application Spirit and principle within made modifications, equivalent substitutions and improvements etc., should be included within the application protection scope.

Claims (22)

1. a kind of audio signal processing method characterized by comprising
Terminal device carries out space to the collected road n omnidirectional microphone array signal using the space filtering coefficient in the area m Ge Yin Filtering generates the enhanced signal in the road m, and m and n are positive integer, and n is for forming the microphone of the omnidirectional microphone array Number;
The terminal device carries out pre- enhancing processing to the enhanced signal in the road m respectively, and the pre- enhancing processing includes back Sound is eliminated and noise estimation, obtains the road m in advance and enhance treated signal;
Energy highest is selected from the signal that enhances that treated in advance of the road m for the terminal device and signal-to-noise ratio is highest believes all the way Number, noise suppressed and automatic growth control are carried out to selected signal all the way, the voice signal that generates that treated.
2. the method according to claim 1, wherein the terminal device uses the space filtering system in the area m Ge Yin It is several that space filtering is carried out to the collected road n omnidirectional microphone array signal, comprising:
The terminal device is complete to the road n in each sound area respectively using the space filtering coefficient in each sound area in the area the m Ge Yin Space filtering is carried out to microphone array signals.
3. the method according to claim 1, wherein the terminal device uses the space filtering system in the area m Ge Yin Before several progress space filterings to the collected road n omnidirectional microphone array signal, the method also includes:
Shape and microphone number and the terminal device available center of the terminal device according to omnidirectional microphone array Processor CPU computing capability and free memory carry out sound Division to acoustic space, obtain the area m Ge Yin;
The terminal device is that each sound area generates fixed space filtering coefficient according to preset rules, obtains the sky in the area m Ge Yin Between filter factor.
4. according to the method described in claim 3, it is characterized in that, the terminal device stores the space filter in the area the m Ge Yin Wave system number.
5. according to the method described in claim 3, it is characterized in that, the terminal device to acoustic space carry out sound Division, After obtaining sound Division information, the method also includes:
The terminal device adjusts the centric angle in sound area according to the probability of the sound source appearance position of statistics.
6. according to the described in any item methods of claim 3-5, which is characterized in that the acoustic space includes that 360 degree of planes are empty Between, any one of 180 degree plane space and 90 degree of plane spaces.
7. the method according to claim 1, wherein the terminal device enhances in advance from the road m, treated Before selecting energy highest and the highest signal all the way of signal-to-noise ratio in signal, the method also includes:
Enhancing the road m treated in advance between the area Liang Geyin, signal is smoothed, the road the m letter after obtaining smoothing processing Number;
Energy highest is selected from the signal that enhances that treated in advance of the road m for the terminal device and signal-to-noise ratio is highest believes all the way Number, comprising:
The terminal device selects energy highest and the highest signal all the way of signal-to-noise ratio from the road the m signal after smoothing processing.
8. a kind of speech signal processing device characterized by comprising
Filter module, for using the space filtering coefficient in the area m Ge Yin to carry out the collected road n omnidirectional microphone array signal Space filtering generates the enhanced signal in the road m, and m and n are positive integer, and n is the microphone for forming the omnidirectional microphone array Number;
Pre- enhancing processing module, for carrying out pre- enhancing processing, the pre- enhancing processing respectively to the enhanced signal in the road m Estimate including echo cancellor and noise, obtains the road m in advance and enhance treated signal;
Processing module, for selecting energy highest from the signal that enhances that treated in advance of the road m and signal-to-noise ratio is highest all the way Signal carries out noise suppressed and automatic growth control to selected signal all the way, the voice signal that generates that treated.
9. device according to claim 8, which is characterized in that the filter module is used for:
Using the space filtering coefficient in each sound area in the area the m Ge Yin respectively to the road the n omnidirectional microphone array in each sound area Signal carries out space filtering.
10. device according to claim 8, which is characterized in that described device further include:
Sound zoning sub-module, for using the space filtering coefficient in the area m Ge Yin to the collected road n omnidirectional in the filter module Before microphone array signals carry out space filtering, according to the shape of omnidirectional microphone array and microphone number and the terminal The available central processor CPU computing capability of equipment and free memory carry out sound Division to acoustic space, obtain the area m Ge Yin;
Filter factor generation module obtains m for being that each sound area generates fixed space filtering coefficient according to preset rules The space filtering coefficient in sound area.
11. device according to claim 10, which is characterized in that the processing module is also used to: storing the area the m Ge Yin Space filtering coefficient.
12. device according to claim 10, which is characterized in that the processing module is also used to:
Sound Division is carried out to acoustic space in the sound zoning sub-module, after obtaining sound Division information, according to statistics The centric angle in the probability adjustment sound area of sound source appearance position.
13. the described in any item devices of 0-12 according to claim 1, which is characterized in that the acoustic space includes 360 degree of planes Any one of space, 180 degree plane space and 90 degree of plane spaces.
14. device according to claim 8, which is characterized in that the processing module is also used to:
Before selecting energy highest and the highest signal all the way of signal-to-noise ratio in the signal that enhances that treated in advance of the road m, two Enhancing the road m treated between the area Ge Yin in advance, signal is smoothed, the road the m signal after obtaining smoothing processing;
The processing module is used for: selecting energy highest from the road the m signal after smoothing processing and signal-to-noise ratio is highest believes all the way Number.
15. a kind of terminal device characterized by comprising
At least one processor;And
The memory being connect at least one described processor communication;Wherein,
The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one It manages device to execute, so that at least one described processor is able to carry out method of any of claims 1-7.
16. a kind of non-transitory computer-readable storage medium for being stored with computer instruction, which is characterized in that the computer refers to It enables for making the computer perform claim require method described in any one of 1-7.
17. a kind of audio signal processing method characterized by comprising
Terminal device carries out space to the collected road n omnidirectional microphone array signal using the space filtering coefficient in the area m Ge Yin Filtering generates the enhanced signal in the road m, and m and n are positive integer, and n is for forming the microphone of the omnidirectional microphone array Number;
The terminal device carries out pre- enhancing processing to the enhanced signal in the road m respectively, and the pre- enhancing processing includes back Sound is eliminated and noise estimation, obtains the road m in advance and enhance treated signal;
The terminal device selects top-quality signal all the way from the signal that enhances that treated in advance of the road m, to selected Signal all the way carry out noise suppressed and automatic growth control, the voice signal that generates that treated.
18. according to the method for claim 17, which is characterized in that the top-quality signal all the way be energy highest and The highest signal all the way of signal-to-noise ratio.
19. method described in 7 or 18 according to claim 1, which is characterized in that the terminal device uses the space in the area m Ge Yin Filter factor carries out space filtering to the collected road n omnidirectional microphone array signal, comprising:
The terminal device is complete to the road n in each sound area respectively using the space filtering coefficient in each sound area in the area the m Ge Yin Space filtering is carried out to microphone array signals.
20. according to the method for claim 17, which is characterized in that the terminal device uses the space filtering in the area m Ge Yin Before coefficient carries out space filtering to the collected road n omnidirectional microphone array signal, the method also includes:
Shape and microphone number and the terminal device available center of the terminal device according to omnidirectional microphone array Processor CPU computing capability and free memory carry out sound Division to acoustic space, obtain the area m Ge Yin;
The terminal device is that each sound area generates fixed space filtering coefficient according to preset rules, obtains the sky in the area m Ge Yin Between filter factor.
21. according to the method for claim 20, which is characterized in that the terminal device stores the space in the area the m Ge Yin Filter factor.
22. according to the method for claim 20, which is characterized in that the terminal device carries out sound zoning to acoustic space Point, after obtaining sound Division information, the method also includes:
The terminal device adjusts the centric angle in sound area according to the probability of the sound source appearance position of statistics.
CN201910774202.4A 2019-08-21 2019-08-21 Audio signal processing method and device Pending CN110364176A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910774202.4A CN110364176A (en) 2019-08-21 2019-08-21 Audio signal processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910774202.4A CN110364176A (en) 2019-08-21 2019-08-21 Audio signal processing method and device

Publications (1)

Publication Number Publication Date
CN110364176A true CN110364176A (en) 2019-10-22

Family

ID=68224963

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910774202.4A Pending CN110364176A (en) 2019-08-21 2019-08-21 Audio signal processing method and device

Country Status (1)

Country Link
CN (1) CN110364176A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111161750A (en) * 2019-12-13 2020-05-15 西安讯飞超脑信息科技有限公司 Voice processing method and related device
CN112951261A (en) * 2021-03-02 2021-06-11 北京声智科技有限公司 Sound source positioning method and device and voice equipment
CN113053406A (en) * 2021-05-08 2021-06-29 北京小米移动软件有限公司 Sound signal identification method and device

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102137318A (en) * 2010-01-22 2011-07-27 华为终端有限公司 Method and device for controlling adapterization
CN102387273A (en) * 2011-07-08 2012-03-21 歌尔声学股份有限公司 Method and device for inhibiting residual echoes
CN102945672A (en) * 2012-09-29 2013-02-27 深圳市国华识别科技开发有限公司 Voice control system for multimedia equipment, and voice control method
CN104053088A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Microphone array adjustment method, microphone array and electronic device
CN104049721A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Information processing method and electronic equipment
CN104301664A (en) * 2013-07-19 2015-01-21 松下电器产业株式会社 Directivity control system, directivity control method, sound collection system and sound collection control method
US20150302869A1 (en) * 2014-04-17 2015-10-22 Arthur Charles Tomlin Conversation, presence and context detection for hologram suppression
US20170013357A1 (en) * 2015-07-07 2017-01-12 Oki Electric Industry Co., Ltd. Sound collection apparatus and method
CN106653041A (en) * 2017-01-17 2017-05-10 北京地平线信息技术有限公司 Audio signal processing equipment and method as well as electronic equipment
CN106710603A (en) * 2016-12-23 2017-05-24 上海语知义信息技术有限公司 Speech recognition method and system based on linear microphone array
CN107277699A (en) * 2017-07-21 2017-10-20 歌尔科技有限公司 A kind of sound pick-up method and device
CN108475511A (en) * 2015-12-17 2018-08-31 亚马逊技术公司 Adaptive beamformer for creating reference channel
CN108694957A (en) * 2018-04-08 2018-10-23 湖北工业大学 The echo cancelltion design method formed based on circular microphone array beams
CN109545230A (en) * 2018-12-05 2019-03-29 百度在线网络技术(北京)有限公司 Acoustic signal processing method and device in vehicle
CN109920433A (en) * 2019-03-19 2019-06-21 上海华镇电子科技有限公司 The voice awakening method of electronic equipment under noisy environment
CN109949810A (en) * 2019-03-28 2019-06-28 华为技术有限公司 A kind of voice awakening method, device, equipment and medium
CN110010126A (en) * 2019-03-11 2019-07-12 百度国际科技(深圳)有限公司 Audio recognition method, device, equipment and storage medium

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102137318A (en) * 2010-01-22 2011-07-27 华为终端有限公司 Method and device for controlling adapterization
CN102387273A (en) * 2011-07-08 2012-03-21 歌尔声学股份有限公司 Method and device for inhibiting residual echoes
CN102945672A (en) * 2012-09-29 2013-02-27 深圳市国华识别科技开发有限公司 Voice control system for multimedia equipment, and voice control method
CN104053088A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Microphone array adjustment method, microphone array and electronic device
CN104049721A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Information processing method and electronic equipment
CN104301664A (en) * 2013-07-19 2015-01-21 松下电器产业株式会社 Directivity control system, directivity control method, sound collection system and sound collection control method
US20150302869A1 (en) * 2014-04-17 2015-10-22 Arthur Charles Tomlin Conversation, presence and context detection for hologram suppression
US20170013357A1 (en) * 2015-07-07 2017-01-12 Oki Electric Industry Co., Ltd. Sound collection apparatus and method
CN108475511A (en) * 2015-12-17 2018-08-31 亚马逊技术公司 Adaptive beamformer for creating reference channel
CN106710603A (en) * 2016-12-23 2017-05-24 上海语知义信息技术有限公司 Speech recognition method and system based on linear microphone array
CN106653041A (en) * 2017-01-17 2017-05-10 北京地平线信息技术有限公司 Audio signal processing equipment and method as well as electronic equipment
CN107277699A (en) * 2017-07-21 2017-10-20 歌尔科技有限公司 A kind of sound pick-up method and device
CN108694957A (en) * 2018-04-08 2018-10-23 湖北工业大学 The echo cancelltion design method formed based on circular microphone array beams
CN109545230A (en) * 2018-12-05 2019-03-29 百度在线网络技术(北京)有限公司 Acoustic signal processing method and device in vehicle
CN110010126A (en) * 2019-03-11 2019-07-12 百度国际科技(深圳)有限公司 Audio recognition method, device, equipment and storage medium
CN109920433A (en) * 2019-03-19 2019-06-21 上海华镇电子科技有限公司 The voice awakening method of electronic equipment under noisy environment
CN109949810A (en) * 2019-03-28 2019-06-28 华为技术有限公司 A kind of voice awakening method, device, equipment and medium

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111161750A (en) * 2019-12-13 2020-05-15 西安讯飞超脑信息科技有限公司 Voice processing method and related device
CN111161750B (en) * 2019-12-13 2022-09-06 西安讯飞超脑信息科技有限公司 Voice processing method and related device
CN112951261A (en) * 2021-03-02 2021-06-11 北京声智科技有限公司 Sound source positioning method and device and voice equipment
CN112951261B (en) * 2021-03-02 2022-07-01 北京声智科技有限公司 Sound source positioning method and device and voice equipment
CN113053406A (en) * 2021-05-08 2021-06-29 北京小米移动软件有限公司 Sound signal identification method and device

Similar Documents

Publication Publication Date Title
US10560783B2 (en) Methods, apparatuses and computer program products for facilitating directional audio capture with multiple microphones
US11880628B2 (en) Screen mirroring display method and electronic device
CN110364176A (en) Audio signal processing method and device
CN109119093A (en) Voice de-noising method, device, storage medium and mobile terminal
CN106657681B (en) A kind of control method, device and the mobile terminal of mobile terminal refresh rate
EP3852106A1 (en) Sound processing method, apparatus and device
CN104422922A (en) Method and device for realizing sound source localization by utilizing mobile terminal
CN111402868B (en) Speech recognition method, device, electronic equipment and computer readable storage medium
CN111968658B (en) Speech signal enhancement method, device, electronic equipment and storage medium
WO2019128639A1 (en) Method for detecting audio signal beat points of bass drum, and terminal
CN108152788A (en) Sound-source follow-up method, sound-source follow-up equipment and computer readable storage medium
WO2021114847A1 (en) Internet calling method and apparatus, computer device, and storage medium
US11284151B2 (en) Loudness adjustment method and apparatus, and electronic device and storage medium
CN110931035B (en) Audio processing method, device, equipment and storage medium
CN110501918A (en) Intelligent electrical appliance control, device, electronic equipment and storage medium
CN110109899A (en) Internet of things data complementing method, apparatus and system
CN114187922A (en) Audio detection method and device and terminal equipment
JP2019057320A (en) Method, computer program, computer-readable recording medium, and apparatus
US20170171524A1 (en) Techniques for improving stereo block matching with the pyramid method
CN111933167A (en) Noise reduction method and device for electronic equipment, storage medium and electronic equipment
US8924206B2 (en) Electrical apparatus and voice signals receiving method thereof
CN110069641B (en) Image processing method and device and electronic equipment
CN102750126B (en) Pronunciation inputting method and terminal
CN108269223B (en) Webpage graph drawing method and terminal
CN113593602B (en) Audio processing method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191022

RJ01 Rejection of invention patent application after publication