CN105931648A - Audio signal de-reverberation method and device - Google Patents

Audio signal de-reverberation method and device Download PDF

Info

Publication number
CN105931648A
CN105931648A CN201610474006.1A CN201610474006A CN105931648A CN 105931648 A CN105931648 A CN 105931648A CN 201610474006 A CN201610474006 A CN 201610474006A CN 105931648 A CN105931648 A CN 105931648A
Authority
CN
China
Prior art keywords
filter
signal
audio signal
sub
channel audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610474006.1A
Other languages
Chinese (zh)
Other versions
CN105931648B (en
Inventor
崔玮玮
宋辉
徐杨飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610474006.1A priority Critical patent/CN105931648B/en
Publication of CN105931648A publication Critical patent/CN105931648A/en
Application granted granted Critical
Publication of CN105931648B publication Critical patent/CN105931648B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

The invention discloses an audio signal de-reverberation method and device. The method comprises the steps that a single channel audio signal is acquired, wherein the single channel audio signal comprises an early reverberation signal and a late reverberation signal; whether the single channel audio signal is a speech signal is determined; if so, the variance of the joint probability density distribution of the early reverberation signal is updated, and the filter coefficient of a subband filter is updated based on the variance of the joint probability density distribution of the early reverberation signal, wherein the subband filter is used to filter out the late reverberation signal in the single channel audio signal; and the de-reverberation single channel audio signal is determined based on the updated filter coefficient. According to the scheme provided by the invention, the late reverberation signal in the inputted audio signal can be filtered out to improve the accuracy of subsequent speech recognition.

Description

Audio signal solution reverberation method and device
Technical field
The application relates to field of computer technology, is specifically related to Audio Signal Processing field, especially Relate to audio signal solution reverberation method and device.
Background technology
During audio frequency especially Speech processing, if for gathering adopting of audio signal Acquisition means (such as, mike) distance sound source position farther out, the sound that harvester receives Frequently signal will inevitably be affected by reverberation.The existence of reverberation not only can reduce audio frequency letter Number sense of hearing quality, and the precision of existing voice identification system can be caused drastically to decline.
Reverberation can be decomposed into reverberation in early days and late reverberation, wherein to audio quality and identification What system accuracy impact was bigger is late reverberation, and the main target therefore solving reverberation is how to reduce Late reverberation.
In the prior art, some have been there are for filtering the voice signal that harvester collects In the algorithm of late reverberation.But, the most all there is following problem in these algorithms:
1) when solving for the filter coefficient of the wave filter filtering late reverberation, need to obtain Whole voice data, thus cause the real-time rate of algorithm relatively low, and then can cause solving reverberation and calculate The time delay of method is higher.And in voice communication and field of speech recognition, to solving reverberation algorithm Requirement of real-time is higher.
2), when solving for the filter coefficient of the wave filter filtering late reverberation, it will usually Relate to the calculating of matrix inversion.And once matrix is irreversible during matrix inversion, then try to achieve Filter coefficient the most inaccurate, and then impact solve reverberation performance.Additionally, matrix inversion Operand is relatively big, and the real-time also resulting in understanding reverberation algorithm from another point of view is poor.
Summary of the invention
The purpose of the application is to propose the audio signal solution reverberation method of a kind of improvement and device, Solve the technical problem that background section above is mentioned.
First aspect, this application provides a kind of audio signal solution reverberation method, including: obtain Channel audio signal, channel audio signal includes reverb signal and late reverberation signal in early days; Judge whether channel audio signal is voice signal;The most then update early stage reverb signal The variance of joint probability density distribution, and joint probability density based on early stage reverb signal distribution Variance update sub-filter filter coefficient, wherein, sub-filter is used for filtering list The late reverberation signal comprised in channel audio signal;And based on the filter coefficient after updating Determine the audio signal after solving reverberation.
In certain embodiments, it is judged that whether channel audio signal is that voice signal includes: logical Cross voice activity detection technology and judge whether channel audio signal is voice signal.
In certain embodiments, the variance of the joint probability density distribution of reverb signal in early days is more The transposed matrix of the filter coefficient of the sub-filter before new and the input signal of sub-filter The product of vector and the spectral coefficient x of the channel audio signal received in tt,fDifference exhausted To value square;Filter coefficient g after renewalf(t+1) for update before filter coefficient with more New change amount sum;Wherein, updating variable quantity is that the first renewal running parameter and second updates change Change the ratio of parameter;First renewal running parameter is the iteration step length of sub-filter, sub-band filter Error between actual output and the desired output of device and the input signal of sub-filter to The product of amount;Second update running parameter be sub-filter input signal vector transposition with The product of the input signal vector of sub-filter;The actual output of sub-filter is defeated with expectation Error between going out is equal to the spectral coefficient of the channel audio signal received in t and t The ratio of variance of joint probability density distribution of early stage reverb signal deduct the subband filter before renewal The transposition of the filter coefficient of ripple device and the product of the input signal vector of sub-filter.
In certain embodiments, audio signal d after reverberation is solvedt,fEqual to receive in t The spectral coefficient of channel audio signal deducts the filter coefficient of the sub-filter in t+1 moment The product of the input signal vector of transposition and sub-filter.In certain embodiments, judging After whether channel audio signal is voice signal, method also includes: if it is not, then will update Sub-filter before the variance of the joint probability density distribution of front early stage reverb signal and renewal Filter coefficient as update after early stage reverb signal joint probability density distribution variance Filter coefficient with the sub-filter after renewal.
In certain embodiments, method also includes: judge actual output and the phase of sub-filter Hope the error between output whether meet Error Absolute Value square more than receiving in t The absolute value of the spectral coefficient of channel audio signal square with the product of predetermined threshold value K;If so, Then filter coefficient is set to null vector;Wherein, K > 1.
Second aspect, this application provides a kind of audio signal solution reverberation unit, including: obtain Module, is configured to obtain channel audio signal, and channel audio signal includes reverberation in early days Signal and late reverberation signal;Judge module, is configured to whether judge channel audio signal For voice signal;First more new module, if being configured to channel audio signal is voice signal, Then update the variance of the joint probability density distribution of reverb signal in early days, and believe based on early stage reverberation Number joint probability density distribution variance update sub-filter filter coefficient, wherein, Sub-filter is for filtering the late reverberation signal comprised in channel audio signal;And really Cover half block, is configured to determine the audio signal after solving reverberation based on the filter coefficient after updating.
In certain embodiments, it is judged that module is configured to further: pass through voice activity detection Technology judges whether channel audio signal is voice signal.
In certain embodiments, the variance of the joint probability density distribution of reverb signal in early days is: The input letter of the transposed matrix of the filter coefficient of the sub-filter before renewal and sub-filter The spectral coefficient x of number vectorial product and the channel audio signal received in tt,fDifference Absolute value square;Filter coefficient g after renewalf(t+1) for update before filter coefficient with Update variable quantity sum;Wherein, updating variable quantity is that the first renewal running parameter and second updates The ratio of running parameter;First renewal running parameter is the iteration step length of sub-filter, subband filter Error between actual output and the desired output of ripple device and the input signal of sub-filter The product of vector;Second transposition updating the input signal vector that running parameter is sub-filter Product with the input signal vector of sub-filter;The actual output of sub-filter and expectation Error between output is equal to the spectral coefficient of the channel audio signal received in t and t The ratio of the variance of the joint probability density distribution of the early stage reverb signal in moment deducts the son before renewal The transposition of the filter coefficient of band filter and the product of the input signal vector of sub-filter.
In certain embodiments, audio signal d after reverberation is solvedt,fEqual to receive in t The spectral coefficient of channel audio signal deducts the filter coefficient of the sub-filter in t+1 moment The product of the input signal vector of transposition and sub-filter.
In certain embodiments, device also includes the second more new module;Second more new module configuration If for channel audio signal not being voice signal, then the early stage reverb signal before updating The filter coefficient of the sub-filter before the variance of joint probability density distribution and renewal is as more Sub-band filter after the variance of the joint probability density distribution of the early stage reverb signal after Xin and renewal The filter coefficient of device.
In certain embodiments, device also includes zero setting module;Zero setting module is configured to judge Whether the error between actual output and the desired output of sub-filter meets Error Absolute Value Square more than the channel audio signal received in t spectral coefficient absolute value square Product with predetermined threshold value K;And the most then filter coefficient is set to null vector;Wherein, K > 1.
The audio signal solution reverberation method of the application offer and device, by voice based on input Signal is continuously updated variance and the subband filter of the joint probability density distribution of reverb signal in early days The parameter of ripple device so that sub-filter filtering can be by the channel audio signal of input Late reverberation target signal filter, thus improve the accuracy rate of subsequent speech recognition.
Additionally, the audio signal solution reverberation method of the application and device, the connection of reverb signal in early days Renewal and the sub-filter parameter of closing the variance of probability density distribution update required amount of calculation relatively Little, and renewal process is only relevant to the channel audio signal in a period of time, has higher Real-time.
Accompanying drawing explanation
By reading retouching in detail with reference to made non-limiting example is made of the following drawings Stating, other features, purpose and advantage will become more apparent upon:
Fig. 1 is that the application can apply to exemplary system architecture figure therein;
Fig. 2 is the flow process of an embodiment of the audio signal solution reverberation method according to the application Figure;
Fig. 3 is the flow process of another embodiment of the audio signal solution reverberation method according to the application Figure;
Fig. 4 is that the structure of an embodiment of the audio signal solution reverberation unit according to the application is shown It is intended to;
Fig. 5 is adapted for the computer for the terminal unit or server realizing the embodiment of the present application The structural representation of system.
Detailed description of the invention
With embodiment, the application is described in further detail below in conjunction with the accompanying drawings.It is appreciated that , specific embodiment described herein is used only for explaining related invention, rather than to this Bright restriction.It also should be noted that, for the ease of describe, accompanying drawing illustrate only with About the part that invention is relevant.
It should be noted that in the case of not conflicting, the embodiment in the application and embodiment In feature can be mutually combined.Describe this below with reference to the accompanying drawings and in conjunction with the embodiments in detail Application.
Fig. 1 shows audio signal solution reverberation method or the audio signal solution that can apply the application The exemplary system architecture 100 of the embodiment of reverberation unit.
As it is shown in figure 1, system architecture 100 can include terminal unit 101,102,103, Network 104 and server 105.Network 104 is in order at terminal unit 101,102,103 and The medium of communication link is provided between server 105.Network 104 can include various connection class Type, the most wired, wireless communication link or fiber optic cables etc..
User can use terminal unit 101,102,103 by network 104 and server 105 Alternately, to receive or to send message etc..Can be provided with on terminal unit 101,102,103 Various telecommunication customer ends are applied, and such as web browser applications, shopping class application, searching class are answered With, JICQ, mailbox client, social platform software etc..
Terminal unit 101,102,103 can be ability various having and gathering audio signal Electronic equipment, includes but not limited to smart mobile phone, panel computer, MP3 player (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard audio layer Face 3), (Moving Picture Experts Group Audio Layer IV, dynamic image is special for MP4 Family's compression standard audio frequency aspect 4) player, pocket computer on knee and desk computer etc..
Server 105 can be to provide the server of various service, such as to terminal unit 101, 102,103 audio signals gathered carry out the audio processing service device processed.Audio processing service The data such as the audio signal received can be analyzed waiting and process by device, and by result (example Voice data as processed through reverberation removal) feed back to terminal unit.
It should be noted that the audio signal solution reverberation method that the embodiment of the present application is provided both may be used To be performed by terminal unit 101,102,103, it is also possible to performed by server 105, also may be used With a part of step by terminal unit 101,102,103 perform and another part step by servicing Device 105 performs.Correspondingly, audio signal solution reverberation unit both can be arranged at terminal unit 101, 102, in 103, it is also possible to be arranged in server 105, or a part of module is arranged at end In end equipment 101,102,103, another part module is arranged in server.
It should be understood that the number of terminal unit, network and the server in Fig. 1 is only signal Property.According to realizing needs, can have any number of terminal unit, network and server.
With continued reference to Fig. 2, it is shown that according to one of the audio signal solution reverberation method of the application The flow process 200 of embodiment.Described audio signal solution reverberation method, comprises the following steps:
Step 210, obtains channel audio signal, and wherein, channel audio signal includes early Phase reverb signal and late reverberation signal.
In the present embodiment, audio signal solution reverberation method runs on electronic equipment (example thereon Server as shown in Figure 1) can by wired connection mode or radio connection from Family utilizes it to carry out the terminal of audio signal sample and receives channel audio signal.It may be noted that It is that above-mentioned radio connection can include but not limited to that 3G/4G connects, WiFi connects, indigo plant Tooth connects, WiMAX connects, Zigbee connects, UWB (ultra wideband) connects, And other currently known or exploitation in the future radio connection.
During it is said that in general, there is a certain distance between equipment and the sound source gathering sound, adopt The audio signal that the equipment of collection sound collects will be affected by reverberation.Reverberation is generally according to directly Reach sound (namely directly being conveyed to gather the audio signal of the equipment of sound from sound source) and be transferred to Time difference between the reverberation of the equipment gathering sound is reverberation in early days and late reverberation.Such as, Arrival in 30ms (millisecond) after direct sound wave can be gathered the reverberation of equipment of sound as in early days Reverberation, and the reverberation of the equipment gathering sound will be arrived more than 30ms as late reverberation.Early Voice amplitude, phase delay, formant are affected less by phase reverberation, and late reverberation then can be right The impact of voice amplitude, phase delay, formant is bigger, and can cause the phase of linguistic syllables Inter-masking, these all reduce speech intelligibility, bring the biggest difficulty to speech recognition.
Step 220, it is judged that whether channel audio signal is voice signal.
The audio signal solution reverberation method of the present embodiment, it is intended to the voice signal collected is carried out Solution reverberation processes, and then the voice signal after processing can be reflected more realistically, and user is actual The voice said.And then promote the accuracy rate of the signal processing such as subsequent speech recognition.
Additionally, voice signal and other audio signal exist more significantly difference, therefore, logical Cross in step, judge whether the channel audio signal got is voice signal, can be rear In continuous process step, carry out the operations such as corresponding filtering only for voice signal, thus improve The solution reverberation treatment effeciency of voice signal and real-time.
In some optional implementations, for example, it is possible to by VAD (Voice Activity Detection, voice activity detection) technology carries out the identification of voice signal, thus judges Whether the channel audio signal got in step 210 is voice signal.
Step 230, the most then update the side of the joint probability density distribution of reverb signal in early days Difference, and the variance renewal sub-filter of joint probability density based on early stage reverb signal distribution Filter coefficient.
The variance of the joint probability density distribution of reverb signal can be the subband filter before updating in early days The product of the transposed matrix of the filter coefficient of ripple device and the input signal vector of sub-filter with Spectral coefficient x at the channel audio signal that t receivest,fDifference absolute value square.
Exemplarily, the variance of the joint probability density distribution of reverb signal in early days can meet:
Wherein, xt,fThe spectral coefficient of the channel audio signal for receiving in t,For more The transposition of the filter coefficient of the sub-filter before new,Input for sub-filter is believed Number vector.
Filter coefficient g after renewalf(t+1) for the filter coefficient before updating and renewal variable quantity Sum.Wherein, update variable quantity be the first renewal running parameter and second update running parameter it Ratio.First updates the reality that running parameter is the iteration step length of sub-filter, sub-filter Error between output and desired output and the product of the input signal vector of sub-filter. Second transposition updating the input signal vector that running parameter is sub-filter and sub-filter The product of input signal vector.Mistake between actual output and the desired output of sub-filter The poor spectral coefficient equal to the channel audio signal received in t and the early stage reverberation of t The ratio of the variance of the joint probability density distribution of signal deducts the filtering of the sub-filter before renewal The transposition of device coefficient and the product of the input signal vector of sub-filter.
Exemplarily, the filter coefficient g after renewalf(t+1) can meet:
g f ( t + 1 ) = g f ( t ) + μ × e ( t ) × x ^ → t - D , f x ^ → T t - D , f x ^ → t - D , f - - - ( 2 )
Wherein, μ is the iteration step length of sub-filter, and e (t) is the actual output of sub-filter And the error e (t) between desired output meets:
e ( t ) = x ^ t , f - g f T × x ^ → t - D , f - - - ( 3 )
Here, the initial value g of the filter coefficient of sub-filterf(0) can be that single order number is The full null vector of N.
Wherein, N is the number of taps of sub-filter, and meets:
N=L-D+1.
L is the frame time length ratio of reverberation time and channel audio signal one frame, and D is early The reverberation time of phase reverberation and the frame time length ratio of channel audio signal one frame.And reverberation Time such as can be defined as indoor sound and reach steady statue, and sound source is remaining sound after stopping sounding Sound absorbs through sound-absorbing material in room repeatedly, the time needed for average sound pressure level decay 60dB.
Additionally, the input signal vector of sub-filterCan have a following form of expression:
x → t - D , f = [ x t - D , f , x t - D - 1 , f , ... , x t - L + 1 , f ] .
In other words,Can be understood as sub-filter to connect to the t-L+1 moment from the t-D moment The input signal vector that the input signal received is formed.
Additionally, in formula (3):
x ^ t , f = x t , f σ t , f ;
Correspondingly, in formula (2) and formula (3):
x ^ → t - D , f = [ x ^ t - D , f , x ^ t - D - 1 , f , ... , x ^ t - L + 1 , f ] .
By above-mentioned formula (1)~(3) it can be seen that in the associating to early stage reverb signal When the variance of probability density distribution is updated, the filter coefficient of employing is the filtering of t Device coefficient (i.e. gfOr the g in formula (2)f(t)), at the connection completing reverb signal in early days After the renewal of the variance closing probability density distribution, then the coefficient of wave filter is updated (i.e. G in formula (2)f(t+1))。
Step 240, determines the audio signal after solving reverberation based on the filter coefficient after updating.
Use formula as above (1)~(3) joint probability density to early stage reverb signal After the variance of distribution and the filter coefficient of sub-filter are updated, solve the sound after reverberation Frequently signal dt,fEqual to when the spectral coefficient of the channel audio signal that t receives deducts t+1 The transposition of filter coefficient and the input signal of described sub-filter of the sub-filter carved to The product of amount.
Exemplarily, audio signal d after reverberation is solvedT, fCan meet:
d t , f = x t , f - g f T ( t + 1 ) × x → t - D , f - - - ( 4 )
Additionally, in some optional implementations, the audio signal solution reverberation side of the present embodiment Method can also comprise the following steps that
Judge whether the error between actual output and the desired output of sub-filter meets error Absolute square more than the spectral coefficient of the channel audio signal received in t of absolute value Value square with the product of predetermined threshold value K.In other words, it is judged that the reality of sub-filter is defeated Go out whether the error e (t) between desired output meets | e (t) |2> K × | xt,f|2
The most then filter coefficient is set to null vector.
Wherein, K is the threshold value pre-set, and meets K > 1.
Additionally, work as | e (t) |2> K × | xt,f|2Time, it is believed that sub-filter dissipates, at next step When the variance being distributed the joint probability density of early stage reverb signal is updated, formula (1) gfFor null vector.
Using the audio signal solution reverberation method of the present embodiment, it solves amount of calculation needed for reverberation process Less, and calculating process only needs between t-D~the t-L+1 moment channel audio signal of input And the channel audio signal of present frame input so that the reality solving reverberation process of the present embodiment Time property is stronger.
With further reference to Fig. 3, it illustrates another embodiment of audio signal solution reverberation method Flow process 300.The flow process 300 of this audio signal solution reverberation method.
The present embodiment can have relative with step 210 in embodiment illustrated in fig. 2~step 240 The step 310 answered~step 340.
Unlike embodiment illustrated in fig. 2, the present embodiment may further comprise:
Step 350, the early stage if judged result in step 320 is no, then before updating The wave filter of the sub-filter before the variance of the joint probability density distribution of reverb signal and renewal Coefficient as update after early stage reverb signal joint probability density distribution variance and update after The filter coefficient of sub-filter.
In other words, if step 320 is judged the single channel audio got in step 310 Signal is not voice signal, does not the most update the side of the joint probability density distribution of reverb signal in early days Difference and the filter coefficient of sub-filter, and connection based on the early stage reverb signal before updating The filter coefficient of the variance and sub-filter of closing probability density distribution obtains in step 310 The channel audio signal got carries out solving reverberation and processes (step 340).
Compared with the embodiment shown in Fig. 2, the audio signal solution reverberation method of the present embodiment is permissible Avoid the output abnormality that in step 320, the error of judged result may cause, promote warp further The late reverberation filter effect of the audio signal after the method process of the present embodiment.
With further reference to Fig. 4, as to the realization of method shown in above-mentioned each figure, the application provides A kind of embodiment of audio signal solution reverberation unit, this device embodiment with shown in Fig. 2 Embodiment of the method corresponding, this device specifically can apply in various electronic equipment.
As shown in Figure 4, the audio signal solution reverberation unit 400 described in the present embodiment includes obtaining Module 410, judge module 420, first more new module 430 and determine module 440:
Wherein:
Acquisition module 410 is configurable to obtain channel audio signal, channel audio signal Including reverb signal and late reverberation signal in early days.
Judge module 420 is configurable to judge whether channel audio signal is voice signal.
If it is voice signal that the first more new module 430 is configurable to channel audio signal, then Update the variance of the joint probability density distribution of reverb signal in early days, and based on early stage reverb signal Joint probability density distribution variance update sub-filter filter coefficient, wherein, son Band filter is for filtering the late reverberation signal comprised in channel audio signal.
After determining that module 440 is configurable to determine solution reverberation based on the filter coefficient after updating Audio signal.
In some optional implementations, it is judged that module 420 can be configured to further: logical Cross voice activity detection technology and judge whether channel audio signal is voice signal.
In some optional implementations,
The variance of the joint probability density distribution of reverb signal can be the subband filter before updating in early days The product of the transposed matrix of the filter coefficient of ripple device and the input signal vector of sub-filter with Spectral coefficient x at the channel audio signal that t receivest,fDifference absolute value square.
Exemplarily, the variance of the joint probability density distribution of reverb signal in early days can meet:
Wherein, xt,fThe spectral coefficient of the channel audio signal for receiving in t,For more The transposition of the filter coefficient of the sub-filter before new,Defeated for described sub-filter Enter signal vector;
Filter coefficient g after renewalf(t+1) for the filter coefficient before updating and renewal variable quantity Sum.Wherein, update variable quantity be the first renewal running parameter and second update running parameter it Ratio.First updates the reality that running parameter is the iteration step length of sub-filter, sub-filter Error between output and desired output and the product of the input signal vector of sub-filter. Second transposition updating the input signal vector that running parameter is sub-filter and sub-filter The product of input signal vector.Mistake between actual output and the desired output of sub-filter The poor spectral coefficient equal to the channel audio signal received in t and the early stage reverberation of t The ratio of the variance of the joint probability density distribution of signal deducts the filtering of the sub-filter before renewal The transposition of device coefficient and the product of the input signal vector of sub-filter.
Exemplarily, the filter coefficient g after renewalf(t+1) can meet:
g f ( t + 1 ) = g f ( t ) + μ × e ( t ) × x ^ → t - D , f x ^ → T t - D , f x ^ → t - D , f ;
Wherein, μ is the iteration step length of sub-filter, and e (t) is the actual output of sub-filter And the error e (t) between desired output meets:
e ( t ) = x ^ t , f - g f T × x ^ → t - D , f .
In some optional implementations, solve audio signal d after reverberationt,fEqual in t The spectral coefficient of the channel audio signal received deducts the filtering of the sub-filter in t+1 moment The product of the input signal vector of the transposition of device coefficient and described sub-filter.
Exemplarily, audio signal d after reverberation is solvedt,fCan meet:
d t , f = x t , f - g f T ( t + 1 ) × x → t - D , f .
In some optional implementations, the audio signal solution reverberation unit of the present embodiment also may be used To farther include the second more new module (not shown).
If it is not voice signal that the second more new module is configurable to channel audio signal, then will Subband filter before the variance of the joint probability density distribution of the early stage reverb signal before renewal and renewal The filter coefficient of ripple device as update after early stage reverb signal joint probability density distribution The filter coefficient of the sub-filter after variance and renewal.
In some optional implementations, the audio signal solution reverberation unit of the present embodiment also may be used To farther include zero setting module (not shown).
Zero setting module is configurable to judge between actual output and the desired output of sub-filter Error whether meet Error Absolute Value square more than the single channel audio that receives in t The absolute value of the spectral coefficient of signal square with the product of predetermined threshold value K.In other words, it is judged that Whether the error e (t) between actual output and the desired output of sub-filter meets | e (t) |2> K ×|xt,f|2;And the most then filter coefficient is set to null vector;Wherein, K is for set in advance The threshold value put, and meet K > 1.
It will be understood by those skilled in the art that above-mentioned audio signal solution reverberation unit 400 is all right Including some other known features, such as processor, memorizer etc., in order to unnecessarily obscure Embodiment of the disclosure, structure known to these is the most not shown.
Below with reference to Fig. 5, it illustrates the terminal unit be suitable to for realizing the embodiment of the present application Or the structural representation of the computer system 500 of server.
As it is shown in figure 5, computer system 500 includes CPU (CPU) 501, its Can be according to the program being stored in read only memory (ROM) 502 or from storage part 508 It is loaded into the program in random access storage device (RAM) 503 and performs various suitable action And process.In RAM 503, also storage has system 500 to operate required various program sums According to.CPU 501, ROM 502 and RAM 503 are connected with each other by bus 504.Input / output (I/O) interface 505 is also connected to bus 504.
It is connected to I/O interface 505: include the importation 506 of keyboard, mouse etc. with lower component; Including such as cathode ray tube (CRT), liquid crystal display (LCD) etc. and speaker etc. Output part 507;Storage part 508 including hard disk etc.;And include such as LAN card, The communications portion 509 of the NIC of modem etc..Communications portion 509 is via such as The network of the Internet performs communication process.Driver 510 is connected to I/O interface also according to needs 505.Detachable media 511, such as disk, CD, magneto-optic disk, semiconductor memory etc., Be arranged on as required in driver 510, in order to the computer program read from it according to Needs are mounted into storage part 508.
Especially, according to embodiment of the disclosure, the process described above with reference to flow chart is permissible It is implemented as computer software programs.Such as, embodiment of the disclosure and include a kind of computer journey Sequence product, it includes the computer program being tangibly embodied on machine readable media, described meter Calculation machine program comprises the program code for performing the method shown in flow chart.In such enforcement In example, this computer program can be downloaded and installed from network by communications portion 509, And/or be mounted from detachable media 511.
Flow chart in accompanying drawing and block diagram, it is illustrated that according to the various embodiment of the application system, Architectural framework in the cards, function and the operation of method and computer program product.This point On, each square frame in flow chart or block diagram can represent a module, program segment or code A part, a part for described module, program segment or code comprise one or more for Realize the executable instruction of the logic function of regulation.It should also be noted that at some as replacement In realization, the function marked in square frame can also be sent out to be different from the order marked in accompanying drawing Raw.Such as, two square frames succeedingly represented can essentially perform substantially in parallel, they Sometimes can also perform in the opposite order, this is depending on involved function.It is also noted that It is, the square frame in each square frame in block diagram and/or flow chart and block diagram and/or flow chart Combination, can realize by the special hardware based system of the function or operation that perform regulation, Or can realize with the combination of specialized hardware with computer instruction.
Being described in the embodiment of the present application involved module can be real by the way of software Existing, it is also possible to realize by the way of hardware.Described module can also be arranged on process In device, for example, it is possible to be described as: a kind of processor include acquisition module, judge module, the One more new module and determine module.Wherein, the title of these modules the most not structure The restriction of this module itself in pairs, such as, acquisition module is also described as " obtaining single-pass The module of audio channel signal ".
As on the other hand, present invention also provides a kind of nonvolatile computer storage media, This nonvolatile computer storage media can be described in above-described embodiment included in device Nonvolatile computer storage media;Can also be individualism, be unkitted allocate in terminal non- Volatile computer storage medium.Above-mentioned nonvolatile computer storage media storage have one or The multiple program of person, when one or more program is performed by an equipment so that equipment: obtain Taking channel audio signal, channel audio signal includes reverb signal and late reverberation letter in early days Number;Judge whether channel audio signal is voice signal;The most then update the letter of reverberation in early days Number the variance of joint probability density distribution, and joint probability density based on early stage reverb signal The variance of distribution updates the filter coefficient of sub-filter, and wherein, sub-filter is used for filtering Except the late reverberation signal comprised in channel audio signal;And based on the wave filter after updating Coefficient determines the channel audio signal solving reverberation.
Above description is only the preferred embodiment of the application and saying institute's application technology principle Bright.It will be appreciated by those skilled in the art that invention scope involved in the application, do not limit In the technical scheme of the particular combination of above-mentioned technical characteristic, also should contain simultaneously without departing from In the case of described inventive concept, above-mentioned technical characteristic or its equivalent feature carry out combination in any And other technical scheme formed.Such as features described above and (but not limited to) disclosed herein The technical characteristic with similar functions is replaced mutually and the technical scheme that formed.

Claims (12)

1. an audio signal solution reverberation method, it is characterised in that including:
Obtain channel audio signal, described channel audio signal include in early days reverb signal and Late reverberation signal;
Judge whether described channel audio signal is voice signal;
The most then update the variance of the joint probability density distribution of described early stage reverb signal, and The variance of joint probability density based on described early stage reverb signal distribution updates sub-filter Filter coefficient, wherein, described sub-filter is used for filtering in described channel audio signal The late reverberation signal comprised;And
The channel audio signal solving reverberation is determined based on the described filter coefficient after updating.
Method the most according to claim 1, it is characterised in that the described single-pass of described judgement Whether audio channel signal is that voice signal includes:
Judge whether described channel audio signal is that voice is believed by voice activity detection technology Number.
Method the most according to claim 1, it is characterised in that:
The variance of the joint probability density distribution of described early stage reverb signal is the described son before updating The transposed matrix of the filter coefficient of band filter and the input signal vector of described sub-filter The spectral coefficient x of product and the channel audio signal received in tt,fThe absolute value of difference Square;
Filter coefficient g after renewalf(t+1) for the filter coefficient before updating and renewal variable quantity Sum;
Wherein, described renewal variable quantity is that the first renewal running parameter and second updates running parameter Ratio;
Described first renewal running parameter is the iteration step length of described sub-filter, described subband Error between actual output and the desired output of wave filter and described sub-filter defeated Enter the product of signal vector;
Described second transposition updating the input signal vector that running parameter is described sub-filter Product with the input signal vector of described sub-filter;
Error between actual output and the desired output of described sub-filter is equal in t The spectral coefficient of the channel audio signal received is general with combining of the early stage reverb signal of t The ratio of the variance of rate Density Distribution deducts the filter coefficient of the described sub-filter before renewal The product of the input signal vector of transposition and described sub-filter.
Method the most according to claim 1, it is characterised in that the sound after described solution reverberation Frequently signal dt,fEqual to when the spectral coefficient of the channel audio signal that t receives deducts t+1 The transposition of filter coefficient and the input signal of described sub-filter of the sub-filter carved to The product of amount.
5. according to the method described in claim 1-4 any one, it is characterised in that described After judging whether described channel audio signal is voice signal, described method also includes:
If it is not, the side that the joint probability density of the described early stage reverb signal before then updating is distributed The filter coefficient of the sub-filter before difference and renewal is believed as the early stage reverberation after described renewal Number the variance of joint probability density distribution and the filter coefficient of sub-filter after updating.
6. according to the method described in claim 1-4 any one, it is characterised in that described side Method also includes:
Judge whether the error between actual output and the desired output of described sub-filter meets Error Absolute Value square more than the spectral coefficient of the channel audio signal that receives in t Absolute value square with the product of predetermined threshold value K;
The most described filter coefficient is set to null vector;
Wherein, K > 1.
7. an audio signal solution reverberation unit, it is characterised in that including:
Acquisition module, is configured to obtain channel audio signal, described channel audio signal Including reverb signal and late reverberation signal in early days;
Judge module, is configured to judge whether described channel audio signal is voice signal;
First more new module, if being configured to described channel audio signal is voice signal, then Update the variance of the joint probability density distribution of described early stage reverb signal, and based on described early stage The variance of the joint probability density distribution of reverb signal updates the filter coefficient of sub-filter, Wherein, described sub-filter mixes for filtering the late period comprised in described channel audio signal Ring signal;And
Determine module, be configured to determine solution reverberation based on the described filter coefficient after updating Channel audio signal.
Device the most according to claim 7, it is characterised in that described judge module enters one Step is configured to:
Judge whether described channel audio signal is that voice is believed by voice activity detection technology Number.
Device the most according to claim 7, it is characterised in that:
The variance of the joint probability density distribution of described early stage reverb signal is:
The transposed matrix of the filter coefficient of the described sub-filter before renewal is filtered with described subband The spectrum of the product of the input signal vector of ripple device and the channel audio signal received in t Coefficient xt,fDifference absolute value square;
Filter coefficient g after renewalf(t+1) for the filter coefficient before updating and renewal variable quantity Sum;
Wherein, described renewal variable quantity is that the first renewal running parameter and second updates running parameter Ratio;
Described first renewal running parameter is the iteration step length of described sub-filter, described subband Error between actual output and the desired output of wave filter and described sub-filter defeated Enter the product of signal vector;
Described second transposition updating the input signal vector that running parameter is described sub-filter Product with the input signal vector of described sub-filter;
Error between actual output and the desired output of described sub-filter is equal in t The spectral coefficient of the channel audio signal received is general with combining of the early stage reverb signal of t The ratio of the variance of rate Density Distribution deducts the filter coefficient of the described sub-filter before renewal The product of the input signal vector of transposition and described sub-filter.
Device the most according to claim 7, it is characterised in that after described solution reverberation Audio signal dt,fT+1 is deducted equal to the spectral coefficient of the channel audio signal received in t The transposition of the filter coefficient of the sub-filter in moment and the input signal of described sub-filter The product of vector.
11. according to the device described in claim 7-10 any one, it is characterised in that described Device also includes the second more new module;
If it is not voice letter that the described second more new module is configured to described channel audio signal Number, then the variance and more of the joint probability density distribution of the described early stage reverb signal before updating The filter coefficient of the sub-filter before new is as the connection of the early stage reverb signal after described renewal The filter coefficient of the sub-filter after the variance of conjunction probability density distribution and renewal.
12. according to the device described in claim 7-10 any one, it is characterised in that described Device also includes zero setting module;
It is defeated with expectation that described zero setting module is configured to the actual output judging described sub-filter Error between going out whether meet Error Absolute Value square more than the single-pass that receives in t The absolute value of the spectral coefficient of audio channel signal square with the product of predetermined threshold value K;And
The most described filter coefficient is set to null vector;
Wherein, K > 1.
CN201610474006.1A 2016-06-24 2016-06-24 Audio signal solution reverberation method and device Active CN105931648B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610474006.1A CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610474006.1A CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Publications (2)

Publication Number Publication Date
CN105931648A true CN105931648A (en) 2016-09-07
CN105931648B CN105931648B (en) 2019-05-03

Family

ID=56829221

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610474006.1A Active CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Country Status (1)

Country Link
CN (1) CN105931648B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107328410A (en) * 2017-06-30 2017-11-07 百度在线网络技术(北京)有限公司 Method and automobile computer for positioning automatic driving vehicle
CN107328411A (en) * 2017-06-30 2017-11-07 百度在线网络技术(北京)有限公司 Vehicle positioning system and automatic driving vehicle
CN111489760A (en) * 2020-04-01 2020-08-04 腾讯科技(深圳)有限公司 Speech signal dereverberation processing method, speech signal dereverberation processing device, computer equipment and storage medium
CN113223543A (en) * 2021-06-10 2021-08-06 北京小米移动软件有限公司 Speech enhancement method, apparatus and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009535674A (en) * 2006-05-01 2009-10-01 日本電信電話株式会社 Method and apparatus for speech dereverberation based on stochastic model of sound source and room acoustics
CN103033815A (en) * 2012-12-19 2013-04-10 中国科学院声学研究所 Detection Method and detection device of distance expansion target based on reverberation covariance matrix
US20130297298A1 (en) * 2012-05-04 2013-11-07 Sony Computer Entertainment Inc. Source separation using independent component analysis with mixed multi-variate probability density function
CN104995676A (en) * 2013-02-14 2015-10-21 杜比实验室特许公司 Signal decorrelation in an audio processing system
US20160005413A1 (en) * 2013-02-14 2016-01-07 Dolby Laboratories Licensing Corporation Audio Signal Enhancement Using Estimated Spatial Parameters
WO2016014254A1 (en) * 2014-07-23 2016-01-28 Pcms Holdings, Inc. System and method for determining audio context in augmented-reality applications

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009535674A (en) * 2006-05-01 2009-10-01 日本電信電話株式会社 Method and apparatus for speech dereverberation based on stochastic model of sound source and room acoustics
US20130297298A1 (en) * 2012-05-04 2013-11-07 Sony Computer Entertainment Inc. Source separation using independent component analysis with mixed multi-variate probability density function
CN103033815A (en) * 2012-12-19 2013-04-10 中国科学院声学研究所 Detection Method and detection device of distance expansion target based on reverberation covariance matrix
CN104995676A (en) * 2013-02-14 2015-10-21 杜比实验室特许公司 Signal decorrelation in an audio processing system
US20160005413A1 (en) * 2013-02-14 2016-01-07 Dolby Laboratories Licensing Corporation Audio Signal Enhancement Using Estimated Spatial Parameters
WO2016014254A1 (en) * 2014-07-23 2016-01-28 Pcms Holdings, Inc. System and method for determining audio context in augmented-reality applications

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107328410A (en) * 2017-06-30 2017-11-07 百度在线网络技术(北京)有限公司 Method and automobile computer for positioning automatic driving vehicle
CN107328411A (en) * 2017-06-30 2017-11-07 百度在线网络技术(北京)有限公司 Vehicle positioning system and automatic driving vehicle
CN111489760A (en) * 2020-04-01 2020-08-04 腾讯科技(深圳)有限公司 Speech signal dereverberation processing method, speech signal dereverberation processing device, computer equipment and storage medium
CN111489760B (en) * 2020-04-01 2023-05-16 腾讯科技(深圳)有限公司 Speech signal dereverberation processing method, device, computer equipment and storage medium
CN113223543A (en) * 2021-06-10 2021-08-06 北京小米移动软件有限公司 Speech enhancement method, apparatus and storage medium

Also Published As

Publication number Publication date
CN105931648B (en) 2019-05-03

Similar Documents

Publication Publication Date Title
CN107680586B (en) Far-field speech acoustic model training method and system
CN109155006A (en) The audio analysis based on frequency is carried out using neural network
CN105931648A (en) Audio signal de-reverberation method and device
CN102870156B (en) Audio communication device, method for outputting an audio signal, and communication system
CN108108902A (en) A kind of risk case alarm method and device
CN110428842A (en) Speech model training method, device, equipment and computer readable storage medium
CN106409310A (en) Audio signal classification method and device
CN103220440A (en) Noise suppressing device, noise suppressing method, and program
CN108877783A (en) The method and apparatus for determining the audio types of audio data
CN107277178A (en) Method and apparatus for pushed information
CN109242043A (en) Method and apparatus for generating information prediction model
CN114492831A (en) Method and device for generating federal learning model
CN107451869A (en) Method and apparatus for pushed information
CN108446658A (en) The method and apparatus of facial image for identification
CN109584895B (en) Voice de-noising method and device
CN113793620B (en) Voice noise reduction method, device and equipment based on scene classification and storage medium
CN106169961A (en) The network parameter processing method and processing device of neutral net based on artificial intelligence
CN109087124A (en) A kind of application program Value Prediction Methods and device
CN108062423A (en) Information-pushing method and device
CN109165570A (en) Method and apparatus for generating information
CN106340310B (en) Speech detection method and device
CN116485527A (en) Financial credit risk analysis method and device
Catellier et al. Wenets: A convolutional framework for evaluating audio waveforms
CN106341564A (en) Method and device for processing signal data and intelligent conference equipment
CN106488554A (en) A kind of fingerprint database method for building up and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant