CN105931648B - Audio signal solution reverberation method and device - Google Patents

Audio signal solution reverberation method and device Download PDF

Info

Publication number
CN105931648B
CN105931648B CN201610474006.1A CN201610474006A CN105931648B CN 105931648 B CN105931648 B CN 105931648B CN 201610474006 A CN201610474006 A CN 201610474006A CN 105931648 B CN105931648 B CN 105931648B
Authority
CN
China
Prior art keywords
filter
signal
sub
audio signal
channel audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610474006.1A
Other languages
Chinese (zh)
Other versions
CN105931648A (en
Inventor
崔玮玮
宋辉
徐杨飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610474006.1A priority Critical patent/CN105931648B/en
Publication of CN105931648A publication Critical patent/CN105931648A/en
Application granted granted Critical
Publication of CN105931648B publication Critical patent/CN105931648B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

This application discloses a kind of audio signal solution reverberation method and devices.The method includes: acquisition channel audio signal, and channel audio signal includes early stage reverb signal and late reverberation signal;Judge whether channel audio signal is voice signal;If, then update the variance of the joint probability density distribution of early stage reverb signal, and the variance based on the distribution of the joint probability density of early stage reverb signal updates the filter coefficient of sub-filter, wherein sub-filter is for filtering out the late reverberation signal for including in channel audio signal;And the channel audio signal of solution reverberation is determined based on updated filter coefficient.The scheme of the application, can be by the late reverberation target signal filter in the audio signal of input, to improve the accuracy rate of subsequent speech recognition.

Description

Audio signal solution reverberation method and device
Technical field
This application involves field of computer technology, and in particular to Audio Signal Processing field more particularly to audio signal solution Reverberation method and device.
Background technique
During audio especially Speech processing, if the acquisition device for acquiring audio signal is (for example, wheat Gram wind) position apart from sound source farther out, the audio signal that acquisition device receives will be inevitably by the influence of reverberation.It is mixed Loud presence can not only reduce the sense of hearing quality of audio signal, but also under will lead to the precision of existing voice identifying system sharply Drop.
Reverberation can be decomposed into early stage reverberation and late reverberation, wherein on audio quality and identifying system precision influence compared with Big is late reverberation, therefore the main target for solving reverberation is how to reduce late reverberation.
In the prior art, more existing for filtering out the late reverberation in the collected voice signal of acquisition device Algorithm.However, these algorithms usually all have the following problems:
1) when solving the filter coefficient of the filter for filtering out late reverberation, need to obtain whole audio numbers According to, it is lower so as to cause the real-time rate of algorithm, and then the delay that will lead to solution reverberation algorithm is relatively high.And in voice communication and language It is higher to the requirement of real-time of solution reverberation algorithm in sound identification field.
2), when solving the filter coefficient of the filter for filtering out late reverberation, it will usually be related to matrix inversion It calculates.Once and matrix is irreversible during matrix inversion, then the filter coefficient acquired just inaccuracy, and then it is mixed to influence solution Loud performance.In addition, the operand of matrix inversion is larger, the real-time for also resulting in understanding reverberation algorithm from another point of view is poor.
Summary of the invention
The purpose of the application is to propose a kind of improved audio signal solution reverberation method and device, to solve background above The technical issues of technology segment is mentioned.
In a first aspect, this application provides a kind of audio signal solution reverberation methods, comprising: channel audio signal is obtained, Channel audio signal includes early stage reverb signal and late reverberation signal;Judge whether channel audio signal is voice letter Number;If so, updating the variance of the joint probability density distribution of early stage reverb signal, and the joint based on early stage reverb signal is general The variance of rate Density Distribution updates the filter coefficient of sub-filter, wherein sub-filter is for filtering out single channel audio The late reverberation signal for including in signal;And the audio signal after solution reverberation is determined based on updated filter coefficient.
In some embodiments, judge that channel audio signal whether be voice signal includes: to pass through voice activity detection Technology judges whether channel audio signal is voice signal.
In some embodiments, the variance of the joint probability density distribution of early stage reverb signal is the sub-band filter before updating The product of the input signal vector of the transposed matrix and sub-filter of the filter coefficient of device and the single-pass received in t moment The spectral coefficient x of audio channel signalt,fAbsolute value of the difference square;Updated filter coefficient gfIt (t+1) is the filter before update The sum of wave device coefficient and update variable quantity;Wherein, updating variable quantity is that the first update running parameter and second update running parameter The ratio between;First update running parameter be the iteration step length of sub-filter, the reality output of sub-filter and desired output it Between error and sub-filter input signal vector product;Second updates running parameter for the defeated of sub-filter Enter the product of the transposition of signal vector and the input signal vector of sub-filter;The reality output of sub-filter and expectation are defeated Error between out is equal to the spectral coefficient of the channel audio signal received in t moment and the early stage reverb signal of t moment The ratio between the variance of joint probability density distribution subtracts the transposition and sub-band filter of the filter coefficient of the sub-filter before updating The product of the input signal vector of device.
In some embodiments, the audio signal d after reverberation is solvedt,fEqual to the channel audio signal received in t moment Spectral coefficient subtract t+1 moment sub-filter filter coefficient transposition and sub-filter input signal vector Product.In some embodiments, judging whether channel audio signal is method after voice signal further include: if it is not, then The filter system of sub-filter before variance and update that the joint probability density of early stage reverb signal before update is distributed The filtering of variance and updated sub-filter that number is distributed as the joint probability density of updated early stage reverb signal Device coefficient.
In some embodiments, method further include: judge the mistake between the reality output of sub-filter and desired output Whether difference meets the absolute value of the spectral coefficient for square being greater than the channel audio signal received in t moment of Error Absolute Value Product square with preset threshold K;If so, filter coefficient is set to null vector;Wherein, K > 1.
Second aspect, this application provides a kind of audio signal solution reverberation units, comprising: obtains module, is configured to obtain Channel audio signal is taken, channel audio signal includes early stage reverb signal and late reverberation signal;Judgment module, configuration are used In judging whether channel audio signal is voice signal;First update module, if being configured to channel audio signal is language Sound signal then updates the variance of the joint probability density distribution of early stage reverb signal, and the joint based on early stage reverb signal is general The variance of rate Density Distribution updates the filter coefficient of sub-filter, wherein sub-filter is for filtering out single channel audio The late reverberation signal for including in signal;And determining module, it is configured to determine that solution is mixed based on updated filter coefficient Audio signal after sound.
In some embodiments, judgment module is further configured to: judging single channel by voice activity detection technology Whether audio signal is voice signal.
In some embodiments, the variance of the joint probability density distribution of early stage reverb signal are as follows: the subband filter before update The product of the input signal vector of the transposed matrix and sub-filter of the filter coefficient of wave device and the list received in t moment The spectral coefficient x of channel audio signalt,fAbsolute value of the difference square;Updated filter coefficient gf(t+1) before to update The sum of filter coefficient and update variable quantity;Wherein, updating variable quantity is the first update running parameter and the second more new change ginseng The ratio between number;First updates the reality output and desired output that running parameter is the iteration step length of sub-filter, sub-filter Between error and sub-filter input signal vector product;Second updates running parameter for sub-filter The product of the input signal vector of the transposition and sub-filter of input signal vector;The reality output and expectation of sub-filter Error between output is equal to the spectral coefficient of the channel audio signal received in t moment and the early stage reverb signal of t moment Joint probability density distribution the ratio between variance subtract the sub-filter before updating filter coefficient transposition and subband filter The product of the input signal vector of wave device.
In some embodiments, the audio signal d after reverberation is solvedt,fEqual to the channel audio signal received in t moment Spectral coefficient subtract t+1 moment sub-filter filter coefficient transposition and sub-filter input signal vector Product.
In some embodiments, device further includes the second update module;If the second update module is configured to single channel sound Frequency signal is not voice signal, then before the variance and update that are distributed the joint probability density of the early stage reverb signal before update The variance and update that the filter coefficient of sub-filter is distributed as the joint probability density of updated early stage reverb signal The filter coefficient of sub-filter afterwards.
In some embodiments, device further includes zero setting module;Zero setting module is configured to judge the reality of sub-filter What whether the error between border output and desired output met Error Absolute Value square is greater than the single channel sound that receives in t moment The product square with preset threshold K of the absolute value of the spectral coefficient of frequency signal;And if so, by filter coefficient be set to zero to Amount;Wherein, K > 1.
Audio signal solution reverberation method and device provided by the present application, are continuously updated by the voice signal based on input The variance of the joint probability density distribution of early stage reverb signal and the parameter of sub-filter, so that sub-filter filters energy Enough by the late reverberation target signal filter in the channel audio signal of input, to improve the accuracy rate of subsequent speech recognition.
In addition, the audio signal solution reverberation method and device of the application, the joint probability density distribution of early stage reverb signal Variance update and sub-filter parameter update needed for calculation amount it is smaller, and renewal process only with the list in a period of time Channel audio signal is related, real-time with higher.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, the application's is other Feature, objects and advantages will become more apparent upon:
Fig. 1 is that this application can be applied to exemplary system architecture figures therein;
Fig. 2 is the flow chart according to one embodiment of the audio signal solution reverberation method of the application;
Fig. 3 is the flow chart according to another embodiment of the audio signal solution reverberation method of the application;
Fig. 4 is the structural schematic diagram according to one embodiment of the audio signal solution reverberation unit of the application;
Fig. 5 is adapted for the structural representation of the computer system for the terminal device or server of realizing the embodiment of the present application Figure.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is shown can be using the implementation of the audio signal solution reverberation method or audio signal solution reverberation unit of the application The exemplary system architecture 100 of example.
As shown in Figure 1, system architecture 100 may include terminal device 101,102,103, network 104 and server 105. Network 104 between terminal device 101,102,103 and server 105 to provide the medium of communication link.Network 104 can be with Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
User can be used terminal device 101,102,103 and be interacted by network 104 with server 105, to receive or send out Send message etc..Various telecommunication customer end applications can be installed, such as web browser is answered on terminal device 101,102,103 With, shopping class application, searching class application, instant messaging tools, mailbox client, social platform software etc..
Terminal device 101,102,103 can be with acquisition audio signal ability various electronic equipments, including but It is not limited to smart phone, tablet computer, MP3 player (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard audio level 3), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image expert's compression standard audio level 4) player, pocket computer on knee and desktop computer etc. Deng.
Server 105 can be to provide the server of various services, such as to the sound that terminal device 101,102,103 acquires The audio processing service device that frequency signal is handled.Audio processing service device can carry out the data such as the audio signal received The processing such as analysis, and processing result (such as audio data through reverberation removal processing) is fed back into terminal device.
It should be noted that audio signal solution reverberation method both can be by terminal device provided by the embodiment of the present application 101, it 102,103 executes, can also be executed, can be held by terminal device 101,102,103 with a part of step by server 105 Row and another part step is executed by server 105.Correspondingly, audio signal solution reverberation unit both can be set in terminal device 101, in 102,103, it also can be set in server 105 or a part of module be set to terminal device 101,102,103 In and another part module is set in server.
It should be understood that the number of terminal device, network and server in Fig. 1 is only schematical.According to realization need It wants, can have any number of terminal device, network and server.
With continued reference to Fig. 2, the process of one embodiment of the audio signal solution reverberation method according to the application is shown 200.The audio signal solution reverberation method, comprising the following steps:
Step 210, channel audio signal is obtained, wherein channel audio signal includes early stage reverb signal and advanced stage Reverb signal.
In the present embodiment, electronic equipment (such as the service shown in FIG. 1 of audio signal solution reverberation method operation thereon Device) it can be received from user using its terminal for carrying out audio signal sample by wired connection mode or radio connection Channel audio signal.It should be pointed out that above-mentioned radio connection can include but is not limited to 3G/4G connection, WiFi connects Connect, bluetooth connection, WiMAX connection, Zigbee connection, UWB (ultra wideband) connection and other it is currently known or will Come the radio connection developed.
In general, when, there are when a certain distance, the equipment for acquiring sound is adopted between the equipment and sound source of acquisition sound The audio signal collected is by the influence by reverberation.Reverberation usually (namely is directly conveyed to acquisition sound from sound source according to direct sound wave The audio signal of the equipment of sound) and be transferred to acquisition sound equipment reverberation between time difference be early stage reverberation and advanced stage Reverberation.For example, the reverberation of the equipment of acquisition sound can will be reached in 30ms after direct sound wave (millisecond) as early stage reverberation, and incite somebody to action The reverberation for the equipment for acquiring sound is reached as late reverberation more than 30ms.Early stage reverberation is to voice amplitude, phase delay, resonance Peak influence is smaller, and late reverberation then can be bigger on voice amplitude, phase delay, formant influence, and will lead to language The phase inter-masking of syllable, these all reduce speech intelligibility, and very big difficulty is brought to speech recognition.
Step 220, judge whether channel audio signal is voice signal.
The audio signal solution reverberation method of the present embodiment, it is intended to solution reverberation processing is carried out to collected voice signal, into And enables that treated voice signal more realistically reflects the practical voice said of user.And then promote subsequent speech recognition The accuracy rate of equal signal processings.
In addition, therefore voice signal and other audio signals are obtained in the presence of more significant difference by judging in step Whether the channel audio signal got is voice signal, can be carried out in subsequent processing step only for voice signal The operations such as corresponding filtering, to improve the solution reverberation treatment effeciency and real-time of voice signal.
In some optional implementations, for example, VAD (Voice Activity Detection, voice can be passed through Activity detection) technology carries out the identification of voice signal, so that the channel audio signal that judgement is got in step 210 is No is voice signal.
Step 230, if so, updating the variance of the joint probability density distribution of early stage reverb signal, and it is mixed based on early stage The variance for ringing the joint probability density distribution of signal updates the filter coefficient of sub-filter.
The variance of the joint probability density distribution of early stage reverb signal can be the filter of the sub-filter before updating The product of the input signal vector of the transposed matrix and sub-filter of coefficient and the channel audio signal received in t moment Spectral coefficient xt,fAbsolute value of the difference square.
Illustratively, the variance of the joint probability density distribution of early stage reverb signal can satisfy:
Wherein, xt,fSpectral coefficient for the channel audio signal received in t moment,For the sub-band filter before update The transposition of the filter coefficient of device,For the input signal vector of sub-filter.
Updated filter coefficient gf(t+1) for the filter coefficient before updating and the sum of variable quantity is updated.Wherein, more New change amount is that the first update running parameter and second update the ratio between running parameter.First updates running parameter for sub-filter Iteration step length, sub-filter reality output and desired output between error and sub-filter input signal The product of vector.Second updates the input of the transposition and sub-filter of the input signal vector that running parameter is sub-filter The product of signal vector.Error between the reality output and desired output of sub-filter is equal to the list received in t moment The ratio between the variance of the joint probability density distribution of the early stage reverb signal of the spectral coefficient and t moment of channel audio signal subtracts update The product of the input signal vector of the transposition and sub-filter of the filter coefficient of preceding sub-filter.
Illustratively, updated filter coefficient gf(t+1) it can satisfy:
Wherein, μ is the iteration step length of sub-filter, and e (t) is between the reality output and desired output of sub-filter Error e (t) meet:
Herein, the initial value g of the filter coefficient of sub-filterf(0) it can be single order number is N complete null vector.
Wherein, N is the number of taps of sub-filter, and is met:
N=L-D+1.
L is the frame time length ratio of reverberation time and one frame of channel audio signal, and D is the reverberation time of early stage reverberation With the frame time length ratio of one frame of channel audio signal.And the reverberation time can for example be defined as indoor sound and reach stable State, remnant voice is absorbed through sound-absorbing material repeatedly in the room after sound source stops sounding, and average sound pressure level is decayed needed for 60dB Time.
In addition, the input signal vector of sub-filterIt can have the following form of expression:
In other words,Can be understood as sub-filter from the t-D moment to t-L+1 reception to input signal The input signal vector of formation.
In addition, in formula (3):
Correspondingly, in formula (2) and formula (3):
By above-mentioned formula (1)~(3) as can be seen that in the side of the joint probability density distribution to early stage reverb signal When difference is updated, the filter coefficient used is filter coefficient (the i.e. g of t momentfOr the g in formula (2)f(t)), exist It completes the update of the variance of the joint probability density distribution of early stage reverb signal and then the coefficient of filter is updated (the g i.e. in formula (2)f(t+1))。
Step 240, the audio signal after solution reverberation is determined based on updated filter coefficient.
Using formula as described above (1)~(3) to the variance of the joint probability density of early stage reverb signal distribution and Audio signal d after the filter coefficient of sub-filter is updated, after solving reverberationt,fEqual to the single-pass received in t moment The spectral coefficient of audio channel signal subtracts the transposition and the sub-filter of the filter coefficient of the sub-filter at t+1 moment The product of input signal vector.
Illustratively, the audio signal d after reverberation is solvedT, fIt can satisfy:
In addition, the audio signal solution reverberation method of the present embodiment can also include such as in some optional implementations Under step:
Judge whether the error between the reality output of sub-filter and desired output meets square of Error Absolute Value Greater than the product square with preset threshold K of the absolute value of the spectral coefficient of the channel audio signal received in t moment.Namely It is to say, judges whether the error e (t) between the reality output of sub-filter and desired output meets | e (t) |2> K × | xt,f |2
If so, filter coefficient is set to null vector.
Wherein, K is pre-set threshold value, and meets K > 1.
In addition, working as | e (t) |2> K × | xt,f|2When, it is believed that sub-filter diverging believes early stage reverberation in next step Number joint probability density distribution variance when being updated, the g of formula (1)fFor null vector.
Using the audio signal solution reverberation method of the present embodiment, calculation amount needed for solving reverberation process is smaller, and operation Journey only need t-D~t-L+1 moment between the channel audio signal that inputs and the channel audio signal of present frame input, So that the real-time of the solution reverberation process of the present embodiment is stronger.
With further reference to Fig. 3, it illustrates the processes 300 of another embodiment of audio signal solution reverberation method.The sound The process 300 of frequency signal solution reverberation method.
Can have in the present embodiment step 310 corresponding with step 210~step 240 in embodiment illustrated in fig. 2~ Step 340.
Unlike embodiment illustrated in fig. 2, the present embodiment be may further comprise:
Step 350, if judging result in step 320 be it is no, the joint of the early stage reverb signal before update is general The connection of the variance of rate Density Distribution and the filter coefficient of the sub-filter before update as updated early stage reverb signal Close the variance of probability density distribution and the filter coefficient of updated sub-filter.
In other words, if judging the channel audio signal got in step 310 in step 320 not is voice letter Number, then the variance of the joint probability density distribution of early stage reverb signal and the filter coefficient of sub-filter are not updated, and The variance of joint probability density distribution based on the early stage reverb signal before update and the filter coefficient pair of sub-filter The channel audio signal got in step 310 carries out solution reverberation processing (step 340).
Compared with embodiment shown in Fig. 2, the audio signal solution reverberation method of the present embodiment can be sentenced to avoid in step 320 The output abnormality that the error of disconnected result may cause further promotes the method through the present embodiment treated the evening of audio signal Phase reverberation filter effect.
With further reference to Fig. 4, as the realization to method shown in above-mentioned each figure, this application provides a kind of audio signal solutions One embodiment of reverberation unit, the Installation practice is corresponding with embodiment of the method shown in Fig. 2, which can specifically answer For in various electronic equipments.
As shown in figure 4, audio signal solution reverberation unit 400 described in the present embodiment includes obtaining module 410, judgment module 420, the first update module 430 and determining module 440:
Wherein:
It obtains module 410 to be configurable to obtain channel audio signal, channel audio signal includes early stage reverberation letter Number and late reverberation signal.
Judgment module 420 is configurable to judge whether channel audio signal is voice signal.
If it is voice signal that the first update module 430, which is configurable to channel audio signal, early stage reverberation letter is updated Number joint probability density distribution variance, and based on the joint probability density of early stage reverb signal distribution variance update subband The filter coefficient of filter, wherein sub-filter is for filtering out the late reverberation signal for including in channel audio signal.
Determining module 440 is configurable to determine the audio signal after solution reverberation based on updated filter coefficient.
In some optional implementations, judgment module 420, which can be further configured, to be used for: by voice activity detection skill Art judges whether channel audio signal is voice signal.
In some optional implementations,
The variance of the joint probability density distribution of early stage reverb signal can be the filter of the sub-filter before updating The product of the input signal vector of the transposed matrix and sub-filter of coefficient and the channel audio signal received in t moment Spectral coefficient xt,fAbsolute value of the difference square.
Illustratively, the variance of the joint probability density distribution of early stage reverb signal can satisfy:
Wherein, xt,fSpectral coefficient for the channel audio signal received in t moment,For the sub-band filter before update The transposition of the filter coefficient of device,For the input signal vector of the sub-filter;
Updated filter coefficient gf(t+1) for the filter coefficient before updating and the sum of variable quantity is updated.Wherein, more New change amount is that the first update running parameter and second update the ratio between running parameter.First updates running parameter for sub-filter Iteration step length, sub-filter reality output and desired output between error and sub-filter input signal The product of vector.Second updates the input of the transposition and sub-filter of the input signal vector that running parameter is sub-filter The product of signal vector.Error between the reality output and desired output of sub-filter is equal to the list received in t moment The ratio between the variance of the joint probability density distribution of the early stage reverb signal of the spectral coefficient and t moment of channel audio signal subtracts update The product of the input signal vector of the transposition and sub-filter of the filter coefficient of preceding sub-filter.
Illustratively, updated filter coefficient gf(t+1) it can satisfy:
Wherein, μ is the iteration step length of sub-filter, and e (t) is between the reality output and desired output of sub-filter Error e (t) meet:
Audio signal d in some optional implementations, after solving reverberationt,fEqual to the single channel received in t moment The spectral coefficient of audio signal subtracts the defeated of the transposition of the filter coefficient of the sub-filter at t+1 moment and the sub-filter Enter the product of signal vector.
Illustratively, the audio signal d after reverberation is solvedt,fIt can satisfy:
In some optional implementations, the audio signal solution reverberation unit of the present embodiment can further include Two update module (not shown)s.
If it is not voice signal that the second update module, which is configurable to channel audio signal, the early stage before update is mixed The filter coefficient of sub-filter before ringing the variance and update of the joint probability density distribution of signal is as updated morning The variance of the joint probability density distribution of phase reverb signal and the filter coefficient of updated sub-filter.
In some optional implementations, the audio signal solution reverberation unit of the present embodiment, which can further include, to be set Zero module (not shown).
Zero setting module is configurable to judge whether the error between the reality output of sub-filter and desired output is full The absolute value of the spectral coefficient for square being greater than the channel audio signal received in t moment of sufficient Error Absolute Value square with it is pre- If the product of threshold k.In other words, judge whether the error e (t) between the reality output of sub-filter and desired output is full Foot | e (t) |2> K × | xt,f|2;And if so, filter coefficient is set to null vector;Wherein, K is pre-set threshold value, And meet K > 1.
It will be understood by those skilled in the art that above-mentioned audio signal solution reverberation unit 400 can also include some other public affairs Know structure, such as processor, memory etc., in order to unnecessarily obscure embodiment of the disclosure, these well known structures are in Fig. 4 In be not shown.
Below with reference to Fig. 5, it illustrates the calculating of the terminal device or server that are suitable for being used to realize the embodiment of the present application The structural schematic diagram of machine system 500.
As shown in figure 5, computer system 500 includes central processing unit (CPU) 501, it can be read-only according to being stored in Program in memory (ROM) 502 or be loaded into the program in random access storage device (RAM) 503 from storage section 508 and Execute various movements appropriate and processing.In RAM 503, also it is stored with system 500 and operates required various programs and data. CPU 501, ROM 502 and RAM 503 are connected with each other by bus 504.Input/output (I/O) interface 505 is also connected to always Line 504.
I/O interface 505 is connected to lower component: the importation 506 including keyboard, mouse etc.;It is penetrated including such as cathode The output par, c 507 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 508 including hard disk etc.; And the communications portion 509 of the network interface card including LAN card, modem etc..Communications portion 509 via such as because The network of spy's net executes communication process.Driver 510 is also connected to I/O interface 505 as needed.Detachable media 511, such as Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 510, in order to read from thereon Computer program be mounted into storage section 508 as needed.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be tangibly embodied in machine readable Computer program on medium, the computer program include the program code for method shown in execution flow chart.At this In the embodiment of sample, which can be downloaded and installed from network by communications portion 509, and/or from removable Medium 511 is unloaded to be mounted.
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the application, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of the module, program segment or code include one or more Executable instruction for implementing the specified logical function.It should also be noted that in some implementations as replacements, institute in box The function of mark can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are practical On can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it wants It is noted that the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart, Ke Yiyong The dedicated hardware based system of defined functions or operations is executed to realize, or can be referred to specialized hardware and computer The combination of order is realized.
Being described in module involved in the embodiment of the present application can be realized by way of software, can also be by hard The mode of part is realized.Described module also can be set in the processor, for example, can be described as: a kind of processor packet It includes and obtains module, judgment module, the first update module and determining module.Wherein, the title of these modules is under certain conditions simultaneously The restriction to the module itself is not constituted, is also described as " obtaining the mould of channel audio signal for example, obtaining module Block ".
As on the other hand, present invention also provides a kind of nonvolatile computer storage media, the non-volatile calculating Machine storage medium can be nonvolatile computer storage media included in device described in above-described embodiment;It is also possible to Individualism, without the nonvolatile computer storage media in supplying terminal.Above-mentioned nonvolatile computer storage media is deposited One or more program is contained, when one or more program is executed by an equipment, so that equipment: obtaining single channel sound Frequency signal, channel audio signal include early stage reverb signal and late reverberation signal;Judge channel audio signal whether be Voice signal;If so, updating the variance of the joint probability density distribution of early stage reverb signal, and based on early stage reverb signal The variance of joint probability density distribution updates the filter coefficient of sub-filter, wherein sub-filter is for filtering out single-pass The late reverberation signal for including in audio channel signal;And the single channel sound of solution reverberation is determined based on updated filter coefficient Frequency signal.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed herein Can technical characteristic replaced mutually and the technical solution that is formed.

Claims (10)

1. a kind of audio signal solution reverberation method characterized by comprising
Channel audio signal is obtained, the channel audio signal includes early stage reverb signal and late reverberation signal;
Judge whether the channel audio signal is voice signal;
If so, updating the variance of the joint probability density distribution of the early stage reverb signal, and believed based on the early stage reverberation Number joint probability density distribution variance update sub-filter filter coefficient, wherein the sub-filter is used for Filter out the late reverberation signal for including in the channel audio signal;And
The channel audio signal of solution reverberation is determined based on the updated filter coefficient;The judgement single channel sound Whether frequency signal is that voice signal includes:
Judge whether the channel audio signal is voice signal by voice activity detection technology.
2. according to the method described in claim 1, it is characterized by:
The variance of the joint probability density distribution of the early stage reverb signal is the filter of the sub-filter before updating The product of the input signal vector of the transposed matrix of coefficient and the sub-filter and the single channel audio received in t moment The spectral coefficient x of signalt,fAbsolute value of the difference square;
Updated filter coefficient gf(t+1) for the filter coefficient before updating and the sum of variable quantity is updated;
Wherein, the update variable quantity is that the first update running parameter and second update the ratio between running parameter;
Described first update running parameter be the iteration step length of the sub-filter, the sub-filter reality output and The product of the input signal vector of error and the sub-filter between desired output;
Described second updates the transposition and the sub-filter of the input signal vector that running parameter is the sub-filter Input signal vector product;
Error between the reality output and desired output of the sub-filter is equal to the single channel audio received in t moment The ratio between the variance of the joint probability density distribution of the early stage reverb signal of the spectral coefficient and t moment of signal subtracts described before updating The product of the input signal vector of the transposition of the filter coefficient of sub-filter and the sub-filter.
3. the method according to claim 1, wherein the audio signal d after the solution reverberationt,fEqual in t moment The spectral coefficient of the channel audio signal received subtract the transposition of the filter coefficient of the sub-filter at t+1 moment with it is described The product of the input signal vector of sub-filter.
4. method according to claim 1 to 3, which is characterized in that in the judgement single channel audio letter After number whether being voice signal, the method also includes:
If it is not, the subband before the variance and update that are then distributed the joint probability density of the early stage reverb signal before update is filtered After variance and update that the filter coefficient of wave device is distributed as the joint probability density of the updated early stage reverb signal Sub-filter filter coefficient.
5. method according to claim 1 to 3, which is characterized in that the method also includes:
Judge whether the error between the reality output and desired output of the sub-filter meets square of Error Absolute Value Greater than the product square with preset threshold K of the absolute value of the spectral coefficient of the channel audio signal received in t moment;
If so, the filter coefficient is set to null vector;
Wherein, K > 1.
6. a kind of audio signal solution reverberation unit characterized by comprising
Obtain module, be configured to obtain channel audio signal, the channel audio signal include early stage reverb signal and Late reverberation signal;
Judgment module is configured to judge whether the channel audio signal is voice signal;
First update module updates the early stage reverberation letter if being configured to the channel audio signal is voice signal Number joint probability density distribution variance, and based on the joint probability density of the early stage reverb signal distribution variance update The filter coefficient of sub-filter, wherein the sub-filter includes for filtering out in the channel audio signal Late reverberation signal;And
Determining module is configured to determine the channel audio signal of solution reverberation based on the updated filter coefficient;Institute Judgment module is stated further to be configured to:
Judge whether the channel audio signal is voice signal by voice activity detection technology.
7. device according to claim 6, it is characterised in that:
The variance of the joint probability density distribution of the early stage reverb signal are as follows:
The input signal of the transposed matrix of the filter coefficient of the sub-filter before update and the sub-filter to The spectral coefficient x of the product of amount and the channel audio signal received in t momentt,fAbsolute value of the difference square;
Updated filter coefficient gf(t+1) for the filter coefficient before updating and the sum of variable quantity is updated;
Wherein, the update variable quantity is that the first update running parameter and second update the ratio between running parameter;
Described first update running parameter be the iteration step length of the sub-filter, the sub-filter reality output and The product of the input signal vector of error and the sub-filter between desired output;
Described second updates the transposition and the sub-filter of the input signal vector that running parameter is the sub-filter Input signal vector product;
Error between the reality output and desired output of the sub-filter is equal to the single channel audio received in t moment The ratio between the variance of the joint probability density distribution of the early stage reverb signal of the spectral coefficient and t moment of signal subtracts described before updating The product of the input signal vector of the transposition of the filter coefficient of sub-filter and the sub-filter.
8. device according to claim 6, which is characterized in that the audio signal d after the solution reverberationt,fEqual in t moment The spectral coefficient of the channel audio signal received subtract the transposition of the filter coefficient of the sub-filter at t+1 moment with it is described The product of the input signal vector of sub-filter.
9. according to device described in claim 6-8 any one, which is characterized in that described device further includes the second update mould Block;
If it is not voice signal that second update module, which is configured to the channel audio signal, described in front of update Described in the filter coefficient of sub-filter before the variance and update of the joint probability density distribution of early stage reverb signal is used as The variance of the joint probability density distribution of updated early stage reverb signal and the filter coefficient of updated sub-filter.
10. according to device described in claim 6-8 any one, which is characterized in that described device further includes zero setting module;
Whether the zero setting module is configured to judge the error between the reality output and desired output of the sub-filter Meet the absolute value of the spectral coefficient for square being greater than the channel audio signal received in t moment of Error Absolute Value square with The product of preset threshold K;And
If so, the filter coefficient is set to null vector;
Wherein, K > 1.
CN201610474006.1A 2016-06-24 2016-06-24 Audio signal solution reverberation method and device Active CN105931648B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610474006.1A CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610474006.1A CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Publications (2)

Publication Number Publication Date
CN105931648A CN105931648A (en) 2016-09-07
CN105931648B true CN105931648B (en) 2019-05-03

Family

ID=56829221

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610474006.1A Active CN105931648B (en) 2016-06-24 2016-06-24 Audio signal solution reverberation method and device

Country Status (1)

Country Link
CN (1) CN105931648B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107328410B (en) * 2017-06-30 2020-07-28 百度在线网络技术(北京)有限公司 Method for locating an autonomous vehicle and vehicle computer
CN107328411B (en) * 2017-06-30 2020-07-28 百度在线网络技术(北京)有限公司 Vehicle-mounted positioning system and automatic driving vehicle
CN111489760B (en) * 2020-04-01 2023-05-16 腾讯科技(深圳)有限公司 Speech signal dereverberation processing method, device, computer equipment and storage medium
CN113223543B (en) * 2021-06-10 2023-04-28 北京小米移动软件有限公司 Speech enhancement method, device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009535674A (en) * 2006-05-01 2009-10-01 日本電信電話株式会社 Method and apparatus for speech dereverberation based on stochastic model of sound source and room acoustics
CN103033815A (en) * 2012-12-19 2013-04-10 中国科学院声学研究所 Detection Method and detection device of distance expansion target based on reverberation covariance matrix
CN104995676A (en) * 2013-02-14 2015-10-21 杜比实验室特许公司 Signal decorrelation in an audio processing system
WO2016014254A1 (en) * 2014-07-23 2016-01-28 Pcms Holdings, Inc. System and method for determining audio context in augmented-reality applications

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8886526B2 (en) * 2012-05-04 2014-11-11 Sony Computer Entertainment Inc. Source separation using independent component analysis with mixed multi-variate probability density function
TWI618051B (en) * 2013-02-14 2018-03-11 杜比實驗室特許公司 Audio signal processing method and apparatus for audio signal enhancement using estimated spatial parameters

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009535674A (en) * 2006-05-01 2009-10-01 日本電信電話株式会社 Method and apparatus for speech dereverberation based on stochastic model of sound source and room acoustics
CN103033815A (en) * 2012-12-19 2013-04-10 中国科学院声学研究所 Detection Method and detection device of distance expansion target based on reverberation covariance matrix
CN104995676A (en) * 2013-02-14 2015-10-21 杜比实验室特许公司 Signal decorrelation in an audio processing system
WO2016014254A1 (en) * 2014-07-23 2016-01-28 Pcms Holdings, Inc. System and method for determining audio context in augmented-reality applications

Also Published As

Publication number Publication date
CN105931648A (en) 2016-09-07

Similar Documents

Publication Publication Date Title
CN105931648B (en) Audio signal solution reverberation method and device
CN109121057B (en) Intelligent hearing aid method and system
CN110428842A (en) Speech model training method, device, equipment and computer readable storage medium
CN108196820A (en) For adjusting the method and apparatus of play parameter
CN110444202B (en) Composite voice recognition method, device, equipment and computer readable storage medium
CN106165015B (en) Apparatus and method for facilitating watermarking-based echo management
CN111986691B (en) Audio processing method, device, computer equipment and storage medium
CN111863015A (en) Audio processing method and device, electronic equipment and readable storage medium
CN109410918A (en) For obtaining the method and device of information
CN109817222A (en) A kind of age recognition methods, device and terminal device
CN112562648A (en) Adaptive speech recognition method, apparatus, device and medium based on meta learning
CN112259116A (en) Method and device for reducing noise of audio data, electronic equipment and storage medium
CN110428835A (en) A kind of adjusting method of speech ciphering equipment, device, storage medium and speech ciphering equipment
EP4040764A2 (en) Method and apparatus for in-vehicle call, device, computer readable medium and product
CN114898762A (en) Real-time voice noise reduction method and device based on target person and electronic equipment
CN113555032A (en) Multi-speaker scene recognition and network training method and device
CN105701686A (en) Voiceprint advertisement implementation method and device
CN112992190B (en) Audio signal processing method and device, electronic equipment and storage medium
CN110169082A (en) Combining audio signals output
CN106340310B (en) Speech detection method and device
CN111508500B (en) Voice emotion recognition method, system, device and storage medium
CN115295024A (en) Signal processing method, signal processing device, electronic apparatus, and medium
CN107798556A (en) For updating method, equipment and the storage medium of situation record
CN113823312A (en) Speech enhancement model generation method and device and speech enhancement method and device
Liu et al. Speech enhancement with stacked frames and deep neural network for VoIP applications

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant