WO2021190274A1 - Method and device for determining state of echo sound field, storage medium, and terminal - Google Patents

Method and device for determining state of echo sound field, storage medium, and terminal Download PDF

Info

Publication number
WO2021190274A1
WO2021190274A1 PCT/CN2021/079181 CN2021079181W WO2021190274A1 WO 2021190274 A1 WO2021190274 A1 WO 2021190274A1 CN 2021079181 W CN2021079181 W CN 2021079181W WO 2021190274 A1 WO2021190274 A1 WO 2021190274A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
determined
state
sound field
echo
Prior art date
Application number
PCT/CN2021/079181
Other languages
French (fr)
Chinese (zh)
Inventor
叶顺舟
Original Assignee
紫光展锐(重庆)科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 紫光展锐(重庆)科技有限公司 filed Critical 紫光展锐(重庆)科技有限公司
Publication of WO2021190274A1 publication Critical patent/WO2021190274A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/082Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Definitions

  • the present invention relates to the technical field of acoustic echo cancellation, in particular to a method and device for determining the state of an echo sound field, a storage medium, and a terminal.
  • AEC Acoustic Echo Canceler
  • the robustness and stability of the corresponding AEC technology are greatly challenged. For example, if the update of adaptive filtering is not controlled in dual-talk and no-speech scenarios, it will face the risk of divergence and misalignment.At the same time, when the echo path changes, if the update speed is not increased, the convergence speed will be too slow, resulting in The residual echo; similarly, in the non-linear or residual echo processing, if the single talk and the dual state are not distinguished, it will often lead to the damage of the effective speech and reduce the performance of the dual talk.
  • Double Talk State in the echo sound field state is particularly important.
  • Conventional Double Talk Detection (DTD) methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and Detection based on echo path.
  • the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high;
  • the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear
  • the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes.
  • the technical problem solved by the present invention is to provide a method and device for determining the state of the echo sound field, a storage medium, and a terminal, which can effectively improve the accuracy of determining the state of the echo path change.
  • an embodiment of the present invention provides a method for determining the state of an echo sound field, which includes the following steps: acquiring a signal to be determined; determining the far-end signal X n (k) and the near-end signal D n ( k) and the filter coefficient W n (k); at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k), the filter update degree Cef update is determined At least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update , it is determined whether the echo sound field state of the signal to be determined is the echo path change state.
  • the method for determining the echo sound field state further includes: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
  • determining the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) includes: according to the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), to determine the updated Filter coefficient W n+1 (k); determine the filter update degree Cef update according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k).
  • the method for determining the echo sound field state further includes: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
  • the method for determining the echo sound field state further includes: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
  • the method for determining the echo sound field state further includes: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
  • determining the echo suppression ratio Err of the signal to be determined includes: determining the residual signal according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k). the difference signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
  • k is the frequency index of the signal to be determined.
  • the echo sound field state determination method further includes: determining the normalized cross-correlation values C YE and C DE ; if C DE Is greater than the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , then it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation threshold The correlation threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
  • it further includes one or more of the following: if the filter update degree Cef update is greater than the preset update degree threshold Thrd update , determining that the echo sound field state of the signal to be determined is the echo path change state; If the update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
  • M and L are the frequency band indexes of the signal to be determined.
  • the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; where M and L are frequency band indexes of the linear region.
  • the method for determining the echo sound field state further includes: adjusting the update step size ⁇ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size ⁇ n ( k) is used to indicate the update step size of the filter coefficient W n (k).
  • an echo adaptive filter is used to adjust the update step size ⁇ n (k) of the signal to be determined.
  • the method for determining the echo sound field state further includes: determining whether to perform non-linear processing on the signal to be determined according to the echo sound field state of the signal to be determined.
  • determining whether to perform nonlinear processing on the signal to be determined includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If the echo sound field state of the signal to be determined is the echo path change state, then the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop talking to the Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, then the non-linear processing of the signal to be determined is stopped.
  • a post-processing non-linear processing unit is used to perform non-linear processing on the signal to be determined.
  • the method for determining the echo sound field state further includes: determining, according to the echo sound field state of the signal to be determined, to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
  • determining to reduce the noise update speed or to improve the non-stationary noise suppression capability includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, reducing the signal to be determined Noise update speed; if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reduce the noise update speed of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is a remote single talk state, The non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is an echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
  • a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
  • the method for determining the state of the echo sound field further includes: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
  • the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
  • an embodiment of the present invention provides an echo sound field state determination device, which includes: an acquisition module for acquiring a signal to be determined; a signal determination module for determining the far-end signal X n ( k), the near-end signal D n (k) and the filter coefficient W n (k); the update degree determination module is used to determine at least the far-end signal X n (k), the near-end signal D n (k) and the The filter coefficient W n (k) determines the filter update degree Cef update ; the state determination module is used to determine the echo sound field state of the signal to be determined at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update Whether it is the state of echo path change.
  • an embodiment of the present invention provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the method for determining the state of the echo sound field when the computer instructions are executed.
  • an embodiment of the present invention provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the computer instructions when the computer instructions are run. The steps of the method for determining the state of the echo sound field.
  • the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
  • the signal to be determined is actually the state of the echo path change.
  • the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
  • the misjudgment is the dual-talk state.
  • the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state, and in the subsequent steps, there is an opportunity to use more parameters to judge more echo sound field states, and more Effectively realize multi-feature detection and improve the completeness of the judgment of the echo sound field state.
  • the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is easier to change than in the prior art
  • the change state of the echo path is misjudged as the dual-talk state, and the solution of the embodiment of the present invention can further effectively improve the accuracy of the determination of the change state of the echo path.
  • the echo sound field state of the signal to be determined is idle.
  • the near-end voice activation flag DVflag is not 1, it can be considered that there is no voice at the near end, otherwise it means that the near end There is voice at the end, and the signal to be determined needs to be further judged.
  • the echo sound field state of the signal to be determined is the near-end single talk state, and when the far-end voice activation flag XVflag is not 1, it can be considered that there is no signal at the far end, There is no echo signal in the near-end signal, and the current state is the near-end single-talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the signal to be determined is required.
  • the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been eliminated by the adaptive filter AF, and the current state It is the far-end single-talk state, otherwise it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgments on the signal to be determined need to be made.
  • the normalized cross-correlation value to make the near-end signal and the residual signal component further determines, in the filter converges, the residual data E n (k) corresponding to the de-correlated echo signal, at this time if the C DE is greater than the threshold Thrd1 coh , indicating that the near-end signal contains many components that are not related to echo, but if the filter does not converge, the residual signal will also contain a large amount of echo components. This conclusion is not valid; therefore, C YE is used to further Confirm that if C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal.
  • the near-end signal contains components that are not related to echo.
  • the current state is dual-talk. Status, otherwise it means that the signal component cannot be determined, and further judgment on the signal to be determined is required.
  • the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and judging according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update
  • It is the far-end single-talk state which can indicate that the filter is in a fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. Due to convergence or echo path change, the current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it is determined as the remote single talk state.
  • the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values in the linear region; where M and L are the frequency band indexes of the linear region, and the accuracy of judgment can be improved by taking the value in the linear region .
  • the update step size ⁇ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is the dual-talk state DTS, adjust ⁇ n (k) Slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, ⁇ n (k) takes the normal value without special adjustment; when the signal to be determined is In the idle state IDS or the near-end single talk state NSTS, ⁇ n (k) is taken as 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
  • the degree of nonlinear processing can be reduced when the signal to be determined is in the dual-talk state, so that effective speech is not damaged, and dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS, the degree of nonlinear processing can be enhanced , To prevent the leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; when the signal to be determined is far-end No special processing is done in the single-talk state FSTS, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
  • the noise update speed can be slowed down to ensure the intelligibility of the effective voice; when the signal to be determined is the far-end single-talk and echo path changes When the non-stationary noise suppression ability is improved, the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked, thereby improving the signal transmission quality .
  • Figure 1 is a schematic diagram of the structure of an AEC system in the prior art
  • FIG. 2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention
  • FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention.
  • Figure 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention.
  • Fig. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention.
  • a typical AEC system includes an adaptive filter AF for linear echo processing and a nonlinear part for residual echo processing.
  • Fig. 1 is a schematic structural diagram of an AEC system in the prior art.
  • the signal x(n) passes through the speaker (statistical process control, SPK) to obtain the signal h(n). After (MIC), the signal d(n) is output.
  • Short-time Fourier transform short-time Fourier transform, or short-term Fourier transform, STFT
  • STFT short-time Fourier transform
  • the adaptive filter adaptive filters, AF
  • the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
  • residual signal E n (k) may be nonlinear input processing unit (Non-linear programming, NLP) and post-processing noise suppression unit (Noise suppression, NS).
  • NLP nonlinear programming
  • NS post-processing noise suppression unit
  • the detection of the dual-talk state in the echo sound field state is particularly important.
  • Conventional dual-talk detection methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and echo path-based detection.
  • the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high;
  • the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear
  • the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes.
  • the accuracy of determining the state of the echo sound field is low, which in turn affects the effect of echo cancellation.
  • the inventors of the present invention have discovered through research that the existing methods for determining the state of the echo sound field simply divide the state of the echo sound field into a single talk state (Single Talk State, STS) and a double talk state (Double Talk State, DTS).
  • STS Single Talk State
  • DTS Double Talk State
  • PCS path Change State
  • the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
  • the signal to be determined is actually the state of the echo path change.
  • the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
  • the misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
  • FIG. 2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention.
  • the method for determining the state of the echo sound field includes steps S21 to S24:
  • Step S21 Obtain a signal to be determined
  • Step S22 Determine the far-end signal, the near-end signal, and filter coefficients of the signal to be determined
  • Step S23 Determine the filter update degree at least according to the far-end signal, the near-end signal and the filter coefficient
  • Step S24 Determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree being greater than the preset update degree threshold.
  • the method can be implemented in the form of a software program that runs on a processor integrated inside a chip or a chip module.
  • the to-be-determined signals with different echo sound field states may include different signals, for example, may include the signal obtained after the sound emitted by the speaker of the communication terminal is picked up by the microphone of the terminal, and may also include only the remote Signal.
  • the echo cancellation can be achieved more effectively.
  • step S22 the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined are determined.
  • conventional techniques may be used to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined. For example, short-time Fourier transform is performed on the signal d(n) and signal x(n) shown in FIG. 1 to obtain the near-end signal D n (k) and the far-end signal X n (k). Determine the filter coefficient W n (k) by an appropriate method.
  • step S23 the filter update degree Cef update is determined.
  • the step of determining the filter update degree Cef update may include: end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), determine the update After the filter coefficient W n+1 (k); according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k), determine the filter update degree Cef update .
  • step S24 it may be determined whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
  • the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is the echo path change state .
  • the filter update degree Cef update is set to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set,
  • the signal to be determined is actually the state of the echo path change.
  • the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state.
  • the misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
  • the method for determining the echo sound field state may further include: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
  • the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is remote single talk state.
  • the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is compared with the current state.
  • the solution of the embodiment of the present invention can further effectively improve the accuracy of the judgment of the change state of the echo path.
  • the method for determining the echo sound field state may further include: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
  • the near-end signal D n (k) is subjected to voice activation detection, and the echo sound field state of the signal to be determined is determined to be idle according to the near-end voice activation flag DVflag
  • the state step can also be set to be executed after step S24.
  • the embodiment of the present invention does not limit the sequence of the step of judging the near-end voice activation flag DVflag and the step S24.
  • the echo sound field state of the signal to be determined is idle.
  • the near-end voice activation flag DVflag is not 1, it can be considered that the near-end has no Voice, otherwise it means that there is voice at the near end, and further judgment on the to-be-determined signal is needed.
  • the method for determining the echo sound field state may further include: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
  • the voice activation detection is performed on the far-end signal X n (k), and the echo sound field state of the signal to be determined is judged to be close according to the far-end voice activation flag XVflag.
  • the step of the single-talk state can also be set to be executed after step S24.
  • the embodiment of the present invention does not limit the sequence of the step of determining the remote voice activation flag XVflag and the step S24.
  • voice activation detection technology can adopt well-known technologies, such as energy detection, zero-crossing rate detection, spectral entropy detection, pitch detection, etc., which are not specifically limited in the embodiment of the present invention.
  • the echo sound field state of the signal to be determined is the near-end single talk state, and it can be considered that when the far-end voice activation flag XVflag is not 1, There is no signal at the far end, no echo signal in the near-end signal, and the current state is the near-end single talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the to-be-determined signal is required.
  • the method for determining the echo sound field state may further include: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
  • the step of determining the echo suppression ratio Err of the signal to be determined and determining that the echo sound field state of the signal to be determined is the remote single talk state can also be set in step S24. Execute afterwards.
  • the embodiment of the present invention does not limit the sequence of the step of determining the echo suppression ratio Err of the signal to be determined and the step S24.
  • the step of determining the echo suppression ratio Err of the signal to be determined may include: according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) determining residual signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
  • k is the frequency index of the signal to be determined.
  • the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been determined by the adaptive filter. AF is eliminated, and the current state is the far-end single-talk state. Otherwise, it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgment on the signal to be determined is required.
  • the threshold Thrd err reference value may be 12 to 20 dB.
  • the echo sound field state determination method may further include: determining the normalized cross-correlation values C YE and C DE ; if C DE is greater than If the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation The threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
  • M and L are the frequency band indexes of the signal to be determined.
  • the residual signal of the near-end signal component further determined by normalizing the cross-correlation value, at the convergence of the filter, the residual data E n (k) corresponding to the echo signal is decorrelated
  • C DE is greater than the threshold Thrd1 coh , it means that the near-end signal contains many components that are not related to echo.
  • the filter does not converge, the residual signal will also contain a large amount of echo components, and this conclusion is not valid; C YE is used for further confirmation. If C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal.
  • the near-end signal contains components that are not related to echo.
  • the current state is a dual-talk state, otherwise it means that the signal component cannot be determined, and the signal to be determined needs to be further judged.
  • the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; wherein M and L are frequency band indexes of the linear region.
  • the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values of the linear region; where M and L are the frequency band indexes of the linear region.
  • M and L as the frequency band index corresponding to the linear region, since the nonlinear distortion of the device has harmonic characteristics and is often distributed in the middle and high frequencies, the present invention gives the reference frequency range, and M corresponds to the low frequency band in 100 ⁇ In the 300Hz interval, L corresponds to the high frequency band in the 2500 ⁇ 3000Hz interval. This range is only a reference value, and the actual use is not limited by this.
  • the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the echo path change state; if the filter update degree Cef update is less than or equal to the The preset update threshold Thrd update determines that the echo sound field state of the signal to be determined is the remote single talk state.
  • the step of judging that the filter update degree Cef update is greater than the preset update degree threshold Thrd update may be set after judging the dual-talk state.
  • the echo suppression ratio Err is the relative cancellation amount of the echo signal, which avoids the influence of the echo signal strength; the normalized cross-correlation quantities C YE and C DE are normalized and have nothing to do with the signal strength of the far and near ends.
  • the linear region calculation is used to reduce the influence of device distortion; the filter update degree Cef update uses a certain degree of robustness of the AF itself to reflect the change intensity of the echo path. Therefore, the comprehensive use of these features can effectively solve the influence of uncertain factors such as echo signal strength changes, far and near-end signal strength changes, device distortion, and echo path changes on the detection accuracy.
  • the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and according to the filter update degree Cef update being less than or equal to the preset update degree Threshold Thrd update , judged as the far-end single-talk state, can indicate that the filter is in the fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. The update can only be caused by non-convergence or echo path change.
  • the current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it can be determined as the remote single talk state .
  • the reference value of Thrd1 coh may be 0.3 to 0.5, and the reference value of Thrd2 coh may be 0.1 to 0.3.
  • FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention.
  • the another method for determining the state of the echo sound field may include step S301 to step S311, and each step will be described below.
  • step S301 it is judged whether DVflag is equal to 1; when the judgment result is yes, step S302 can be executed; otherwise, step S303 can be executed.
  • step S302 it is judged whether XVflag is equal to 1; when the judgment result is yes, step S304 can be executed; otherwise, step S305 can be executed.
  • step S303 it is determined that the state of the echo sound field is the idle state (IDS).
  • step S304 it is judged whether Err is greater than Thrd err ; when the judgment result is yes, step S306 can be executed; otherwise, step S307 can be executed.
  • step S305 it is determined that the state of the echo sound field is the near-end single talk state (NSTS).
  • NSTS near-end single talk state
  • step S306 it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
  • FSTS far-end single talk state
  • step S307 it is judged whether C DE is greater than Thrd1 coh and C YE is less than Thrd2 coh ; when the judgment result is yes, step S308 can be executed; otherwise, step S309 can be executed.
  • step S308 it is determined that the state of the echo sound field is a dual talk state (DTS).
  • DTS dual talk state
  • step S309 it is determined that Cef update is greater than Thrd update ; when the determination result is yes, step S310 can be executed; otherwise, step S311 can be executed.
  • step S310 it is determined that the echo sound field state is the echo path change state (PCS).
  • PCS echo path change state
  • step S311 it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
  • FSTS far-end single talk state
  • sequence number of each step in this embodiment does not represent a limitation on the execution order of each step.
  • the order of steps between steps S301, S302, S304, S307, and S309 is not limited.
  • step S309 may be set after S307 to improve the accuracy of judging the change state of the echo path.
  • the selected features and decision methods are robust against uncertain factors such as signal strength changes (far and near ends, echo signals), device distortion and echo path changes, and the combined use of multiple features Makes the detection accuracy higher and the performance more reliable.
  • the method for determining the echo sound field state may further include adjusting the update step size ⁇ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size ⁇ n (k) ) Is used to indicate the update step size of the filter coefficient W n (k).
  • an echo adaptive filter may be used to adjust the update step size ⁇ n (k) of the signal to be determined.
  • the value of the update step ⁇ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is in the dual-talk state DTS When, adjust ⁇ n (k) to slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, ⁇ n (k) takes the normal value without special adjustment; When the signal to be determined is in the idle state IDS or the near-end single talk state NSTS, ⁇ n (k) is set to 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
  • the method for determining the state of the echo sound field may further include: determining whether to perform nonlinear processing on the signal to be determined according to the state of the echo sound field of the signal to be determined.
  • the step of determining whether to perform nonlinear processing on the signal to be determined may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If it is determined that the echo sound field state of the signal to be determined is the echo path change state, the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, the non-linear processing of the signal to be determined is stopped.
  • a post-processing non-linear processing unit can be used to perform non-linear processing on the signal to be determined.
  • the degree of non-linear processing can be reduced when the signal to be determined is in the dual-talk state, so that the effective voice is not damaged, and the dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS Enhance the degree of non-linear processing to prevent leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; When it is determined that the signal is in the far-end single talk state FSTS, no special processing is performed, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
  • the method for determining the state of the echo sound field may further include: according to the state of the echo sound field of the signal to be determined, determining to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
  • the step of determining to reduce the noise update speed or to improve the non-stationary noise suppression capability may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, then the standby state is reduced.
  • the non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is the echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
  • a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
  • the noise update speed can be slowed down to ensure the intelligibility of effective speech; when the signal to be determined is the far-end single-talk state When the echo path is changed, the non-stationary noise suppression capability is improved, and the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked. Thereby improving the quality of signal transmission.
  • Fig. 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention.
  • the signal x(n) passes through the loudspeaker (SPK) to obtain the signal h(n), which has echo, and the voice signal (voice) and noise signal (noise) after passing through the microphone (MIC) Output signal d(n).
  • SPK loudspeaker
  • MIC microphone
  • the short-time Fourier transform is performed on the signal d(n) and signal x(n) to obtain the near-end signal D n (k) and the far-end signal X n (k).
  • the adaptive filter (AF ) can be calculated far-end signal X n (k) with the filter coefficients W n (k) the echo estimation signal Y n (k), and the near-end signal D n (k) obtained by subtracting the residual signal E n ( k).
  • the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
  • Further far-end signal may be X n (k), near-end signal D n (k), the echo estimation signal Y n (k), the residual signal E n (k) with the filter coefficients W n (k) back to the input sound field
  • the state detection unit ESD performs signal feature calculation, and makes the echo sound field state judgment based on the calculation result, and obtains the specific echo sound field state.
  • the echo state can be subdivided into five sound field states: far-end single-talk state FSTS, near-end single-talk state NSTS, dual-talk state DTS, echo path change state PCS, and IDS in idle state (ie, background noise).
  • an adaptive filter AF and a post-processing non-linear processing unit (NLP) and a post-processing noise suppression unit (NS) can be set to obtain a specific sound field state through ESD, and perform corresponding processing.
  • the method for determining the state of the echo sound field may further include: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
  • the DTS output is maintained through the holding time Thold to protect the near-end voice to the greatest extent.
  • the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
  • the output of the PCS will be suspended through the start time Tstart. At this time, the state output is forced to be the remote single talk FSTS to reduce the risk of filter divergence A compromise effect with suppressing echo residue.
  • the value of Thold and Tstart can be set between 20 and 100 ms.
  • FIG. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention.
  • the apparatus for determining the state of the echo sound field may include:
  • the obtaining module 51 is used to obtain the signal to be determined
  • the signal determining module 52 is configured to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined;
  • the update degree determination module 53 is configured to determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);
  • the state determination module 54 is configured to determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
  • the foregoing device may correspond to a chip with data processing function in user equipment, such as a baseband chip; or a chip module including a chip with data processing function in user equipment, or a user equipment.
  • the embodiment of the present invention also provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the foregoing method when the computer instructions are executed.
  • the storage medium may be a computer-readable storage medium, for example, it may include non-volatile memory (non-volatile) or non-transitory (non-transitory) memory, and may also include optical disks, mechanical hard drives, solid state hard drives, and the like.
  • An embodiment of the present invention also provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the steps of the above method when the computer instructions are executed.
  • the terminal includes, but is not limited to, terminal devices such as mobile phones, computers, and tablets.
  • modules/units contained in the various devices and products described in the above embodiments they may be software modules/units, hardware modules/units, or part software modules/units and part hardware modules/units.
  • the various modules/units contained therein can be implemented in the form of hardware such as circuits, or at least part of the modules/units can be implemented in the form of software programs. Runs on the integrated processor inside the chip, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the chip module, the modules/units contained therein can be All are implemented by hardware such as circuits.
  • Different modules/units can be located in the same component (such as a chip, circuit module, etc.) or different components of the chip module, or at least part of the modules/units can be implemented by software programs.
  • the software program runs on the processor integrated inside the chip module, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the terminal, the modules contained therein
  • the modules/units can all be implemented by hardware such as circuits, and different modules/units can be located in the same component (for example, chip, circuit module, etc.) or different components in the terminal, or at least part of the modules/units can be implemented in the form of software programs Implementation, the software program runs on the processor integrated inside the terminal, and the remaining (if any) part of the modules/units can be implemented by hardware such as circuits.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Telephone Function (AREA)

Abstract

A method and device for determining a state of an echo sound field, a storage medium, and a terminal. The method comprises: acquiring a signal to be determined; determining a remote signal Xn(k), a proximal signal Dn(k), and a filter coefficient Wn(k) of said signal; determining a filter update degree Cefupdate at least on the basis of the remote signal Xn(k), of the proximal signal Dn(k), and of the filter coefficient Wn(k); and determining, at least on the basis of the filter update degree Cefupdate being greater than an update degree threshold Thrdupdate, whether an echo sound field state of said signal is an echo path change state. The present invention effectively increases the accuracy of an echo path change state determination, provides an opportunity to employ more parameters in determining more echo sound field states, effectively implements multifeatured detection, and increases the comprehensiveness of an echo sound field state determination.

Description

回声声场状态确定方法及装置、存储介质、终端Method and device for determining state of echo sound field, storage medium and terminal
本申请要求于2020年3月26日提交中国专利局、申请号为202010223647.6、发明名称为“回声声场状态确定方法及装置、存储介质、终端”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on March 26, 2020, the application number is 202010223647.6, and the invention title is "Method and device for determining the state of the echo sound field, storage medium, and terminal", the entire content of which is incorporated by reference Incorporated in this application.
技术领域Technical field
本发明涉及声学回声消除技术领域,尤其涉及一种回声声场状态确定方法及装置、存储介质、终端。The present invention relates to the technical field of acoustic echo cancellation, in particular to a method and device for determining the state of an echo sound field, a storage medium, and a terminal.
背景技术Background technique
在实时语音通信与基于IP的语音传输(Voice over Internet Protocol,VOIP)过程中,通信终端扬声器发出的声音,总会被该终端的麦克风拾取到,若是不处理就发送出去,对方总能听到自己说话的声音,体验不佳。在人机交互领域,由于交互终端发出的声音又被麦克风拾取回去,同时拾取了控制者的说话声,若在麦克风拾取信号中不消除交互终端发出的声音,那么交互终端在识别控制者说话声音时将引入很强的干扰,降低了识别的成功率,最终造成交互困难。采用回声消除(Acoustic Echo Canceler,AEC)对回声进行消除是公知的方法,通常的AEC系统包括针对线性回声处理的自适应滤波AF以及针对残留回声处理的非线性部分。In the process of real-time voice communication and Voice over Internet Protocol (VOIP), the sound emitted by the speaker of the communication terminal will always be picked up by the microphone of the terminal. If it is not processed, it will be sent out and the other party can always hear it. The voice of self-talk is not good. In the field of human-computer interaction, since the sound emitted by the interactive terminal is picked up by the microphone and the controller's speech is picked up at the same time, if the microphone pickup signal does not eliminate the sound from the interactive terminal, the interactive terminal is recognizing the controller's speech At this time, strong interference will be introduced, which will reduce the success rate of recognition and eventually cause interaction difficulties. It is a well-known method to use Acoustic Echo Canceler (AEC) to cancel the echo. A typical AEC system includes an adaptive filter AF for linear echo processing and a nonlinear part for residual echo processing.
由于回声声场状态的多样性与多变性特点,使得AEC相应技术的鲁棒性与稳定性受到很大的挑战。例如自适应滤波的更新在双讲与无语音场景下若不加以控制,则会面临发散与失调的风险,同时当回声路径发生改变时,若不提高更新速度则会导致收敛速度过慢,造成回声的 残留;同样的,在非线性或残留回声处理中,若不对单讲与双进状态加以区分,则往往会导致有效语音的损伤,使得双讲性能下降。Due to the diversity and variability of the echo sound field, the robustness and stability of the corresponding AEC technology are greatly challenged. For example, if the update of adaptive filtering is not controlled in dual-talk and no-speech scenarios, it will face the risk of divergence and misalignment.At the same time, when the echo path changes, if the update speed is not increased, the convergence speed will be too slow, resulting in The residual echo; similarly, in the non-linear or residual echo processing, if the single talk and the dual state are not distinguished, it will often lead to the damage of the effective speech and reduce the performance of the dual talk.
回声声场状态中双讲状态(Double Talk State,DTS)的检测显得尤为重要,常规的双讲检测(Double Talk Detection,DTD)方法大致分为三类:基于能量的检测、基于相关性的检测以及基于回声路径的检测。其中基于能量的检测最为简单,极度依赖于回声信号强度、近端语音信号强度与背景噪声强度的稳定性,误判率非常高;基于相关性的检测受限于器件的特性,当扬声器非线性失真较大时,该方法的性能急剧下降;基于回声路径的检测,如估计喇叭冲激响应、可变冲击响应等,当回声路径变化时性能变差。The detection of Double Talk State (DTS) in the echo sound field state is particularly important. Conventional Double Talk Detection (DTD) methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and Detection based on echo path. Among them, the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high; the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear When the distortion is large, the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes.
然而,在现有技术中,回声声场状态确定的准确性较低,进而影响回声消除效果。However, in the prior art, the accuracy of determining the state of the echo sound field is low, which in turn affects the effect of echo cancellation.
发明内容Summary of the invention
本发明解决的技术问题是提供一种回声声场状态确定方法及装置、存储介质、终端,可以有效提高对回声路径变化状态判断的准确性。The technical problem solved by the present invention is to provide a method and device for determining the state of the echo sound field, a storage medium, and a terminal, which can effectively improve the accuracy of determining the state of the echo path change.
为解决上述技术问题,本发明实施例提供一种回声声场状态确定方法,包括以下步骤:获取待确定信号;确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k);至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update;至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 In order to solve the above technical problem, an embodiment of the present invention provides a method for determining the state of an echo sound field, which includes the following steps: acquiring a signal to be determined; determining the far-end signal X n (k) and the near-end signal D n ( k) and the filter coefficient W n (k); at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k), the filter update degree Cef update is determined At least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update , it is determined whether the echo sound field state of the signal to be determined is the echo path change state.
可选的,所述的回声声场状态确定方法还包括:至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为远端单讲状态。 Optionally, the method for determining the echo sound field state further includes: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
可选的,至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波 器系数W n(k),确定滤波器更新度Cef update包括:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k);根据所述残差信号E n(k),确定更新后的滤波器系数W n+1(k);根据所述滤波器系数W n(k)以及更新后的滤波器系数W n+1(k),确定所述滤波器更新度Cef updateOptionally, determining the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) includes: according to the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), to determine the updated Filter coefficient W n+1 (k); determine the filter update degree Cef update according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k).
可选的,满足以下一项或多项:采用下述公式,确定残差信号E n(k): Alternatively, one or more of the following: using the following equation to determine the residual signal E n (k):
Figure PCTCN2021079181-appb-000001
Figure PCTCN2021079181-appb-000001
采用下述公式,确定更新后的滤波器系数W n+1(k),其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长: The following formula is used to determine the updated filter coefficient W n+1 (k), where the update step size μ n (k) is used to indicate the update step size of the filter coefficient W n (k):
Figure PCTCN2021079181-appb-000002
Figure PCTCN2021079181-appb-000002
采用下述公式,确定滤波器更新度Cef updateUse the following formula to determine the filter update degree Cef update :
Figure PCTCN2021079181-appb-000003
Figure PCTCN2021079181-appb-000003
可选的,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还包括:对所述近端信号D n(k)进行语音激活检测,以得到近端语音激活标志DVflag;如果所述近端语音激活标志DVflag不等于1,则判断所述待确定信号的回声声场状态为空闲状态。 Optionally, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state further includes: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
可选的,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还包括:对所述远端信号X n(k)进行语音激活检测,以得到远端语音激活标志XVflag;如果所述远端语音激活标志XVflag不等于1,则判断所述待确定信号的回声声场状态为近端单讲状态。 Optionally, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state further includes: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
可选的,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还包括:确定所述待确定信号的回波抑制比Err;如果所述回波抑制比Err大于预设回波阈值Thrd err,则判断所述待确定信号的回声声场状态为远端单讲状态。 Optionally, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state further includes: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
可选的,确定所述待确定信号的回波抑制比Err包括:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k);根据所述近端信号D n(k)与残差信号E n(k),确定信号的回波抑制比Err。 Optionally, determining the echo suppression ratio Err of the signal to be determined includes: determining the residual signal according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k). the difference signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
可选的,满足以下一项或多项:采用下述公式,确定残差信号E n(k): Alternatively, one or more of the following: using the following equation to determine the residual signal E n (k):
Figure PCTCN2021079181-appb-000004
Figure PCTCN2021079181-appb-000004
采用下述公式,确定信号的回波抑制比Err:Use the following formula to determine the signal echo suppression ratio Err:
Figure PCTCN2021079181-appb-000005
Figure PCTCN2021079181-appb-000005
其中,k为所述待确定信号的频率索引。Wherein, k is the frequency index of the signal to be determined.
可选的,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还包括:确定归一化互相关值C YE与C DE;如果C DE大于第一预设互相关阈值Thrd1 coh,且C YE小于第二预设互相关阈值Thrd2 coh,则判断所述待确定信号的回声声场状态为双讲状态;其中,所述第一预设互相关阈值Thrd1 coh大于等于所述第二预设互相关阈值Thrd2 cohOptionally, before determining whether the echo sound field state of the signal to be determined is the echo path change state, the echo sound field state determination method further includes: determining the normalized cross-correlation values C YE and C DE ; if C DE Is greater than the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , then it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation threshold The correlation threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
可选的,还包括以下一项或多项:如果滤波器更新度Cef update大于预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为回声路径变化状态;如果所述滤波器更新度Cef update小于等于所述预设 更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为远端单讲状态。 Optionally, it further includes one or more of the following: if the filter update degree Cef update is greater than the preset update degree threshold Thrd update , determining that the echo sound field state of the signal to be determined is the echo path change state; If the update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
可选的,采用下述公式,确定归一化互相关值C YE与C DEOptionally, use the following formula to determine the normalized cross-correlation values C YE and C DE :
Figure PCTCN2021079181-appb-000006
Figure PCTCN2021079181-appb-000006
Figure PCTCN2021079181-appb-000007
Figure PCTCN2021079181-appb-000007
其中,M与L为所述待确定信号的频段索引。Wherein, M and L are the frequency band indexes of the signal to be determined.
可选的,所述归一化互相关值C YE与C DE为线性区归一化互相关值;其中,M与L为线性区的频段索引。 Optionally, the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; where M and L are frequency band indexes of the linear region.
可选的,所述的回声声场状态确定方法还包括:根据所述待确定信号的回声声场状态,调整所述待确定信号的更新步长μ n(k);其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长。 Optionally, the method for determining the echo sound field state further includes: adjusting the update step size μ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size μ n ( k) is used to indicate the update step size of the filter coefficient W n (k).
可选的,调整更新步长μ n(k)包括以下一项或多项:如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增加更新步长μ n(k);如果确定所述待确定信号的回声声场状态为双讲状态,则调整μ n(k)放慢更新;如果确定所述待确定信号的回声声场状态为空闲状态或近端单讲状态,则调整μ n(k)=0。 Optionally, adjusting the update step size μ n (k) includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the echo path change state, increase the update step size μ n (k); if If it is determined that the echo sound field state of the signal to be determined is the dual talk state, adjust μ n (k) to slow down the update; if it is determined that the echo sound field state of the signal to be determined is the idle state or the near-end single talk state, adjust μ n (k)=0.
可选的,采用回声自适应滤波器调整所述待确定信号的更新步长μ n(k)。 Optionally, an echo adaptive filter is used to adjust the update step size μ n (k) of the signal to be determined.
可选的,所述的回声声场状态确定方法还包括:根据所述待确定信号的回声声场状态,确定是否对所述待确定信号进行非线性处理。Optionally, the method for determining the echo sound field state further includes: determining whether to perform non-linear processing on the signal to be determined according to the echo sound field state of the signal to be determined.
可选的,确定是否对所述待确定信号进行非线性处理包括以下一 项或多项:如果确定所述待确定信号的回声声场状态为双讲状态,则减少非线性处理程度;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增强对所述待确定信号的非线性处理;如果确定所述待确定信号的回声声场状态为近端单讲状态,则停止对所述待确定信号的非线性处理;如果确定所述待确定信号的回声声场状态为空闲状态,则停止对所述待确定信号的非线性处理。Optionally, determining whether to perform nonlinear processing on the signal to be determined includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If the echo sound field state of the signal to be determined is the echo path change state, then the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop talking to the Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, then the non-linear processing of the signal to be determined is stopped.
可选的,采用后处理非线性处理单元对所述待确定信号进行非线性处理。Optionally, a post-processing non-linear processing unit is used to perform non-linear processing on the signal to be determined.
可选的,所述的回声声场状态确定方法还包括:根据所述待确定信号的回声声场状态,确定降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。Optionally, the method for determining the echo sound field state further includes: determining, according to the echo sound field state of the signal to be determined, to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
可选的,确定降低噪声更新速度或者提高非平稳噪声抑制能力包括以下一项或多项:如果确定所述待确定信号的回声声场状态为近端单讲状态,则降低所述待确定信号的噪声更新速度;如果确定所述待确定信号的回声声场状态为双讲状态,则降低所述待确定信号的噪声更新速度;如果确定所述待确定信号的回声声场状态为远端单讲状态,则提高所述待确定信号的非平稳噪声抑制能力;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则提高所述待确定信号的非平稳噪声抑制能力。Optionally, determining to reduce the noise update speed or to improve the non-stationary noise suppression capability includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, reducing the signal to be determined Noise update speed; if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reduce the noise update speed of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is a remote single talk state, The non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is an echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
可选的,采用后处理噪声抑制单元降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。Optionally, a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
可选的,所述的回声声场状态确定方法还包括:确定所述待确定信号的临时声场状态;根据所述待确定信号的回声声场状态以及临时声场状态,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出。Optionally, the method for determining the state of the echo sound field further includes: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
可选的,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出包括以下一项或多项:如果所述待 确定信号的回声声场状态为双讲状态,所述临时声场状态为远端单讲状态,则通过保持时间对所述待确定信号保持双讲状态输出;如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为回声路径变化状态,则通过开始时间对所述待确定信号暂缓回声路径改变的输出。Optionally, the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
为解决上述技术问题,本发明实施例提供一种回声声场状态确定装置,包括:获取模块,用于获取待确定信号;信号确定模块,用于确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k);更新度确定模块,用于至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update;状态确定模块,用于至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 In order to solve the above technical problem, an embodiment of the present invention provides an echo sound field state determination device, which includes: an acquisition module for acquiring a signal to be determined; a signal determination module for determining the far-end signal X n ( k), the near-end signal D n (k) and the filter coefficient W n (k); the update degree determination module is used to determine at least the far-end signal X n (k), the near-end signal D n (k) and the The filter coefficient W n (k) determines the filter update degree Cef update ; the state determination module is used to determine the echo sound field state of the signal to be determined at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update Whether it is the state of echo path change.
为解决上述技术问题,本发明实施例提供一种存储介质,其上存储有计算机指令,所述计算机指令运行时执行上述回声声场状态确定方法的步骤。In order to solve the above technical problem, an embodiment of the present invention provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the method for determining the state of the echo sound field when the computer instructions are executed.
为解决上述技术问题,本发明实施例提供一种终端,包括存储器和处理器,所述存储器上存储有能够在所述处理器上运行的计算机指令,所述处理器运行所述计算机指令时执行上述回声声场状态确定方法的步骤。In order to solve the above technical problems, an embodiment of the present invention provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the computer instructions when the computer instructions are run. The steps of the method for determining the state of the echo sound field.
与现有技术相比,本发明实施例的技术方案具有以下有益效果:Compared with the prior art, the technical solution of the embodiment of the present invention has the following beneficial effects:
在本发明实施例中,通过设置至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态,可以设置适当的参数,对所述待确定信号实际上为回声路径变化状态的情况进行准确判断,相比于现有技术中只是将回声声场状态简单的划分为单讲状态与双讲状态加以检测,容易将回声路径变化状态误判为双讲状态,采用本发明实施例的方案,可以有效提高对回声路径变化状态判断的准确性,并且在后续步骤中,有 机会采用更多参数对更多回声声场状态进行判断,更有效地实现多特征检测,提高对回声声场状态判断的完整性。 In the embodiment of the present invention, by setting the filter update degree Cef update to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set, The signal to be determined is actually the state of the echo path change. Compared with the prior art, the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state. The misjudgment is the dual-talk state. The solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state, and in the subsequent steps, there is an opportunity to use more parameters to judge more echo sound field states, and more Effectively realize multi-feature detection and improve the completeness of the judgment of the echo sound field state.
进一步,至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态为远端单讲状态,相比于现有技术中容易将回声路径变化状态误判为双讲状态,采用本发明实施例的方案,可以进一步有效提高对回声路径变化状态判断的准确性。 Further, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is easier to change than in the prior art The change state of the echo path is misjudged as the dual-talk state, and the solution of the embodiment of the present invention can further effectively improve the accuracy of the determination of the change state of the echo path.
进一步,通过判断近端语音激活标志DVflag不等于1时,所述待确定信号的回声声场状态为空闲状态,可以在近端语音激活标志DVflag不为1时,认为近端无语音,否则说明近端存在语音,需要对所述待确定信号进一步进行判断。Further, by judging that the near-end voice activation flag DVflag is not equal to 1, the echo sound field state of the signal to be determined is idle. When the near-end voice activation flag DVflag is not 1, it can be considered that there is no voice at the near end, otherwise it means that the near end There is voice at the end, and the signal to be determined needs to be further judged.
进一步,通过判断远端语音激活标志XVflag不等于1时,所述待确定信号的回声声场状态为近端单讲状态,可以在远端语音激活标志XVflag不为1时,认为远端无信号,近端信号中无回声信号,当前状态为近端单讲状态,否则说明近端信号中有回声的存在,需要对所述待确定信号进一步进行判断。Further, by judging that the far-end voice activation flag XVflag is not equal to 1, the echo sound field state of the signal to be determined is the near-end single talk state, and when the far-end voice activation flag XVflag is not 1, it can be considered that there is no signal at the far end, There is no echo signal in the near-end signal, and the current state is the near-end single-talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the signal to be determined is required.
进一步,通过判断回波抑制比Err大于预设回波阈值Thrd err,说明残差信号相对幅度很小,近端信号成分中大部分确定为回声信号,已被自适应滤波器AF消除,当前状态为远端单讲状态,否则说明残差信号相对幅度仍较高且近端信号中成分不确定,需要对所述待确定信号进一步进行判断。 Furthermore, by judging that the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been eliminated by the adaptive filter AF, and the current state It is the far-end single-talk state, otherwise it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgments on the signal to be determined need to be made.
进一步,通过归一化互相关值对近端信号与残差信号成分做进一步确定,在滤波器收敛情况下,残差数据E n(k)相当于已与回声信号去相关,此时若C DE大于门限Thrd1 coh,说明近端信号中含有很多与回声不相关的成分,但若滤波器未收敛,则残差信号中还会含有大量回声成分,该结论则不成立;故采用C YE做进一步确认,若C YE小于门限Thrd2 coh说明残差信号中回声成分已很少,结合C DE大于门限Thrd1 coh的条件可确认近端信号中含有与回声不相关的成分,此时当前状态为双 讲状态,否则说明信号成分无法确定,需要对所述待确定信号进一步进行判断。 Further, the normalized cross-correlation value to make the near-end signal and the residual signal component further determines, in the filter converges, the residual data E n (k) corresponding to the de-correlated echo signal, at this time if the C DE is greater than the threshold Thrd1 coh , indicating that the near-end signal contains many components that are not related to echo, but if the filter does not converge, the residual signal will also contain a large amount of echo components. This conclusion is not valid; therefore, C YE is used to further Confirm that if C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal. Combined with the condition that C DE is greater than the threshold Thrd1 coh , it can be confirmed that the near-end signal contains components that are not related to echo. At this time, the current state is dual-talk. Status, otherwise it means that the signal component cannot be determined, and further judgment on the signal to be determined is required.
进一步,在判断并排除双讲状态之后,根据滤波器收敛度Cef update大于门限Thrd update,判断为回声路径改变状态,根据滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,判断为远端单讲状态,可以说明滤波器处于快速更新状态,由于之前判决已排除确定性双讲状态,近端语音信号对滤波器的干扰已不会太高,此时快速更新只能是未收敛或回声路径改变造成,当前状态为回声路径改变状态,否则当前特征暂无明显区分度,视为不确定状态,在本发明实施例中,确定为远端单讲状态。 Further, after judging and excluding the dual-talk state, it is judged that the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and judging according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update It is the far-end single-talk state, which can indicate that the filter is in a fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. Due to convergence or echo path change, the current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it is determined as the remote single talk state.
进一步,所述归一化互相关值C YE与C DE为线性区归一化互相关值;其中,M与L为线性区的频段索引,通过在线性区取值,可以提高判断的准确性。 Further, the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values in the linear region; where M and L are the frequency band indexes of the linear region, and the accuracy of judgment can be improved by taking the value in the linear region .
进一步,可以在所述待确定信号为回声路径改变状态时,增加更新步长μ n(k)取值,加快更新,快速收敛;在所述待确定信号为双讲状态DTS时,调整μ n(k)放慢更新,保证滤波器的稳健性;在所述待确定信号为远端单讲状态FSTS时,μ n(k)取正常值,不做特殊调整;在所述待确定信号为空闲状态IDS或近端单讲状态NSTS时,μ n(k)取0,停止更新,防止发散,从而提高信号传输质量。 Further, when the signal to be determined is the echo path change state, the update step size μ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is the dual-talk state DTS, adjust μ n (k) Slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, μ n (k) takes the normal value without special adjustment; when the signal to be determined is In the idle state IDS or the near-end single talk state NSTS, μ n (k) is taken as 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
进一步,可以在所述待确定信号为双讲状态时减少非线性处理程度,使有效语音不受损伤,保证双讲性能;在所述待确定信号为回声路径改变状态PCS时增强非线性处理程度,防止残留回声的泄漏;在所述待确定信号为近端单讲NSTS与空闲状态IDS时,停止非线性处理,避免引起近端语音与环境音的失真;在所述待确定信号为远端单讲状态FSTS时不做特殊处理,正常抑制残留回声,从而提高信号传输质量。Further, the degree of nonlinear processing can be reduced when the signal to be determined is in the dual-talk state, so that effective speech is not damaged, and dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS, the degree of nonlinear processing can be enhanced , To prevent the leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; when the signal to be determined is far-end No special processing is done in the single-talk state FSTS, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
进一步,可以在所述待确定信号为近端单讲状态与双讲状态时,放慢噪声更新速度,保证有效语音的可懂度;在所述待确定信号为远 端单讲与回声路径改变时,提高非平稳噪声抑制能力,起到对残留回声的抑制作用;在所述待确定信号为空闲状态,即背景噪声IDS状态时,不做特殊处理,正常跟踪背景噪声,从而提高信号传输质量。Further, when the signal to be determined is in the near-end single-talk state and the dual-talk state, the noise update speed can be slowed down to ensure the intelligibility of the effective voice; when the signal to be determined is the far-end single-talk and echo path changes When the non-stationary noise suppression ability is improved, the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked, thereby improving the signal transmission quality .
附图说明Description of the drawings
图1是现有技术中一种AEC系统的结构示意图;Figure 1 is a schematic diagram of the structure of an AEC system in the prior art;
图2是本发明实施例中一种回声声场状态确定方法的流程图;2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention;
图3是本发明实施例中另一种回声声场状态确定方法的流程图;FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention;
图4是本发明实施例中一种AEC系统的结构示意图;Figure 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention;
图5是本发明实施例中一种回声声场状态确定装置的结构示意图。Fig. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention.
具体实施方式Detailed ways
如前所述,在实时语音通信与基于IP的语音传输过程中,通信终端扬声器发出的声音,总会被该终端的麦克风拾取到,若是不处理就发送出去,对方总能听到自己说话的声音,体验不佳。采用回声消除对回声进行消除是公知的方法,通常的AEC系统包括针对线性回声处理的自适应滤波AF以及针对残留回声处理的非线性部分。As mentioned above, in the process of real-time voice communication and IP-based voice transmission, the sound emitted by the speaker of the communication terminal will always be picked up by the microphone of the terminal. If it is not processed and sent out, the other party can always hear the voice. Sound, poor experience. It is a well-known method to use echo cancellation to cancel the echo. A typical AEC system includes an adaptive filter AF for linear echo processing and a nonlinear part for residual echo processing.
参照图1,图1是现有技术中一种AEC系统的结构示意图。Referring to Fig. 1, Fig. 1 is a schematic structural diagram of an AEC system in the prior art.
如图1所示,信号x(n)经过扬声器(statistical process control,SPK)之后得到信号h(n),该信号具有回声(echo),与语音信号(voice)以及噪声信号(noise)经由麦克风(MIC)之后输出信号d(n)。As shown in Figure 1, the signal x(n) passes through the speaker (statistical process control, SPK) to obtain the signal h(n). After (MIC), the signal d(n) is output.
对所述信号d(n)以及信号x(n)分别进行短时傅里叶变换(short-time Fourier transform,或short-term Fourier transform,STFT)得到近端信号D n(k)以及远端信号X n(k),自适应滤波器(Adaptive Filters,AF)可以根据远端信号X n(k)与滤波器系数W n(k)计算出回声 估计信号Y n(k),并与近端信号D n(k)相减得到残差信号E n(k)。 Short-time Fourier transform (short-time Fourier transform, or short-term Fourier transform, STFT) is performed on the signal d(n) and signal x(n) respectively to obtain a near-end signal D n (k) and a far-end signal signal X n (k), the adaptive filter (adaptive filters, AF) can be calculated far-end signal X n (k) with the filter coefficients W n (k) the echo estimation signal Y n (k), and the near end of the signal D n (k) obtained by subtracting the residual signal E n (k).
在具体实施中,可以根据滤波器系数W n(k)进行滤波器系数更新,得到W n+1(k)。 In a specific implementation, the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
进而可以将残差信号E n(k)输入后处理非线性处理单元(Non-linear programming,NLP)以及后处理噪声抑制单元(Noise suppression,NS)。 Further the residual signal E n (k) may be nonlinear input processing unit (Non-linear programming, NLP) and post-processing noise suppression unit (Noise suppression, NS).
回声声场状态中双讲状态的检测显得尤为重要,常规的双讲检测方法大致分为三类:基于能量的检测、基于相关性的检测以及基于回声路径的检测。其中基于能量的检测最为简单,极度依赖于回声信号强度、近端语音信号强度与背景噪声强度的稳定性,误判率非常高;基于相关性的检测受限于器件的特性,当扬声器非线性失真较大时,该方法的性能急剧下降;基于回声路径的检测,如估计喇叭冲激响应、可变冲击响应等,当回声路径变化时性能变差。然而,在现有技术中,回声声场状态确定的准确性较低,进而影响回声消除效果。The detection of the dual-talk state in the echo sound field state is particularly important. Conventional dual-talk detection methods can be roughly divided into three categories: energy-based detection, correlation-based detection, and echo path-based detection. Among them, the energy-based detection is the simplest, which is extremely dependent on the stability of the echo signal strength, the near-end speech signal strength and the background noise strength, and the misjudgment rate is very high; the correlation-based detection is limited by the characteristics of the device, when the speaker is nonlinear When the distortion is large, the performance of this method drops sharply; based on the detection of the echo path, such as estimating the horn impulse response, variable impulse response, etc., the performance becomes worse when the echo path changes. However, in the prior art, the accuracy of determining the state of the echo sound field is low, which in turn affects the effect of echo cancellation.
本发明的发明人经过研究发现,现有的回声声场状态的确定方法都只是将回声声场状态简单的划分为单讲状态(Single Talk State,STS)与双讲状态(Double Talk State,DTS)加以检测,但在实际情况中用于表示回声路径的变化的回声路径变化状态(Path Change State,PCS)却缺乏有效的检测方法,往往被误判为DTS,使得最需要处理的回声反而被最大限度的保留了下来,导致回声声场状态确定有误,进而影响回声消除效果。The inventors of the present invention have discovered through research that the existing methods for determining the state of the echo sound field simply divide the state of the echo sound field into a single talk state (Single Talk State, STS) and a double talk state (Double Talk State, DTS). However, in actual situations, the path change state (Path Change State, PCS) used to indicate the change of the echo path lacks an effective detection method, and is often misjudged as DTS, so that the echo that needs to be processed most is maximized The retention of the echo sound field status is incorrect, which affects the echo cancellation effect.
在本发明实施例中,通过设置至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态,可以设置适当的参数,对所述待确定信号实际上为回声路径变化状态的情况进行准确判断,相比于现有技术中只是将回声声场状态简单的划分为单讲状态与双讲状态加以检测,容易将回声路径变化状态误判为双讲状态,采用本发明实施例的方案,可以有效提高对回声路径变化状态判断的准确性。 In the embodiment of the present invention, by setting the filter update degree Cef update to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set, The signal to be determined is actually the state of the echo path change. Compared with the prior art, the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state. The misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
为使本发明的上述目的、特征和有益效果能够更为明显易懂,下面结合附图对本发明的具体实施例做详细的说明。In order to make the above objectives, features and beneficial effects of the present invention more obvious and understandable, specific embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
参照图2,图2是本发明实施例中一种回声声场状态确定方法的流程图。实施回声声场状态确定方法包括步骤S21至步骤S24:Referring to FIG. 2, FIG. 2 is a flowchart of a method for determining the state of an echo sound field in an embodiment of the present invention. The method for determining the state of the echo sound field includes steps S21 to S24:
步骤S21:获取待确定信号;Step S21: Obtain a signal to be determined;
步骤S22:确定所述待确定信号的远端信号、近端信号以及滤波器系数;Step S22: Determine the far-end signal, the near-end signal, and filter coefficients of the signal to be determined;
步骤S23:至少根据所述远端信号、近端信号以及滤波器系数,确定滤波器更新度;Step S23: Determine the filter update degree at least according to the far-end signal, the near-end signal and the filter coefficient;
步骤S24:至少根据滤波器更新度大于预设更新度阈值,确定所述待确定信号的回声声场状态是否为回声路径变化状态。Step S24: Determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree being greater than the preset update degree threshold.
可以理解的是,在具体实施中,所述方法可以采用软件程序的方式实现,该软件程序运行于芯片或芯片模组内部集成的处理器中。It can be understood that, in specific implementation, the method can be implemented in the form of a software program that runs on a processor integrated inside a chip or a chip module.
在步骤S21的具体实施中,具有不同的回声声场状态的待确定信号可以包含不同的信号,例如可以包含通信终端扬声器发出的声音被该终端的麦克风拾取后得到的信号,还可以仅包含远端信号。在本发明实施例中,通过准确确定待确定信号的回声声场状态,可以更有效地实现对回声的消除。In the specific implementation of step S21, the to-be-determined signals with different echo sound field states may include different signals, for example, may include the signal obtained after the sound emitted by the speaker of the communication terminal is picked up by the microphone of the terminal, and may also include only the remote Signal. In the embodiment of the present invention, by accurately determining the echo sound field state of the signal to be determined, the echo cancellation can be achieved more effectively.
在步骤S22的具体实施中,确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k)。 In the specific implementation of step S22, the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined are determined.
具体地,可以采用常规的技术,确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k)。例如对图1示出的信号d(n)以及信号x(n)分别进行短时傅里叶变换,以得到近端信号D n(k)以及远端信号X n(k),还可以采用适当的方法,确定滤波器系数W n(k)。 Specifically, conventional techniques may be used to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined. For example, short-time Fourier transform is performed on the signal d(n) and signal x(n) shown in FIG. 1 to obtain the near-end signal D n (k) and the far-end signal X n (k). Determine the filter coefficient W n (k) by an appropriate method.
在步骤S23的具体实施中,确定滤波器更新度Cef updateIn the specific implementation of step S23, the filter update degree Cef update is determined.
进一步地,至少根据所述远端信号X n(k)、近端信号D n(k)以及滤 波器系数W n(k),确定滤波器更新度Cef update的步骤可以包括:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k);根据所述残差信号E n(k),确定更新后的滤波器系数W n+1(k);根据所述滤波器系数W n(k)以及更新后的滤波器系数W n+1(k),确定所述滤波器更新度Cef updateFurther, according to at least the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k), the step of determining the filter update degree Cef update may include: end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k); according to the residual signal E n (k), determine the update After the filter coefficient W n+1 (k); according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k), determine the filter update degree Cef update .
更进一步地,可以采用下述公式确定残差信号E n(k): Still further, the following formula may be used to determine the residual signal E n (k):
Figure PCTCN2021079181-appb-000008
Figure PCTCN2021079181-appb-000008
更进一步地,可以采用下述公式确定更新后的滤波器系数W n+1(k),其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长: Furthermore, the following formula can be used to determine the updated filter coefficient W n+1 (k), where the update step size μ n (k) is used to indicate the update step of the filter coefficient W n (k) long:
Figure PCTCN2021079181-appb-000009
Figure PCTCN2021079181-appb-000009
更进一步地,可以采用下述公式确定滤波器更新度Cef updateFurthermore, the following formula can be used to determine the filter update degree Cef update :
Figure PCTCN2021079181-appb-000010
Figure PCTCN2021079181-appb-000010
需要指出的是,在本发明实施例中,还可以采用其他适当的方法确定上述参数,本发明实施例对此不做限制。It should be pointed out that in the embodiment of the present invention, other appropriate methods may also be used to determine the above-mentioned parameters, which is not limited in the embodiment of the present invention.
在步骤S24的具体实施中,可以至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 In the specific implementation of step S24, it may be determined whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
进一步地,在本发明实施例的一种具体实施方式中,如果滤波器更新度Cef update大于预设更新度阈值Thrd update,则可以判断为所述待确定信号的回声声场状态为回声路径变化状态。 Further, in a specific implementation manner of the embodiment of the present invention, if the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is the echo path change state .
在本发明实施例中,通过设置至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态,可以设置适当的参数,对所述待确定信号实际 上为回声路径变化状态的情况进行准确判断,相比于现有技术中只是将回声声场状态简单的划分为单讲状态与双讲状态加以检测,容易将回声路径变化状态误判为双讲状态,采用本发明实施例的方案,可以有效提高对回声路径变化状态判断的准确性。 In the embodiment of the present invention, by setting the filter update degree Cef update to be greater than the preset update degree threshold Thrd update to determine whether the echo sound field state of the signal to be determined is the echo path change state, appropriate parameters can be set, The signal to be determined is actually the state of the echo path change. Compared with the prior art, the echo sound field state is simply divided into a single-talk state and a dual-talk state for detection, which makes it easier to detect the echo path change state. The misjudgment is the dual-talk state, and the solution of the embodiment of the present invention can effectively improve the accuracy of the judgment of the echo path change state.
进一步地,所述的回声声场状态确定方法还可以包括:至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为远端单讲状态。 Further, the method for determining the echo sound field state may further include: determining whether the echo sound field state of the signal to be determined is far, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update Single talk status.
在本发明实施例的一种具体实施方式中,如果滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,则可以判断为所述待确定信号的回声声场状态为远端单讲状态。 In a specific implementation of the embodiment of the present invention, if the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it can be determined that the echo sound field state of the signal to be determined is remote single talk state.
在本发明实施例中,至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态为远端单讲状态,相比于现有技术中容易将回声路径变化状态误判为双讲状态,采用本发明实施例的方案,可以进一步有效提高对回声路径变化状态判断的准确性。 In the embodiment of the present invention, at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state, which is compared with the current state. In some technologies, it is easy to misjudge the change state of the echo path as the dual-talk state. The solution of the embodiment of the present invention can further effectively improve the accuracy of the judgment of the change state of the echo path.
进一步地,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还可以包括:对所述近端信号D n(k)进行语音激活检测,以得到近端语音激活标志DVflag;如果所述近端语音激活标志DVflag不等于1,则判断所述待确定信号的回声声场状态为空闲状态。 Further, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state may further include: performing voice activation detection on the near-end signal D n (k), To obtain the near-end voice activation flag DVflag; if the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
需要指出的是,在本发明实施例中,对所述近端信号D n(k)进行语音激活检测,并根据所述近端语音激活标志DVflag判断所述待确定信号的回声声场状态为空闲状态的步骤还可以设置在步骤S24之后执行。本发明实施例对于判断近端语音激活标志DVflag的步骤与步骤S24的先后顺序不做限制。 It should be pointed out that, in the embodiment of the present invention, the near-end signal D n (k) is subjected to voice activation detection, and the echo sound field state of the signal to be determined is determined to be idle according to the near-end voice activation flag DVflag The state step can also be set to be executed after step S24. The embodiment of the present invention does not limit the sequence of the step of judging the near-end voice activation flag DVflag and the step S24.
在本发明实施例中,通过判断近端语音激活标志DVflag不等于1时,所述待确定信号的回声声场状态为空闲状态,可以在近端语音 激活标志DVflag不为1时,认为近端无语音,否则说明近端存在语音,需要对所述待确定信号进一步进行判断。In the embodiment of the present invention, by judging that the near-end voice activation flag DVflag is not equal to 1, the echo sound field state of the signal to be determined is idle. When the near-end voice activation flag DVflag is not 1, it can be considered that the near-end has no Voice, otherwise it means that there is voice at the near end, and further judgment on the to-be-determined signal is needed.
进一步地,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还可以包括:对所述远端信号X n(k)进行语音激活检测,以得到远端语音激活标志XVflag;如果所述远端语音激活标志XVflag不等于1,则判断所述待确定信号的回声声场状态为近端单讲状态。 Further, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state may further include: performing voice activation detection on the far-end signal X n (k), To obtain the far-end voice activation flag XVflag; if the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
需要指出的是,在本发明实施例中,对所述远端信号X n(k)进行语音激活检测,并根据所述远端语音激活标志XVflag判断所述待确定信号的回声声场状态为近端单讲状态的步骤还可以设置在步骤S24之后执行。本发明实施例对于判断远端语音激活标志XVflag的步骤与步骤S24的先后顺序不做限制。 It should be pointed out that, in the embodiment of the present invention, the voice activation detection is performed on the far-end signal X n (k), and the echo sound field state of the signal to be determined is judged to be close according to the far-end voice activation flag XVflag. The step of the single-talk state can also be set to be executed after step S24. The embodiment of the present invention does not limit the sequence of the step of determining the remote voice activation flag XVflag and the step S24.
需要指出的是,语音激活检测技术可以采用公知技术,常见有能量检测、过零率检测、谱熵检测与基音检测等等,本发明实施例对此不做具体限制。It should be pointed out that the voice activation detection technology can adopt well-known technologies, such as energy detection, zero-crossing rate detection, spectral entropy detection, pitch detection, etc., which are not specifically limited in the embodiment of the present invention.
在本发明实施例中,通过判断远端语音激活标志XVflag不等于1时,所述待确定信号的回声声场状态为近端单讲状态,可以在远端语音激活标志XVflag不为1时,认为远端无信号,近端信号中无回声信号,当前状态为近端单讲状态,否则说明近端信号中有回声的存在,需要对所述待确定信号进一步进行判断。In the embodiment of the present invention, by judging that the far-end voice activation flag XVflag is not equal to 1, the echo sound field state of the signal to be determined is the near-end single talk state, and it can be considered that when the far-end voice activation flag XVflag is not 1, There is no signal at the far end, no echo signal in the near-end signal, and the current state is the near-end single talk state. Otherwise, it indicates that there is echo in the near-end signal, and further judgment on the to-be-determined signal is required.
进一步地,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还可以包括:确定所述待确定信号的回波抑制比Err;如果所述回波抑制比Err大于预设回波阈值Thrd err,则判断所述待确定信号的回声声场状态为远端单讲状态。 Further, before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method for determining the echo sound field state may further include: determining the echo suppression ratio Err of the signal to be determined; if said If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
需要指出的是,在本发明实施例中,确定所述待确定信号的回波抑制比Err,并判断所述待确定信号的回声声场状态为远端单讲状态 的步骤还可以设置在步骤S24之后执行。本发明实施例对于判断待确定信号的回波抑制比Err的步骤与步骤S24的先后顺序不做限制。It should be pointed out that, in the embodiment of the present invention, the step of determining the echo suppression ratio Err of the signal to be determined and determining that the echo sound field state of the signal to be determined is the remote single talk state can also be set in step S24. Execute afterwards. The embodiment of the present invention does not limit the sequence of the step of determining the echo suppression ratio Err of the signal to be determined and the step S24.
更进一步地,确定所述待确定信号的回波抑制比Err的步骤可以包括:根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k);根据所述近端信号D n(k)与残差信号E n(k),确定信号的回波抑制比Err。 Furthermore, the step of determining the echo suppression ratio Err of the signal to be determined may include: according to the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) determining residual signal E n (k); according to the end signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
更进一步地,可以采用下述公式,确定残差信号E n(k): Still further, the following formula may be used to determine the residual signal E n (k):
Figure PCTCN2021079181-appb-000011
Figure PCTCN2021079181-appb-000011
更进一步地,可以采用下述公式,确定信号的回波抑制比Err:Furthermore, the following formula can be used to determine the signal echo suppression ratio Err:
Figure PCTCN2021079181-appb-000012
Figure PCTCN2021079181-appb-000012
其中,k为所述待确定信号的频率索引。Wherein, k is the frequency index of the signal to be determined.
在本发明实施例中,通过判断回波抑制比Err大于预设回波阈值Thrd err,说明残差信号相对幅度很小,近端信号成分中大部分确定为回声信号,已被自适应滤波器AF消除,当前状态为远端单讲状态,否则说明残差信号相对幅度仍较高且近端信号中成分不确定,需要对所述待确定信号进一步进行判断。 In the embodiment of the present invention, by judging that the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it indicates that the relative amplitude of the residual signal is very small, and most of the near-end signal components are determined to be echo signals, which have been determined by the adaptive filter. AF is eliminated, and the current state is the far-end single-talk state. Otherwise, it indicates that the relative amplitude of the residual signal is still high and the component in the near-end signal is uncertain, and further judgment on the signal to be determined is required.
作为一个非限制性的例子,门限值Thrd err参考值可以为12至20dB。 As a non-limiting example, the threshold Thrd err reference value may be 12 to 20 dB.
进一步,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,所述的回声声场状态确定方法还可以包括:确定归一化互相关值C YE与C DE;如果C DE大于第一预设互相关阈值Thrd1 coh,且C YE小于第二预设互相关阈值Thrd2 coh,则判断所述待确定信号的回声声场状态为双讲状态;其中,所述第一预设互相关阈值Thrd1 coh大于 等于所述第二预设互相关阈值Thrd2 cohFurther, before determining whether the echo sound field state of the signal to be determined is the echo path change state, the echo sound field state determination method may further include: determining the normalized cross-correlation values C YE and C DE ; if C DE is greater than If the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , it is determined that the echo sound field state of the signal to be determined is a dual-talk state; wherein, the first preset cross-correlation The threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
更进一步地,可以采用下述公式,确定归一化互相关值C YE与C DEFurthermore, the following formula can be used to determine the normalized cross-correlation values C YE and C DE :
Figure PCTCN2021079181-appb-000013
Figure PCTCN2021079181-appb-000013
Figure PCTCN2021079181-appb-000014
Figure PCTCN2021079181-appb-000014
其中,M与L为所述待确定信号的频段索引。Wherein, M and L are the frequency band indexes of the signal to be determined.
在本发明实施例中,通过归一化互相关值对近端信号与残差信号成分做进一步确定,在滤波器收敛情况下,残差数据E n(k)相当于已与回声信号去相关,此时若C DE大于门限Thrd1 coh,说明近端信号中含有很多与回声不相关的成分,但若滤波器未收敛,则残差信号中还会含有大量回声成分,该结论则不成立;故采用C YE做进一步确认,若C YE小于门限Thrd2 coh说明残差信号中回声成分已很少,结合C DE大于门限Thrd1 coh的条件可确认近端信号中含有与回声不相关的成分,此时当前状态为双讲状态,否则说明信号成分无法确定,需要对所述待确定信号进一步进行判断。 In an embodiment of the present invention, the residual signal of the near-end signal component further determined by normalizing the cross-correlation value, at the convergence of the filter, the residual data E n (k) corresponding to the echo signal is decorrelated At this time, if C DE is greater than the threshold Thrd1 coh , it means that the near-end signal contains many components that are not related to echo. However, if the filter does not converge, the residual signal will also contain a large amount of echo components, and this conclusion is not valid; C YE is used for further confirmation. If C YE is less than the threshold Thrd2 coh, it means that there are few echo components in the residual signal. Combined with the condition that C DE is greater than the threshold Thrd1 coh , it can be confirmed that the near-end signal contains components that are not related to echo. The current state is a dual-talk state, otherwise it means that the signal component cannot be determined, and the signal to be determined needs to be further judged.
更进一步地,所述归一化互相关值C YE与C DE为线性区归一化互相关值;其中,M与L为线性区的频段索引。 Furthermore, the normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region; wherein M and L are frequency band indexes of the linear region.
在本发明实施例中,所述归一化互相关值C YE与C DE为线性区归一化互相关值;其中,M与L为线性区的频段索引,通过在线性区取值,可以提高判断的准确性。 In the embodiment of the present invention, the normalized cross-correlation values C YE and C DE are the normalized cross-correlation values of the linear region; where M and L are the frequency band indexes of the linear region. By taking the value in the linear region, you can Improve the accuracy of judgment.
需要指出的是,通过设置M与L为线性区对应的频段索引,由于器件非线性失真具有谐波特征,常分布在中高频,故本发明给出参考频率范围,M对应低频段在100~300Hz区间,L对应高频段在 2500~3000Hz区间,该范围仅为参考值,实际使用不受此限制。It should be pointed out that by setting M and L as the frequency band index corresponding to the linear region, since the nonlinear distortion of the device has harmonic characteristics and is often distributed in the middle and high frequencies, the present invention gives the reference frequency range, and M corresponds to the low frequency band in 100~ In the 300Hz interval, L corresponds to the high frequency band in the 2500~3000Hz interval. This range is only a reference value, and the actual use is not limited by this.
更进一步地,如果滤波器更新度Cef update大于预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为回声路径变化状态;如果所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为远端单讲状态。 Further, if the filter update degree Cef update is greater than the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the echo path change state; if the filter update degree Cef update is less than or equal to the The preset update threshold Thrd update determines that the echo sound field state of the signal to be determined is the remote single talk state.
也即在本发明实施例中,可以设置判断滤波器更新度Cef update大于预设更新度阈值Thrd update的步骤在判断双讲状态之后。 That is, in the embodiment of the present invention, the step of judging that the filter update degree Cef update is greater than the preset update degree threshold Thrd update may be set after judging the dual-talk state.
需要指出的是,回波抑制比Err为回声信号相对消除量,避免了回声信号强度的影响;归一化互相关量C YE与C DE,采用了归一化处理,与远近端信号强度无关,同时采用线性区计算减少了器件失真的影响;滤波器更新度Cef update利用AF本身一定程度的稳健性,反映了回声路径的变化强度。所以这些特征的综合使用能有效解决回声信号强度变化、远近端信号强度变化、器件失真以及回声路径变化等不确定因素对检测准确度的影响。 It should be pointed out that the echo suppression ratio Err is the relative cancellation amount of the echo signal, which avoids the influence of the echo signal strength; the normalized cross-correlation quantities C YE and C DE are normalized and have nothing to do with the signal strength of the far and near ends. At the same time, the linear region calculation is used to reduce the influence of device distortion; the filter update degree Cef update uses a certain degree of robustness of the AF itself to reflect the change intensity of the echo path. Therefore, the comprehensive use of these features can effectively solve the influence of uncertain factors such as echo signal strength changes, far and near-end signal strength changes, device distortion, and echo path changes on the detection accuracy.
在本发明实施例中,在判断并排除双讲状态之后,根据滤波器收敛度Cef update大于门限Thrd update,判断为回声路径改变状态,根据滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,判断为远端单讲状态,可以说明滤波器处于快速更新状态,由于之前判决已排除确定性双讲状态,近端语音信号对滤波器的干扰已不会太高,此时快速更新只能是未收敛或回声路径改变造成,当前状态为回声路径改变状态,否则当前特征暂无明显区分度,视为不确定状态,在本发明实施例中,可以确定为远端单讲状态。 In the embodiment of the present invention, after the dual-talk state is judged and excluded, it is judged that the echo path change state according to the filter convergence Cef update being greater than the threshold Thrd update , and according to the filter update degree Cef update being less than or equal to the preset update degree Threshold Thrd update , judged as the far-end single-talk state, can indicate that the filter is in the fast update state. Since the previous judgment has ruled out the deterministic dual-talk state, the interference of the near-end voice signal to the filter is not too high. The update can only be caused by non-convergence or echo path change. The current state is the echo path change state. Otherwise, the current feature has no obvious distinction and is regarded as an uncertain state. In the embodiment of the present invention, it can be determined as the remote single talk state .
需要指出的是,在滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update时,当前特征暂无明显区分度,可以视为不确定状态,本发明的发明人经过研究和实践,选择以远端单讲状态FSTS处理。 It should be pointed out that when the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , the current feature has no obvious distinguishability for the time being and can be regarded as an uncertain state. The inventor of the present invention has studied and practiced, Select FSTS processing in remote single talk state.
作为一个非限制性的例子,Thrd1 coh参考取值可以为0.3至0.5,Thrd2 coh参考取值可以为0.1至0.3。 As a non-limiting example, the reference value of Thrd1 coh may be 0.3 to 0.5, and the reference value of Thrd2 coh may be 0.1 to 0.3.
参照图3,图3是本发明实施例中另一种回声声场状态确定方法的流程图。所述另一种回声声场状态确定方法可以包括步骤S301至步骤S311,以下对各个步骤进行说明。Referring to FIG. 3, FIG. 3 is a flowchart of another method for determining the state of an echo sound field in an embodiment of the present invention. The another method for determining the state of the echo sound field may include step S301 to step S311, and each step will be described below.
在步骤S301中,判断DVflag是否等于1;当判断结果为是时,可以执行步骤S302;反之,则可以执行步骤S303。In step S301, it is judged whether DVflag is equal to 1; when the judgment result is yes, step S302 can be executed; otherwise, step S303 can be executed.
在步骤S302中,判断XVflag是否等于1;当判断结果为是时,可以执行步骤S304;反之,则可以执行步骤S305。In step S302, it is judged whether XVflag is equal to 1; when the judgment result is yes, step S304 can be executed; otherwise, step S305 can be executed.
在步骤S303中,判断回声声场状态为空闲状态(IDS)。In step S303, it is determined that the state of the echo sound field is the idle state (IDS).
在步骤S304中,判断Err是否大于Thrd err;当判断结果为是时,可以执行步骤S306;反之,则可以执行步骤S307。 In step S304, it is judged whether Err is greater than Thrd err ; when the judgment result is yes, step S306 can be executed; otherwise, step S307 can be executed.
在步骤S305中,判断回声声场状态为近端单讲状态(NSTS)。In step S305, it is determined that the state of the echo sound field is the near-end single talk state (NSTS).
在步骤S306中,判断回声声场状态为远端单讲状态(FSTS)。In step S306, it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
在步骤S307中,判断C DE是否大于Thrd1 coh,且C YE小于Thrd2 coh;当判断结果为是时,可以执行步骤S308;反之,则可以执行步骤S309。 In step S307, it is judged whether C DE is greater than Thrd1 coh and C YE is less than Thrd2 coh ; when the judgment result is yes, step S308 can be executed; otherwise, step S309 can be executed.
在步骤S308中,判断回声声场状态为双讲状态(DTS)。In step S308, it is determined that the state of the echo sound field is a dual talk state (DTS).
在步骤S309中,判断Cef update大于Thrd update;当判断结果为是时,可以执行步骤S310;反之,则可以执行步骤S311。 In step S309, it is determined that Cef update is greater than Thrd update ; when the determination result is yes, step S310 can be executed; otherwise, step S311 can be executed.
在步骤S310中,判断回声声场状态为回声路径变化状态(PCS)。In step S310, it is determined that the echo sound field state is the echo path change state (PCS).
在步骤S311中,判断回声声场状态为远端单讲状态(FSTS)。In step S311, it is determined that the state of the echo sound field is the far-end single talk state (FSTS).
需要指出的是,本实施例中各个步骤的序号并不代表对各个步骤的执行顺序的限定。例如,不限制步骤S301、S302、S304、S307、S309之间的步骤顺序。It should be pointed out that the sequence number of each step in this embodiment does not represent a limitation on the execution order of each step. For example, the order of steps between steps S301, S302, S304, S307, and S309 is not limited.
在本发明实施例的一种具体实施方式中,步骤S309可以设置在S307之后,以提高对回声路径变化状态进行判断的准确性。In a specific implementation manner of the embodiment of the present invention, step S309 may be set after S307 to improve the accuracy of judging the change state of the echo path.
在本发明实施例中,选用的特征与判决方法对信号强度变化(远 近端、回声信号)、器件失真与回声路径改变等不确定因素具有很强的鲁棒性,且多种特征的联合运用使得检测精度更高,性能更可靠。In the embodiment of the present invention, the selected features and decision methods are robust against uncertain factors such as signal strength changes (far and near ends, echo signals), device distortion and echo path changes, and the combined use of multiple features Makes the detection accuracy higher and the performance more reliable.
进一步地,所述的回声声场状态确定方法还可以包括根据所述待确定信号的回声声场状态,调整所述待确定信号的更新步长μ n(k);其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长。 Further, the method for determining the echo sound field state may further include adjusting the update step size μ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined; wherein the update step size μ n (k) ) Is used to indicate the update step size of the filter coefficient W n (k).
更进一步地,调整更新步长μ n(k)包括以下一项或多项:如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增加更新步长μ n(k);如果确定所述待确定信号的回声声场状态为双讲状态,则调整μ n(k)放慢更新;如果确定所述待确定信号的回声声场状态为空闲状态或近端单讲状态,则调整μ n(k)=0。 Furthermore, adjusting the update step size μ n (k) includes one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the echo path change state, then the update step size μ n (k) is increased; if If it is determined that the echo sound field state of the signal to be determined is the dual talk state, adjust μ n (k) to slow down the update; if it is determined that the echo sound field state of the signal to be determined is the idle state or the near-end single talk state, adjust μ n (k)=0.
更进一步地,可以采用回声自适应滤波器调整所述待确定信号的更新步长μ n(k)。 Furthermore, an echo adaptive filter may be used to adjust the update step size μ n (k) of the signal to be determined.
在本发明实施例中,可以在所述待确定信号为回声路径改变状态时,增加更新步长μ n(k)取值,加快更新,快速收敛;在所述待确定信号为双讲状态DTS时,调整μ n(k)放慢更新,保证滤波器的稳健性;在所述待确定信号为远端单讲状态FSTS时,μ n(k)取正常值,不做特殊调整;在所述待确定信号为空闲状态IDS或近端单讲状态NSTS时,μ n(k)取0,停止更新,防止发散,从而提高信号传输质量。 In the embodiment of the present invention, when the signal to be determined is the echo path change state, the value of the update step μ n (k) can be increased to speed up the update and fast convergence; when the signal to be determined is in the dual-talk state DTS When, adjust μ n (k) to slow down the update to ensure the robustness of the filter; when the signal to be determined is the remote single talk state FSTS, μ n (k) takes the normal value without special adjustment; When the signal to be determined is in the idle state IDS or the near-end single talk state NSTS, μ n (k) is set to 0, and the update is stopped to prevent divergence, thereby improving the signal transmission quality.
进一步地,所述的回声声场状态确定方法还可以包括:根据所述待确定信号的回声声场状态,确定是否对所述待确定信号进行非线性处理。Further, the method for determining the state of the echo sound field may further include: determining whether to perform nonlinear processing on the signal to be determined according to the state of the echo sound field of the signal to be determined.
更进一步地,确定是否对所述待确定信号进行非线性处理的步骤可以包括以下一项或多项:如果确定所述待确定信号的回声声场状态为双讲状态,则减少非线性处理程度;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增强对所述待确定信号的非线性处理;如果确定所述待确定信号的回声声场状态为近端单讲状态,则停止对所述待确定信号的非线性处理;如果确定所述待确定信号的回 声声场状态为空闲状态,则停止对所述待确定信号的非线性处理。Furthermore, the step of determining whether to perform nonlinear processing on the signal to be determined may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the degree of nonlinear processing; If it is determined that the echo sound field state of the signal to be determined is the echo path change state, the nonlinear processing of the signal to be determined is enhanced; if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop Non-linear processing of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is an idle state, the non-linear processing of the signal to be determined is stopped.
更进一步地,可以采用后处理非线性处理单元对所述待确定信号进行非线性处理。Furthermore, a post-processing non-linear processing unit can be used to perform non-linear processing on the signal to be determined.
在本发明实施例中,可以在所述待确定信号为双讲状态时减少非线性处理程度,使有效语音不受损伤,保证双讲性能;在所述待确定信号为回声路径改变状态PCS时增强非线性处理程度,防止残留回声的泄漏;在所述待确定信号为近端单讲NSTS与空闲状态IDS时,停止非线性处理,避免引起近端语音与环境音的失真;在所述待确定信号为远端单讲状态FSTS时不做特殊处理,正常抑制残留回声,从而提高信号传输质量。In the embodiment of the present invention, the degree of non-linear processing can be reduced when the signal to be determined is in the dual-talk state, so that the effective voice is not damaged, and the dual-talk performance is ensured; when the signal to be determined is the echo path change state PCS Enhance the degree of non-linear processing to prevent leakage of residual echo; when the signal to be determined is near-end single talk NSTS and idle state IDS, stop non-linear processing to avoid causing near-end voice and environmental sound distortion; When it is determined that the signal is in the far-end single talk state FSTS, no special processing is performed, and the residual echo is normally suppressed, thereby improving the signal transmission quality.
进一步地,所述的回声声场状态确定方法还可以包括:根据所述待确定信号的回声声场状态,确定降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。Further, the method for determining the state of the echo sound field may further include: according to the state of the echo sound field of the signal to be determined, determining to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined .
更进一步地,确定降低噪声更新速度或者提高非平稳噪声抑制能力的步骤可以包括以下一项或多项:如果确定所述待确定信号的回声声场状态为近端单讲状态,则降低所述待确定信号的噪声更新速度;如果确定所述待确定信号的回声声场状态为双讲状态,则降低所述待确定信号的噪声更新速度;如果确定所述待确定信号的回声声场状态为远端单讲状态,则提高所述待确定信号的非平稳噪声抑制能力;如果确定所述待确定信号的回声声场状态为回声路径变化状态,则提高所述待确定信号的非平稳噪声抑制能力。Furthermore, the step of determining to reduce the noise update speed or to improve the non-stationary noise suppression capability may include one or more of the following: if it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, then the standby state is reduced. Determine the noise update speed of the signal; if it is determined that the echo sound field state of the signal to be determined is the dual-talk state, reduce the noise update speed of the signal to be determined; if it is determined that the echo sound field state of the signal to be determined is the remote single In terms of state, the non-stationary noise suppression capability of the signal to be determined is improved; if it is determined that the echo sound field state of the signal to be determined is the echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
更进一步地,采用后处理噪声抑制单元降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。Furthermore, a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
在本发明实施例中,可以在所述待确定信号为近端单讲状态与双讲状态时,放慢噪声更新速度,保证有效语音的可懂度;在所述待确定信号为远端单讲与回声路径改变时,提高非平稳噪声抑制能力,起到对残留回声的抑制作用;在所述待确定信号为空闲状态,即背景噪 声IDS状态时,不做特殊处理,正常跟踪背景噪声,从而提高信号传输质量。In the embodiment of the present invention, when the signal to be determined is in the near-end single-talk state and the dual-talk state, the noise update speed can be slowed down to ensure the intelligibility of effective speech; when the signal to be determined is the far-end single-talk state When the echo path is changed, the non-stationary noise suppression capability is improved, and the residual echo is suppressed; when the signal to be determined is in the idle state, that is, the background noise IDS state, no special processing is performed, and the background noise is normally tracked. Thereby improving the quality of signal transmission.
参照图4,图4是本发明实施例中一种AEC系统的结构示意图。Referring to Fig. 4, Fig. 4 is a schematic structural diagram of an AEC system in an embodiment of the present invention.
如图4所示,信号x(n)经过扬声器(SPK)之后得到信号h(n),该信号具有回声(echo),与语音信号(voice)以及噪声信号(noise)经由麦克风(MIC)之后输出信号d(n)。As shown in Figure 4, the signal x(n) passes through the loudspeaker (SPK) to obtain the signal h(n), which has echo, and the voice signal (voice) and noise signal (noise) after passing through the microphone (MIC) Output signal d(n).
对所述信号d(n)以及信号x(n)分别进行短时傅里叶变换(STFT)得到近端信号D n(k)以及远端信号X n(k),自适应滤波器(AF)可以根据远端信号X n(k)与滤波器系数W n(k)计算出回声估计信号Y n(k),并与近端信号D n(k)相减得到残差信号E n(k)。 The short-time Fourier transform (STFT) is performed on the signal d(n) and signal x(n) to obtain the near-end signal D n (k) and the far-end signal X n (k). The adaptive filter (AF ) can be calculated far-end signal X n (k) with the filter coefficients W n (k) the echo estimation signal Y n (k), and the near-end signal D n (k) obtained by subtracting the residual signal E n ( k).
在具体实施中,可以根据滤波器系数W n(k)进行滤波器系数更新,得到W n+1(k)。 In a specific implementation, the filter coefficient can be updated according to the filter coefficient W n (k) to obtain W n+1 (k).
进而可以将远端信号X n(k)、近端信号D n(k)、回声估计信号Y n(k)、残差信号E n(k)与滤波器系数W n(k)输入回声声场状态检测单元ESD,以进行信号特征计算,并根据计算结果做回声声场状态判决,得到具体的回声声场状态。 Further far-end signal may be X n (k), near-end signal D n (k), the echo estimation signal Y n (k), the residual signal E n (k) with the filter coefficients W n (k) back to the input sound field The state detection unit ESD performs signal feature calculation, and makes the echo sound field state judgment based on the calculation result, and obtains the specific echo sound field state.
如前所述,在本发明实施例中,可以将回声声状态细分为五种声场状态:远端单讲状态FSTS,近端单讲状态NSTS,双讲状态DTS,回声路径改变状态PCS以及空闲状态IDS(即为背景噪声)。As mentioned above, in the embodiment of the present invention, the echo state can be subdivided into five sound field states: far-end single-talk state FSTS, near-end single-talk state NSTS, dual-talk state DTS, echo path change state PCS, and IDS in idle state (ie, background noise).
进而可以设置自适应滤波器AF与后处理非线性处理单元(NLP)以及后处理噪声抑制单元(NS)通过ESD获取具体的声场状态,并做相应的处理。Furthermore, an adaptive filter AF and a post-processing non-linear processing unit (NLP) and a post-processing noise suppression unit (NS) can be set to obtain a specific sound field state through ESD, and perform corresponding processing.
进一步地,所述的回声声场状态确定方法还可以包括:确定所述待确定信号的临时声场状态;根据所述待确定信号的回声声场状态以及临时声场状态,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出。Further, the method for determining the state of the echo sound field may further include: determining the temporary sound field state of the signal to be determined; and determining to maintain the dual sound field state of the signal to be determined according to the echo sound field state and the temporary sound field state of the signal to be determined. Talking status output or output of delaying echo path change for the signal to be determined.
在本发明实施例中,若历史状态为双讲DTS,EStemp为远端单讲FSTS则通过保持时间Thold保持DTS输出,最大程度的保护近端语音。In the embodiment of the present invention, if the historical state is dual-talk DTS and EStemp is the remote single-talk FSTS, the DTS output is maintained through the holding time Thold to protect the near-end voice to the greatest extent.
更进一步地,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出包括以下一项或多项:如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为远端单讲状态,则通过保持时间对所述待确定信号保持双讲状态输出;如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为回声路径变化状态,则通过开始时间对所述待确定信号暂缓回声路径改变的输出。Further, the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change for the signal to be determined includes one or more of the following: if the echo sound field state of the signal to be determined is dual-talk State, the temporary sound field state is the remote single-talk state, the signal to be determined is maintained in the dual-talk state output through the hold time; if the echo sound field state of the signal to be determined is the dual-talk state, the temporary sound field state If it is the echo path change state, the output of the echo path change is suspended for the signal to be determined through the start time.
在本发明实施例中,若历史状态为双讲DTS,EStemp为回声路径改变PCS则通过开始时间Tstart暂缓PCS的输出,此时强制状态输出为远端单讲FSTS,起到减少滤波器发散风险与抑制回声残留的折中效果。In the embodiment of the present invention, if the historical state is dual talk DTS and EStemp is the echo path change PCS, the output of the PCS will be suspended through the start time Tstart. At this time, the state output is forced to be the remote single talk FSTS to reduce the risk of filter divergence A compromise effect with suppressing echo residue.
作为一个非限制性的例子,Thold与Tstart的取值,可以设置在20至100ms之间。As a non-limiting example, the value of Thold and Tstart can be set between 20 and 100 ms.
参照图5,图5是本发明实施例中一种回声声场状态确定装置的结构示意图。所述回声声场状态确定装置可以包括:Referring to FIG. 5, FIG. 5 is a schematic structural diagram of a device for determining an echo sound field state in an embodiment of the present invention. The apparatus for determining the state of the echo sound field may include:
获取模块51,用于获取待确定信号;The obtaining module 51 is used to obtain the signal to be determined;
信号确定模块52,用于确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k); The signal determining module 52 is configured to determine the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k) of the signal to be determined;
更新度确定模块53,用于至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef updateThe update degree determination module 53 is configured to determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);
状态确定模块54,用于至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 The state determination module 54 is configured to determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
在具体实施中,上述装置可以对应于用户设备中具有数据处理功能的芯片,如基带芯片;或者对应于用户设备中包括具有数据处理功能芯片的芯片模组,或者对应于用户设备。In a specific implementation, the foregoing device may correspond to a chip with data processing function in user equipment, such as a baseband chip; or a chip module including a chip with data processing function in user equipment, or a user equipment.
关于该回声声场状态确定装置的原理、具体实现和有益效果请参照前文及图2至图4示出的关于回声声场状态确定方法的相关描述,此处不再赘述。For the principle, specific implementation and beneficial effects of the device for determining the state of the echo sound field, please refer to the foregoing and the related description of the method for determining the state of the echo sound field shown in FIGS. 2 to 4, which will not be repeated here.
本发明实施例还提供了一种存储介质,其上存储有计算机指令,所述计算机指令运行时执行上述方法的步骤。所述存储介质可以是计算机可读存储介质,例如可以包括非挥发性存储器(non-volatile)或者非瞬态(non-transitory)存储器,还可以包括光盘、机械硬盘、固态硬盘等。The embodiment of the present invention also provides a storage medium on which computer instructions are stored, and the computer instructions execute the steps of the foregoing method when the computer instructions are executed. The storage medium may be a computer-readable storage medium, for example, it may include non-volatile memory (non-volatile) or non-transitory (non-transitory) memory, and may also include optical disks, mechanical hard drives, solid state hard drives, and the like.
本发明实施例还提供了一种终端,包括存储器和处理器,所述存储器上存储有能够在所述处理器上运行的计算机指令,所述处理器运行所述计算机指令时执行上述方法的步骤。所述终端包括但不限于手机、计算机、平板电脑等终端设备。An embodiment of the present invention also provides a terminal, including a memory and a processor, the memory stores computer instructions that can run on the processor, and the processor executes the steps of the above method when the computer instructions are executed. . The terminal includes, but is not limited to, terminal devices such as mobile phones, computers, and tablets.
关于上述实施例中描述的各个装置、产品包含的各个模块/单元,其可以是软件模块/单元,也可以是硬件模块/单元,或者也可以部分是软件模块/单元,部分是硬件模块/单元。例如,对于应用于或集成于芯片的各个装置、产品,其包含的各个模块/单元可以都采用电路等硬件的方式实现,或者,至少部分模块/单元可以采用软件程序的方式实现,该软件程序运行于芯片内部集成的处理器,剩余的(如果有)部分模块/单元可以采用电路等硬件方式实现;对于应用于或集成于芯片模组的各个装置、产品,其包含的各个模块/单元可以都采用电路等硬件的方式实现,不同的模块/单元可以位于芯片模组的同一组件(例如芯片、电路模块等)或者不同组件中,或者,至少部分模块/单元可以采用软件程序的方式实现,该软件程序运行于芯片模组内部集成的处理器,剩余的(如果有)部分模块/单元可以采用电路等硬件方式实现;对于应用于或集成于终端的各个装置、产品,其 包含的各个模块/单元可以都采用电路等硬件的方式实现,不同的模块/单元可以位于终端内同一组件(例如,芯片、电路模块等)或者不同组件中,或者,至少部分模块/单元可以采用软件程序的方式实现,该软件程序运行于终端内部集成的处理器,剩余的(如果有)部分模块/单元可以采用电路等硬件方式实现。Regarding the various modules/units contained in the various devices and products described in the above embodiments, they may be software modules/units, hardware modules/units, or part software modules/units and part hardware modules/units. . For example, for various devices and products that are applied to or integrated in a chip, the various modules/units contained therein can be implemented in the form of hardware such as circuits, or at least part of the modules/units can be implemented in the form of software programs. Runs on the integrated processor inside the chip, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the chip module, the modules/units contained therein can be All are implemented by hardware such as circuits. Different modules/units can be located in the same component (such as a chip, circuit module, etc.) or different components of the chip module, or at least part of the modules/units can be implemented by software programs. The software program runs on the processor integrated inside the chip module, and the remaining (if any) part of the modules/units can be implemented by hardware methods such as circuits; for each device and product applied to or integrated in the terminal, the modules contained therein The modules/units can all be implemented by hardware such as circuits, and different modules/units can be located in the same component (for example, chip, circuit module, etc.) or different components in the terminal, or at least part of the modules/units can be implemented in the form of software programs Implementation, the software program runs on the processor integrated inside the terminal, and the remaining (if any) part of the modules/units can be implemented by hardware such as circuits.
虽然本发明披露如上,但本发明并非限定于此。任何本领域技术人员,在不脱离本发明的精神和范围内,均可作各种更动与修改,因此本发明的保护范围应当以权利要求所限定的范围为准。Although the present invention is disclosed as above, the present invention is not limited to this. Any person skilled in the art can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, the protection scope of the present invention should be subject to the scope defined by the claims.

Claims (27)

  1. 一种回声声场状态确定方法,其特征在于,包括以下步骤:A method for determining the state of an echo sound field is characterized in that it comprises the following steps:
    获取待确定信号;Obtain the signal to be determined;
    确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k); Determine the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined;
    至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef updateDetermine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);
    至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 At least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update , it is determined whether the echo sound field state of the signal to be determined is the echo path change state.
  2. 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:
    至少根据所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为远端单讲状态。 Determine whether the echo sound field state of the signal to be determined is a remote single talk state at least according to the filter update degree Cef update being less than or equal to the preset update degree threshold Thrd update.
  3. 根据权利要求1所述的回声声场状态确定方法,其特征在于,至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef update包括: The method for determining the state of the echo sound field according to claim 1, wherein the filter is determined based on at least the far-end signal X n (k), the near-end signal D n (k), and the filter coefficient W n (k). Cef update includes:
    根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k); Based on the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k);
    根据所述残差信号E n(k),确定更新后的滤波器系数W n+1(k); According to the residual signal E n (k), determines the filter coefficient W updated n + 1 (k);
    根据所述滤波器系数W n(k)以及更新后的滤波器系数W n+1(k),确定所述滤波器更新度Cef updateDetermine the filter update degree Cef update according to the filter coefficient W n (k) and the updated filter coefficient W n+1 (k).
  4. 根据权利要求3所述的回声声场状态确定方法,其特征在于,满足以下一项或多项:The method for determining the state of the echo sound field according to claim 3, wherein one or more of the following is satisfied:
    采用下述公式,确定残差信号E n(k): Using the following equation to determine the residual signal E n (k):
    Figure PCTCN2021079181-appb-100001
    Figure PCTCN2021079181-appb-100001
    采用下述公式,确定更新后的滤波器系数W n+1(k),其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长: The following formula is used to determine the updated filter coefficient W n+1 (k), where the update step size μ n (k) is used to indicate the update step size of the filter coefficient W n (k):
    Figure PCTCN2021079181-appb-100002
    Figure PCTCN2021079181-appb-100002
    采用下述公式,确定滤波器更新度Cef updateUse the following formula to determine the filter update degree Cef update :
    Figure PCTCN2021079181-appb-100003
    Figure PCTCN2021079181-appb-100003
  5. 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:
    对所述近端信号D n(k)进行语音激活检测,以得到近端语音激活标志DVflag; Performing voice activation detection on the near-end signal D n (k) to obtain a near-end voice activation flag DVflag;
    如果所述近端语音激活标志DVflag不等于1,则判断所述待确定信号的回声声场状态为空闲状态。If the near-end voice activation flag DVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is an idle state.
  6. 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:
    对所述远端信号X n(k)进行语音激活检测,以得到远端语音激活标志XVflag; Perform voice activation detection on the far-end signal X n (k) to obtain a far-end voice activation flag XVflag;
    如果所述远端语音激活标志XVflag不等于1,则判断所述待确定信号的回声声场状态为近端单讲状态。If the far-end voice activation flag XVflag is not equal to 1, it is determined that the echo sound field state of the signal to be determined is the near-end single talk state.
  7. 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之 前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:
    确定所述待确定信号的回波抑制比Err;Determining the echo suppression ratio Err of the signal to be determined;
    如果所述回波抑制比Err大于预设回波阈值Thrd err,则判断所述待确定信号的回声声场状态为远端单讲状态。 If the echo suppression ratio Err is greater than the preset echo threshold Thrd err , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
  8. 根据权利要求7所述的回声声场状态确定方法,其特征在于,确定所述待确定信号的回波抑制比Err包括:The method for determining an echo sound field state according to claim 7, wherein determining the echo suppression ratio Err of the signal to be determined comprises:
    根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定残差信号E n(k); Based on the far-end signal X n (k), near-end signal D n (k) and the filter coefficients W n (k), determining the residual signal E n (k);
    根据所述近端信号D n(k)与残差信号E n(k),确定信号的回波抑制比Err。 The proximal end of the signal D n (k) and the residual signal E n (k), the echo signal suppression ratio determined Err.
  9. 根据权利要求8所述的回声声场状态确定方法,其特征在于,满足以下一项或多项:The method for determining the state of the echo sound field according to claim 8, wherein one or more of the following is satisfied:
    采用下述公式,确定残差信号E n(k): Using the following equation to determine the residual signal E n (k):
    Figure PCTCN2021079181-appb-100004
    Figure PCTCN2021079181-appb-100004
    采用下述公式,确定信号的回波抑制比Err:Use the following formula to determine the signal echo suppression ratio Err:
    Figure PCTCN2021079181-appb-100005
    Figure PCTCN2021079181-appb-100005
    其中,k为所述待确定信号的频率索引。Wherein, k is the frequency index of the signal to be determined.
  10. 根据权利要求1所述的回声声场状态确定方法,其特征在于,在确定所述待确定信号的回声声场状态是否为回声路径变化状态之前,还包括:The method for determining an echo sound field state according to claim 1, wherein before determining whether the echo sound field state of the signal to be determined is an echo path change state, the method further comprises:
    确定归一化互相关值C YE与C DEDetermine the normalized cross-correlation values C YE and C DE ;
    如果C DE大于第一预设互相关阈值Thrd1 coh,且C YE小于第二预设互相关阈值Thrd2 coh,则判断所述待确定信号的回声声场状态为双讲状态; If C DE is greater than the first preset cross-correlation threshold Thrd1 coh and C YE is less than the second preset cross-correlation threshold Thrd2 coh , determining that the echo sound field state of the signal to be determined is a dual-talk state;
    其中,所述第一预设互相关阈值Thrd1 coh大于等于所述第二预设互相关阈值Thrd2 cohWherein, the first preset cross-correlation threshold Thrd1 coh is greater than or equal to the second preset cross-correlation threshold Thrd2 coh .
  11. 根据权利要求10所述的回声声场状态确定方法,其特征在于,还包括以下一项或多项:The method for determining the state of the echo sound field according to claim 10, further comprising one or more of the following:
    如果滤波器更新度Cef update大于预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为回声路径变化状态; If the filter update degree Cef update is greater than the preset update degree threshold Thrd update , determining that the echo sound field state of the signal to be determined is the echo path change state;
    如果所述滤波器更新度Cef update小于等于所述预设更新度阈值Thrd update,则确定所述待确定信号的回声声场状态为远端单讲状态。 If the filter update degree Cef update is less than or equal to the preset update degree threshold Thrd update , it is determined that the echo sound field state of the signal to be determined is the remote single talk state.
  12. 根据权利要求10所述的回声声场状态确定方法,其特征在于,采用下述公式,确定归一化互相关值C YE与C DEThe method for determining the state of the echo sound field according to claim 10, wherein the following formula is used to determine the normalized cross-correlation values C YE and C DE :
    Figure PCTCN2021079181-appb-100006
    Figure PCTCN2021079181-appb-100006
    Figure PCTCN2021079181-appb-100007
    Figure PCTCN2021079181-appb-100007
    其中,M与L为所述待确定信号的频段索引。Wherein, M and L are the frequency band indexes of the signal to be determined.
  13. 根据权利要求12所述的回声声场状态确定方法,其特征在于,The method for determining the state of the echo sound field according to claim 12, wherein:
    所述归一化互相关值C YE与C DE为线性区归一化互相关值; The normalized cross-correlation values C YE and C DE are normalized cross-correlation values in the linear region;
    其中,M与L为线性区的频段索引。Among them, M and L are the frequency band indexes of the linear region.
  14. 根据权利要求1所述的回声声场状态确定方法,其特征在于,还 包括:The method for determining the state of the echo sound field according to claim 1, further comprising:
    根据所述待确定信号的回声声场状态,调整所述待确定信号的更新步长μ n(k); Adjusting the update step size μ n (k) of the signal to be determined according to the echo sound field state of the signal to be determined;
    其中,更新步长μ n(k)用于指示所述滤波器系数W n(k)更新的步长。 Wherein, the update step μ n (k) is used to indicate the update step of the filter coefficient W n (k).
  15. 根据权利要求14所述的回声声场状态确定方法,其特征在于,调整更新步长μ n(k)包括以下一项或多项: The method for determining the state of the echo sound field according to claim 14, wherein the adjusting and updating step size μ n (k) includes one or more of the following:
    如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增加更新步长μ n(k); If it is determined that the echo sound field state of the signal to be determined is the echo path change state, increase the update step size μ n (k);
    如果确定所述待确定信号的回声声场状态为双讲状态,则调整μ n(k)放慢更新; If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, adjust μ n (k) to slow down the update;
    如果确定所述待确定信号的回声声场状态为空闲状态或近端单讲状态,则调整μ n(k)=0。 If it is determined that the echo sound field state of the signal to be determined is the idle state or the near-end single talk state, adjust μ n (k)=0.
  16. 根据权利要求14所述的回声声场状态确定方法,其特征在于,采用回声自适应滤波器调整所述待确定信号的更新步长μ n(k)。 The method for determining the state of the echo sound field according to claim 14, wherein an echo adaptive filter is used to adjust the update step size μ n (k) of the signal to be determined.
  17. 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:
    根据所述待确定信号的回声声场状态,确定是否对所述待确定信号进行非线性处理。According to the echo sound field state of the signal to be determined, it is determined whether to perform nonlinear processing on the signal to be determined.
  18. 根据权利要求17所述的回声声场状态确定方法,其特征在于,确定是否对所述待确定信号进行非线性处理包括以下一项或多项:The method for determining the state of the echo sound field according to claim 17, wherein determining whether to perform nonlinear processing on the signal to be determined comprises one or more of the following:
    如果确定所述待确定信号的回声声场状态为双讲状态,则减少非线性处理程度;If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reduce the degree of non-linear processing;
    如果确定所述待确定信号的回声声场状态为回声路径变化状态,则增强对所述待确定信号的非线性处理;If it is determined that the echo sound field state of the signal to be determined is an echo path change state, the nonlinear processing of the signal to be determined is enhanced;
    如果确定所述待确定信号的回声声场状态为近端单讲状态,则停 止对所述待确定信号的非线性处理;If it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, stop the non-linear processing of the signal to be determined;
    如果确定所述待确定信号的回声声场状态为空闲状态,则停止对所述待确定信号的非线性处理。If it is determined that the echo sound field state of the signal to be determined is an idle state, the non-linear processing of the signal to be determined is stopped.
  19. 根据权利要求17所述的回声声场状态确定方法,其特征在于,采用后处理非线性处理单元对所述待确定信号进行非线性处理。The method for determining the state of the echo sound field according to claim 17, wherein a post-processing non-linear processing unit is used to perform non-linear processing on the signal to be determined.
  20. 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:
    根据所述待确定信号的回声声场状态,确定降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。According to the echo sound field state of the signal to be determined, it is determined to reduce the noise update speed of the signal to be determined or to increase the non-stationary noise suppression capability of the signal to be determined.
  21. 根据权利要求20所述的回声声场状态确定方法,其特征在于,确定降低噪声更新速度或者提高非平稳噪声抑制能力包括以下一项或多项:The method for determining the state of the echo sound field according to claim 20, wherein the determining to reduce the noise update speed or to improve the non-stationary noise suppression capability includes one or more of the following:
    如果确定所述待确定信号的回声声场状态为近端单讲状态,则降低所述待确定信号的噪声更新速度;If it is determined that the echo sound field state of the signal to be determined is the near-end single talk state, reducing the noise update speed of the signal to be determined;
    如果确定所述待确定信号的回声声场状态为双讲状态,则降低所述待确定信号的噪声更新速度;If it is determined that the echo sound field state of the signal to be determined is a dual-talk state, reducing the noise update speed of the signal to be determined;
    如果确定所述待确定信号的回声声场状态为远端单讲状态,则提高所述待确定信号的非平稳噪声抑制能力;If it is determined that the echo sound field state of the signal to be determined is the far-end single talk state, improving the non-stationary noise suppression capability of the signal to be determined;
    如果确定所述待确定信号的回声声场状态为回声路径变化状态,则提高所述待确定信号的非平稳噪声抑制能力。If it is determined that the echo sound field state of the signal to be determined is an echo path change state, the non-stationary noise suppression capability of the signal to be determined is improved.
  22. 根据权利要求20所述的回声声场状态确定方法,其特征在于,采用后处理噪声抑制单元降低所述待确定信号的噪声更新速度或者提高所述待确定信号的非平稳噪声抑制能力。The method for determining the state of the echo sound field according to claim 20, wherein a post-processing noise suppression unit is used to reduce the noise update speed of the signal to be determined or to improve the non-stationary noise suppression capability of the signal to be determined.
  23. 根据权利要求1所述的回声声场状态确定方法,其特征在于,还包括:The method for determining the state of the echo sound field according to claim 1, further comprising:
    确定所述待确定信号的临时声场状态;Determining the temporary sound field state of the signal to be determined;
    根据所述待确定信号的回声声场状态以及临时声场状态,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出。According to the echo sound field state and the temporary sound field state of the signal to be determined, it is determined that the signal to be determined is kept in a dual-talk state output or the output of the echo path change of the signal to be determined is suspended.
  24. 根据权利要求23所述的回声声场状态确定方法,其特征在于,确定对所述待确定信号保持双讲状态输出或者对所述待确定信号暂缓回声路径改变的输出包括以下一项或多项:The method for determining the state of the echo sound field according to claim 23, wherein the output determined to maintain the dual-talk state output for the signal to be determined or to suspend the echo path change of the signal to be determined includes one or more of the following:
    如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为远端单讲状态,则通过保持时间对所述待确定信号保持双讲状态输出;If the echo sound field state of the signal to be determined is a dual-talk state, and the temporary sound field state is a remote single-talk state, then the signal to be determined is kept in a dual-talk state output through the holding time;
    如果所述待确定信号的回声声场状态为双讲状态,所述临时声场状态为回声路径变化状态,则通过开始时间对所述待确定信号暂缓回声路径改变的输出。If the echo sound field state of the signal to be determined is a dual-talk state, and the temporary sound field state is an echo path change state, the output of the echo path change for the signal to be determined is temporarily suspended based on the start time.
  25. 一种回声声场状态确定装置,其特征在于,包括:A device for determining the state of an echo sound field, characterized in that it comprises:
    获取模块,用于获取待确定信号;The acquisition module is used to acquire the signal to be determined;
    信号确定模块,用于确定所述待确定信号的远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k); A signal determining module for determining the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k) of the signal to be determined;
    更新度确定模块,用于至少根据所述远端信号X n(k)、近端信号D n(k)以及滤波器系数W n(k),确定滤波器更新度Cef updateThe update degree determination module is configured to determine the filter update degree Cef update at least according to the far-end signal X n (k), the near-end signal D n (k) and the filter coefficient W n (k);
    状态确定模块,用于至少根据滤波器更新度Cef update大于预设更新度阈值Thrd update,确定所述待确定信号的回声声场状态是否为回声路径变化状态。 The state determination module is configured to determine whether the echo sound field state of the signal to be determined is the echo path change state at least according to the filter update degree Cef update being greater than the preset update degree threshold Thrd update.
  26. 一种存储介质,其上存储有计算机指令,其特征在于,所述计算机指令运行时执行权利要求1至24任一项所述回声声场状态确定方法的步骤。A storage medium having computer instructions stored thereon, wherein the computer instructions execute the steps of the method for determining the state of the echo sound field according to any one of claims 1 to 24 when the computer instructions are run.
  27. 一种终端,包括存储器和处理器,所述存储器上存储有能够在所 述处理器上运行的计算机指令,其特征在于,所述处理器运行所述计算机指令时执行权利要求1至24任一项所述回声声场状态确定方法的步骤。A terminal, comprising a memory and a processor, and computer instructions that can run on the processor are stored on the memory, wherein the processor executes any one of claims 1 to 24 when the computer instructions are executed. The steps of the method for determining the state of the echo sound field described in the item.
PCT/CN2021/079181 2020-03-26 2021-03-05 Method and device for determining state of echo sound field, storage medium, and terminal WO2021190274A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010223647.6A CN111654585B (en) 2020-03-26 2020-03-26 Echo sound field state determination method and device, storage medium and terminal
CN202010223647.6 2020-03-26

Publications (1)

Publication Number Publication Date
WO2021190274A1 true WO2021190274A1 (en) 2021-09-30

Family

ID=72346411

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/079181 WO2021190274A1 (en) 2020-03-26 2021-03-05 Method and device for determining state of echo sound field, storage medium, and terminal

Country Status (2)

Country Link
CN (1) CN111654585B (en)
WO (1) WO2021190274A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111654585B (en) * 2020-03-26 2021-08-03 紫光展锐(重庆)科技有限公司 Echo sound field state determination method and device, storage medium and terminal

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160006880A1 (en) * 2014-07-02 2016-01-07 Youhong Lu Variable step size echo cancellation with accounting for instantaneous interference
CN108986837A (en) * 2018-09-05 2018-12-11 科大讯飞股份有限公司 A kind of filter update method and device
CN109348072A (en) * 2018-08-30 2019-02-15 湖北工业大学 A kind of double talk detection method applied to acoustic echo cancellation system
CN109524018A (en) * 2017-09-19 2019-03-26 华为技术有限公司 A kind of echo processing method and equipment
CN109712636A (en) * 2019-03-07 2019-05-03 出门问问信息科技有限公司 Near-end speech restorative procedure and system in a kind of echo cancellation process
CN111654585A (en) * 2020-03-26 2020-09-11 紫光展锐(重庆)科技有限公司 Echo sound field state determination method and device, storage medium and terminal

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6563803B1 (en) * 1997-11-26 2003-05-13 Qualcomm Incorporated Acoustic echo canceller
US6434110B1 (en) * 1998-03-20 2002-08-13 Cirrus Logic, Inc. Full-duplex speakerphone circuit including a double-talk detector
DE19935587A1 (en) * 1998-08-04 2000-02-17 Motorola Inc Detection of echo state in duplex transmission e.g. when using mobile phone system, is achieved by monitoring adaptive filter coefficient update, which reveals phantom signal presence on exceeding threshold
JP3492315B2 (en) * 2000-12-15 2004-02-03 沖電気工業株式会社 Echo canceller with automatic volume adjustment
JP3917116B2 (en) * 2003-08-01 2007-05-23 日本電信電話株式会社 Echo canceling apparatus, method, echo canceling program, and recording medium recording the program
JP4678349B2 (en) * 2006-08-31 2011-04-27 ヤマハ株式会社 Call determination device
CN102739286B (en) * 2011-04-01 2014-06-11 中国科学院声学研究所 Echo cancellation method used in communication system
US9088336B2 (en) * 2012-09-06 2015-07-21 Imagination Technologies Limited Systems and methods of echo and noise cancellation in voice communication
US9191493B2 (en) * 2013-12-09 2015-11-17 Captioncall, Llc Methods and devices for updating an adaptive filter for echo cancellation
CN107332591B (en) * 2016-04-29 2021-01-05 北京紫光展锐通信技术有限公司 Repeater and echo interference elimination method and device thereof
CN108630219B (en) * 2018-05-08 2021-05-11 北京小鱼在家科技有限公司 Processing system, method and device for echo suppression audio signal feature tracking
CN110634496B (en) * 2019-10-22 2021-12-24 广州视源电子科技股份有限公司 Double-talk detection method and device, computer equipment and storage medium
CN110838300B (en) * 2019-11-18 2022-03-25 紫光展锐(重庆)科技有限公司 Echo cancellation processing method and processing system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160006880A1 (en) * 2014-07-02 2016-01-07 Youhong Lu Variable step size echo cancellation with accounting for instantaneous interference
CN109524018A (en) * 2017-09-19 2019-03-26 华为技术有限公司 A kind of echo processing method and equipment
CN109348072A (en) * 2018-08-30 2019-02-15 湖北工业大学 A kind of double talk detection method applied to acoustic echo cancellation system
CN108986837A (en) * 2018-09-05 2018-12-11 科大讯飞股份有限公司 A kind of filter update method and device
CN109712636A (en) * 2019-03-07 2019-05-03 出门问问信息科技有限公司 Near-end speech restorative procedure and system in a kind of echo cancellation process
CN111654585A (en) * 2020-03-26 2020-09-11 紫光展锐(重庆)科技有限公司 Echo sound field state determination method and device, storage medium and terminal

Also Published As

Publication number Publication date
CN111654585A (en) 2020-09-11
CN111654585B (en) 2021-08-03

Similar Documents

Publication Publication Date Title
US11601554B2 (en) Detection of acoustic echo cancellation
US9088336B2 (en) Systems and methods of echo and noise cancellation in voice communication
CN104980601B (en) Gain control system and method for dynamic tuning echo canceller
US6792107B2 (en) Double-talk detector suitable for a telephone-enabled PC
US5598468A (en) Method and apparatus for echo removal in a communication system
US9516159B2 (en) System and method of double talk detection with acoustic echo and noise control
CN103748865B (en) Utilize the clock deskew of the acoustic echo arrester of not audible tone
JP4282260B2 (en) Echo canceller
CN111768796B (en) Acoustic echo cancellation and dereverberation method and device
TWI392322B (en) Double talk detection method based on spectral acoustic properties
US20220301577A1 (en) Echo cancellation method and apparatus
CN109273019B (en) Method for double-talk detection for echo suppression and echo suppression
CN111199748B (en) Echo cancellation method, device, equipment and storage medium
CN110211602B (en) Intelligent voice enhanced communication method and device
CN110995951B (en) Echo cancellation method, device and system based on double-end sounding detection
US8831210B2 (en) Method and system for detection of onset of near-end signal in an echo cancellation system
CN106571147A (en) Method for suppressing acoustic echo of network telephone
TWI594234B (en) A method and device for detecting near-end voice signal
CN109215672B (en) Method, device and equipment for processing sound information
WO2021190274A1 (en) Method and device for determining state of echo sound field, storage medium, and terminal
US9083783B2 (en) Detecting double talk in acoustic echo cancellation using zero-crossing rate
CN111756906B (en) Echo suppression method and device for voice signal and computer readable medium
CN111355855B (en) Echo processing method, device, equipment and storage medium
US20080152156A1 (en) Robust Method of Echo Suppressor
CN111970410B (en) Echo cancellation method and device, storage medium and terminal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21774667

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21774667

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 21774667

Country of ref document: EP

Kind code of ref document: A1