WO2020063798A1 - Echo cancellation method, device and intelligent loudspeaker box - Google Patents

Echo cancellation method, device and intelligent loudspeaker box Download PDF

Info

Publication number
WO2020063798A1
WO2020063798A1 PCT/CN2019/108343 CN2019108343W WO2020063798A1 WO 2020063798 A1 WO2020063798 A1 WO 2020063798A1 CN 2019108343 W CN2019108343 W CN 2019108343W WO 2020063798 A1 WO2020063798 A1 WO 2020063798A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
audio
echo
signal
echo cancellation
Prior art date
Application number
PCT/CN2019/108343
Other languages
French (fr)
Chinese (zh)
Inventor
韩中波
夏萌
吴海全
迟欣
张恩勤
曹磊
师瑞文
Original Assignee
深圳市冠旭电子股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201811130274.7A external-priority patent/CN110956973A/en
Priority claimed from CN201811561782.0A external-priority patent/CN111356058B/en
Application filed by 深圳市冠旭电子股份有限公司 filed Critical 深圳市冠旭电子股份有限公司
Publication of WO2020063798A1 publication Critical patent/WO2020063798A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic

Definitions

  • the present application relates to the field of signal processing technologies, and in particular, to an echo cancellation method, device, and smart speaker.
  • the purpose of the embodiments of the present application is to provide an echo cancellation method, device, and smart speaker, which are intended to solve the problem that the existing echo interference technology cannot satisfy audio playback in multiple audio channels.
  • an echo cancellation method is provided, which is applied to a smart speaker.
  • the method includes:
  • an echo cancellation device where the device includes:
  • an acquisition module configured to acquire N first audio signals corresponding to N audio channels connected to the speaker input end; wherein Ng2 is an integer;
  • a synthesizing module configured to linearly transform the N first audio signals to synthesize a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
  • a canceling module configured to acquire a third audio signal collected by a microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
  • a smart speaker including a memory, a processor, and a computer program stored in the memory and executable on the processor.
  • the processor implements the computer program when the processor executes the computer program. Steps of the first aspect of the method.
  • a computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the steps of the method of the first aspect.
  • the beneficial effects of the echo cancellation method provided in the embodiments of the present application are: obtaining N first audio signals corresponding to N audio channels connected to the speaker input end; wherein, Ng2 is an integer; and N first audio signals are linearly transformed into a second audio signal, and the second audio signal is used as a reference signal for echo cancellation; a third audio signal collected by a microphone is obtained, and the third audio signal is collected according to the reference signal.
  • the audio signal is subjected to echo cancellation to generate a fourth audio signal.
  • the N first audio signals in the N audio channels are synthesized into a second audio signal as the reference signal for echo cancellation.
  • the audio signals of multiple audio channels can be synthesized and used as the reference signal for echo cancellation, so that multiple audio channels can be processed.
  • the audio signals in the channels are unified for echo cancellation, eliminating the need to perform multiple echo cancellations on the audio signals in multiple audio channels separately, which improves the efficiency of echo cancellation, and because the echo audio signals collected by the microphones are audio in multiple audio channels
  • the audio signal synthesized by the signal, and the audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can more accurately simulate the echo audio signal , Can improve the sound quality of the loudspeaker output after echo cancellation.
  • FIG. 1 is a schematic flowchart of an echo cancellation method provided in Embodiment 1 of the present application.
  • FIG. 2 is a schematic flowchart of an echo cancellation method provided in Embodiment 2 of the present application.
  • FIG. 3 is a schematic flowchart of an echo cancellation method provided in Embodiment 3 of the present application.
  • FIG. 4 is a schematic flowchart of an echo cancellation method provided in Embodiment 4 of the present application.
  • Embodiment 5 is a schematic flowchart of an echo cancellation method provided in Embodiment 5 of the present application.
  • FIG. 6 is a schematic flowchart of an echo cancellation method provided in Embodiment 6 of the present application.
  • FIG. 7 is a schematic diagram of an echo cancellation device provided in Embodiment 7 of the present application.
  • Embodiment 8 is a schematic structural diagram of a smart speaker provided in Embodiment 8 of the present application.
  • the echo cancellation method provided in the embodiment of the present application may be applied to an audio playback device or system such as a smart speaker including a speaker and a microphone.
  • the echo cancellation method provided in Embodiment 1 of the application includes: [0032] Step S101 obtains N first audio signals corresponding to N audio channels connected to a speaker input end; wherein Ng2 is an integer;
  • the current mainstream speaker or audio playback system is to play high-quality sound effects such as 5.1 or 7.1 channels.
  • the speaker or audio playback system capable of playing multiple channels includes multiple audio channels, and transmits multiple audio channels. Audio signals for each channel.
  • the speaker may be one or more speakers, and the N audio channels may be connected to one or more speakers. When N first audio signals transmitted from the N audio channels are transmitted to the speakers, the N audio channels are obtained. First audio signal.
  • the speaker is a transducing device capable of converting an electric signal into an acoustic signal.
  • Step S102 Linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
  • the N first audio signals are played through a speaker.
  • the microphone collects audio emitted by the N first audio signals played by the speaker, an acoustic echo phenomenon is caused.
  • the acoustic echo phenomenon It is generated from the N first audio signals, linearly transforms the N first audio signals, and synthesizes a second audio signal, and uses the second audio signal as a reference signal for echo cancellation.
  • performing linear transformation on the N first audio signals to synthesize a second audio signal includes: obtaining gain values for gain processing in the N audio channels, respectively; according to the N audios A corresponding weight is assigned to the N first audio signals by a channel corresponding gain value; the amplitudes of the N first audio signals are respectively multiplied by the corresponding weights and then accumulated to generate the second audio signal.
  • the above obtaining the gain values of the N audio channels for gain processing can be understood as: performing gain amplification processing on audio signals by the gain amplifier in the N audio channels, and obtaining the gain values of the N audio channels through the gain amplifier. The coefficient of gain amplification.
  • the above-mentioned coefficient of gain amplification may be a preset gain amplification parameter in a gain amplifier corresponding to each audio channel.
  • the above allocating corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels can be understood as: Assigning corresponding weights according to the size of the gain values corresponding to the N audio channels, and different gain values can be established in advance A mapping table with the corresponding weights, and the corresponding weights are allocated according to the size of the gain values corresponding to the N audio channels.
  • the above-mentioned second audio signal may be understood as an audio signal collected by the microphone and synthesized from the N first audio signals.
  • Step S103 Acquire a third audio signal collected by a microphone, and align the third audio signal according to the reference signal.
  • the frequency signal is subjected to echo cancellation to generate a fourth audio signal.
  • the third audio signal collected by the microphone includes a useful audio signal and a noise audio signal
  • the noise audio signal includes collecting an echo audio signal synthesized by N first audio signals sent from a speaker.
  • the fourth audio signal may be understood as an audio signal after the echo signal is eliminated from the third audio signal.
  • the fourth audio signal may be generated by performing echo cancellation on the third audio signal according to the reference signal.
  • the reference signal may be echoed as a reference signal in an echo canceller designed according to an Acoustic Echo Canceller technology. After cancellation, a fourth audio signal is generated.
  • acquiring a third audio signal collected by a microphone, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal includes: acquiring an echo canceller according to the reference An echo estimation signal generated by the signal; acquiring a third audio signal collected by a microphone, and subtracting the echo estimation signal from the third audio signal to generate the fourth audio signal.
  • the reference signal may be passed through an adaptive filter in an acoustic echo canceller to generate an echo estimation signal, and the third audio signal including the useful audio signal and the echo audio signal collected by the microphone may be subjected to echo cancellation by generating the echo estimation signal,
  • the fourth audio signal may be generated by subtracting the echo estimation signal from the third audio signal.
  • the N first audio signals in the N audio channels are combined into a second audio signal as a reference signal for echo cancellation, and audio signals of multiple audio channels can be processed.
  • Synthetic processing is used as a reference signal for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, eliminating the need for multiple echo cancellations for audio signals in multiple audio channels, which improves the efficiency of echo cancellation, and because
  • the echo audio signal collected by the microphone from the noise audio signal is an audio signal synthesized from audio signals in multiple audio channels.
  • the audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can be more accurate. Analog echo audio signal can improve the sound quality of the speaker output after echo cancellation.
  • the echo cancellation method provided in Embodiment 2 of the present application includes:
  • Step S201 Obtain N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer;
  • Step S202 linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
  • Step S203 Acquire a third audio signal collected by the microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
  • the method includes: according to the fourth audio The signal and the preset standard audio signal calculate the audio signal difference value through the audio quality perception evaluation algorithm PEAQ, and determine whether the audio signal difference value is within a preset audio signal difference range; if the audio signal difference value is not in a preset Within the audio signal difference range, return the audio signal difference value to the echo canceller, so that the echo canceller adjusts a filter coefficient according to the audio signal difference value.
  • the audio quality perception evaluation algorithm PEAQ Perceptual Evaluation of Audio Quality
  • PEAQ can imitate the hearing system of the human ear, analyze and compare the reference signal and the test signal to obtain an objective evaluation difference in audio quality, and store the standard audio signal of the speaker as
  • the reference signal in PEAQ and the above-mentioned echo-cancelled fourth audio signal are used as test signals in PEAQ, and the audio signal difference and equivalent value can be calculated by PEAQ according to the fourth audio signal and a preset standard audio signal.
  • the echo canceller receives the difference value of the audio signal, it can adjust the filter coefficient (increase or decrease the filter coefficient) according to the difference value of the audio signal until the difference value of the audio signal is within a preset difference range of the audio signal.
  • the above steps S201, S202, and S203 are the same or similar to the above steps S101, S102, and S103, respectively.
  • the above steps S101 to S103 are not described herein again.
  • Step S204 Frequency-divide the fourth audio signal and input the corresponding N audio channels respectively, and then input the fourth audio signal to the speakers connected to the N audio channels after gain processing. Instruct the speaker to play the fourth audio signal that has been processed by gain.
  • the fourth audio signal is a useful audio signal after echo cancellation.
  • the fourth audio signal is frequency-divided to generate corresponding N audio signals, and the corresponding N audio signals are input. After the channels are subjected to gain amplification processing, playback is performed by one or more speakers connected to the N audio channels.
  • the N first audio signals in the N audio channels are combined into one.
  • Two second audio signals are used as reference signals for echo cancellation, and audio signals of multiple audio channels can be synthesized as reference signals for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, without the need to separately
  • the audio signals in each audio channel are subjected to multiple echo cancellations, which improves the efficiency of echo cancellation, and because the echo audio signals in the noise audio signals collected by the microphone are audio signals synthesized from the audio signals in multiple audio channels, multiple The audio signals in each audio channel are combined into an audio signal as a reference signal for echo cancellation, which can more accurately simulate the echo audio signal, and can improve the sound quality of the loudspeaker output after echo cancellation.
  • the echo cancellation method provided in Embodiment 3 of the present application includes:
  • Step S301 Obtain N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer.
  • Step S302 linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation.
  • the above steps S301 and S302 are the same or similar to the above steps S101 and S102, respectively.
  • Step S303 Detect the working mode of the smart speaker.
  • the current working mode of the smart speaker is detected.
  • the specific working modes mentioned above include a voice working mode and a music playing mode.
  • the voice working mode includes use scenarios such as voice playing and telephone calling.
  • the music playing mode Including usage scenarios such as playing music.
  • Step S304 Acquire a third audio signal collected by a microphone and an echo estimation signal generated by the echo canceller according to the reference signal and the working mode, and subtract the echo estimation signal from the third audio signal to generate the Fourth audio signal.
  • the collected audio signal can be echo-cancelled by the echo canceller according to the working mode of the smart speaker, the characteristics of different modes can be targeted in different modes of the smart speaker. Performing corresponding echo cancellation can effectively reduce the error of echo cancellation.
  • the echo canceller includes the first embodiment.
  • Step S401 if the working mode of the smart speaker is a first preset working mode, obtain the echo estimation signal generated by the first adaptive filter according to the reference signal;
  • the smart speaker included in the first preset working mode when the first use scenario of the smart speaker included in the first preset working mode is detected, it means that the smart speaker is in the first preset working mode, Acquire a preset echo estimation signal generated by a first adaptive filter that is preset corresponding to a preset in the first preset working mode, and perform echo cancellation on the third audio signal according to the echo estimation signal.
  • the first preset working mode may be a voice working mode, and a first use scenario corresponding to the first preset working mode when the first working mode is the voice working mode, such as a smart speaker in voice playback and a phone call. scenes to be used.
  • Step S402 if the working mode of the smart speaker is a second preset working mode, obtain the echo estimation signal generated by the second adaptive filter according to the reference signal.
  • the second preset working mode includes a second use scenario of the smart speaker.
  • the smart speaker When it is detected that the smart speaker is in the second use scenario, it means that the smart speaker is in the second preset working mode.
  • a predetermined echo estimation signal generated by a preset second adaptive filter corresponding to a preset in the second preset working mode, and performing echo cancellation on the third audio signal according to the echo estimation signal.
  • the above-mentioned second preset working mode may be a music playback working mode, and a second usage scenario corresponding to the second preset working mode when the second preset working mode is a music mode.
  • the smart speaker is used in music playback.
  • Step S401 includes:
  • Step S501 If the working mode of the smart speaker is a voice working mode, determine it by using a minimum mean square algorithm. Determining coefficients of a first adaptive filter corresponding to the voice working mode;
  • a third audio signal is collected by a microphone and the working mode of the smart speaker is a voice working mode
  • it is determined to work with the voice by using a Least Mean Squares (LMS) algorithm.
  • LMS Least Mean Squares
  • LMS recursive least squares
  • Step S502 the echo estimation signal generated according to the coefficient of the first adaptive filter and the reference signal.
  • Step S402 includes:
  • Step S601 if the working mode of the smart speaker is a music playback mode, determine a coefficient of a second adaptive filter corresponding to the music playback mode by recursive least squares algorithm;
  • a second adaptive filter corresponding to the music playback mode is determined by an RLS algorithm. Because the music has multiple frequency components, because the RLS algorithm has better adaptability to non-stationary signals than the LMS, its filtering performance is significantly better than the LMS algorithm, and the second adaptive filtering corresponding to the music playback mode is determined using RLS Coefficients of the filter will cause the second adaptive filter to process the third speech signal. Echo cancellation is more adaptive.
  • Step S602 the echo estimation signal generated according to the coefficient of the second adaptive filter and the reference signal.
  • echo cancellation is performed on a voice signal collected by a microphone through a second adaptive filter when the smart speaker is in a music playback mode, and echo cancellation is performed according to the characteristics of this mode, which can effectively reduce echo cancellation. error.
  • an embodiment of the present application provides an echo cancellation device, which can be integrated into an audio playback device or system such as a smart speaker including a speaker and a microphone, and is configured to execute the method steps in Embodiments 1 to 6.
  • an audio playback device or system such as a smart speaker including a speaker and a microphone
  • the echo cancellation device 700 includes:
  • the acquisition module 701 is configured to acquire N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer;
  • a synthesizing module 702 configured to linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
  • composition module 702 includes:
  • a first acquisition unit configured to acquire gain values for gain processing in the N audio channels, respectively;
  • an assigning unit configured to assign corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels
  • an accumulating unit configured to multiply the amplitudes of the N first audio signals by the corresponding weights, and then accumulate to generate the second audio signal.
  • the cancellation module 703 is configured to acquire a third audio signal collected by a microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
  • the elimination module 702 includes:
  • a second obtaining unit configured to obtain an echo estimate generated by the adaptive filter according to the reference signal Signal
  • a generating unit configured to obtain a third audio signal collected by a microphone, and subtract the echo estimation signal from the third audio signal to generate the fourth audio signal.
  • the echo cancellation device 700 further includes:
  • a frequency division processing module configured to divide the fourth audio signal into the corresponding N audio channels after frequency division processing, and input the signals to all the channels connected to the N audio channels after gain processing; And speaking the speaker to instruct the speaker to play the fourth audio signal that has been processed by gain.
  • the echo cancellation device 700 further includes:
  • a judging module configured to calculate an audio signal difference value through an audio quality perception evaluation algorithm PEAQ according to the fourth audio signal and a preset standard audio signal, and determine whether the audio signal difference value is within a preset audio signal Within the difference range; if the audio signal difference value is not within a preset audio signal difference range, returning the audio signal difference value to the adaptive filter, so that the adaptive filter is based on the audio signal The difference value adjusts the filter coefficient.
  • the echo cancellation device 700 further includes a detection module for detecting a working mode of the smart speaker;
  • the second obtaining unit is specifically configured to:
  • the echo canceller includes a first adaptive filter and a second adaptive filter
  • the second obtaining unit is further specifically configured to:
  • the working mode of the smart speaker is a first preset working mode, acquiring the echo estimation signal generated by the first adaptive filter according to the reference signal;
  • the working mode of the smart speaker is a second preset working mode, acquiring the echo estimation signal generated by the second adaptive filter according to the reference signal.
  • the first preset working mode is a voice working mode.
  • the second obtaining unit is specifically configured to: determine a coefficient of a first adaptive filter corresponding to the voice working mode by using a minimum mean square algorithm;
  • the echo estimation signal generated according to the coefficients of the first adaptive filter and the reference signal is generated according to the coefficients of the first adaptive filter and the reference signal.
  • the second preset working mode is a music playback mode.
  • the second obtaining unit is specifically configured to determine a coefficient of a second adaptive filter corresponding to the music playback mode by a recursive least square algorithm
  • the N first audio signals in the N audio channels are combined into a second audio signal as a reference signal for echo cancellation, and audio signals of multiple audio channels can be processed.
  • Synthetic processing is used as a reference signal for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, eliminating the need for multiple echo cancellations for audio signals in multiple audio channels, which improves the efficiency of echo cancellation, and because
  • the echo audio signal collected by the microphone from the noise audio signal is an audio signal synthesized from audio signals in multiple audio channels.
  • the audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can be more accurate. Analog echo audio signal can improve the sound quality of the speaker output after echo cancellation.
  • FIG. 8 is a schematic structural diagram of a smart speaker according to an embodiment of the present application.
  • the smart speaker 800 includes: a processor 801, a memory 802, and a computer program 803 stored in the memory 802 and executable on the processor 801.
  • the processor 801 executes the computer program 803 the steps in the embodiment of the echo cancellation method are implemented, for example, the method steps in the foregoing embodiment.
  • the computer program 803 may be divided into one or more units / modules, and the one or more units / modules are stored in the memory 802 and executed by the processor 801 to complete the present invention.
  • the one or more units / modules may be a series of computer program instruction segments capable of performing specific functions, and the instruction segments are used to describe the execution process of the computer program 803 in the smart speaker 800 described above.
  • the processor 801 may be a central processing unit (CPU), or may be other general-purpose processors, digital signal processors (DSPs), and application specific integrated circuits (Application Specific Integrated Circuits, ASIC), off-the-shelf programmable gate array
  • CPU central processing unit
  • DSP digital signal processors
  • ASIC Application Specific Integrated Circuits
  • the memory 802 may be an internal storage unit of the smart speaker 800, such as a hard disk or a memory of the smart speaker 800.
  • the memory 802 may also be an external storage device of the smart speaker 800, for example, a plug-in hard disk, a smart media card (SMC), a secure digital (SD) card, and a flash memory provided on the smart speaker 800. Card (Flash Card), etc.
  • the memory 802 may include both the internal storage unit of the smart speaker 800 and an external storage device.
  • the memory 802 is configured to store the computer program and other programs and data required by the smart speaker 800.
  • the memory 802 may also be used to temporarily store data that has been output or is to be output.
  • FIG. 8 is only an example of the smart speaker 800, and does not constitute a limitation on the smart speaker 800.
  • the smart speaker 800 may include more or fewer components than shown, or some components may be combined, or Different components, for example, the above-mentioned smart speaker 800 may further include an input-output device, a network access device, a bus, and the like.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, the indirect coupling or communication connection of the device or unit, and may be electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. on. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions in the embodiments of the present application.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above integrated unit may be implemented in the form of hardware or in the form of software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on such an understanding, this application implements all or part of the processes in the method in the foregoing embodiment, and may also be completed by a computer program instructing related hardware.
  • the computer program may be stored in a computer-readable storage medium.
  • the computer program When executed by a processor, the steps of the foregoing method embodiments may be implemented.
  • the computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form.
  • the above computer-readable medium may include: any entity or device capable of carrying the above computer program code, a recording medium, a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), a random Access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media.
  • a recording medium a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), a random Access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Telephone Function (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The present application discloses an echo cancellation method and device and an intelligent loudspeaker box. The method comprises: acquiring N first audio signals corresponding to N audio channels connected with the input end of a loudspeaker; wherein N is an integer greater than or equal to 2; performing linear transformation on the N first audio signals to synthesize a second audio signal, and using the second audio signal as a reference signal of echo cancellation; and acquiring a third audio signal collected by a microphone, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal. The embodiment of the present application improves the echo cancellation efficiency by getting rid of the need to perform multiple echo cancellation on the audio signals in a plurality of audio channels respectively, realizes more accurate simulation of the echo audio signal by using an audio signal synthesized from audio signals in the plurality of audio channels as a reference signal of echo cancellation, and improves the output tone quality of the loudspeaker after echo cancellation.

Description

说明书 发明名称:一种回声消除方法、 装置及智能音箱  Specification Invention Name: Echo cancellation method, device and smart speaker
[0001] 本申请要求于 2018年 09月 27日在中国专利局提交的、 申请号为 201811130274.7 、 发明名称为“一种回声消除方法、 装置及智能终端”的中国专利申请, 以及于 20 18年 12月 20日在中国专利局提交的、 申请号为 201811561782.0、 发明名称为“一 种回声消除方法、 装置及智能音箱”的中国专利申请的优先权, 其全部内容通过 引用结合在本申请中。  [0001] This application requires a Chinese patent application filed in the Chinese Patent Office on September 27, 2018, with an application number of 201811130274.7, and an invention name of "a method, device and smart terminal for echo cancellation", and in 2018 The priority of a Chinese patent application filed at the Chinese Patent Office on December 20 with application number 201811561782.0 and the invention name is "An Echo Cancellation Method, Device and Smart Speaker", the entire contents of which are incorporated herein by reference.
技术领域  Technical field
[0002] 本申请涉及信号处理技术领域, 具体涉及一种回声消除方法、 装置及智能音箱 背景技术  [0002] The present application relates to the field of signal processing technologies, and in particular, to an echo cancellation method, device, and smart speaker.
[0003] 随着人们对视听享受的不断追求, 各种智能音箱系统从单声道不断的发展到立 体声多声道音频进行播放, 在播放音频的过程中会存在噪声的干扰, 如音频播 放设备 (扬声器) 和音频采集设备 (麦克风) 都是音箱系统的附属产品。  [0003] With the continuous pursuit of audiovisual enjoyment, various smart speaker systems have continuously developed from mono to stereo multichannel audio for playback, and there will be noise interference during audio playback, such as audio playback equipment (Speakers) and audio capture devices (microphones) are accessories to the speaker system.
[0004] 当扬声器播放的音频通过麦克风采集到系统中从而会产生回声干扰, 使音箱系 统无法识别或播放真正有用的语音信号, 然而目前针对此类回声干扰技术一般 只支持单声道, 无法满足当前主流的多个音频通道中的音频播放 (如 5.1声道或 7 .1声道音频播放) 。  [0004] When the audio played by a speaker is collected into the system through a microphone, echo interference occurs, making the speaker system unable to recognize or play a truly useful voice signal. However, currently such echo interference technologies generally only support mono, which cannot meet the requirements. Audio playback in multiple mainstream audio channels (such as 5.1-channel or 7.1-channel audio playback).
发明概述  Summary of invention
技术问题  technical problem
[0005] 本申请实施例的目的在于: 提供一种回声消除方法、 装置及智能音箱, 旨在解 决现有的回声干扰技术无法满足多个音频通道中音频播放的问题。  [0005] The purpose of the embodiments of the present application is to provide an echo cancellation method, device, and smart speaker, which are intended to solve the problem that the existing echo interference technology cannot satisfy audio playback in multiple audio channels.
问题的解决方案  Problem solution
技术解决方案  Technical solutions
[0006] 为解决上述技术问题, 本申请实施例采用的技术方案是:  [0006] In order to solve the above technical problems, the technical solutions adopted in the embodiments of the present application are:
[0007] 第一方面, 提供了一种回声消除方法, 应用于智能音箱, 所述方法包括:  [0007] In a first aspect, an echo cancellation method is provided, which is applied to a smart speaker. The method includes:
[0008] 获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音频信号; 其中, 所述 Ng2且为整数; [0008] acquiring N first audio signals corresponding to N audio channels connected to a speaker input end; The Ng2 is an integer;
[0009] 将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将所述第二 音频信号作为回声消除的参考信号;  [0009] linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
[0010] 获取麦克风采集的第三音频信号, 根据所述参考信号对所述第三音频信号进行 回声消除后生成第四音频信号。  [0010] acquiring a third audio signal collected by a microphone, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
[0011] 第二方面, 提供了一种回声消除装置, 所述装置包括:  [0011] In a second aspect, an echo cancellation device is provided, where the device includes:
[0012] 获取模块, 用于获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音 频信号; 其中, 所述 Ng2且为整数;  [0012] an acquisition module, configured to acquire N first audio signals corresponding to N audio channels connected to the speaker input end; wherein Ng2 is an integer;
[0013] 合成模块, 用于将所述 N个第一音频信号进行线性变换后合成一个第二音频信 号, 将所述第二音频信号作为回声消除的参考信号;  [0013] a synthesizing module, configured to linearly transform the N first audio signals to synthesize a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
[0014] 消除模块, 用于获取麦克风采集的第三音频信号, 根据所述参考信号对所述第 三音频信号进行回声消除后生成第四音频信号。  A canceling module, configured to acquire a third audio signal collected by a microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
[0015] 第三方面, 提供一种智能音箱, 包括存储器、 处理器以及存储在所述存储器中 并可在所述处理器上运行的计算机程序, 所述处理器执行所述计算机程序时实 现上述第一方面的方法的步骤。  [0015] According to a third aspect, a smart speaker is provided, including a memory, a processor, and a computer program stored in the memory and executable on the processor. The processor implements the computer program when the processor executes the computer program. Steps of the first aspect of the method.
[0016] 第四方面, 提供一种计算机可读存储介质, 所述计算机可读存储介质存储有计 算机程序, 所述计算机程序被处理器执行时实现上述第一方面的方法的步骤。  [0016] In a fourth aspect, a computer-readable storage medium is provided. The computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the steps of the method of the first aspect.
[0017] 本申请实施例提供的回声消除方法的有益效果在于: 获取与扬声器输入端连接 的 N个音频通道中对应的 N个第一音频信号; 其中, 所述 Ng2且为整数; 将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将所述第二音频信号 作为回声消除的参考信号; 获取麦克风采集的第三音频信号, 根据所述参考信 号对所述第三音频信号进行回声消除后生成第四音频信号。 将 N个音频通道中的 N个第一音频信号合成一个第二音频信号作为回声消除的参考信号, 可对多个音 频通道的音频信号进行合成处理作为回声消除的参考信号, 从而对多个音频通 道中的音频信号统一进行回声消除, 无需分别对多个音频通道中的音频信号进 行多次回声消除, 提高了回声消除的效率, 且由于麦克风采集的回声音频信号 是多个音频通道中的音频信号所合成得音频信号, 将多个音频通道中的音频信 号合成一个音频信号作为回声消除的参考信号, 能更准确的模拟回声音频信号 , 可提高消除回声后扬声器输出的音质。 [0017] The beneficial effects of the echo cancellation method provided in the embodiments of the present application are: obtaining N first audio signals corresponding to N audio channels connected to the speaker input end; wherein, Ng2 is an integer; and N first audio signals are linearly transformed into a second audio signal, and the second audio signal is used as a reference signal for echo cancellation; a third audio signal collected by a microphone is obtained, and the third audio signal is collected according to the reference signal. The audio signal is subjected to echo cancellation to generate a fourth audio signal. The N first audio signals in the N audio channels are synthesized into a second audio signal as the reference signal for echo cancellation. The audio signals of multiple audio channels can be synthesized and used as the reference signal for echo cancellation, so that multiple audio channels can be processed. The audio signals in the channels are unified for echo cancellation, eliminating the need to perform multiple echo cancellations on the audio signals in multiple audio channels separately, which improves the efficiency of echo cancellation, and because the echo audio signals collected by the microphones are audio in multiple audio channels The audio signal synthesized by the signal, and the audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can more accurately simulate the echo audio signal , Can improve the sound quality of the loudspeaker output after echo cancellation.
发明的有益效果  The beneficial effects of the invention
对附图的简要说明  Brief description of the drawings
附图说明  BRIEF DESCRIPTION OF THE DRAWINGS
[0018] 为了更清楚地说明本申请实施例中的技术方案, 下面将对实施例或示范性技术 描述中所需要使用的附图作简单地介绍。  [0018] In order to more clearly explain the technical solutions in the embodiments of the present application, the drawings needed to be used in the embodiments or exemplary technical descriptions will be briefly introduced below.
[0019] 图 1是本申请实施例一提供的回声消除方法的流程示意图;  [0019] FIG. 1 is a schematic flowchart of an echo cancellation method provided in Embodiment 1 of the present application;
[0020] 图 2是本申请实施例二提供的回声消除方法的流程示意图;  [0020] FIG. 2 is a schematic flowchart of an echo cancellation method provided in Embodiment 2 of the present application;
[0021] 图 3是本申请实施例三提供的回声消除方法的流程示意图;  [0021] FIG. 3 is a schematic flowchart of an echo cancellation method provided in Embodiment 3 of the present application;
[0022] 图 4是本申请实施例四提供的回声消除方法的流程示意图;  [0022] FIG. 4 is a schematic flowchart of an echo cancellation method provided in Embodiment 4 of the present application;
[0023] 图 5是本申请实施例五提供的回声消除方法的流程示意图;  5 is a schematic flowchart of an echo cancellation method provided in Embodiment 5 of the present application;
[0024] 图 6是本申请实施例六提供的回声消除方法的流程示意图;  [0024] FIG. 6 is a schematic flowchart of an echo cancellation method provided in Embodiment 6 of the present application;
[0025] 图 7是本申请实施例七提供的回声消除装置示意图;  [0025] FIG. 7 is a schematic diagram of an echo cancellation device provided in Embodiment 7 of the present application;
[0026] 图 8是本申请实施例八提供的智能音箱的结构示意图。  8 is a schematic structural diagram of a smart speaker provided in Embodiment 8 of the present application.
发明实施例  Invention Examples
本发明的实施方式  Embodiments of the invention
[0027] 为了使本申请的目的、 技术方案及优点更加清楚明白, 以下结合附图及实施例 , 对本申请进行进一步详细说明。 应当理解, 此处所描述的具体实施例仅用以 解释本申请, 并不用于限定本申请。  [0027] In order to make the purpose, technical solution, and advantages of the present application clearer and clearer, the present application is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the application, and are not used to limit the application.
[0028] 需说明的是, 术语“第一”、 “第二”仅用于便于描述目的, 而不能理解为指示或 暗示相对重要性或者隐含指明技术特征的数量。 “多个”的含义是两个或两个以上 , 除非另有明确具体的限定。  [0028] It should be noted that the terms "first" and "second" are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of technical features. The meaning of "multiple" is two or more, unless it is specifically and specifically defined otherwise.
[0029] 为了说明本申请所述的技术方案, 以下结合具体附图及实施例进行详细说明。  [0029] In order to explain the technical solution described in this application, detailed descriptions are given below with reference to specific drawings and embodiments.
[0030] 实施例一  Embodiment 1
[0031] 本申请实施例提供的回声消除方法, 可应用于包括扬声器和麦克风的智能音箱 等音频播放设备或系统中, 如图 1所示, 本申请实施例一提供的回声消除方法包 括: [0032] 步骤 S101 获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音频信 号; 其中, 所述 Ng2且为整数; [0031] The echo cancellation method provided in the embodiment of the present application may be applied to an audio playback device or system such as a smart speaker including a speaker and a microphone. As shown in FIG. 1, the echo cancellation method provided in Embodiment 1 of the application includes: [0032] Step S101 obtains N first audio signals corresponding to N audio channels connected to a speaker input end; wherein Ng2 is an integer;
[0033] 在本申请实施例中, 当前主流音箱或音频播放系统为播放如 5.1或 7.1声道的高 质量音效, 可播放多声道的音箱或音频播放系统中包括多个音频通道, 传输多 个声道的音频信号。 上述扬声器可以是一个或多个扬声器, 上述 N个音频通道可 与一个或多个扬声器连接, 当分别从 N个音频通道进行传输的 N个第一音频信号 传输至扬声器时, 获取所述 N个第一音频信号。 上述扬声器是能把电信号转变为 声信号的换能器件。  [0033] In the embodiments of the present application, the current mainstream speaker or audio playback system is to play high-quality sound effects such as 5.1 or 7.1 channels. The speaker or audio playback system capable of playing multiple channels includes multiple audio channels, and transmits multiple audio channels. Audio signals for each channel. The speaker may be one or more speakers, and the N audio channels may be connected to one or more speakers. When N first audio signals transmitted from the N audio channels are transmitted to the speakers, the N audio channels are obtained. First audio signal. The speaker is a transducing device capable of converting an electric signal into an acoustic signal.
[0034] 步骤 S 102, 将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将所述第二音频信号作为回声消除的参考信号;  [0034] Step S102: Linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
[0035] 在本申请实施例中, 上述 N个第一音频信号通过扬声器进行播放, 当麦克风采 集到由上述扬声器播放 N个第一音频信号发出的音频时会造成声学回声现象, 该 声学回声现象是由上述 N个第一音频信号产生, 将上述 N个第一音频信号进行线 性变换后合成一个第二音频信号, 将上述第二音频信号作为用于回声消除的参 考信号。  [0035] In the embodiment of the present application, the N first audio signals are played through a speaker. When the microphone collects audio emitted by the N first audio signals played by the speaker, an acoustic echo phenomenon is caused. The acoustic echo phenomenon It is generated from the N first audio signals, linearly transforms the N first audio signals, and synthesizes a second audio signal, and uses the second audio signal as a reference signal for echo cancellation.
[0036] 在一个实施例中, 将所述 N个第一音频信号进行线性变换后合成第二音频信号 , 包括: 分别获取所述 N个音频通道中进行增益处理的增益值; 根据 N个音频通 道对应的增益值对所述 N个第一音频信号分配对应的权重; 将所述 N个第一音频 信号的幅值分别乘以对应的所述权重后进行累加生成所述第二音频信号。 上述 分别获取所述 N个音频通道中进行增益处理的增益值可以理解为: 在上述 N个音 频通道中由增益放大器对音频信号进行进增益放大处理, 获取上述 N个音频通道 中经增益放大器进行增益放大的系数, 上述增益放大的系数可以是各个音频通 道对应增益放大器中预设的增益放大参数。 上述根据 N个音频通道对应的增益值 对所述 N个第一音频信号分配对应的权重可理解为: 根据 N个音频通道对应的增 益值的大小分配对应的权重, 可预先建立增益值不同大小与对应权重的关系映 射表, 再根据 N个音频通道对应的增益值的大小分配对应的权重。 上述第二音频 信号可理解为麦克风采集的由 N个第一音频信号合成的音频信号。  [0036] In one embodiment, performing linear transformation on the N first audio signals to synthesize a second audio signal includes: obtaining gain values for gain processing in the N audio channels, respectively; according to the N audios A corresponding weight is assigned to the N first audio signals by a channel corresponding gain value; the amplitudes of the N first audio signals are respectively multiplied by the corresponding weights and then accumulated to generate the second audio signal. The above obtaining the gain values of the N audio channels for gain processing can be understood as: performing gain amplification processing on audio signals by the gain amplifier in the N audio channels, and obtaining the gain values of the N audio channels through the gain amplifier. The coefficient of gain amplification. The above-mentioned coefficient of gain amplification may be a preset gain amplification parameter in a gain amplifier corresponding to each audio channel. The above allocating corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels can be understood as: Assigning corresponding weights according to the size of the gain values corresponding to the N audio channels, and different gain values can be established in advance A mapping table with the corresponding weights, and the corresponding weights are allocated according to the size of the gain values corresponding to the N audio channels. The above-mentioned second audio signal may be understood as an audio signal collected by the microphone and synthesized from the N first audio signals.
[0037] 步骤 S 103 , 获取麦克风采集的第三音频信号, 根据所述参考信号对所述第三音 频信号进行回声消除后生成第四音频信号。 [0037] Step S103: Acquire a third audio signal collected by a microphone, and align the third audio signal according to the reference signal. The frequency signal is subjected to echo cancellation to generate a fourth audio signal.
[0038] 在本申请实施例中, 上述麦克风采集的第三音频信号包括有用音频信号和噪声 音频信号, 上述噪声音频信号包括采集由扬声器发出的 N个第一音频信号合成的 回声音频信号。 上述第四音频信号可理解为对上述第三音频信号消除回声信号 后的音频信号。 可根据上述参考信号对所述第三音频信号进行回声消除后生成 第四音频信号, 具体可将上述参考信号作为根据声学回声消除 (Acoustic Echo Canceller) 技术设计的回声消除器中的参考信号进行回声消除后, 并生成第四音 频信号。  [0038] In the embodiment of the present application, the third audio signal collected by the microphone includes a useful audio signal and a noise audio signal, and the noise audio signal includes collecting an echo audio signal synthesized by N first audio signals sent from a speaker. The fourth audio signal may be understood as an audio signal after the echo signal is eliminated from the third audio signal. The fourth audio signal may be generated by performing echo cancellation on the third audio signal according to the reference signal. Specifically, the reference signal may be echoed as a reference signal in an echo canceller designed according to an Acoustic Echo Canceller technology. After cancellation, a fourth audio signal is generated.
[0039] 在一个实施例中, 获取麦克风采集的第三音频信号, 根据所述参考信号将所述 第三音频信号进行回声消除后生成第四音频信号, 包括: 获取回声消除器根据 所述参考信号生成的回声估计信号; 获取麦克风采集的第三音频信号, 将所述 第三音频信号减去所述回声估计信号后生成所述第四音频信号。 可将上述参考 信号通过声学回声消除器中的自适应滤波器后, 生成回声估计信号, 并通过生 成回声估计信号将麦克风采集的包括有用音频信号和回声音频信号的第三音频 信号进行回声消除, 具体地可将所述第三音频信号减去所述回声估计信号后生 成所述第四音频信号。  [0039] In an embodiment, acquiring a third audio signal collected by a microphone, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal includes: acquiring an echo canceller according to the reference An echo estimation signal generated by the signal; acquiring a third audio signal collected by a microphone, and subtracting the echo estimation signal from the third audio signal to generate the fourth audio signal. The reference signal may be passed through an adaptive filter in an acoustic echo canceller to generate an echo estimation signal, and the third audio signal including the useful audio signal and the echo audio signal collected by the microphone may be subjected to echo cancellation by generating the echo estimation signal, Specifically, the fourth audio signal may be generated by subtracting the echo estimation signal from the third audio signal.
[0040] 由此可见, 在本申请实施例中, 将 N个音频通道中的 N个第一音频信号合成一 个第二音频信号作为回声消除的参考信号, 可对多个音频通道的音频信号进行 合成处理作为回声消除的参考信号, 从而对多个音频通道中的音频信号统一进 行回声消除, 无需分别对多个音频通道中的音频信号进行多次回声消除, 提高 了回声消除的效率, 且由于麦克风采集到噪声音频信号中的回声音频信号是多 个音频通道中的音频信号所合成得音频信号, 将多个音频通道中的音频信号合 成一个音频信号作为回声消除的参考信号, 能更准确的模拟回声音频信号, 可 提高消除回声后扬声器输出的音质。  [0040] It can be seen that, in the embodiment of the present application, the N first audio signals in the N audio channels are combined into a second audio signal as a reference signal for echo cancellation, and audio signals of multiple audio channels can be processed. Synthetic processing is used as a reference signal for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, eliminating the need for multiple echo cancellations for audio signals in multiple audio channels, which improves the efficiency of echo cancellation, and because The echo audio signal collected by the microphone from the noise audio signal is an audio signal synthesized from audio signals in multiple audio channels. The audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can be more accurate. Analog echo audio signal can improve the sound quality of the speaker output after echo cancellation.
[0041] 实施例二  [0041] Embodiment Two
[0042] 如图 2所示, 本申请实施例二提供的回声消除方法包括:  [0042] As shown in FIG. 2, the echo cancellation method provided in Embodiment 2 of the present application includes:
[0043] 步骤 S201, 获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音频信 号; 其中, 所述 Ng2且为整数; [0044] 步骤 S202, 将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将所述第二音频信号作为回声消除的参考信号; [0043] Step S201: Obtain N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer; [0044] Step S202: linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
[0045] 步骤 S203, 获取麦克风采集的第三音频信号, 根据所述参考信号将对所述第三 音频信号进行回声消除后生成第四音频信号。  [0045] Step S203: Acquire a third audio signal collected by the microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
[0046] 在一个实施例中, 在获取麦克风采集的第三音频信号, 根据所述参考信号将所 述第三音频信号进行回声消除后生成第四音频信号之后, 包括: 根据所述第四 音频信号和预设的标准音频信号通过音频质量感知评价算法 PEAQ计算音频信号 差异值, 并判断所述音频信号差异值是否在预设的音频信号差异范围内; 若所 述音频信号差异值不在预设的音频信号差异范围内, 则将所述音频信号差异值 返回至所述回声消除器, 使所述回声消除器根据所述音频信号差异值调节滤波 系数。 音频质量感知评价算法 PEAQ(Perceptual Evaluation of Audio Quality)可通 过模仿人耳的听觉系统, 对参考信号和测试信号进行分析对比得出音频质量的 客观评价差异值, 可预先存储扬声器的标准音频信号作为 PEAQ中的参考信号, 上述消除回声的第四音频信号作为 PEAQ中的测试信号, 可根据第四音频信号和 预设的标准音频信号通过 PEAQ计算音频信号差异等值。 上述回声消除器接收到 根据音频信号差异值时, 可根据音频信号差异值调节滤波系数 (增大或减小滤 波系数) 直至音频信号差异值在预设的音频信号差异范围内。  [0046] In one embodiment, after acquiring a third audio signal collected by a microphone, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal, the method includes: according to the fourth audio The signal and the preset standard audio signal calculate the audio signal difference value through the audio quality perception evaluation algorithm PEAQ, and determine whether the audio signal difference value is within a preset audio signal difference range; if the audio signal difference value is not in a preset Within the audio signal difference range, return the audio signal difference value to the echo canceller, so that the echo canceller adjusts a filter coefficient according to the audio signal difference value. The audio quality perception evaluation algorithm PEAQ (Perceptual Evaluation of Audio Quality) can imitate the hearing system of the human ear, analyze and compare the reference signal and the test signal to obtain an objective evaluation difference in audio quality, and store the standard audio signal of the speaker as The reference signal in PEAQ and the above-mentioned echo-cancelled fourth audio signal are used as test signals in PEAQ, and the audio signal difference and equivalent value can be calculated by PEAQ according to the fourth audio signal and a preset standard audio signal. When the echo canceller receives the difference value of the audio signal, it can adjust the filter coefficient (increase or decrease the filter coefficient) according to the difference value of the audio signal until the difference value of the audio signal is within a preset difference range of the audio signal.
[0047] 在本申请实施例中, 上述步骤 S201、 S202和 S203分别与上述步骤 S101、 S102 和 S103有相同或相似的地方, 具体可参见上述步骤 S101至 S103的相关描述, 对 此不在赘述。  [0047] In the embodiment of the present application, the above steps S201, S202, and S203 are the same or similar to the above steps S101, S102, and S103, respectively. For details, refer to the related description of the above steps S101 to S103, and details are not described herein again.
[0048] 步骤 S204, 将所述第四音频信号进行分频处理后分别输入对应的所述 N个音频 通道, 并通过增益处理后输入至与所述 N个音频通道连接的所述扬声器, 以指示 所述扬声器播放通过增益处理后的所述第四音频信号。  [0048] Step S204: Frequency-divide the fourth audio signal and input the corresponding N audio channels respectively, and then input the fourth audio signal to the speakers connected to the N audio channels after gain processing. Instruct the speaker to play the fourth audio signal that has been processed by gain.
[0049] 在本申请实施例中, 上述第四音频信号为消除回声后的有用音频信号, 将上述 第四音频信号进行分频处理后生成对应的 N个音频信号, 并输入对应的 N个音频 通道进行增益放大处理后, 由与所述 N个音频通道连接的一个或多个扬声器进行 播放。  [0049] In the embodiment of the present application, the fourth audio signal is a useful audio signal after echo cancellation. The fourth audio signal is frequency-divided to generate corresponding N audio signals, and the corresponding N audio signals are input. After the channels are subjected to gain amplification processing, playback is performed by one or more speakers connected to the N audio channels.
[0050] 由此可见, 在本申请实施例中, 将 N个音频通道中的 N个第一音频信号合成一 个第二音频信号作为回声消除的参考信号, 可对多个音频通道的音频信号进行 合成处理作为回声消除的参考信号, 从而对多个音频通道中的音频信号统一进 行回声消除, 无需分别对多个音频通道中的音频信号进行多次回声消除, 提高 了回声消除的效率, 且由于麦克风采集到噪声音频信号中的回声音频信号是多 个音频通道中的音频信号所合成得音频信号, 将多个音频通道中的音频信号合 成一个音频信号作为回声消除的参考信号, 能更准确的模拟回声音频信号, 可 提高消除回声后扬声器输出的音质。 [0050] It can be seen that, in the embodiment of the present application, the N first audio signals in the N audio channels are combined into one. Two second audio signals are used as reference signals for echo cancellation, and audio signals of multiple audio channels can be synthesized as reference signals for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, without the need to separately The audio signals in each audio channel are subjected to multiple echo cancellations, which improves the efficiency of echo cancellation, and because the echo audio signals in the noise audio signals collected by the microphone are audio signals synthesized from the audio signals in multiple audio channels, multiple The audio signals in each audio channel are combined into an audio signal as a reference signal for echo cancellation, which can more accurately simulate the echo audio signal, and can improve the sound quality of the loudspeaker output after echo cancellation.
[0051] 实施例三  Embodiment 3
[0052] 如图 3所示, 本申请实施例三提供的回声消除方法包括:  [0052] As shown in FIG. 3, the echo cancellation method provided in Embodiment 3 of the present application includes:
[0053] 步骤 S301, 获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音频信 号; 其中, 所述 Ng2且为整数。  [0053] Step S301: Obtain N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer.
[0054] 步骤 S302, 将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将所述第二音频信号作为回声消除的参考信号。  [0054] Step S302, linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation.
[0055] 在本申请实施例中, 上述步骤 S301和 S302分别与上述步骤 S101和 S102有相同或 相似的地方, 具体可参见上述步骤 S101至 S102的相关描述, 对此不在赘述。  [0055] In the embodiment of the present application, the above steps S301 and S302 are the same or similar to the above steps S101 and S102, respectively. For details, refer to the related descriptions of the above steps S101 to S102, and details are not described herein.
[0056] 步骤 S303: 检测智能音箱的工作模式。  [0056] Step S303: Detect the working mode of the smart speaker.
[0057] 在本申请实施例中, 检测智能音箱当前的工作模式, 具体的上述工作模式有语 音工作模式和音乐播放模式, 语音工作模式包括语音播放和电话通话等使用场 景, 所述音乐播放模式包括播放音乐等使用场景。  [0057] In the embodiment of the present application, the current working mode of the smart speaker is detected. The specific working modes mentioned above include a voice working mode and a music playing mode. The voice working mode includes use scenarios such as voice playing and telephone calling. The music playing mode Including usage scenarios such as playing music.
[0058] 步骤 S304, 获取麦克风采集的第三音频信号和回声消除器根据所述参考信号和 所述工作模式生成的回声估计信号, 将所述第三音频信号减去所述回声估计信 号后生成第四音频信号。  [0058] Step S304: Acquire a third audio signal collected by a microphone and an echo estimation signal generated by the echo canceller according to the reference signal and the working mode, and subtract the echo estimation signal from the third audio signal to generate the Fourth audio signal.
[0059] 具体的, 当麦克风采集到第三音频信号, 根据检测到的智能音箱当前工作模式 , 选择与当前工作模式相对应的回声消除器中自适应滤波系数, 并基于参考信 号生成回声估计信号, 将所述第三音频信号减去所述回声估计信号后生成第四 音频信号, 以进行回声消除。  [0059] Specifically, when the microphone collects a third audio signal, according to the detected current working mode of the smart speaker, an adaptive filter coefficient in the echo canceller corresponding to the current working mode is selected, and an echo estimation signal is generated based on the reference signal. And subtracting the echo estimation signal from the third audio signal to generate a fourth audio signal to perform echo cancellation.
[0060] 本申请实施例中, 由于可根据智能音箱的工作模式通过回声消除器对采集的音 频信号进行回声消除, 可以在智能音箱的的不同模式下, 针对不同模式的特点 进行对应的回声消除, 可有效减少回声消除的误差。 [0060] In the embodiment of the present application, since the collected audio signal can be echo-cancelled by the echo canceller according to the working mode of the smart speaker, the characteristics of different modes can be targeted in different modes of the smart speaker. Performing corresponding echo cancellation can effectively reduce the error of echo cancellation.
[0061] 实施例四  Embodiment 4
[0062] 本实施例是对实施例三的进一步说明, 本实施例与实施例三相同或相似的地方 具体可参见实施例三的相关描述, 此处不再赘述, 所述回声消除器包括第一自 适应滤波器和第二自适应滤波器, 如图 4所示, 上述步骤 S304包括:  [0062] This embodiment is a further description of the third embodiment. For the same or similar parts of this embodiment as those of the third embodiment, reference may be made to the related description of the third embodiment, which will not be repeated here. The echo canceller includes the first embodiment. An adaptive filter and a second adaptive filter. As shown in FIG. 4, the above step S304 includes:
[0063] 步骤 S401 : 若所述智能音箱的工作模式为第一预设工作模式, 获取所述第一自 适应滤波器根据所述参考信号生成的所述回声估计信号;  [0063] Step S401: if the working mode of the smart speaker is a first preset working mode, obtain the echo estimation signal generated by the first adaptive filter according to the reference signal;
[0064] 在本申请实施例中, 第一预设工作模式包括的智能音箱的第一使用场景, 当检 测到智能音箱处于第一使用场景, 即表示智能音箱处于第一预设工作模式下, 获取预设的与第一预设工作模式下对应预设的第一自适应滤波器生成的回声估 计信号, 根据回声估计信号对第三音频信号进行回声消除。  [0064] In the embodiment of the present application, when the first use scenario of the smart speaker included in the first preset working mode is detected, it means that the smart speaker is in the first preset working mode, Acquire a preset echo estimation signal generated by a first adaptive filter that is preset corresponding to a preset in the first preset working mode, and perform echo cancellation on the third audio signal according to the echo estimation signal.
[0065] 在一个实施例中, 上述第一预设工作模式可以是语音工作模式, 第一预设工作 模式为语音工作模式时对应的第一使用场景, 如智能音箱处于语音播放和电话 通话等使用场景。  [0065] In an embodiment, the first preset working mode may be a voice working mode, and a first use scenario corresponding to the first preset working mode when the first working mode is the voice working mode, such as a smart speaker in voice playback and a phone call. scenes to be used.
[0066] 步骤 S402: 若所述智能音箱的工作模式为第二预设工作模式, 获取所述第二自 适应滤波器根据所述参考信号生成的所述回声估计信号。  [0066] Step S402: if the working mode of the smart speaker is a second preset working mode, obtain the echo estimation signal generated by the second adaptive filter according to the reference signal.
[0067] 在本申请实施例中, 第二预设工作模式包括智能音箱的第二使用场景, 当检测 到智能音箱处于第二使用场景, 即表示智能音箱处于第二预设工作模式下, 通 过预设的与第二预设工作模式下对应预设的第二自适应滤波器生成的回声估计 信号, 根据回声估计信号对第三音频信号进行回声消除。  [0067] In the embodiment of the present application, the second preset working mode includes a second use scenario of the smart speaker. When it is detected that the smart speaker is in the second use scenario, it means that the smart speaker is in the second preset working mode. A predetermined echo estimation signal generated by a preset second adaptive filter corresponding to a preset in the second preset working mode, and performing echo cancellation on the third audio signal according to the echo estimation signal.
[0068] 在一个实施例中, 上述第二预设工作模式可以是音乐播放工作模式, 第二预设 工作模式为音乐模式时对应的第二使用场景。 如智能音箱处于音乐播放等使用 场景。  [0068] In one embodiment, the above-mentioned second preset working mode may be a music playback working mode, and a second usage scenario corresponding to the second preset working mode when the second preset working mode is a music mode. For example, the smart speaker is used in music playback.
[0069] 实施例五  [0069] Embodiment Five
[0070] 本实施例是对实施例四的进一步说明, 本实施例与实施例四相同或相似的地方 具体可参见实施例四的相关描述, 此处不再赘述, 如图 5所示, 上述步骤 S401包 括:  [0070] This embodiment is a further description of the fourth embodiment. For the same or similar places of this embodiment and the fourth embodiment, reference may be made to the related description of the fourth embodiment, and details are not described herein again, as shown in FIG. 5. Step S401 includes:
[0071] 步骤 S501 : 若所述智能音箱的工作模式为语音工作模式, 通过最小均方算法确 定与所述语音工作模式对应的第一自适应滤波器的系数; [0071] Step S501: If the working mode of the smart speaker is a voice working mode, determine it by using a minimum mean square algorithm. Determining coefficients of a first adaptive filter corresponding to the voice working mode;
[0072] 在本申请实施例中, 当麦克风采集到第三音频信号且所述智能音箱的工作模式 为语音工作模式时, 通过最小均方 (LMS, Least Mean Squares)算法确定与所述语 音工作模式对应的第一自适应滤波器的系数; 由于 RLS算法具有良好的收敛性能 , 除收敛速度快于递推最小二乘 (RLS, Recursive Least Squares)算法以及稳定性强 夕卜, 而且具有更高的起始收敛速率、 更小的权噪声和更大的抑噪能力。 因此在 检测到是语音信号时, 采用 LMS确定与所述语音工作模式对应的第一自适应滤 波器的系数会使得第一自适应滤波器对第三语音信号进行回声消除的抑噪能力 更好。  [0072] In the embodiment of the present application, when a third audio signal is collected by a microphone and the working mode of the smart speaker is a voice working mode, it is determined to work with the voice by using a Least Mean Squares (LMS) algorithm. Coefficients of the first adaptive filter corresponding to the mode; because the RLS algorithm has good convergence performance, except that the convergence speed is faster than the recursive least squares (RLS, Recursive Least Squares) algorithm and the stability is strong, and has higher stability Initial convergence rate, smaller weight noise, and greater noise suppression. Therefore, when a voice signal is detected, determining the coefficient of the first adaptive filter corresponding to the voice working mode by using LMS will make the first adaptive filter perform better noise suppression on the third voice signal. .
[0073] 步骤 S502: 根据所述第一自适应滤波器的系数和所述参考信号生成的所述回声 估计信号。  [0073] Step S502: the echo estimation signal generated according to the coefficient of the first adaptive filter and the reference signal.
[0074] 具体的, 调节第一自适应滤波器的系数, 将参考信号通过第一自适应滤波器, 生成回声估计信号, 将所述第三音频信号减去回声估计信号后生成所述第四音 频信号。  [0074] Specifically, adjusting a coefficient of the first adaptive filter, passing a reference signal through the first adaptive filter to generate an echo estimation signal, and subtracting the third audio signal from the echo estimation signal to generate the fourth audio signal.
[0075] 在本申请实施例中, 在智能音箱处于语音工作模式下通过第一自适应滤波器对 麦克风采集的语音信号进行回声消除, 针对该模式的特点进行回声消除, 可有 效减少回声消除的误差。  [0075] In the embodiment of the present application, when a smart speaker is in a voice working mode, echo cancellation is performed on a voice signal collected by a microphone through a first adaptive filter, and echo cancellation is performed according to characteristics of this mode, which can effectively reduce echo cancellation. error.
[0076] 实施例六  Embodiment 6
[0077] 本实施例是对实施例四的进一步说明, 本实施例与实施例四相同或相似的地方 具体可参见实施例四的相关描述, 此处不再赘述, 如图 6所示, 上述步骤 S402包 括:  [0077] This embodiment is a further description of the fourth embodiment. For the same or similar places of this embodiment and the fourth embodiment, reference may be made to the related description of the fourth embodiment, which will not be repeated here. As shown in FIG. Step S402 includes:
[0078] 步骤 S601 : 若所述智能音箱的工作模式为音乐播放模式, 通过递推最小二乘算 法确定与所述音乐播放模式对应的第二自适应滤波器的系数;  [0078] Step S601: if the working mode of the smart speaker is a music playback mode, determine a coefficient of a second adaptive filter corresponding to the music playback mode by recursive least squares algorithm;
[0079] 在本申请实施例中, 当麦克风采集到第三音频信号且所述智能音箱的工作模式 为音乐播放模式时, 通过 RLS算法确定与所述音乐播放模式对应的第二自适应滤 波器的系数; 由于音乐具有多种频率分量, 因为 RLS算法具有比 LMS对非平稳信 号适应性强, 其滤波性能明显好于 LMS算法, 采用 RLS确定与所述音乐播放模式 对应的第二自适应滤波器的系数会使得第二自适应滤波器对第三语音信号进行 回声消除的适应能力更强。 [0079] In the embodiment of the present application, when a third audio signal is collected by a microphone and the working mode of the smart speaker is a music playback mode, a second adaptive filter corresponding to the music playback mode is determined by an RLS algorithm. Because the music has multiple frequency components, because the RLS algorithm has better adaptability to non-stationary signals than the LMS, its filtering performance is significantly better than the LMS algorithm, and the second adaptive filtering corresponding to the music playback mode is determined using RLS Coefficients of the filter will cause the second adaptive filter to process the third speech signal. Echo cancellation is more adaptive.
[0080] 步骤 S602: 根据所述第二自适应滤波器的系数和所述参考信号生成的所述回声 估计信号。  [0080] Step S602: the echo estimation signal generated according to the coefficient of the second adaptive filter and the reference signal.
[0081] 具体的, 调节第二自适应滤波器的系数, 将参考信号通过第二自适应滤波器, 生成回声估计信号, 将所述第三音频信号减去回声估计信号后生成所述第四音 频信号。  [0081] Specifically, adjusting the coefficients of the second adaptive filter, passing the reference signal through the second adaptive filter to generate an echo estimation signal, and subtracting the third audio signal from the echo estimation signal to generate the fourth audio signal.
[0082] 在本申请实施例中, 在智能音箱处于音乐播放模式下通过第二自适应滤波器对 麦克风采集的语音信号进行回声消除, 针对该模式的特点进行回声消除, 可有 效减少回声消除的误差。  [0082] In the embodiment of the present application, echo cancellation is performed on a voice signal collected by a microphone through a second adaptive filter when the smart speaker is in a music playback mode, and echo cancellation is performed according to the characteristics of this mode, which can effectively reduce echo cancellation. error.
[0083] 实施例七  Embodiment 7
[0084] 本申请实施例提供一种回声消除装置, 可集成于包括扬声器和麦克风的智能音 箱等音频播放设备或系统中, 用于执行实施例一至实施例六中的方法步骤, 为 了便于说明, 仅示出于本申请相关的部分, 如图 7所示, 所述回声消除装置 700 包括:  [0084] An embodiment of the present application provides an echo cancellation device, which can be integrated into an audio playback device or system such as a smart speaker including a speaker and a microphone, and is configured to execute the method steps in Embodiments 1 to 6. For ease of description, Only shown in relevant parts of this application. As shown in FIG. 7, the echo cancellation device 700 includes:
[0085] 获取模块 701 用于获取与扬声器输入端连接的 N个音频通道中对应的 N个第一 音频信号; 其中, 所述 Ng2且为整数;  [0085] The acquisition module 701 is configured to acquire N first audio signals corresponding to the N audio channels connected to the speaker input end, where Ng2 is an integer;
[0086] 合成模块 702, 用于将所述 N个第一音频信号进行线性变换后合成一个第二音频 信号, 将所述第二音频信号作为回声消除的参考信号;  A synthesizing module 702, configured to linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
[0087] 在一个实施例中, 所述合成模块 702包括:  [0087] In an embodiment, the composition module 702 includes:
[0088] 第一获取单元, 用于分别获取所述 N个音频通道中进行增益处理的增益值; [0088] a first acquisition unit, configured to acquire gain values for gain processing in the N audio channels, respectively;
[0089] 分配单元, 用于根据 N个音频通道对应的增益值对所述 N个第一音频信号分配 对应的权重; [0089] an assigning unit, configured to assign corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels;
[0090] 累加单元, 用于将所述 N个第一音频信号的幅值分别乘以对应的所述权重后进 行累加生成所述第二音频信号。  [0090] an accumulating unit, configured to multiply the amplitudes of the N first audio signals by the corresponding weights, and then accumulate to generate the second audio signal.
[0091] 消除模块 703 , 用于获取麦克风采集的第三音频信号, 根据所述参考信号对所 述第三音频信号进行回声消除后生成第四音频信号。  [0091] The cancellation module 703 is configured to acquire a third audio signal collected by a microphone, and perform echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
[0092] 在一个实施例中, 所述消除模块 702包括:  [0092] In one embodiment, the elimination module 702 includes:
[0093] 第二获取单元, 用于获取通过自适应滤波器根据所述参考信号生成的回声估计 信号; [0093] a second obtaining unit, configured to obtain an echo estimate generated by the adaptive filter according to the reference signal Signal
[0094] 生成单元, 用于获取麦克风采集的第三音频信号, 将所述第三音频信号减去所 述回声估计信号后生成所述第四音频信号。  [0094] a generating unit, configured to obtain a third audio signal collected by a microphone, and subtract the echo estimation signal from the third audio signal to generate the fourth audio signal.
[0095] 在一个实施例中, 所述回声消除装置 700还包括:  [0095] In an embodiment, the echo cancellation device 700 further includes:
[0096] 分频处理模块, 用于将所述第四音频信号进行分频处理后分别输入对应的所述 N个音频通道, 并通过增益处理后输入至与所述 N个音频通道连接的所述扬声器 , 以指示所述扬声器播放通过增益处理后的所述第四音频信号。  [0096] a frequency division processing module, configured to divide the fourth audio signal into the corresponding N audio channels after frequency division processing, and input the signals to all the channels connected to the N audio channels after gain processing; And speaking the speaker to instruct the speaker to play the fourth audio signal that has been processed by gain.
[0097] 在一个实施例中, 所述回声消除装置 700还包括:  [0097] In an embodiment, the echo cancellation device 700 further includes:
[0098] 判断模块, 用于根据所述第四音频信号和预设的标准音频信号通过音频质量感 知评价算法 PEAQ计算音频信号差异值, 并判断所述音频信号差异值是否在预设 的音频信号差异范围内; 若所述音频信号差异值不在预设的音频信号差异范围 内, 则将所述音频信号差异值返回至所述自适应滤波器, 使所述自适应滤波器 根据所述音频信号差异值调节滤波系数。  [0098] a judging module, configured to calculate an audio signal difference value through an audio quality perception evaluation algorithm PEAQ according to the fourth audio signal and a preset standard audio signal, and determine whether the audio signal difference value is within a preset audio signal Within the difference range; if the audio signal difference value is not within a preset audio signal difference range, returning the audio signal difference value to the adaptive filter, so that the adaptive filter is based on the audio signal The difference value adjusts the filter coefficient.
[0099] 在一个实施例中, 所述回声消除装置 700还包括检测模块, 用于检测智能音箱 的工作模式;  [0099] In one embodiment, the echo cancellation device 700 further includes a detection module for detecting a working mode of the smart speaker;
[0100] 对应地, 所述第二获取单元具体用于:  [0100] Correspondingly, the second obtaining unit is specifically configured to:
[0101] 获取回声消除器根据所述参考信号和所述工作模式生成的回声估计信号。  [0101] acquiring an echo estimation signal generated by the echo canceller according to the reference signal and the operating mode.
[0102] 在一个实施例中, 所述回声消除器包括第一自适应滤波器和第二自适应滤波器 , 所述第二获取单元还具体用于:  [0102] In one embodiment, the echo canceller includes a first adaptive filter and a second adaptive filter, and the second obtaining unit is further specifically configured to:
[0103] 若所述智能音箱的工作模式为第一预设工作模式, 获取所述第一自适应滤波器 根据所述参考信号生成的所述回声估计信号;  [0103] if the working mode of the smart speaker is a first preset working mode, acquiring the echo estimation signal generated by the first adaptive filter according to the reference signal;
[0104] 若所述智能音箱的工作模式为第二预设工作模式, 获取所述第二自适应滤波器 根据所述参考信号生成的所述回声估计信号。  [0104] if the working mode of the smart speaker is a second preset working mode, acquiring the echo estimation signal generated by the second adaptive filter according to the reference signal.
[0105] 在一个实施例中, 所述第一预设工作模式为语音工作模式。  [0105] In one embodiment, the first preset working mode is a voice working mode.
[0106] 在一个实施例中, 所述第二获取单元具体用于: 通过最小均方算法确定与所述 语音工作模式对应的第一自适应滤波器的系数;  [0106] In one embodiment, the second obtaining unit is specifically configured to: determine a coefficient of a first adaptive filter corresponding to the voice working mode by using a minimum mean square algorithm;
[0107] 根据所述第一自适应滤波器的系数和所述参考信号生成的所述回声估计信号。  [0107] the echo estimation signal generated according to the coefficients of the first adaptive filter and the reference signal.
[0108] 根据所述第一自适应滤波器的系数和所述参考信号生成的所述回声估计信号。 [0109] 在一个实施例中, 所述第二预设工作模式为音乐播放模式。 [0108] the echo estimation signal generated according to the coefficients of the first adaptive filter and the reference signal. [0109] In one embodiment, the second preset working mode is a music playback mode.
[0110] 在一个实施例中, 所述第二获取单元具体用于: 通过递推最小二乘算法确定与 所述音乐播放模式对应的第二自适应滤波器的系数;  [0110] In an embodiment, the second obtaining unit is specifically configured to determine a coefficient of a second adaptive filter corresponding to the music playback mode by a recursive least square algorithm;
[0111] 根据所述第二自适应滤波器的系数和所述参考信号生成的所述回声估计信号。  [0111] the echo estimation signal generated according to a coefficient of the second adaptive filter and the reference signal.
[0112] 由此可见, 在本申请实施例中, 将 N个音频通道中的 N个第一音频信号合成一 个第二音频信号作为回声消除的参考信号, 可对多个音频通道的音频信号进行 合成处理作为回声消除的参考信号, 从而对多个音频通道中的音频信号统一进 行回声消除, 无需分别对多个音频通道中的音频信号进行多次回声消除, 提高 了回声消除的效率, 且由于麦克风采集到噪声音频信号中的回声音频信号是多 个音频通道中的音频信号所合成得音频信号, 将多个音频通道中的音频信号合 成一个音频信号作为回声消除的参考信号, 能更准确的模拟回声音频信号, 可 提高消除回声后扬声器输出的音质。  [0112] It can be seen that, in the embodiment of the present application, the N first audio signals in the N audio channels are combined into a second audio signal as a reference signal for echo cancellation, and audio signals of multiple audio channels can be processed. Synthetic processing is used as a reference signal for echo cancellation, thereby performing unified echo cancellation on audio signals in multiple audio channels, eliminating the need for multiple echo cancellations for audio signals in multiple audio channels, which improves the efficiency of echo cancellation, and because The echo audio signal collected by the microphone from the noise audio signal is an audio signal synthesized from audio signals in multiple audio channels. The audio signals in multiple audio channels are combined into an audio signal as a reference signal for echo cancellation, which can be more accurate. Analog echo audio signal can improve the sound quality of the speaker output after echo cancellation.
[0113] 实施例八  Embodiment 8
[0114] 如图 8所示, 是本申请实施例提供的智能音箱的结构示意图。 所述智能音箱 800 包括: 处理器 801、 存储器 802以及存储在上述存储器 802中并可在上述处理器 80 1上运行的计算机程序 803。 上述处理器 801执行上述计算机程序 803时实现上述 回声消除方法实施例中的步骤, 例如上述实施例中的方法步骤。  [0114] FIG. 8 is a schematic structural diagram of a smart speaker according to an embodiment of the present application. The smart speaker 800 includes: a processor 801, a memory 802, and a computer program 803 stored in the memory 802 and executable on the processor 801. When the processor 801 executes the computer program 803, the steps in the embodiment of the echo cancellation method are implemented, for example, the method steps in the foregoing embodiment.
[0115] 示例性的, 上述计算机程序 803可以被分割成一个或多个单元 /模块, 上述一个 或者多个单元 /模块被存储在上述存储器 802中, 并由上述处理器 801执行, 以完 成本申请。 上述一个或多个单元 /模块可以是能够完成特定功能的一系列计算机 程序指令段, 该指令段用于描述上述计算机程序 803在上述智能音箱 800中的执 行过程。  [0115] Exemplarily, the computer program 803 may be divided into one or more units / modules, and the one or more units / modules are stored in the memory 802 and executed by the processor 801 to complete the present invention. Application. The one or more units / modules may be a series of computer program instruction segments capable of performing specific functions, and the instruction segments are used to describe the execution process of the computer program 803 in the smart speaker 800 described above.
[0116] 所述处理器 801可以是中央处理单元 (Central Processing Unit, CPU) , 还可以是 其它通用处理器、 数字信号处理器 (Digital Signal Processor, DSP)、 专用集成电 路 (Application Specific Integrated Circuit, ASIC)、 现成可编程门阵列  [0116] The processor 801 may be a central processing unit (CPU), or may be other general-purpose processors, digital signal processors (DSPs), and application specific integrated circuits (Application Specific Integrated Circuits, ASIC), off-the-shelf programmable gate array
(Field-Programmable Gate Array, FPGA)或者其它可编程逻辑器件、 分立门或者 晶体管逻辑器件、 分立硬件组件等。 通用处理器可以是微处理器或者该处理器 也可以是任何常规的处理器等。 [0117] 所述存储器 802可以是智能音箱 800的内部存储单元, 例如智能音箱 800的硬盘 或内存。 上述存储器 802也可以是上述智能音箱 800的外部存储设备, 例如上述 智能音箱 800上配备的插接式硬盘, 智能存储卡 (Smart Media Card, SMC) , 安 全数字 (Secure Digital, SD) 卡, 闪存卡 (Flash Card) 等。 进一步地, 上述存储 器 802还可以既包括上述智能音箱 800的内部存储单元也包括外部存储设备。 上 述存储器 802用于存储上述计算机程序以及上述智能音箱 800所需的其它程序和 数据。 上述存储器 802还可以用于暂时地存储已经输出或者将要输出的数据。 (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like. [0117] The memory 802 may be an internal storage unit of the smart speaker 800, such as a hard disk or a memory of the smart speaker 800. The memory 802 may also be an external storage device of the smart speaker 800, for example, a plug-in hard disk, a smart media card (SMC), a secure digital (SD) card, and a flash memory provided on the smart speaker 800. Card (Flash Card), etc. Further, the memory 802 may include both the internal storage unit of the smart speaker 800 and an external storage device. The memory 802 is configured to store the computer program and other programs and data required by the smart speaker 800. The memory 802 may also be used to temporarily store data that has been output or is to be output.
[0118] 本领域技术人员可以理解, 图 8仅仅是智能音箱 800的示例, 并不构成对智能音 箱 800的限定, 可以包括比图示更多或更少的部件, 或者组合某些部件, 或者不 同的部件, 例如上述智能音箱 800还可以包括输入输出设备、 网络接入设备、 总 线等。  [0118] Those skilled in the art can understand that FIG. 8 is only an example of the smart speaker 800, and does not constitute a limitation on the smart speaker 800. The smart speaker 800 may include more or fewer components than shown, or some components may be combined, or Different components, for example, the above-mentioned smart speaker 800 may further include an input-output device, a network access device, a bus, and the like.
[0119] 所属领域的技术人员可以清楚地了解到, 为了描述的方便和简洁, 仅以上述各 功能单元、 模块的划分进行举例说明, 实际应用中, 可以根据需要而将上述功 能分配由不同的功能单元、 模块完成, 即将上述装置的内部结构划分成不同的 功能单元或模块, 以完成以上描述的全部或者部分功能。 实施例中的各功能单 元、 模块可以集成在一个处理单元中, 也可以是各个单元单独物理存在, 也可 以两个或两个以上单元集成在一个单元中, 上述集成的单元既可以采用硬件的 形式实现, 也可以采用软件功能单元的形式实现。 另外, 各功能单元、 模块的 具体名称也只是为了便于相互区分, 并不用于限制本申请的保护范围。 上述智 能音箱中单元、 模块的具体工作过程, 可以参考前述方法实施例中的对应过程 , 在此不再赘述。  [0119] Those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the above-mentioned division of functional units and modules is used as an example. In actual applications, the foregoing functions may be allocated by different functions according to requirements. The functional units and modules are completed, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit, and the integrated unit may use hardware. It can be implemented in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of this application. For the specific working process of the unit and module in the smart speaker, reference may be made to the corresponding process in the foregoing method embodiment, and details are not described herein again.
[0120] 在上述实施例中, 对各个实施例的描述都各有侧重, 某个实施例中没有详述或 记载的部分, 可以参见其它实施例的相关描述。  [0120] In the foregoing embodiments, the description of each embodiment has its own emphasis. For a part that is not detailed or recorded in an embodiment, reference may be made to related descriptions of other embodiments.
[0121] 本领域普通技术人员可以意识到, 结合本文中所公开的实施例描述的各示例的 单元及算法步骤, 能够以电子硬件、 或者计算机软件和电子硬件的结合来实现 。 这些功能究竟以硬件还是软件方式来执行, 取决于技术方案的特定应用和设 计约束条件。 专业技术人员可以对每个特定的应用来使用不同方法来实现所描 述的功能, 但是这种实现不应认为超出本申请的范围。 [0122] 在本申请所提供的实施例中, 应该理解到, 所揭露的装置和方法, 可以通过其 它的方式实现。 例如, 以上所描述的装置实施例仅仅是示意性的, 例如, 上述 模块或单元的划分, 仅仅为一种逻辑功能划分, 实际实现时可以有另外的划分 方式, 例如多个单元或组件可以结合或者可以集成到另一个系统, 或一些特征 可以忽略, 或不执行。 另一点, 所显示或讨论的相互之间的耦合或直接耦合或 通讯连接可以是通过一些接口, 装置或单元的间接耦合或通讯连接, 可以是电 性, 机械或其它的形式。 [0121] Those of ordinary skill in the art may realize that the units and algorithm steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed by hardware or software depends on the specific application and design constraints of the technical solution. Professional technicians can use different methods to implement the described functions for each specific application, but such implementation should not be considered beyond the scope of this application. [0122] In the embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are only schematic. For example, the division of the foregoing modules or units is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined. Or it can be integrated into another system, or some features can be ignored, or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, the indirect coupling or communication connection of the device or unit, and may be electrical, mechanical or other forms.
[0123] 上述作为分离部件说明的单元可以是或者也可以不是物理上分开的, 作为单元 显示的部件可以是或者也可以不是物理单元, 即可以位于一个地方, 或者也可 以分布到多个网络单元上。 可以根据实际的需要选择其中的部分或者全部单元 来实现本申请实施例方案的目的。  [0123] The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. on. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions in the embodiments of the present application.
[0124] 另外, 在本申请各个实施例中的各功能单元可以集成在一个处理单元中, 也可 以是各个单元单独物理存在, 也可以两个或两个以上单元集成在一个单元中。 上述集成的单元既可以采用硬件的形式实现, 也可以采用软件功能单元的形式 实现。  [0124] In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or in the form of software functional unit.
[0125] 上述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用 时, 可以存储在一个计算机可读取存储介质中。 基于这样的理解, 本申请实现 上述实施例方法中的全部或部分流程, 也可以通过计算机程序来指令相关的硬 件来完成, 上述的计算机程序可存储于一计算机可读存储介质中, 该计算机程 序在被处理器执行时, 可实现上述各个方法实施例的步骤。 其中, 上述计算机 程序包括计算机程序代码, 上述计算机程序代码可以为源代码形式、 对象代码 形式、 可执行文件或某些中间形式等。 上述计算机可读介质可以包括: 能够携 带上述计算机程序代码的任何实体或装置、 记录介质、 U盘、 移动硬盘、 磁碟、 光盘、 计算机存储器、 只读存储器 (ROM, Read-Only Memory) 、 随机存取存 储器 (RAM, Random Access Memory) 、 电载波信号、 电信信号以及软件分发 介质等。  [0125] If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on such an understanding, this application implements all or part of the processes in the method in the foregoing embodiment, and may also be completed by a computer program instructing related hardware. The computer program may be stored in a computer-readable storage medium. The computer program When executed by a processor, the steps of the foregoing method embodiments may be implemented. The computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form. The above computer-readable medium may include: any entity or device capable of carrying the above computer program code, a recording medium, a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), a random Access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media.
[0126] 以上仅为本申请的可选实施例而已, 并不用于限制本申请。 对于本领域的技术 人员来说, 本申请可以有各种更改和变化。 凡在本申请的精神和原则之内, 所 作的任何修改、 等同替换、 改进等, 均应包含在本申请的权利要求范围之内。 [0126] The above are only optional embodiments of the present application, and are not used to limit the present application. For those skilled in the art, this application may have various modifications and changes. All within the spirit and principles of this application Any modification, equivalent replacement, improvement, etc. shall be included in the scope of the claims of this application.

Claims

权利要求书 Claim
[权利要求 1] 一种回声消除方法, 应用于智能音箱, 其特征在于, 所述方法包括: 获取与扬声器输入端连接的 N个音频通道中对应的 N个第一音频信号 ; 其中, 所述 Ng2且为整数;  [Claim 1] An echo cancellation method applied to a smart speaker, characterized in that the method includes: acquiring N first audio signals corresponding to N audio channels connected to a speaker input end; wherein, the Ng2 is an integer;
将所述 N个第一音频信号进行线性变换后合成一个第二音频信号, 将 所述第二音频信号作为回声消除的参考信号;  Linearly transform the N first audio signals into a second audio signal, and use the second audio signal as a reference signal for echo cancellation;
获取麦克风采集的第三音频信号, 根据所述参考信号对所述第三音频 信号进行回声消除后生成第四音频信号。  A third audio signal collected by a microphone is acquired, and an echo cancellation is performed on the third audio signal according to the reference signal to generate a fourth audio signal.
[权利要求 2] 如权利要求 1所述的回声消除方法, 其特征在于, 将所述 N个第一音 频信号进行线性变换后合成第二音频信号, 包括: 分别获取所述 N个音频通道中进行增益处理的增益值;  [Claim 2] The echo cancellation method according to claim 1, wherein linearly transforming the N first audio signals to synthesize a second audio signal comprises: obtaining the N audio channels respectively. Gain value for gain processing;
根据 N个音频通道对应的增益值对所述 N个第一音频信号分配对应的 权重;  Assign corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels;
将所述 N个第一音频信号的幅值分别乘以对应的所述权重后进行累加 生成所述第二音频信号。  Multiplying the amplitudes of the N first audio signals by the corresponding weights and accumulating them to generate the second audio signal.
[权利要求 3] 如权利要求 1所述的回声消除方法, 其特征在于, 获取麦克风采集的 第三音频信号, 根据所述参考信号将所述第三音频信号进行回声消除 后生成第四音频信号, 包括:  [Claim 3] The echo cancellation method according to claim 1, wherein a third audio signal collected by a microphone is acquired, and the third audio signal is subjected to echo cancellation according to the reference signal to generate a fourth audio signal. Including:
获取回声消除器根据所述参考信号生成的回声估计信号;  Obtaining an echo estimation signal generated by the echo canceller according to the reference signal;
获取麦克风采集的第三音频信号, 将所述第三音频信号减去所述回声 估计信号后生成所述第四音频信号。  Acquiring a third audio signal collected by a microphone, and subtracting the echo estimation signal from the third audio signal to generate the fourth audio signal.
[权利要求 4] 如权利要求 1至 3任一项所述的回声消除方法, 其特征在于, 在根据所 述参考信号将所述第三音频信号进行回声消除后生成第四音频信号之 后, 还包括:  [Claim 4] The echo cancellation method according to any one of claims 1 to 3, wherein after the third audio signal is subjected to echo cancellation according to the reference signal, a fourth audio signal is generated, and Including:
将所述第四音频信号进行分频处理后分别输入对应的所述 N个音频通 道, 并通过增益处理后输入至与所述 N个音频通道连接的所述扬声器 , 以指示所述扬声器播放通过增益处理后的所述第四音频信号。  Frequency-dividing the fourth audio signal into the corresponding N audio channels and inputting the fourth audio signal to the speakers connected to the N audio channels after gain processing to instruct the speakers to pass through The fourth audio signal after gain processing.
[权利要求 5] 如权利要求 3所述的回声消除方法, 其特征在于, 在获取麦克风采集 的第三音频信号, 根据所述参考信号将所述第三音频信号进行回声消 除后生成第四音频信号之后, 还包括: [Claim 5] The echo cancellation method according to claim 3, characterized in that: After the third audio signal is echo-cancelled according to the reference signal to generate a fourth audio signal, the method further includes:
根据所述第四音频信号和预设的标准音频信号通过音频质量感知评价 算法 PEAQ计算音频信号差异值, 并判断所述音频信号差异值是否在 预设的音频信号差异范围内;  Calculating an audio signal difference value through an audio quality perception evaluation algorithm PEAQ according to the fourth audio signal and a preset standard audio signal, and determining whether the audio signal difference value is within a preset audio signal difference range;
若所述音频信号差异值不在预设的音频信号差异范围内, 则将所述音 频信号差异值返回至所述回声消除器, 使所述回声消除器根据所述音 频信号差异值调节滤波系数。  If the audio signal difference value is not within a preset audio signal difference range, returning the audio signal difference value to the echo canceller, so that the echo canceller adjusts a filter coefficient according to the audio signal difference value.
[权利要求 6] 如权利要求 3所述的回声消除方法, 其特征在于, 所述回声消除方法 还包括: 检测智能音箱的工作模式;  [Claim 6] The echo cancellation method according to claim 3, wherein the echo cancellation method further comprises: detecting a working mode of the smart speaker;
对应地, 所述获取回声消除器根据所述参考信号生成的回声估计信号 , 具体包括:  Correspondingly, the acquiring the echo estimation signal generated by the echo canceller according to the reference signal specifically includes:
获取回声消除器根据所述参考信号和所述工作模式生成的回声估计信 号。  Acquire an echo estimation signal generated by the echo canceller according to the reference signal and the operating mode.
[权利要求 7] 如权利要求 6所述的回声消除方法, 其特征在于, 所述回声消除器包 括第一自适应滤波器和第二自适应滤波器, 获取回声消除器根据所述 参考信号和所述工作模式生成的回声估计信号, 包括:  [Claim 7] The echo canceling method according to claim 6, wherein the echo canceller includes a first adaptive filter and a second adaptive filter, and the echo canceler is obtained according to the reference signal and The echo estimation signal generated by the working mode includes:
若所述智能音箱的工作模式为第一预设工作模式, 获取所述第一自适 应滤波器根据所述参考信号生成的所述回声估计信号;  If the working mode of the smart speaker is a first preset working mode, obtaining the echo estimation signal generated by the first adaptive filter according to the reference signal;
若所述智能音箱的工作模式为第二预设工作模式, 获取所述第二自适 应滤波器根据所述参考信号生成的所述回声估计信号。  If the working mode of the smart speaker is a second preset working mode, obtain the echo estimation signal generated by the second adaptive filter according to the reference signal.
[权利要求 8] 如权利要求 7所述的回声消除方法, 其特征在于, 所述第一预设工作 模式为语音工作模式。  [Claim 8] The echo cancellation method according to claim 7, wherein the first preset working mode is a voice working mode.
[权利要求 9] 如权利要求 8所述的回声消除方法, 其特征在于, 获取所述第一自适 应滤波器根据所述参考信号生成的所述回声估计信号, 包括: 通过最小均方算法确定与所述语音工作模式对应的第一自适应滤波器 的系数; [Claim 9] The echo cancellation method according to claim 8, wherein the acquiring the echo estimation signal generated by the first adaptive filter according to the reference signal comprises: determining by a least mean square algorithm Coefficients of a first adaptive filter corresponding to the voice working mode;
根据所述第一自适应滤波器的系数和所述参考信号生成的所述回声估 计信号。 The echo estimate generated according to the coefficients of the first adaptive filter and the reference signal 计 信号。 Meter signal.
[权利要求 10] 如权利要求 7所述的回声消除方法, 其特征在于, 所述第二预设工作 模式为音乐播放模式。  [Claim 10] The echo cancellation method according to claim 7, wherein the second preset working mode is a music playback mode.
[权利要求 11] 如权利要求 10所述的回声消除方法, 其特征在于, 获取所述第二自适 应滤波器根据所述参考信号生成的所述回声估计信号, 包括: 通过递推最小二乘算法确定与所述音乐播放模式对应的第二自适应滤 波器的系数; [Claim 11] The echo cancellation method according to claim 10, wherein acquiring the echo estimation signal generated by the second adaptive filter according to the reference signal comprises: recursive least squares An algorithm determining a coefficient of a second adaptive filter corresponding to the music playback mode;
根据所述第二自适应滤波器的系数和所述参考信号生成的所述回声估 计信号。  The echo estimation signal generated based on the coefficients of the second adaptive filter and the reference signal.
[权利要求 12] 一种回声消除装置, 其特征在于, 所述装置包括:  [Claim 12] An echo cancellation device, characterized in that the device includes:
获取模块, 用于获取与扬声器输入端连接的 N个音频通道中对应的 N 个第一音频信号; 其中, 所述 Ng2且为整数;  An acquisition module, configured to acquire N first audio signals corresponding to the N audio channels connected to the speaker input end; wherein Ng2 is an integer;
合成模块, 用于将所述 N个第一音频信号进行线性变换后合成一个第 二音频信号, 将所述第二音频信号作为回声消除的参考信号; 消除模块, 用于获取麦克风采集的第三音频信号, 根据所述参考信号 对所述第三音频信号进行回声消除后生成第四音频信号。  A synthesis module, configured to linearly transform the N first audio signals into a second audio signal, and using the second audio signal as a reference signal for echo cancellation; a cancellation module configured to obtain a third signal collected by a microphone An audio signal, and performing echo cancellation on the third audio signal according to the reference signal to generate a fourth audio signal.
[权利要求 13] 如权利要求 12所述的回声消除装置, 其特征在于, 所述合成模块包括 第一获取单元, 用于分别获取所述 N个音频通道中进行增益处理的增 益值;  [Claim 13] The echo cancellation device according to claim 12, wherein the synthesis module includes a first acquisition unit configured to acquire gain values of gain processing in the N audio channels, respectively;
分配单元, 用于根据 N个音频通道对应的增益值对所述 N个第一音频 信号分配对应的权重;  An allocation unit, configured to allocate corresponding weights to the N first audio signals according to the gain values corresponding to the N audio channels;
累加单元, 用于将所述 N个第一音频信号的幅值分别乘以对应的所述 权重后进行累加生成所述第二音频信号。  And an accumulating unit, configured to multiply the amplitudes of the N first audio signals by corresponding weights, and then accumulate to generate the second audio signal.
[权利要求 14] 如权利要求 12所述的回声消除装置, 其特征在于, 所述消除模块包括 第二获取单元, 用于获取回声消除器根据所述参考信号生成的回声估 计信号; 生成单元, 用于获取麦克风采集的第三音频信号, 将所述第三音频信 号减去所述回声估计信号后生成所述第四音频信号。 [Claim 14] The echo cancellation device according to claim 12, wherein the cancellation module includes a second acquisition unit, configured to acquire an echo estimation signal generated by the echo canceller according to the reference signal; A generating unit is configured to obtain a third audio signal collected by a microphone, and subtract the echo estimation signal from the third audio signal to generate the fourth audio signal.
[权利要求 15] 如权利要求 12至 14任一项所述的回声消除装置, 其特征在于, 所述装 置还包括:  [Claim 15] The echo cancellation device according to any one of claims 12 to 14, wherein the device further comprises:
分频处理模块, 用于将所述第四音频信号进行分频处理后分别输入对 应的所述 N个音频通道, 并通过增益处理后输入至与所述 N个音频通 道连接的所述扬声器, 以指示所述扬声器播放通过增益处理后的所述 第四音频信号。  A frequency division processing module, configured to frequency-divide the fourth audio signal and input the corresponding N audio channels respectively, and input the signals to the speakers connected to the N audio channels after gain processing, To instruct the speaker to play the fourth audio signal that has been processed by gain.
[权利要求 16] 如权利要求 14所述的回声消除装置, 其特征在于, 所述装置还包括: 判断模块, 用于根据所述第四音频信号和预设的标准音频信号通过音 频质量感知评价算法 PEAQ计算音频信号差异值, 并判断所述音频信 号差异值是否在预设的音频信号差异范围内;  [Claim 16] The echo cancellation device according to claim 14, wherein the device further comprises: a judging module, configured to perform an audio quality perception evaluation according to the fourth audio signal and a preset standard audio signal. The algorithm PEAQ calculates an audio signal difference value, and determines whether the audio signal difference value is within a preset audio signal difference range;
若所述音频信号差异值不在预设的音频信号差异范围内, 则将所述音 频信号差异值返回至所述回声消除器, 使所述回声消除器根据所述音 频信号差异值调节滤波系数。  If the audio signal difference value is not within a preset audio signal difference range, returning the audio signal difference value to the echo canceller, so that the echo canceller adjusts a filter coefficient according to the audio signal difference value.
[权利要求 17] 如权利要求 14所述的回声消除装置, 其特征在于, 所述装置还包括检 测模块, 用于检测智能音箱的工作模式;  [Claim 17] The echo cancellation device according to claim 14, wherein the device further comprises a detection module for detecting a working mode of the smart speaker;
对应地, 所述第二获取单元具体用于:  Correspondingly, the second obtaining unit is specifically configured to:
获取回声消除器根据所述参考信号和所述工作模式生成的回声估计信 号。  Acquire an echo estimation signal generated by the echo canceller according to the reference signal and the operating mode.
[权利要求 18] 如权利要求 17所述的回声消除装置, 其特征在于, 所述回声消除器包 括第一自适应滤波器和第二自适应滤波器, 所述第二获取单元还具体 用于:  [Claim 18] The echo cancellation device according to claim 17, wherein the echo canceller includes a first adaptive filter and a second adaptive filter, and the second obtaining unit is further specifically configured to: :
若所述智能音箱的工作模式为第一预设工作模式, 获取所述第一自适 应滤波器根据所述参考信号生成的所述回声估计信号;  If the working mode of the smart speaker is a first preset working mode, obtaining the echo estimation signal generated by the first adaptive filter according to the reference signal;
若所述智能音箱的工作模式为第二预设工作模式, 获取所述第二自适 应滤波器根据所述参考信号生成的所述回声估计信号。  If the working mode of the smart speaker is a second preset working mode, obtain the echo estimation signal generated by the second adaptive filter according to the reference signal.
[权利要求 19] 一种智能音箱, 包括存储器、 处理器以及存储在所述存储器中并可在 所述处理器上运行的计算机程序, 其特征在于, 所述处理器执行所述 计算机程序时实现如权利要求 1至 11任一项所述方法的步骤。 [Claim 19] A smart speaker, comprising a memory, a processor, and stored in the memory and accessible in The computer program running on the processor is characterized in that, when the processor executes the computer program, the steps of the method according to any one of claims 1 to 11 are implemented.
[权利要求 20] 一种计算机可读存储介质, 所述计算机可读存储介质存储有计算机程 序, 其特征在于, 所述计算机程序被处理器执行时实现如权利要求 1 至 11任一项所述方法的步骤。  [Claim 20] A computer-readable storage medium storing a computer program, wherein when the computer program is executed by a processor, the computer program is implemented according to any one of claims 1 to 11. Method steps.
PCT/CN2019/108343 2018-09-27 2019-09-27 Echo cancellation method, device and intelligent loudspeaker box WO2020063798A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201811130274.7A CN110956973A (en) 2018-09-27 2018-09-27 Echo cancellation method and device and intelligent terminal
CN201811130274.7 2018-09-27
CN201811561782.0A CN111356058B (en) 2018-12-20 2018-12-20 Echo cancellation method and device and intelligent sound box
CN201811561782.0 2018-12-20

Publications (1)

Publication Number Publication Date
WO2020063798A1 true WO2020063798A1 (en) 2020-04-02

Family

ID=69953399

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/108343 WO2020063798A1 (en) 2018-09-27 2019-09-27 Echo cancellation method, device and intelligent loudspeaker box

Country Status (1)

Country Link
WO (1) WO2020063798A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022083502A1 (en) * 2020-10-22 2022-04-28 广东美的白色家电技术创新中心有限公司 Voice interaction method and related apparatus, and method for establishing correspondence

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996026592A1 (en) * 1995-02-24 1996-08-29 Ericsson Inc. Apparatus and method for canceling acoustic echoes including non-linear distortions in loudspeaker telephones
US20040174991A1 (en) * 2001-07-11 2004-09-09 Yamaha Corporation Multi-channel echo cancel method, multi-channel sound transfer method, stereo echo canceller, stereo sound transfer apparatus and transfer function calculation apparatus
CN107105366A (en) * 2017-06-15 2017-08-29 歌尔股份有限公司 A kind of multi-channel echo eliminates circuit, method and smart machine
CN107123430A (en) * 2017-04-12 2017-09-01 广州视源电子科技股份有限公司 Echo cancel method, device, meeting flat board and computer-readable storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996026592A1 (en) * 1995-02-24 1996-08-29 Ericsson Inc. Apparatus and method for canceling acoustic echoes including non-linear distortions in loudspeaker telephones
US20040174991A1 (en) * 2001-07-11 2004-09-09 Yamaha Corporation Multi-channel echo cancel method, multi-channel sound transfer method, stereo echo canceller, stereo sound transfer apparatus and transfer function calculation apparatus
CN107123430A (en) * 2017-04-12 2017-09-01 广州视源电子科技股份有限公司 Echo cancel method, device, meeting flat board and computer-readable storage medium
CN107105366A (en) * 2017-06-15 2017-08-29 歌尔股份有限公司 A kind of multi-channel echo eliminates circuit, method and smart machine

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022083502A1 (en) * 2020-10-22 2022-04-28 广东美的白色家电技术创新中心有限公司 Voice interaction method and related apparatus, and method for establishing correspondence

Similar Documents

Publication Publication Date Title
CN106664473B (en) Information processing apparatus, information processing method, and program
KR101250124B1 (en) Apparatus and Method for Computing Control Information for an Echo Suppression Filter and Apparatus and Method for Computing a Delay Value
JP6389232B2 (en) Short latency multi-driver adaptive noise cancellation (ANC) system for personal audio devices
US9210504B2 (en) Processing audio signals
CN105144674B (en) Multi-channel echo is eliminated and noise suppressed
JP5762956B2 (en) System and method for providing noise suppression utilizing nulling denoising
JP4780119B2 (en) Head-related transfer function measurement method, head-related transfer function convolution method, and head-related transfer function convolution device
JP5533248B2 (en) Audio signal processing apparatus and audio signal processing method
US20090046866A1 (en) Apparatus capable of performing acoustic echo cancellation and a method thereof
CN110956973A (en) Echo cancellation method and device and intelligent terminal
US11711061B2 (en) Customized automated audio tuning
CN102968999B (en) Audio signal processing
US20140349638A1 (en) Signal processing control in an audio device
EP3671740B1 (en) Method of compensating a processed audio signal
GB2550457A (en) Method and apparatus for acoustic crosstalk cancellation
CN111356058B (en) Echo cancellation method and device and intelligent sound box
JP6873549B2 (en) Audio equipment and computer readable programs
WO2020063798A1 (en) Echo cancellation method, device and intelligent loudspeaker box
JP5163685B2 (en) Head-related transfer function measurement method, head-related transfer function convolution method, and head-related transfer function convolution device
US11228837B2 (en) Processing device, processing method, reproduction method, and program
TW202331701A (en) Echo cancelling method for dual-microphone array, echo cancelling device for dual-microphone array, electronic equipment, and computer-readable medium
JP5698110B2 (en) Multi-channel echo cancellation method, multi-channel echo cancellation apparatus, and program
JP2021097293A (en) Echo canceling device, echo canceling method, and echo canceling program
JP5249633B2 (en) Sound collecting / reproducing apparatus with characteristic difference function between channels and method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19866721

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19866721

Country of ref document: EP

Kind code of ref document: A1