RU2485607C2

RU2485607C2 - Apparatus and method for computing filter coefficients for echo suppression

Info

Publication number: RU2485607C2
Application number: RU2010132161A
Authority: RU
Inventors: Фабиан КЮХ; Маркус КАЛЛИНГЕР; Кристоф ФАЛЛЕР; Алексис ФАВРОТ
Original assignee: Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен
Priority date: 2008-01-31
Filing date: 2009-01-16
Publication date: 2013-06-20

Abstract

FIELD: information technology.

SUBSTANCE: realisation of an apparatus (100) for computing filter coefficients for an adaptive filter (210) for processing a microphone signal so as to suppress an echo due to a loudspeaker signal, which includes an extractor (250) for extracting a stationary component signal or a non-stationary component signal from the loudspeaker signal or from a signal derived from the loudspeaker signal, and a computer (270) for computing the filter coefficients for the adaptive filter (210), which uses the extracted stationary component signal or the extracted non-stationary component signal.

EFFECT: improved sound quality in systems for suppressing or compensating for echo.

24 cl, 20 dwg

Description

Конструктивные решения предлагаемого изобретения относятся к устройству и способам расчета коэффициентов пропускания адаптивного фильтра, предназначенного для подавления эха микрофонного сигнала, возбуждаемого сигналом громкоговорителя, и могут быть применены, например, в системах конференцсвязи. К системам конференцсвязи здесь отнесены телефонные, видео- и другие разновидности сетей интерактивной дуплексной связи.The constructive solutions of the present invention relate to a device and methods for calculating the transmittance of an adaptive filter designed to suppress the echo of a microphone signal excited by a loudspeaker signal, and can be applied, for example, in conference communication systems. Conferencing systems here include telephone, video and other types of interactive duplex networks.

Акустическое эхо возникает, когда тональные сигналы, звуки и шумы от громкоговорителя улавливаются микрофоном, установленным в том же помещении или в той же акустической среде. В телекоммуникационных сетях такой акустический сигнал возвращается абоненту на дальнем конце линии в виде его собственной речи, звучащей с запозданием. В подобных ситуациях отраженные сигналы являются отвлекающим и раздражающим фактором и могут быть причиной нарушения полноценной интерактивной дуплексной связи. Кроме того, акустическое эхо может генерировать паразитный гул и другие проявления нестабильных состояний акустического контура обратной связи.An acoustic echo occurs when tones, sounds and noise from a speaker are picked up by a microphone installed in the same room or in the same acoustic environment. In telecommunication networks, such an acoustic signal is returned to the subscriber at the far end of the line in the form of his own speech, sounding belatedly. In such situations, the reflected signals are distracting and annoying and can cause a violation of the full interactive duplex communication. In addition, an acoustic echo can generate a parasitic hum and other manifestations of unstable states of the acoustic feedback loop.

В публикации WO 2006/111370 А1 авторы описывают устройство и способ устранения эха многоканального аудиосигнала. Контроль над акустическим эхо-сигналом и подавление помех является существенной частью любой управляемой „без рук" сети дальней связи, например, системы телефонной или аудио- и видеоконференции. Описанный в издании способ обработки многоканальных звуковых сигналов громкоговорителя и, по меньшей мере, одного микрофонного сигнала включает в себя операции преобразования входного сигнала микрофона в его крактовременные спектры, расчета на основе сигналов громкоговорителя кратковременного спектра комбинированного сигнала громкоговорителя, расчета на основе входного сигнала микрофона кратковременного спектра комбинированного сигнала микрофона, оценки спектра амплитуды или спектра мощности эхо-сигнала в комбинированном кратковременном спектре сигнала микрофона, расчета фильтра усиления для коррекции амплитуды кратковременного спектра входного сигнала микрофона, применения фильтра усиления, по крайней мере, к одному спектру входного сигнала микрофона и преобразования отфильтрованного входного спектра микрофона во временную область.In WO 2006/111370 A1, the authors describe a device and method for eliminating the echo of a multi-channel audio signal. Control of the acoustic echo signal and suppression of interference is an essential part of any “hands-free” long-distance communications network, for example, telephone or audio and video conferencing systems. The method described in the publication for processing multichannel audio signals from a speaker and at least one microphone signal includes the operations of converting the microphone input signal into its short-term spectra, calculating, based on the loudspeaker signals, the short-term spectrum of the combined loudspeaker signal by calculating, based on the microphone input signal, the short-term spectrum of the combined microphone signal, estimating the amplitude spectrum or the power spectrum of the echo signal in the combined short-term spectrum of the microphone signal, calculating the gain filter to correct the amplitude of the short-term spectrum of the microphone input signal, applying at least to one spectrum of the microphone input signal and converting the filtered microphone input spectrum to the time domain.

Применяемые сегодня системы эхоподавления и эхокомпенсации, называемые также системами устранения эха, часто бывают неэффективными для многих видовThe echo cancellation and echo cancellation systems used today, also called echo cancellation systems, are often ineffective for many types of

звуков, тонов и шумов, несмотря на используемые в них адаптивные фильтры. Например, в коммуникационной системе преобладание одного элемента над другим может привести к неоптимальной компенсации эхо-сигнала громкоговорителя в составе сигнала микрофона. С другой стороны, в случае искажений при совмещении компонент из разных источников вследствие использования средств подавления или компенсации эха могут возникать тональные артефакты, которые также воспринимаются как крайне раздражающий фактор.sounds, tones and noise, despite the adaptive filters used in them. For example, in a communication system, the predominance of one element over another may lead to non-optimal compensation of the loudspeaker echo in the microphone signal. On the other hand, in the case of distortions when combining components from different sources due to the use of means of suppressing or compensating for the echo, tonal artifacts can occur, which are also perceived as an extremely annoying factor.

Таким образом, на базе известного уровня техники в настоящем изобретении ставится задача улучшить качество звука в системах подавления или компенсации эхо-сигнала.Thus, based on the prior art, the present invention seeks to improve the sound quality in echo cancellation or compensation systems.

Эта задача решается с использованием устройства по пункту 1, способа по п.23, 25 или программы по п.26 патентной формулы.This problem is solved using the device according to paragraph 1, the method according to p. 23, 25 or the program according to p. 26 of the patent formula.

В техническом исполнении вычислитель коэффициентов пропускания адаптивного фильтра сигнала микрофона, предназначенного для подавления эха, возбуждаемого сигналом громкоговорителя, включает в себя экстрактор стационарной или нестационарной составляющей сигнала громкоговорителя или производной сигнала громкоговорителя. Кроме того, в конструкцию включен вычислитель коэффициентов пропускания адаптивного фильтра на базе экстрагированных стационарных или нестационарных компонент сигнала.In a technical embodiment, the transmitter of the transmittance of an adaptive filter of a microphone signal designed to suppress the echo excited by a speaker signal includes an extractor of the stationary or non-stationary component of the speaker signal or derivative of the speaker signal. In addition, the transmitter includes an adaptive filter transmittance calculator based on extracted stationary or non-stationary signal components.

При данной конфигурации осуществление способа расчета коэффициентов пропускания адаптивного фильтра микрофонного сигнала для подавления эха, возбуждаемого сигналом громкоговорителя, состоит в выборе стационарной или нестационарной составляющей сигнала громкоговорителя или производной сигнала громкоговорителя и вычислении коэффициентов пропускания адаптивного фильтра на основе селектированной стационарной или нестационарной компоненты сигнала.With this configuration, the implementation of the method for calculating the transmittance of the adaptive filter of the microphone signal to suppress the echo excited by the speaker signal consists in selecting the stationary or non-stationary component of the speaker signal or derivative of the speaker signal and calculating the transmittance of the adaptive filter based on the selected stationary or non-stationary signal component.

Реализация предлагаемого изобретения основана на заключении, что улучшение качества звука может быть достигнуто за счет улучшения статистических свойств сигнала громкоговорителя или производной сигнала громкоговорителя, анализ которых выполняется при вычислении коэффициентов адаптивной фильтрации для подавления эха. С этой целью сигнал громкоговорителя или производную от сигнала громкоговорителя анализируют для выделения одной или нескольких соответствующих стационарных и/или нестационарных составляющих сигнала. Затем, на базе выявленной стационарной или нестационарной компоненты сигнала выполняют расчет коэффициентов пропускания адаптивного фильтра.The implementation of the invention is based on the conclusion that an improvement in sound quality can be achieved by improving the statistical properties of the speaker signal or a derivative of the speaker signal, the analysis of which is performed when calculating the adaptive filtering coefficients to suppress the echo. To this end, the loudspeaker signal or derivative of the loudspeaker signal is analyzed to extract one or more corresponding stationary and / or non-stationary signal components. Then, based on the identified stationary or non-stationary signal components, the transmission coefficients of the adaptive filter are calculated.

Стационарная компонента сигнала или производной сигнала, например громкоговорителя, может отображать, скажем, в частотной области, значение энергии, которое только незначительно изменяется во времени, или может образовывать соответствующую стационарную составляющую. Таким образом, стационарная составляющая подобного сигнала может быть определена, например, в частотной области при одновременном определении энергопоказателя для соответствующего полосового сигнала и усреднении во времени. Усреднение может быть плавающим и выполняться с различными расчетными характеристиками. Такое вычисление может быть выполнено рекурсивно с использованием конструкции типа фильтра с БИХ (БИХ = бесконечная импульсная характеристика). Аналогично, соответствующее усреднение может быть выполнено с использованием конструкции типа фильтра с КИХ (КИХ = конечная импульсная характеристика).The stationary component of a signal or derivative of a signal, such as a loudspeaker, can display, say, in the frequency domain, an energy value that only changes slightly over time, or can form a corresponding stationary component. Thus, the stationary component of such a signal can be determined, for example, in the frequency domain while simultaneously determining the energy index for the corresponding band signal and averaging over time. Averaging can be floating and performed with different design characteristics. Such a calculation can be performed recursively using a filter type construction with IIR (IIR = infinite impulse response). Similarly, appropriate averaging can be performed using a FIR type filter design (FIR = finite impulse response).

Следовательно, нестационарная компонента сигнала или производного сигнала громкоговорителя может быть определена на базе соответствующего полосового сигнала. В реализациях настоящего изобретения соотносимая нестационарная компонента может быть определена с помощью стационарной компоненты сигнала и фильтра усиления. Фильтр усиления может зависеть, по меньшей мере, от одного управляющего параметра, который при осуществлении настоящего изобретения определяют, например, на базе функции когерентности, которая учитывает сигнал громкоговорителя и сигнал микрофона или сигналы, производные от них.Therefore, the non-stationary component of the signal or derivative of the speaker signal can be determined based on the corresponding band signal. In implementations of the present invention, the associated non-stationary component can be determined using the stationary signal component and a gain filter. The gain filter may depend on at least one control parameter, which in the implementation of the present invention is determined, for example, on the basis of the coherence function, which takes into account the loudspeaker signal and the microphone signal or signals derived from them.

Согласно данному изобретению коэффициенты пропускания первого фильтра вычисляют, исходя из стационарной составляющей сигнала, коэффициенты пропускания второго фильтра, на базе которых в итоге определяются коэффициенты пропускания для адаптивного фильтра, вычисляют, исходя из нестационарной компоненты. При этом коэффициенты пропускания адаптивного фильтра могут соответствовать последовательному соединению первого фильтра, который базируется на коэффициентах пропускания первого фильтра, со вторым фильтром, который базируется на коэффициентах пропускания второго фильтра. Реализация данного изобретения позволяет также определять коэффициенты фильтрации как на основе коэффициентов пропускания первого фильтра, так и на основе коэффициентов пропускания второго фильтра.According to this invention, the transmittance of the first filter is calculated based on the stationary component of the signal, the transmittance of the second filter, based on which the transmittance for the adaptive filter is determined, is calculated based on the non-stationary component. In this case, the transmittance of the adaptive filter may correspond to the serial connection of the first filter, which is based on the transmittance of the first filter, with the second filter, which is based on the transmittance of the second filter. The implementation of this invention also allows you to determine the filter coefficients both on the basis of the transmittance of the first filter, and on the basis of the transmittance of the second filter.

Далее будут описаны варианты конструктивных решений и функциональные возможности предлагаемого изобретения. В зависимости от версии исполнения как стационарные, так и нестационарные составляющие могут оцениваться, исходя из соответствующих сигналов. Далее, конструкция, реализуемая в соответствии с настоящим изобретением, может включать в себя устройство фильтрации шумов микрофонного сигнала, выполненное на основе коэффициентов фильтрации.Next will be described options for structural solutions and functionality of the invention. Depending on the version of execution, both stationary and non-stationary components can be estimated based on the corresponding signals. Further, the structure implemented in accordance with the present invention may include a microphone noise filtering device based on filter coefficients.

Варианты осуществления предлагаемого изобретения будут более подробно рассмотрены и графически проиллюстрированы ниже. Таким образом, более предметным обсуждение настоящего изобретения будет в сопровождении следующих фигур:Embodiments of the invention will be discussed in more detail and graphically illustrated below. Thus, a more substantive discussion of the present invention will be accompanied by the following figures:

на фиг.1 показана типичная ситуация, при которой требуется устранение акустического эха;figure 1 shows a typical situation in which the elimination of acoustic echo is required;

на фиг.2 дана принципиальная блочная схема вычислителя коэффициентов фильтрации согласно подходу настоящего изобретения;figure 2 is a schematic block diagram of a calculator of filter coefficients according to the approach of the present invention;

на фиг. с 3A по 3С даны принципиальные блочные схемы вариантов экстрактора согласно подходу настоящего изобретения;in FIG. 3A to 3C are schematic block diagrams of extractor options according to the approach of the present invention;

на фиг.4А и 4B даны принципиальные блочные схемы дополнительного фильтра предварительного анализа эхо-сигнала согласно подходу настоящего изобретения;on figa and 4B are schematic block diagrams of an additional filter preliminary analysis of the echo signal according to the approach of the present invention;

на фиг. с 5А по 5Е даны принципиальные блочные схемы вариантов вычислителя в реализациях настоящего изобретения;in FIG. 5A to 5E are schematic block diagrams of embodiments of a calculator in implementations of the present invention;

на фиг.6 дана принципиальная блочная схема модификации предлагаемого изобретения;Fig.6 is a schematic block diagram of a modification of the invention;

на фиг.7 дана принципиальная блочная схема осуществления изобретения в конфигурации вычислителя коэффициентов фильтрации;Fig.7 is a schematic block diagram of an embodiment of the invention in a configuration of a filter coefficient calculator;

фиг. с 8а по 8с графически отображают разделение стационарной и нестационарной компонент сигнала громкоговорителя;FIG. 8a to 8c graphically show the separation of the stationary and non-stationary components of the speaker signal;

на фиг.9а графически отображает функцию фильтра эхоподавления для нестационарной компоненты сигнала на частоте 1 кГц;on figa graphically displays the function of the echo cancellation filter for non-stationary components of the signal at a frequency of 1 kHz;

на фиг.9b графически отображает функцию коррелирующего фильтра эхоподавления для нестационарной компоненты этого сигнала;Fig. 9b graphically displays the function of a correlation echo cancellation filter for the non-stationary component of this signal;

на фиг. с 10а по 10с показано соотношение уровней стационарной и нестационарной составляющих, усиления предсказания и речевой активности канала громкоговорителя;in FIG. 10a to 10c show the ratio of the levels of the stationary and non-stationary components, amplification of the prediction and speech activity of the loudspeaker channel;

на фиг.11 дана принципиальная блочная схема варианта реализации предлагаемого изобретения;figure 11 is a schematic block diagram of an embodiment of the invention;

на фиг.12 дана принципиальная блочная схема варианта реализации предлагаемого изобретения;Fig.12 is a schematic block diagram of an embodiment of the invention;

на фиг.13 представлена принципиальная блочная схема многоканального решения данного изобретения;on Fig presents a schematic block diagram of a multi-channel solution of the present invention;

на фиг.14 приведен пример группирования спектра равномерного кратковременного преобразования Фурье для моделирования неравномерного частотного разрешения слуха человека;on Fig shows an example of grouping the spectrum of a uniform short-term Fourier transform for modeling uneven frequency resolution of human hearing;

на фиг.15а показан график применения интерполирующих фильтров Ханна для частотного сглаживания фильтра усиления;on figa shows a graph of the use of Hannah interpolating filters for frequency smoothing of the gain filter;

на фиг.15b показана кривая интерполяции коэффициентов фильтра усиления.15b shows an interpolation curve of the gain filter coefficients.

В начале подробного описания на базе фигур со 2 по 15 вариантов реализации настоящего изобретения, решающих задачу подавления акустического эха за счет разделения стационарных и нестационарных составляющих сигнала, на фиг.1 проиллюстрирована типичная ситуация, в которой требуется устранение акустического эха.At the beginning of the detailed description based on figures 2 through 15 of the embodiments of the present invention that solve the problem of suppressing acoustic echo by separating the stationary and non-stationary components of the signal, figure 1 illustrates a typical situation in which the elimination of acoustic echo is required.

На чертежах приняты следующие обозначения:In the drawings, the following notation:

100 громкоговоритель;100 loudspeaker;

110 микрофон;110 microphone;

120 акустическая среда;120 acoustic environment;

130 сигнал громкоговорителя;130 speaker signal;

140 сигнал микрофона;140 microphone signal;

150 блок устранения эха;150 block elimination of the echo;

160 сигнал с блокированным эхом;160 signal with blocked echo;

170 прямой путь;170 direct way;

180 непрямой путь;180 indirect way;

200 устройство (расчета коэффициентов фильтрации);200 device (calculation of filtration coefficients);

210 адаптивный фильтр;210 adaptive filter;

220 вход;220 entrance;

230 времячастотный преобразователь (ВЧП);230 time-frequency converter (VChP);

240 фильтр предварительного анализа эхо-сигнала;240 filter preliminary analysis of the echo signal;

250 экстрактор;250 extractor;

260 фильтр предварительного анализа эхо-сигнала;260 filter preliminary analysis of the echo signal;

270 вычислитель;270 calculator;

280 вход;280 entrance;

290 времячастотный преобразователь (ВЧП);290 time-frequency converter (VCHP);

300 частотно-временной преобразователь (ЧВП);300 time-frequency converter (CVP);

310 выход;310 exit;

320 усреднитель;320 averager;

330 фильтр усиления;330 gain filter;

340 вычислитель параметров;340 parameter calculator;

350 распределитель;350 dispenser;

360 звено фильтра;360 filter link;

370 вычислитель параметров фильтрации;370 filter parameters calculator;

380 комбинатор;380 combinator;

390 селектор;390 selector;

400 определитель параметров;400 parameter identifier;

410 распределитель;410 dispenser;

420 кривая графика;420 curve graphics;

430 кривая графика;430 curve graphics;

440 фигурная скобка;440 brace;

450 фигурная скобка;450 braces;

460 стрелка;460 arrows;

470 фильтр предварительного анализа эхо-сигнала;470 filter preliminary analysis of the echo signal;

480 устройство задержки;480 delay device;

490 вычислитель величины энергии;490 energy calculator;

500 вычислитель величины энергии;500 energy calculator;

510 вычислитель величины энергии;510 energy magnitude calculator;

520 дополнительный вычислитель;520 additional computer;

530 группиратор;530 grouping machine;

540 дополнительный группиратор.540 additional grouping device.

Акустическое эхо возникает, когда микрофон улавливает тоны, звуки или шумы, исходящие от громкоговорителя, расположенного в том же помещении или в той же акустической среде. В телекоммуникационных системах акустические сигналы обратной связи ретранслируются собеседнику на дальнем конце линии, который воспринимает их как эхо собственной речи. В подобной ситуации эхо-сигналы могут быть сильным отвлекающим фактором и даже нарушать ход интерактивной полнодуплексной связи. Кроме того, акустическое эхо может генерировать паразитный свист и другие нестабильные состояния акустического контура обратной связи. Естественно, что системы дистанционной связи с управлением без использования рук, обеспечивающие полноценную двухстороннюю коммуникацию, требуют контроля за эхо-сигналом для устранения взаимодействия между громкоговорителем и микрофоном. Фиг.1 иллюстрирует ситуацию возникновения акустического эха.An acoustic echo occurs when a microphone picks up tones, sounds, or noise coming from a speaker located in the same room or in the same acoustic environment. In telecommunication systems, acoustic feedback signals are relayed to the interlocutor at the far end of the line, who perceives them as an echo of their own speech. In such a situation, echoes can be a strong distraction and even disrupt interactive full-duplex communication. In addition, the acoustic echo can generate spurious whistles and other unstable states of the acoustic feedback loop. Naturally, hands-free remote control systems that provide full two-way communication require echo control to eliminate the interaction between the speaker and the microphone. Figure 1 illustrates the situation of the occurrence of acoustic echo.

На фиг.1 показаны громкоговоритель 100 и микрофон 110, расположенные в одной акустической среде 120, которая может, например, сформироваться в помещении. Аналогично акустическая среда 120 может быть образована внутренним объемом салона автомобиля.1 shows a loudspeaker 100 and a microphone 110 located in the same acoustic environment 120, which may, for example, be formed in a room. Similarly, the acoustic environment 120 may be formed by the interior volume of the vehicle interior.

На фиг.1 сигнал громкоговорителя 130, или x[n], где временной показатель n - целое число, поступает на громкоговоритель 100. Микрофон 110 ловит шумы, звуки и тональные сигналы общего звукового окружения 120 и генерирует микрофонный сигнал 140 или y[n]. Согласно фиг.1 сигнал громкоговорителя 130 и сигнал микрофона 140 в виде входных сигналов поступают в эхокомпенсатор 150, который на выходе формирует из микрофонного сигнала 140 очищенный от эха сигнал 160, или e[n].In figure 1, the signal of the speaker 130, or x [n], where the time indicator n is an integer, is fed to the speaker 100. The microphone 110 picks up noises, sounds and tones of the general sound environment 120 and generates a microphone signal 140 or y [n] . According to figure 1, the loudspeaker signal 130 and the microphone signal 140 in the form of input signals are supplied to the echo canceller 150, which at the output generates an echo-cleared signal 160, or e [n] from the microphone signal 140.

Таким образом, фиг.1 иллюстрирует проблему возникновения и борьбы с акустическим эхом в системах двухсторонней связи. Сигнал с дальнего конца телекоммуникационной линии, преобразованный громкоговорителем в звук, поступает в микрофон прямым путем 170 и по отраженным траекториям 180-1, 180-2, которые называют также косвенными каналами. Вследствие этого микрофон 110 воспринимает не только голос, звучащий локально на передающем конце линии, но улавливает также и эхо, которое сразу же ретранслируется обратно на принимающий конец линии.Thus, FIG. 1 illustrates the problem of the occurrence and control of acoustic echo in two-way communication systems. The signal from the far end of the telecommunication line, converted by the loudspeaker into sound, enters the microphone directly through 170 and along the reflected paths 180-1, 180-2, which are also called indirect channels. As a result, the microphone 110 not only perceives a voice that sounds locally at the transmitting end of the line, but also picks up an echo, which is immediately relayed back to the receiving end of the line.

Иначе говоря, сигнал громкоговорителя x[n] вновь смешивается с сигналом микрофона y[n]. В идеале, с помощью блока эхокомпенсации 150 такое эхо должно быть полностью устранено, в то время как исходящий голосовой сигнал на ближнем конце телекоммуникационной системы должен быть пропущен.In other words, the speaker signal x [n] is again mixed with the microphone signal y [n]. Ideally, using the echo cancellation unit 150, such an echo should be completely eliminated, while the outgoing voice signal at the near end of the telecommunication system should be skipped.

Стандартным способом борьбы с отраженным сигналом является параллельное включение в канал распространения эхо-сигнала акустического эхоподавителя (АЭП), как описано в [1]. Такой акустический эхоподавитель анализирует цифровую реплику эхо-сигнала, вычитая ее впоследствии из измеренного или фактического микрофонного сигнала. Стандартные подходы к проблеме удаления акустического эха базируются на заключении, что прохождение эхо-сигнала может быть смоделировано с помощью фильтра с КИХ (с конечной импульсной характеристикой), после чего применяются соответствующие подавители акустического эха, что также описано в [С.Breining, P.Dreiseitel, E.Hänsler, A.Mader, В.Nitsch, H.Puder, Т.Schertler, G.Schmidt, and J. Tilp. Acoustic echo control. IEEE Signal Processing Magazine, 16(4): 42-69, July 1999]. В силу того, что путь формирования эха, как правило, неизвестен и, более того, может изменяться в ходе рабочего процесса, линейный фильтр такого акустического эхозаградителя обычно реализуется как адаптивный. Для моделирования типичных путей формирования эха применяются КИХ-фильтры с длительностью, не превышающей несколько сотен миллисекунд, что соответствует частоте дискретизации, что, в свою очередь, предполагает высокий уровень вычислительной сложности.The standard way to combat the reflected signal is the parallel inclusion of an acoustic echo canceller (AED) into the echo propagation channel, as described in [1]. Such an acoustic echo canceller analyzes a digital replica of the echo signal, subtracting it subsequently from the measured or actual microphone signal. Standard approaches to the problem of acoustic echo removal are based on the conclusion that the passage of the echo signal can be modeled using a FIR filter (with a finite impulse response), after which the corresponding acoustic echo cancellers are used, which is also described in [C. Breining, P. Dreiseitel, E. Hänsler, A. Mader, B. Nitsch, H. Puder, T. Schertler, G. Schmidt, and J. Tilp. Acoustic echo control. IEEE Signal Processing Magazine, 16 (4): 42-69, July 1999]. Due to the fact that the path of echo formation is usually unknown and, moreover, may change during the working process, the linear filter of such an acoustic echo suppressor is usually implemented as adaptive. FIR filters with a duration not exceeding several hundred milliseconds, which corresponds to a sampling frequency, which, in turn, assumes a high level of computational complexity, are used to simulate typical echo formation paths.

Уровни затухания эха, достигаемые при практическом использовании традиционных подходов, по различным причинам часто бывают недостаточными. Такими причинами могут быть, в частности, слишком продолжительная реверберация (эффект хвоста эха), которая приводит к погрешностям моделирования путей прохождения эхо-сигнала, нелинейные составляющие эхо-сигнала, вызванные, например, вибрациями или нелинейными отклонениями в недорогом аудиооборудовании, и конвергенция в случае высокой нестабильности каналов прохождения эха, что рассмотрено в [А.N.Birkett and R.A.Goubran. Limitations of handsfree acoustic echo cancellers due to nonlinear loudspeaker distortion and enclosure vibration effects. In Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, p.13 - 16, New Paltz, Oct. 1995]. Поэтому для ликвидации остаточного эха и его составляющих, которые не были устранены с помощью компенсатора акустического эха, эхокомпенсаторы комбинируют с нелинейными постпроцессорами, как описано в [G.Schmidt and E.Hänsler. Acoustic echo and noise control: a practical approach. Hoboken: Wiley, 2004]. В большинстве случаев остаточное эхо гасят частотно-избирательным способом, как описывается в [W.L.В.Jeannes, P.Scalart, G.Faucon, and С.Beaugeant. Combined noise and echo reduction in hands-free systems: a survey. IEEE Transactions on Speech and Audio Processing, 9(8): 808-820, Nov. 2001]. Фактически, все акустические эхокомпенсаторы дооснащают постпроцессорами, поскольку слишком часто они не глушат эхо полностью, чтобы его не было слышно.The echo attenuation levels achieved with the practical use of traditional approaches are often insufficient for various reasons. Such reasons may be, in particular, too long reverberation (echo tail effect), which leads to errors in modeling the paths of the echo signal, nonlinear components of the echo signal, caused, for example, by vibrations or nonlinear deviations in inexpensive audio equipment, and convergence in the case of the high instability of the echo paths, as discussed in [A.N. Birkett and RAGoubran. Limitations of handsfree acoustic echo cancellers due to nonlinear loudspeaker distortion and enclosure vibration effects. In Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, p.13 - 16, New Paltz, Oct. 1995]. Therefore, to eliminate the residual echo and its components, which were not eliminated using the acoustic echo canceller, echo cancellers are combined with non-linear post-processors, as described in [G. Schmidt and E. Hänsler. Acoustic echo and noise control: a practical approach. Hoboken: Wiley, 2004]. In most cases, the residual echo is quenched by a frequency-selective method, as described in [W.L. B. Jeannes, P. Scalart, G. Faucon, and C. Beaugeant. Combined noise and echo reduction in hands-free systems: a survey. IEEE Transactions on Speech and Audio Processing, 9 (8): 808-820, Nov. 2001]. In fact, all acoustic echo cancellers are equipped with postprocessors, because too often they do not completely suppress the echo so that it cannot be heard.

Недавно в [С.Faller and J.Chen. Suppressing acoustic echo in a sampled auditory envelope space. IEEE Trans. on Speech and Audio Proc., 13(5): 1.048-1.062, Sept. 20055, и С.Faller and С.Toumery. Estimating the delay and coloration effect of the acoustic echo path for low complexity echo suppression. In Proc. Intl. Works, on Acoust. Echo and Noise Control (IWAENC), Sept. 2005] было предложено несколько устройств подавления акустических эхо-сигналов в подполосовой области, которые сходны с вышеупомянутыми нелинейными постпроцессорами, но не нуждаются в компенсаторе акустического эха и в оценке импульсной характеристики траектории эха. В публикациях сообщается, что эти системы характеризуются низкой вычислительной сложностью, надежностью и высокими дуплексными характеристиками.Recently in [C. Faller and J. Chen. Suppressing acoustic echo in a sampled auditory envelope space. IEEE Trans. on Speech and Audio Proc., 13 (5): 1.048-1.062, Sept. 20055, and C. Faller and C. Toumery. Estimating the delay and coloration effect of the acoustic echo path for low complexity echo suppression. In Proc. Intl. Works, on Acoust. Echo and Noise Control (IWAENC), Sept. 2005], several sub-band acoustic echo cancellation devices have been proposed that are similar to the aforementioned non-linear post-processors, but do not need an acoustic echo canceller and an assessment of the impulse response of the echo path. It is reported in publications that these systems are characterized by low computational complexity, reliability, and high duplex characteristics.

В разработке [С.Faller and С.Toumery. Estimating the delay and coloration effect of the acoustic echo path for low complexity echo suppression. In Proc. Intl. Works, on Acoust. Echo and Noise Control (IWAENC), Sept. 2005] устройства подавления акустического эха для вычисления спектров сигналов громкоговорителя и микрофона предлагается алгоритм с использованием оконного (кратковременного) преобразования Фурье (ОПФ). К соответствующему сигналу громкоговорителя применяют величину задержки d между двумя сигналами, преобразованными с помощью ОПФ, которую выбирают учитывая наибольшее воздействие импульсной характеристики канала прохождения эха.In development [C. Faller and C. Toumery. Estimating the delay and coloration effect of the acoustic echo path for low complexity echo suppression. In Proc. Intl. Works, on Acoust. Echo and Noise Control (IWAENC), Sept. 2005] acoustic echo cancellation devices for calculating the spectra of the loudspeaker and microphone signals, an algorithm using the window (short-term) Fourier transform (OPF) is proposed. To the corresponding loudspeaker signal, the delay value d is applied between two signals converted using an OPF, which is selected taking into account the greatest effect of the impulse response of the echo channel.

Затем оценивают фильтр анализа действительного эхо-сигнала, который имитирует начальный путь прохождения эха. Для получения амплитудной спектральной характеристики эхо-сигнала к спектру сигнала громкоговорителя применяется расчетная величина задержки и фильтр предварительного анализа эхо-сигнала. С использованием полученной амплитудной спектральной характеристики эхо-сигнала рассчитывается действительный фильтр эхоподавления и применяется к спектру микрофонного сигнала для гашения эха.Then evaluate the filter analysis of the actual echo signal, which simulates the initial path of the echo. To obtain the amplitude spectral characteristic of the echo signal, a calculated delay value and a filter of preliminary analysis of the echo signal are applied to the spectrum of the loudspeaker signal. Using the obtained amplitude spectral characteristic of the echo signal, the actual echo cancellation filter is calculated and applied to the spectrum of the microphone signal for echo cancellation.

Недостаток вышеупомянутых систем подавления акустического эхо-сигнала состоит в том, что они не в полной мере выполняют свои функции в отношении смешанных сигналов громкоговорителя, содержащих как стационарные, так и нестационарные составляющие. Это может иметь место, например, когда речь на дальнем конце линии звучит в условиях высокого уровня шума. В такой ситуации сигнал громкоговорителя и эхо-сигнал содержат нестационарную составляющую в форме речи на дальнем конце и стационарную составляющую в виде шумового фона на том же дальнем конце линии связи.A disadvantage of the aforementioned acoustic echo cancellation systems is that they do not fully perform their functions with respect to mixed speaker signals containing both stationary and non-stationary components. This may be the case, for example, when speech at the far end of the line sounds in a high noise environment. In such a situation, the loudspeaker signal and the echo signal comprise a non-stationary component in the form of speech at the far end and a stationary component in the form of a noise background at the same far end of the communication line.

Действующие системы акустического эхоподавления задают только один эхогасящий фильтр для сигнала громкоговорителя. Отсюда следует, что в указанных подходах не учитывается, что компоненты эха с различными характеристиками возбуждают в сигналах передающего конца различные типы искажений, которые впоследствии требуют разных видов обработки.Existing acoustic echo cancellation systems define only one echo cancellation filter for the loudspeaker signal. It follows that these approaches do not take into account that echo components with different characteristics excite various types of distortions in the signals of the transmitting end, which subsequently require different types of processing.

На фиг.2 показан первый из вариантов реализации устройства 200 для расчета коэффициентов пропускания адаптивного фильтра 210, в котором, как и в сопутствующем способе, применено разделение стационарного и нестационарного сигналов для совершенствования эхоподавления и, следовательно, для улучшения качества воспринимаемого звука. Таким образом, технические решения по настоящему изобретению предполагают применение разнообразных методов компенсации сигналов в зависимости от их статистических свойств и особенностей, что обеспечивает более эффективное эхоподавление, в меньшей степени подверженное проявлению артефактов.Figure 2 shows the first embodiment of a device 200 for calculating the transmittance of an adaptive filter 210, in which, as in the accompanying method, the separation of stationary and non-stationary signals is used to improve echo cancellation and, therefore, to improve the quality of perceived sound. Thus, the technical solutions of the present invention involve the use of various methods of signal compensation depending on their statistical properties and features, which provides more efficient echo cancellation, less prone to artifacts.

Подробное описание вариантов исполнения данного изобретения, представленных на фиг.3-5, предваряет рассмотрение принципиальной блочной схемы устройства 200. Здесь необходимо обратить внимание на то, что иллюстрации и описания принципиальных схем устройств, предлагаемых в изобретении, являются одновременно блок-схемами соответствующих способов. Иначе говоря, принципиальные блочные схемы рассматриваемых конструкций устройства соответствуют блок-схемам алгоритмов предлагаемого способа, где отображена последовательность операций, выполняемых компонентами оборудования.A detailed description of the embodiments of the present invention shown in FIGS. 3-5 is preceded by a consideration of the block diagram of the device 200. Here it is necessary to pay attention to the fact that the illustrations and descriptions of the circuit diagrams of the devices proposed in the invention are simultaneously block diagrams of the corresponding methods. In other words, the basic block diagrams of the device structures under consideration correspond to the block diagrams of the algorithms of the proposed method, where the sequence of operations performed by the equipment components is displayed.

При этом следует учитывать, что в рамках данного описания для устройств, блоков и схем, одинаковых или сходных по своим функциям, используются одинаковые или похожие номера ссылок. Одновременно, устройства, блоки и схемы, обозначенные совпадающими или похожими номерами ссылок, содержат идентичные или подобные структурные и функциональные характеристики. Другими словами, в предлагаемом описании одинаковые номера ссылок служат для обозначения устройств, блоков и схем, имеющих одинаковые или сходные функции, назначение или конструкцию. Это позволяет сжато изложить материалы представляемого изобретения, используя описание одного технического решения для пояснения другого варианта реализации, если однозначно не определено иное.It should be borne in mind that in the framework of this description, for devices, blocks and circuits, identical or similar in their functions, the same or similar reference numbers are used. At the same time, devices, blocks and circuits denoted by matching or similar reference numbers contain identical or similar structural and functional characteristics. In other words, in the proposed description, the same reference numbers are used to refer to devices, blocks and circuits having the same or similar functions, purpose or design. This allows you to succinctly present the materials of the present invention, using the description of one technical solution to explain another implementation option, unless clearly defined otherwise.

Кроме того, при описании изобретения для неоднократного обозначения устройств, блоков и схем на одной фигуре используются номера обобщающих ссылок. В частности, для двух непрямых траекторий 180-1, 180-2 на фиг.1 определены разные номера ссылок, но, если непрямые траектории упоминаются как таковые или если рассматриваются их общие отличительные свойства, применяется только номер обобщающей ссылки 180. Это также способствует краткости изложения и лучшему пониманию предлагаемого описания. Устройство 200 на фиг.2 имеет входной терминал 220 для ввода сигнала громкоговорителя, не показанного на фиг.2. Через входной терминал 220 сигнал вводится во времячастотный преобразователь 230, показанный на фиг.2 пунктиром как опция устройства 200. Из времячастотного преобразователя 230 сигнал может быть подан на первый фильтр анализа эхо-сигнала 240, который как опция тоже не обязателен в конструкции устройства 200. Выход фильтра предварительного анализа эхо-сигнала 240 соединен с входом 250а экстрактора 250, который, в свою очередь, через первый выход 250с и второй выход 250d может быть подключен к произвольно устанавливаемому второму фильтру предварительного анализа эхо-сигнала 260 через его первый вход 260а и второй вход 260b. Наличие этого фильтра преданализа эхо-сигнала также необязательно и зависит от конкретной реализации. Скажем, устройство 200, предлагаемое в настоящем изобретении, может быть конструктивно решено как с введением первого 240 и второго 260 фильтра предварительного анализа эхо-сигнала вместе или по отдельности, так и без любого из них или обоих. Безусловно, возможен вариант, где используется только один из двух фильтров предварительного анализа эхо-сигнала 240, 260. Осуществимы также технические решения с задействованием других элементов схемы.In addition, when describing the invention, the numbers of generalizing references are used to repeatedly denote devices, blocks and circuits in the same figure. In particular, for two indirect paths 180-1, 180-2 in FIG. 1, different reference numbers are defined, but if indirect paths are referred to as such or if their general distinguishing features are considered, only generalizing reference number 180 is applied. This also contributes to brevity presentation and better understanding of the proposed description. The device 200 of FIG. 2 has an input terminal 220 for inputting a speaker signal not shown in FIG. 2. Through the input terminal 220, the signal is input to the time-frequency converter 230, shown in FIG. 2 by a dashed line as an option of the device 200. From the time-frequency converter 230, the signal can be supplied to the first filter of the analysis of the echo signal 240, which is also optional in the design of the device 200. The output of the preliminary analysis filter of the echo signal 240 is connected to the input 250a of the extractor 250, which, in turn, through the first output 250c and the second output 250d can be connected to an optionally installed second preliminary filter a an echo signal 260 through its first input 260a and a second input 260b. The presence of this pre-analysis filter of the echo signal is also optional and depends on the specific implementation. Say, the device 200 proposed in the present invention can be structurally solved with the introduction of the first 240 and second 260 filter preliminary analysis of the echo signal together or separately, or without any of them or both. Of course, a variant is possible where only one of the two filters for preliminary analysis of the echo signal 240, 260 is used. Technical solutions are also feasible involving other elements of the circuit.

При наличии второго фильтра предварительного анализа эхо-сигнала 260 его первый выход 260с и второй выход 260d подсоединяются к первому входу 270а и второму входу 270b вычислителя 270 коэффициентов пропускания адаптивного фильтра 210. Вычислитель 270 через выход 270d соединен с входом адаптивного фильтра 210.With a second echo pre-analysis filter 260, its first output 260c and second output 260d are connected to the first input 270a and second input 270b of the transmittance calculator 270 of the adaptive filter 210. The calculator 270 is connected via an output 270d to the input of the adaptive filter 210.

На другой вход адаптивного фильтра 210 через дополнительный времячастотный преобразователь 290 от входного терминала 280 может подаваться сигнал микрофона. Выход адаптивного фильтра 210 может быть соединен через дополнительный частотно-временной преобразователь 300 с терминалом вывода микрофонного сигнала 310. Одновременно входной терминал 280 произвольно подключается через времячастотный преобразователь 290 к второму входу 250b экстрактора 250 и к третьему входу 270 с вычислителя 270. При этом оба ввода - 250b экстрактора 250 и 270с вычислителя 270 - играют роль вспомогательных и могут быть введены независимо друг от друга в различные аппаратные версии настоящего изобретения.A microphone signal may be provided to the other input of the adaptive filter 210 through an additional time-frequency converter 290 from the input terminal 280. The output of the adaptive filter 210 can be connected via an additional time-frequency converter 300 to the microphone signal output terminal 310. At the same time, the input terminal 280 is arbitrarily connected via the time-frequency converter 290 to the second input 250b of the extractor 250 and to the third input 270 from the calculator 270. Both inputs - 250b of the extractor 250 and 270c of the calculator 270 - play the role of auxiliary and can be entered independently from each other in various hardware versions of the present invention.

Например, устройство 200 может быть включено в блок устранения эха 150, показанный на фиг.1.For example, the device 200 may be included in the echo cancellation unit 150 shown in FIG.

Перед более детальным рассмотрением функций устройства 200 в варианте на фиг.2 следует отметить, что, в целом, конструкция предлагаемого изобретения может быть решена на базе как дискретных, так и интегральных, или иных, более сложных, схем. В частности, данное изобретение может быть встроено в средства обработки данных, такие как процессоры, интегрированные системы (SOC = системы на кристалле), прикладные интегральные схемы (ASIC) или иные интегральные микросхемы и специализированные процессоры. В таких конфигурациях идентичные элементы контуров обработки данных могут задействоваться поочередно для различных устройств. Например, один и тот же логический вентиль арифметического логического устройства (АЛУ) микропроцессора может быть использован, во-первых, для управления экстрактором 250, и, во-вторых, вычислителем 270. Тем не менее, устройства могут значительно отличаться друг от друга, как, например, в вышеупомянутом случае они требуют разные управляющие команды, в совокупности определяющие каждое из устройств. В силу этого допустимо частичное или полное перекрывание элементов схемотехники, реализуемых в различных версиях устройств.Before a more detailed consideration of the functions of the device 200 in the embodiment of FIG. 2, it should be noted that, in general, the design of the invention can be solved on the basis of both discrete and integrated, or other, more complex, circuits. In particular, the present invention can be integrated into data processing means such as processors, integrated systems (SOC = systems on a chip), application integrated circuits (ASICs) or other integrated circuits and specialized processors. In such configurations, identical elements of the data processing loops can be activated alternately for different devices. For example, the same logic gate of an arithmetic logic device (ALU) of a microprocessor can be used, firstly, to control the extractor 250, and secondly, the calculator 270. However, the devices can differ significantly from each other, as for example, in the aforementioned case, they require different control commands that together define each of the devices. Because of this, partial or complete overlap of circuitry elements implemented in various versions of devices is permissible.

Во многом по этой причине здесь в описании сопряженные устройства, блоки и схемы понимаются как прямо или опосредованно взаимосвязанные. К примеру, если реализация базируется на средствах обработки данных, взаимодействие может осуществляться через ячейку памяти, содержащую промежуточный результат в форме защелкнутого в ней сигнала.Largely for this reason, here in the description, paired devices, blocks and circuits are understood as directly or indirectly interconnected. For example, if the implementation is based on data processing tools, interaction can be carried out through a memory cell containing an intermediate result in the form of a signal latched in it.

Более того, однако, конструктивные решения настоящего изобретения не ограничиваются цифровыми устройствами, хотя, в дальнейшем будут описываться преимущественно цифровые средства. Изобретение принципиально предусматривает возможность его аналогового и смешанного аналого-цифрового исполнения. В такие конфигурации дополнительно вводятся АЦП или ЦАП (аналого-цифровые или цифроаналоговые преобразователи) для трансформации одного вида сигналов в другой.Moreover, however, the constructive solutions of the present invention are not limited to digital devices, although mainly digital means will be described hereinafter. The invention basically provides for the possibility of its analog and mixed analog-to-digital performance. In such configurations, an ADC or DAC (analog-to-digital or digital-to-analog converters) is additionally introduced to transform one type of signal into another.

В зависимости от назначения устройства 200, изображенного на фиг.2, сигнал громкоговорителя, поступающий на вход 220, может быть преобразован в частотную область с помощью времячастотного преобразователя 230, который показан как опция. Времячастотный преобразователь 230 обеспечивает на выходе адекватное спектральное представление блоков данных (фреймов) из временной области. В зависимости от конкретной реализации изобретения в конфигурацию времячастотного преобразователя 230 могут быть введены преобразователь Фурье, подполосный преобразователь или КЗФ-преобразователь (на базе КЗФ = квадратурно-зеркального фильтра). Независимо от конкретного приложения времячастотный преобразователь 230 трансформирует принимаемый им сигнал (из временной области) в множество полосовых сигналов. Каждый полосовой сигнал имеет характеристическую частоту, которая может быть средней частотой, нижней частотой среза или верхней частотой среза соответствующей полосы. В зависимости от особенностей технического решения разные полосовые сигналы могут иметь больше одной характеристической частоты или характеризоваться другими параметрами.Depending on the purpose of the device 200 shown in FIG. 2, the speaker signal supplied to input 220 can be converted into the frequency domain using a time-frequency converter 230, which is shown as an option. The time-frequency converter 230 provides an adequate spectral representation of data blocks (frames) from the time domain at the output. Depending on the particular implementation of the invention, a Fourier converter, a subband converter, or a KZF converter (based on the KZF = quadrature-mirror filter) may be introduced into the configuration of the time-frequency converter 230. Regardless of the particular application, the time-frequency converter 230 transforms the signal it receives (from the time domain) into a plurality of band signals. Each band signal has a characteristic frequency, which may be an average frequency, a lower cutoff frequency, or an upper cutoff frequency of a corresponding band. Depending on the features of the technical solution, different band signals can have more than one characteristic frequency or be characterized by other parameters.

Первый фильтр предварительного анализа эхо-сигнала 240 дает возможность модуляции акустической среды 120 (на фиг.1), которая обеспечила бы на его выходе сигнал, содержащий расчетную амплитудную спектральную характеристику, соответствующую сигналу, который будет сформирован наложением на сигнал микрофона сигнала громкоговорителя. Тем не менее, как уже пояснялось выше, первый фильтр предварительного анализа эхо-сигнала 240 является вспомогательным и не обязателен к монтажу.The first pre-analysis filter of the echo signal 240 enables the modulation of the acoustic medium 120 (in FIG. 1), which would provide a signal at its output containing the calculated amplitude spectral characteristic corresponding to the signal that will be generated by superposing the speaker signal on the microphone signal. However, as already explained above, the first pre-filter analysis of the echo signal 240 is auxiliary and not required for installation.

Далее, сигнал громкоговорителя или сигнал, производный от него, полученный в результате преобразования и фильтрации соответствующими дополнительными инструментами 230 и 240, поступает на первый вход экстрактора 250. Экстрактор 250 селектирует из сигнала громкоговорителя или из его деривата стационарную и нестационарную компоненты. В частности, стационарная компонента может быть рассчитана через усреднение входного сигнала, что описано ниже.Further, the loudspeaker signal or a signal derived from it, obtained by converting and filtering with the corresponding additional tools 230 and 240, is supplied to the first input of the extractor 250. The extractor 250 selects the stationary and non-stationary components from the speaker signal or from its derivative. In particular, the stationary component can be calculated by averaging the input signal, which is described below.

В зависимости от выбранного технического решения сигнал может представлять собой вычисленный сигнал, имеющий отклонения от „реальной" стационарной составляющей. Соответственно, нестационарная составляющая может быть определена из стационарной составляющей сигнала, например, при задействовании фильтра усиления, который не показан на фиг.2.Depending on the technical solution chosen, the signal may be a calculated signal deviating from the “real” stationary component. Accordingly, the non-stationary component can be determined from the stationary component of the signal, for example, by using a gain filter, which is not shown in FIG.

В других реализациях экстрактор 250 может использовать другой метод оценки устойчивости.In other implementations, extractor 250 may use a different stability assessment method.

Нестационарная компонента также может быть определена, например, путем сравнения временного изменения во входном сигнале. Кроме того, в случае встраивания в конфигурацию или в среду устройства 200 голосового кодека можно прибегнуть к методу предсказания с использованием экстрактора 250 для экстракции, по меньшей мере, одного из двух упомянутых сигналов. Подобный метод предсказания может представить, например, сигнал ошибки кодека LPC (LPC = кодирование с линейным предсказанием).The non-stationary component can also be determined, for example, by comparing the temporal change in the input signal. In addition, if embedding a voice codec in the configuration or in the medium of the device 200, one can resort to a prediction method using an extractor 250 to extract at least one of the two mentioned signals. A similar prediction method can be represented, for example, by an LPC codec error signal (LPC = linear prediction coding).

Экстрактор 250 имеет два выхода, обозначенных выше, на которые в зависимости от требований могут подаваться различные сигналы. В частности, как правило, по меньшей мере, стационарная или нестационарная компонента поступает на первый выход экстрактора 250. На второй выход поступает вторая из двух компонент или сигнал, содержащий информацию о выходном сигнале на первом выходе. Это могут быть, предположительно, параметры дальнейшей обработки вычислителем 270 соответствующего сигнала, или это может быть простой управляющий сигнал, указывающий, какая из двух составляющих передается.The extractor 250 has two outputs, indicated above, to which various signals can be supplied depending on the requirements. In particular, as a rule, at least a stationary or non-stationary component is fed to the first output of the extractor 250. The second of the two components or a signal containing information about the output signal at the first output is fed to the second output. This may be, presumably, the parameters for further processing by the calculator 270 of the corresponding signal, or it may be a simple control signal indicating which of the two components is transmitted.

По выполняемым функциям произвольный второй фильтр предварительного анализа эхо-сигнала 260, как правило, соответствует первому фильтру предварительного анализа эхо-сигнала 240. Обычно второй фильтр предварительного анализа эхо-сигнала 260, если он рассчитан на выполнение подобной оценки эха, может оценивать сигнал громкоговорителя на входном терминале 220, чтобы в результате получить сигнал, который соответствовал бы сигналу, воспринимаемому микрофоном при условии отсутствия других источников шума. Первый фильтр предварительного анализа эхо-сигнала 240, как и второй фильтр предварительного анализа эхо-сигнала 260, может факультативно включать в себя устройство задержки, учитывающее задержку эха громкоговорителя, улавливаемого микрофоном. Говоря иначе, фильтры 240, 260 могут быть применены также для задержки сигнала громкоговорителя, или производного от него, как с помощью дополнительно смонтированного устройства задержки, так и за счет внутренней схемотехники. В большинстве случаев разделение функций оценки эха, с одной стороны, и задержки, с другой, также возможно, если, например, первый фильтр предварительного анализа эхо-сигнала 240 будет использован только для задержки соответствующего сигнала, а второй фильтр предварительного анализа эхо-сигнала 260 - для анализа реального эха.According to the functions performed, an arbitrary second echo pre-analysis filter 260 typically corresponds to a first echo pre-analysis filter 240. Typically, a second echo pre-analysis filter 260, if it is designed to perform such an echo estimate, can evaluate the speaker at the input terminal 220, as a result, to obtain a signal that would correspond to the signal perceived by the microphone in the absence of other sources of noise. The first echo pre-analysis filter 240, like the second echo pre-analysis filter 260, may optionally include a delay device that takes into account the delay of the loudspeaker echo picked up by the microphone. In other words, filters 240, 260 can also be used to delay the signal from a loudspeaker, or a derivative of it, both with the help of an additionally mounted delay device, as well as through internal circuitry. In most cases, separation of the functions of echo estimation, on the one hand, and delay, on the other, is also possible if, for example, the first pre-analysis filter of the echo signal 240 is used only to delay the corresponding signal, and the second filter of the preliminary analysis of the echo signal 260 - for real echo analysis.

Затем сигналы от второго фильтра предварительного анализа эха 260 поступают на вычислитель 270, который, в свою очередь, вычисляет или определяет коэффициенты пропускания адаптивного фильтра 210, исходя из экстрагированной стационарной или нестационарной компоненты. В зависимости от конкретного приложения вычислитель 270 может, кроме того, обращаться к сигналу микрофона, поступающему на входной терминал 280, или к микрофонным сигналам, преобразуемым в частотную область. Этот сигнал доступен также для экстрактора 250, о чем подробнее говорится ниже.Then the signals from the second filter of the preliminary analysis of the echo 260 are fed to the calculator 270, which, in turn, calculates or determines the transmittance of the adaptive filter 210, based on the extracted stationary or non-stationary components. Depending on the particular application, the transmitter 270 may also access the microphone signal received at the input terminal 280, or the microphone signals converted to the frequency domain. This signal is also available for extractor 250, as described in more detail below.

Далее, адаптивный фильтр 210, получающий от вычислителя 270 коэффициенты фильтрации, корректирует спектр микрофонного сигнала с формированием на выходе, по меньшей мере, частично эхокомпенсированного варианта этого сигнала, который пересылается для последующей обработки. В зависимости от специфики технического исполнения микрофонный сигнал, прошедший эхокомпенсацию или описанную выше модификацию спектра, может быть реконвертирован во временную область с помощью частотно-временного преобразователя 300 или выведен напрямую на выходной терминал 310. Однако необходимости в обратном преобразовании во временную область с помощью преобразователя 300 может не быть, если, например, микрофонный сигнал закодирован в частотной или связанной с ней области.Further, the adaptive filter 210, receiving filter coefficients from the calculator 270, corrects the spectrum of the microphone signal with the formation of an at least partially echo-compensated version of this signal, which is sent for further processing. Depending on the specifics of the technical design, the microphone signal that has undergone echo cancellation or the spectrum modification described above can be converted to the time domain using the time-frequency converter 300 or output directly to the output terminal 310. However, the need for inverse conversion to the time domain using the converter 300 may not be, if, for example, the microphone signal is encoded in the frequency or related region.

Здесь, перед тем, как приступить к подробному рассмотрению в сопровождении фиг.3А-5Е элементов принципиальной схемы устройства 200 на фиг.2, следует обратить внимание на то, что преобразование сигнала громкоговорителя или его производного может большей частью выполняться в частотной области, причем обработке подлежат, соответственно, одиночный ассоциированный полосовой сигнал, совокупность полосовых сигналов, множество полосовых сигналов или все полосовые сигналы.Here, before proceeding to a detailed discussion, accompanied by FIGS. 3A-5E, of the circuit elements of the device 200 of FIG. 2, it should be noted that the conversion of the speaker signal or its derivative can for the most part be performed in the frequency domain, and processing subject, respectively, to a single associated band signal, a plurality of band signals, a plurality of band signals or all band signals.

Также следует отметить, что отдельные устройства и фильтры могут работать, например, используя энергопоказатели, что зависит от индивидуального конструктивного решения. Энергетический показатель представляет собой результат возведения действительной величины-основания в степень с четным показателем или результат возведения модуля (абсолютной величины) в степень с любым показателем. Например, обрабабываемые с помощью отдельных фильтров или отдельных устройств кратковременные спектры могут использовать энергопоказатели, в частности, - значения энергии, образованные как квадраты модулей соответствующих спектральных коэффициентов. Аналогично этому модульные спектры, то есть абсолютные величины соответствующих спектральных коэффициентов, могут быть использованы с показателем 1. Формулируя иначе, величины, пропорциональные SzS^m, где m - положительное, скажем, натуральное, число, могут быть использованы в качестве энергетических показателей, начиная с любого значения z, являющегося действительной или комплексной величиной. При z, являющемся действительной величиной, величины, пропорциональные z^2m, могут дополнительно использоваться как энергопоказатели.It should also be noted that individual devices and filters can work, for example, using energy indicators, which depends on the individual design solution. The energy indicator is the result of raising the actual value of the base to a power with an even indicator or the result of raising a module (absolute value) to a power with any indicator. For example, short-term spectra that are processed using separate filters or separate devices can use energy indicators, in particular, energy values formed as squares of modules of the corresponding spectral coefficients. Similarly, modular spectra, that is, the absolute values of the corresponding spectral coefficients, can be used with exponent 1. Formulating differently, quantities proportional to SzS ^m , where m is a positive, say, natural number, can be used as energy indicators, starting with any value of z that is a real or complex quantity. With z being the actual value, values proportional to z ^2m can be additionally used as energy indicators.

На фиг.3A дана принципиальная схема экстрактора 250, реализованного в соответствии с настоящим изобретением как возможный компонент устройства 200. Экстрактор 250 имеет только один первый вход 250а, произвольно соединенный с выходом первого фильтра предварительного анализа эхо-сигнала 240, показанного на фиг.2 как опция. Экстрактор 250 на фиг.3A не имеет второй вход (вход 250b на фиг.2).FIG. 3A is a schematic diagram of an extractor 250, implemented in accordance with the present invention as a possible component of the device 200. Extractor 250 has only one first input 250a, arbitrarily connected to the output of the first pre-analysis filter of the echo signal 240, shown in FIG. 2 as option. The extractor 250 in FIG. 3A does not have a second input (input 250b in FIG. 2).

К первому входу 250а экстрактора 250 подсоединен усреднитель 320, предназначенный для определения среднего значения сигнала на входе 250а. Термин „сигнал” обозначает здесь не только сигналы во временной области (временные сигналы), но и сигналы в частотной области, где соответствующие сигналы являются спектральным представлением сигналов временной области. Аналогично, сигналы могут включать в себя и транслировать информацию, полученную из вышеназванных сигналов, такую как величина амплитуды в частотной области (энергетический спектр), величины энергии (квадраты амплитуды), спектры и другие выведенные значения и показатели.An averager 320 is connected to the first input 250a of the extractor 250 to determine the average value of the signal at the input 250a. The term “signal” here means not only signals in the time domain (temporary signals), but also signals in the frequency domain, where the corresponding signals are the spectral representation of signals in the time domain. Similarly, signals can include and transmit information obtained from the above signals, such as the magnitude of the amplitude in the frequency domain (energy spectrum), energy values (squares of the amplitude), spectra and other derived values and indicators.

Внутри контура экстрактора 250 на фиг.3A сигнал, поступивший от входа 250а на усреднитель 320, выводится из него в виде стационарной составляющей сигнала через первый выход 250с. Как показано на фиг.2, первый выход 250с соединен с произвольным вторым фильтром предварительного анализа эхо-сигнала 260 и/или с вычислителем 270.Inside the circuit of the extractor 250 in FIG. 3A, the signal received from the input 250a to the averager 320 is output from it in the form of a stationary signal component through the first output 250c. As shown in FIG. 2, a first output 250c is coupled to an arbitrary second echo pre-analysis filter 260 and / or to a calculator 270.

В рамках контура экстрактора 250 сигнал, принятый на первом входе 250а, далее поступает вместе со стационарной составляющей сигнала от усреднителя 320 на фильтр усиления 330, который формирует нестационарную составляющую сигнала и подает ее на второй выход 250d. Фильтр усиления 330 определяет нестационарную составляющую сигнала на базе принимаемого на первый вход 250а сигнала громкоговорителя или производного от него и на базе стационарной составляющей сигнала. Более подробно функции усреднителя 320 и фильтра усиления 330 будут рассмотрены далее в контексте описания фиг.2.Within the extractor circuit 250, the signal received at the first input 250a then goes along with the stationary component of the signal from the averager 320 to the gain filter 330, which generates a non-stationary component of the signal and feeds it to the second output 250d. The gain filter 330 determines the non-stationary component of the signal based on the loudspeaker signal received or received from the first input 250a and based on the stationary component of the signal. In more detail, the functions of the averager 320 and the gain filter 330 will be discussed later in the context of the description of FIG. 2.

На фиг.3B показана возможная модификация экстрактора 250 в составе устройства 200. Экстрактор 250 на фиг.3B отличается от экстрактора на фиг.3A наличием вычислителя параметров 340, вход которого тоже соединен с первым входом 250а. С выхода вычислителя параметров 340 сгенерированные им параметры управления выводятся на фильтр усиления 330 для расчета нестационарной компоненты сигнала. Особенности функционирования рассматриваются далее.FIG. 3B shows a possible modification of the extractor 250 as part of the device 200. The extractor 250 in FIG. 3B differs from the extractor in FIG. 3A by the presence of a parameter calculator 340, the input of which is also connected to the first input 250a. From the output of the parameter calculator 340, the control parameters generated by it are output to the gain filter 330 for calculating the non-stationary signal component. Features of the operation are discussed below.

Являющийся опцией экстрактор 250, изображенный на фиг.3B, имеет второй вход 250b, уже показанный на фиг.2, который может быть не напрямую соединен, с одной стороны, с дополнительным входом вычислителя параметров 340 и, с другой стороны, с входным терминалом 280 для микрофонного сигнала, что также показано на фиг.2. В данном случае непрямое соединение может быть установлено через времячастотный преобразователь 290. Ниже также рассматриваются особенности работы вычислителя параметров 340.The optional extractor 250 shown in FIG. 3B has a second input 250b, already shown in FIG. 2, which may not be directly connected, on the one hand, to an additional input of the parameter calculator 340 and, on the other hand, to the input terminal 280 for a microphone signal, which is also shown in FIG. 2. In this case, an indirect connection can be established through a time-frequency converter 290. The operation features of a parameter calculator 340 are also discussed below.

На фиг.3С показан возможный вариант встраивания экстрактора 250 в конфигурацию устройства 200 на фиг.2. Компоновка экстрактора 250 на фиг.3С базируется на конструкции фиг.3B с привязкой вычислителя параметров 340 в качестве опции с соответствующими соединениями. В отличие от экстрактора 250 на фиг.3B экстрактор 250 на фиг.3С содержит распределитель 350, имеющий два входа, один из которых соединен с выходом усреднителя, а второй - с выходом фильтра усиления 330. Распределитель 350 принимает стационарную компоненту от усреднителя 320 и нестационарную компоненту от фильтра усиления 330.On figs shows a possible option of embedding the extractor 250 in the configuration of the device 200 in figure 2. The layout of the extractor 250 in FIG. 3C is based on the construction of FIG. 3B with reference to the parameter calculator 340 as an option with the corresponding connections. Unlike the extractor 250 in FIG. 3B, the extractor 250 in FIG. 3C contains a distributor 350 having two inputs, one of which is connected to the output of the averager, and the second to the output of the gain filter 330. The distributor 350 receives a stationary component from the averager 320 and a non-stationary component from the gain filter 330.

Два выхода распределителя 350 соединены с первым выходом 250с и со вторым выходом 250d экстрактора 250. Распределитель 350 определяет, какая из двух принятых им компонент будет передана через выход 250 с для последующей обработки. В зависимости от выбора одной из двух компонент распределитель 350 генерирует и выводит на второй выход 250d экстрактора 250 управляющий сигнал, который может содержать, например, информацию о том, которая из двух компонент сигнала поступила на первый выход 250с, или параметры, необходимые для дальнейшей обработки компоненты. Далее будут рассмотрены параметры, которые могут содержаться в выходном управляющем сигнале.Two outputs of the distributor 350 are connected to the first output 250c and to the second output 250d of the extractor 250. The distributor 350 determines which of the two components it receives will be transmitted through the output 250 s for further processing. Depending on the choice of one of the two components, the distributor 350 generates and outputs to the second output 250d of the extractor 250 a control signal, which may contain, for example, information about which of the two signal components arrived at the first output 250c, or the parameters necessary for further processing Components. Next, we will consider the parameters that may be contained in the output control signal.

В зависимости от индивидуального конструктивного решения распределитель 350 может передавать компонентный сигнал на первый выход 250с, что подразумевает более высокий уровень громкости, более высокий уровень энергии или большую величину энергии по сравнению с другим компонентным сигналом. При необходимости разные компоненты могут выводиться для разных полосовых сигналов.Depending on the individual design solution, the distributor 350 may transmit the component signal to the first output 250c, which implies a higher volume level, higher energy level or a larger amount of energy compared to the other component signal. If necessary, different components can be output for different band signals.

Следовательно, экстрактор 250 на фиг.3С отличается от версий, представленных на фиг.3A и 3B, в основном, тем, что формирует на первом выходе 250с только один из двух компонентных сигналов. Как уже пояснялось в связи с фиг.2, экстрактор 250 на фиг.3С генерирует на выходе только управляющий сигнал, который содержит информацию о компонентном сигнале на первом выходе 250с.Therefore, the extractor 250 in FIG. 3C differs from the versions shown in FIGS. 3A and 3B mainly in that it generates only one of two component signals at the first output 250c. As already explained in connection with FIG. 2, the extractor 250 in FIG. 3C generates at the output only a control signal that contains information about the component signal at the first output 250c.

На фиг.4А показан первый вариант реализации второго фильтра предварительного анализа эхо-сигнала 260, представленного на фиг.2 также в виде опции. Произвольно устанавливаемый второй фильтр предварительного анализа эхо-сигнала 260 включает в себя два звена фильтра 360-1, 360-2, вход каждого из которых раздельно соединен с входами 260а, 260b соответственно. Оба звена фильтра 360-1, 360-2 выведены раздельно на два выхода 260с, 260d, соответственно, второго фильтра предварительного анализа эхо-сигнала 260.On figa shows a first embodiment of a second filter preliminary analysis of the echo signal 260, presented in figure 2 also as an option. An arbitrarily set second pre-filter of the preliminary analysis of the echo signal 260 includes two filter links 360-1, 360-2, the input of each of which is separately connected to the inputs 260a, 260b, respectively. Both links of the filter 360-1, 360-2 are output separately to two outputs 260c, 260d, respectively, of the second filter of the preliminary analysis of the echo signal 260.

Второй фильтр предварительного анализа эхо-сигнала 260, показанный как опция на фиг.4А, может использоваться в сочетании с экстрактором 250, изображенном на фиг.3A и 3B. Говоря конкретнее, на фиг.4А второй фильтр предварительного анализа эхо-сигнала 260 обеспечивает одновременную обработку стационарной составляющей сигнала с помощью подфильтра 360-1 и нестационарной составляющей сигнала с помощью подфильтра 360-2. Конструкция двух звеньев фильтра 360-1, 360-2 может быть одинаковой или разной в зависимости от идентичности или различия фильтров анализа эхо-сигнала, используемых при модуляции акустической среды 120 (на фиг.1) для стационарных и нестационарных компонент сигнала. Оба звена фильтра 360-1, 360-2 могут быть реализованы на идентичной элементной базе для выполнения защелкивания или буферизации одного сигнала.A second echo pre-analysis filter 260, shown as an option in FIG. 4A, can be used in conjunction with an extractor 250 shown in FIGS. 3A and 3B. More specifically, in FIG. 4A, a second echo pre-analysis filter 260 enables simultaneous processing of the stationary component of the signal using subfilter 360-1 and the non-stationary component of the signal using subfilter 360-2. The design of the two filter links 360-1, 360-2 may be the same or different depending on the identity or difference of the echo analysis filters used in modulating the acoustic environment 120 (in FIG. 1) for stationary and non-stationary signal components. Both links of the filter 360-1, 360-2 can be implemented on the same element base to perform latching or buffering of a single signal.

При обсуждении фиг.2 уже говорилось о возможности двоякого применения фильтров анализа эхо-сигнала 240, 260, аналогично этому звенья фильтра 360 могут использоваться, например, только для обеспечения задержки. Естественно, что при реализации второго фильтра предварительного анализа эхо-сигнала 260 предусматривается задействование звеньев фильтрации, отличных от описанных ранее. Например, в звене фильтра 360 предусмотрен дополнительный ввод сигнала управляющего воздействия на процесс фильтрации,In the discussion of FIG. 2, it was already mentioned that there is a dual use of echo analysis filters 240, 260, similarly filter links 360 can be used, for example, only to provide delay. Naturally, the implementation of the second filter of the preliminary analysis of the echo signal 260 provides for the involvement of filtering links other than those described previously. For example, in the link of the filter 360 provides an additional input signal control action on the filtering process,

На фиг.4B изображен вариант конструктивного решения второго фильтра анализа эхо-сигнала 260, отличающийся от версии на фиг.4А тем, что в нем реализовано только одно звено фильтра 360, смонтированное между первым входом 260а и первым выходом 260с. В техническом исполнении, представленном на фиг.4, сигнал, принятый на втором входе 260b, поступает на второй выход 260d.FIG. 4B illustrates an embodiment of the second filter for echo analysis 260, which differs from the version in FIG. 4A in that it implements only one filter link 360 mounted between the first input 260a and the first output 260c. In the technical embodiment shown in FIG. 4, the signal received at the second input 260b is supplied to the second output 260d.

Таким образом, второй фильтр предварительного анализа эхо-сигнала 260, показанный на фиг.4b, может быть функционально встроен, например, в конструкцию экстрактора 250, как представлено на фиг.3С. В этом случае управляющий сигнал, который содержит информацию о компоненте сигнала на первом входе 260а, не модифицируется фильтром анализа эхо-сигнала 260.Thus, the second pre-filter analysis of the echo signal 260 shown in fig.4b, can be functionally integrated, for example, in the design of the extractor 250, as shown in figs. In this case, the control signal, which contains information about the signal component at the first input 260a, is not modified by the echo analysis filter 260.

Естественно, предусматривается вариант интегрирования фильтра предварительного анализа эхо-сигнала на фиг.4B с экстрактором 250 на фиг.3a и 3B, когда, к примеру, на стадии фильтрации 360 должен быть модифицирован только один из двух компонентных сигналов. Здесь может быть использован зеркальный вариант фильтра анализа эхо-сигнала 260, фильтрующего входной сигнал на втором входе 260b.Naturally, an option is provided for integrating the pre-analysis filter of the echo signal in FIG. 4B with the extractor 250 in FIGS. 3a and 3B, when, for example, only one of the two component signals should be modified at the filtering stage 360. Here, a mirror version of the echo analysis filter 260 may be used, filtering the input signal at the second input 260b.

На фиг.5А изображен вьяислитель 270, реализованный согласно данному изобретению в комбинации с устройством 200 на фиг.2. В данном случае вычислитель 270 также имеет первый вход 270а и второй вход 270b. Вычислитель 270 далее включает в себя первый и второй вычислители параметров фильтрации 370-1, 370-2, входы которых подключены, соответственно, к входам 270а, 270b. Говоря конкретнее, вход вычислителя фильтра 370-1 соединен с первым входом 270а, чтобы принимать, предположим, стационарную составляющую сигнала. Соответственно, второй вычислитель фильтра 370-2 сопряжен со вторым входом 270b, чтобы принимать нестационарную составляющую сигнала, например, от экстрактора 250, как изображено на фиг.3A или 3b. В случае, если второй фильтр анализа эхо-сигнала 260 смонтирован между экстрактором 250 и вычислителем 270, сигнал, производный от соответствующих компонентных сигналов, передается на оба вычислителя параметров фильтрации 370.FIG. 5A shows a depictor 270 implemented in accordance with the present invention in combination with the device 200 of FIG. In this case, the calculator 270 also has a first input 270a and a second input 270b. The calculator 270 further includes first and second calculators of the filtering parameters 370-1, 370-2, the inputs of which are connected, respectively, to the inputs 270a, 270b. More specifically, the input of the filter computer 370-1 is connected to the first input 270a in order to receive, say, the stationary component of the signal. Accordingly, the second filter transmitter 370-2 is coupled to the second input 270b to receive the non-stationary component of the signal, for example, from the extractor 250, as shown in FIG. 3A or 3b. In the event that a second echo analysis filter 260 is mounted between the extractor 250 and calculator 270, a signal derived from the respective component signals is transmitted to both filter parameter calculators 370.

Выходы обоих вычислителей параметров фильтрации 370 подключены к комбинатору 380, чей выход, в свою очередь, соединен с выходом 270d. Вычислитель 270, показанный на фиг.5А как произвольный компонент, имеет третий вход 270с, который внутри контура вычислителя 270 соединен с обоими вычислителями параметров фильтрации 370, и, как также показано на фиг.2, прямо или опосредованно соединен с входным терминалом 280 для микрофонного сигнала.The outputs of both filter parameter calculators 370 are connected to a combiner 380, whose output, in turn, is connected to an output 270d. The calculator 270, shown in FIG. 5A as an arbitrary component, has a third input 270c, which is connected to both filter parameters calculators 370 inside the circuit of the calculator 270, and, as also shown in FIG. 2, is directly or indirectly connected to the microphone input terminal 280 signal.

Принимая во внимание режим работы вычислителя 270, оба вычислителя характеристик фильтра 370 предназначаются для вычисления на базе полученных компонентных сигналов и, возможно, с учетом снятого с входного терминала 280 сигнала микрофона, соответствующих коэффициентов фильтрации, которые позже передаются на комбинатор 380. Оба вычислителя параметров фильтрации 370 производят соответствующие расчеты на основе принятых производных от соответствующих компонентных сигналов, которые могли быть скорректированы вторым фильтром предварительного анализа эхо-сигнала 260. Тем не менее, независимо от этого вычислители характеристик фильтра 370 предназначены для расчета коэффициентов пропускания первого и второго фильтров, соответственно, на базе составляющих сигналов, принятых от экстрактора 250.Taking into account the operating mode of calculator 270, both filter characteristics calculators 370 are intended to calculate, based on the received component signals and, possibly, the microphone signal taken from the input terminal 280, the corresponding filter coefficients, which are later transmitted to combinator 380. Both filter parameter calculators 370 carry out the corresponding calculations based on the received derivatives of the corresponding component signals, which could be adjusted by the second pre-filter analysis of the echo signal 260. However, regardless of this, the calculators of the characteristics of the filter 370 are designed to calculate the transmittance of the first and second filters, respectively, based on the components of the signals received from the extractor 250.

Затем, рассчитанные подобным образом коэффициенты пропускания первого и второго фильтров объединяются с помощью комбинатора 380 в набор коэффициентов фильтрации, после чего в качестве входных данных вводятся в адаптивный фильтр 210 через выход 270d вычислителя 270. Такое комбинирование может выполняться с помощью последовательности различных операций. От индивидуального технического решения фильтра, а также, в значительной степени, от задействованных времячастотных преобразователей 230, 290 и взаимодействующего с ними частотно-временного преобразователя 300 зависит комбинаторность коэффициентов пропускания первого и второго фильтров, благодаря чему возможен расчет коэффициентов пропускания адаптивного фильтра 210. Далее будут приведены соответствующие примеры.Then, the transmittances of the first and second filters calculated in this way are combined using a combinator 380 into a set of filter coefficients, after which they are input to the adaptive filter 210 through the output 270d of the calculator 270. Such a combination can be performed using a series of different operations. The combination of the transmittance of the first and second filters depends on the individual technical solution of the filter, as well as, to a large extent, on the involved time-frequency converters 230, 290 and the frequency-time converter 300 associated with them, so that the transmittance of the adaptive filter 210 can be calculated. relevant examples are given.

На фиг.5B изображен второй вычислитель 270, аналогичный вычислителю на фиг.5A. Он отличается от вычислителя 270 на фиг.5А тем, что комбинатор 380 на фиг.5А замещен селектором 390, который предназначен для вывода на выход 270d набора коэффициентов фильтрации, сформированного на базе коэффициентов пропускания первого и второго фильтров, который основывается или на коэффициентах пропускания первого фильтра первого вычислителя фильтра 370-1 или на коэффициентах пропускания второго фильтра второго вычислителя фильтра 370-2. Формулируя иначе, селектор 390 предназначен для определения коэффициентов пропускания адаптивного фильтра 210 на базе или стационарного, или нестационарного компонентного сигнала.On figv depicted the second calculator 270, similar to the calculator on figa. It differs from the calculator 270 in FIG. 5A in that the combinator 380 in FIG. 5A is replaced by a selector 390, which is designed to output to the output 270d a set of filter coefficients generated based on the transmittances of the first and second filters, which is based either on the transmittances of the first filter of the first filter transmitter 370-1 or at the transmittance of the second filter of the second filter transmitter 370-2. Formulated differently, the selector 390 is designed to determine the transmittance of the adaptive filter 210 based on either a stationary or non-stationary component signal.

При такой архитектуре селектор 390 может реализовывать более сложные математические зависимости, основываясь на соответствующем наборе коэффициентов фильтрации вычислителя 370. Однако он отличается от комбинатора 380 в составе вычислителя 270 на фиг.5А тем, что учитывает совокупность из двух наборов коэффициентов фильтрации, выдаваемых вычислителем параметров фильтра 370.With this architecture, the selector 390 can implement more complex mathematical dependencies based on the corresponding set of filter coefficients of the calculator 370. However, it differs from the combinator 380 as part of the calculator 270 in FIG. 5A in that it takes into account the combination of two sets of filter coefficients provided by the filter parameter calculator 370

На фиг.5C показан еще один вариант вычислителя 270, который отличается от вычислителя 270 на фиг.5А тем, что в нем вычислитель параметров фильтрации 370 соединен только с первым входом 270а. Дополнительно вычислитель параметров фильтра 370 в структуре вычислителя 270 на фиг.5С подключен ко второму входу 270b для получения через него параметров, необходимых для определения коэффициентов фильтрации. Кроме того, в качестве опции вычислитель параметров фильтра 370 на фиг.5C может быть соединен с третьим входом 270 с для обеспечения возможности расчета коэффициентов фильтрации с учетом микрофонного сигнала.FIG. 5C shows yet another embodiment of the calculator 270, which differs from the calculator 270 in FIG. 5A in that the filter parameter calculator 370 is only connected to the first input 270a therein. Additionally, the filter parameter calculator 370 in the structure of the calculator 270 in FIG. 5C is connected to the second input 270b to obtain through it the parameters necessary to determine the filtering coefficients. In addition, as an option, the filter parameter calculator 370 in FIG. 5C can be connected to a third input 270 s to enable calculation of filter coefficients taking into account the microphone signal.

Таким образом, вычислитель 270 на фиг.5C может работать в сочетании с экстрактором 250, показанным на фиг.3С, и вторым фильтром предварительного анализа эхо-сигнала 260, показанным на фиг.4B. Через второй выход 250d экстрактора 250 параметры, необходимые для расчетов соответствующего вычислителя характеристик фильтра 370, передаются на него непосредственно через второй вход 270b вычислителя 270. Для этого второй вход 270b соединен с терминалом ввода параметров вычислителя характеристик фильтрации 370, через который могут вводиться вспомогательные параметры для вычисления коэффициентов фильтрации.Thus, the calculator 270 in FIG. 5C can work in conjunction with the extractor 250 shown in FIG. 3C and the second echo pre-filter 260 shown in FIG. 4B. Through the second output 250d of the extractor 250, the parameters necessary for calculating the corresponding filter characteristics calculator 370 are transmitted to it directly through the second input 270b of the calculator 270. For this, the second input 270b is connected to the parameter input terminal of the filter characteristics calculator 370, through which auxiliary parameters can be entered for computing filter coefficients.

В силу того, что в контур вычислителя 270 включен только одиночный вычислитель параметров фильтрации 370, комбинатор и селектор могут не использоваться.Due to the fact that only a single filter parameter calculator 370 is included in the loop of the calculator 270, the combinator and selector may not be used.

На фиг.5D изображена версия вычислителя 270, по конфигурации и выполняемым функциям сходная с аналогом на фиг.5C. В отличие от вычислителя 270 на фиг.5C конструкция вычислителя на фиг.5D дополнена определителем параметров 400, смонтированным между вторым входом 270b и соответствующим разъемом для ввода параметров вычислителя характеристик фильтра 370.On fig.5D shows the version of the calculator 270, in configuration and functions performed similar to the analogue in figs. In contrast to the calculator 270 in FIG. 5C, the design of the calculator in FIG. 5D is supplemented by a parameter determiner 400 mounted between the second input 270b and the corresponding connector for inputting the parameters of the filter characteristics calculator 370.

В отличие от вычислителя 270 на фиг.5C вычислитель 270 на фиг.5D может функционировать в составе устройства 200, где экстрактор 250 выдает через второй выход 250d управляющий сигнал, который содержит сведения о том, какой из двух компонентных сигналов он передает через соответствующий первый выход 250с. При расхождении параметров, требуемых вычислителю параметров фильтра 370 для расчета коэффициентов фильтрации для указанных двух компонент сигнала, или при различии сгенерированных на их основании сигналов, соответствующие параметры могут быть рассчитаны с учетом переданного определителем параметров 400 компонентного сигнала при одновременном использовании вычислителя 270, как показано на фиг.5D. В силу этого определитель параметров 400 может быть выполнен, например, в виде блока памяти или вычислительного устройства. Реализация как средства хранения информации возможна в форме постоянного запоминающего устройства (ROM/ПЗУ), энергонезависимого запоминающего устройства (NVM/ЭНЗУ) или оперативного запоминающего устройства (RAM/ОЗУ).In contrast to the calculator 270 in FIG. 5C, the calculator 270 in FIG. 5D can function as part of the device 200, where the extractor 250 outputs a control signal through the second output 250d that contains information about which of the two component signals it transmits through the corresponding first output 250s If the parameters required by the filter parameter calculator 370 for calculating the filter coefficients for the two signal components are different, or if the signals generated on the basis of the difference are different, the corresponding parameters can be calculated taking into account the component signal transmitted by the parameter determiner 400 while using the calculator 270, as shown in fig.5D. By virtue of this, the parameter determiner 400 may be implemented, for example, in the form of a memory unit or computing device. Implementation as a means of storing information is possible in the form of read-only memory (ROM / ROM), non-volatile memory (NVM / ENZU) or random access memory (RAM / RAM).

На фиг.5E представлен очередной вариант осуществления вычислителя 270, который имеет в своем составе два вычислителя параметров фильтрации 370-1 и 370-2, чье функционирование зависит от компонентного сигнала, на основе которого должны быть рассчитаны коэффициенты пропускания адаптивного фильтра 210. Здесь входы обоих вычислителей параметров фильтрации 370 подключены к первому входу 270а. Кроме этого, каждый из обоих вычислителей параметров фильтрации 370 может быть произвольно подсоединен к третьему входу 270с, а также - на вход распределителя 410, выход которого сопряжен с выходом вычислителя 270. Распределительный блок 410 имеет дополнительный ввод для управляющего сигнала, соединенный с вторым входом 270b вычислителя 270.FIG. 5E shows another embodiment of a calculator 270, which incorporates two filter parameter calculators 370-1 and 370-2, whose operation depends on a component signal, on the basis of which the transmittances of the adaptive filter 210 should be calculated. Here, the inputs of both filter parameter calculators 370 are connected to the first input 270a. In addition, each of the two filter parameter calculators 370 can be arbitrarily connected to the third input 270c, and also to the input of the distributor 410, the output of which is coupled to the output of the calculator 270. The distribution unit 410 has an additional input for the control signal connected to the second input 270b calculator 270.

Таким образом, вычислитель 270 на фиг.5E позволяет рассчитывать первый набор коэффициентов фильтрации, используя вычислитель параметров фильтрации 370-1, и второй набор коэффициентов фильтрации, используя второй вычислитель параметров фильтрации 370-2 на основе сигналов, принятых на первом входе 270а. Выбор одного из двух коэффициентов фильтрации, рассчитанных вычислителем параметров фильтрации 370, который в конечном счете должен быть отправлен на выходу 270d, зависит от управляющего сигнала, принятого на входе для команд задающего воздействия распределителя 410 через второй вход 270b. В зависимости от управляющего сигнала, полученного на входе для сигналов управления, распределитель 410 соединяет один из двух входов с выходом 270d.Thus, the calculator 270 in FIG. 5E allows the first set of filter coefficients to be calculated using the filter parameter calculator 370-1 and the second set of filter coefficients using the second filter parameter calculator 370-2 based on the signals received at the first input 270a. The choice of one of the two filter coefficients calculated by the filter parameter calculator 370, which ultimately must be sent to the output 270d, depends on the control signal received at the input for the commands of the setting action of the distributor 410 through the second input 270b. Depending on the control signal received at the input for the control signals, a distributor 410 connects one of the two inputs to the output 270d.

Таким образом, вычислитель 270 на фиг.5E может действовать, например, в сочетании с экстрактором 250, как показано на фиг.3С, где через второй выход 250d подается управляющая команда, содержащая данные компонентного сигнала, переданного через первый выход 250с. Следовательно, конструкция вычислителя 270, представленная на фиг.5C, может быть применена, например, в тех случаях, когда исходные данные для двух составляющих сигналов, рассчитываемых с помощью вычислителя параметров фильтрации 370, имеют такие расхождения, что не могут быть эффективно преобразованы внесением в параметры простых изменений.Thus, the calculator 270 in FIG. 5E can act, for example, in combination with an extractor 250, as shown in FIG. 3C, where a control command containing data of the component signal transmitted through the first output 250c is supplied through the second output 250d. Therefore, the design of calculator 270 shown in FIG. 5C can be applied, for example, in cases where the initial data for the two component signals calculated by the filter parameter calculator 370 have such differences that they cannot be effectively converted by parameters of simple changes.

Здесь логично было бы отметить, что разновидности экстрактора 250, показанные на фиг.3А-3С, варианты фильтра предварительного анализа эхо-сигнала 260, представленные на фиг.4A-4B, и модификации вычислителя 270, приведенные на фиг.5A-5E, могут быть взаимно интегрированы в соответствии с конкретным приложением. Предположим, если устройство выбора вычислителя параметров фильтрации 370 с последующим выводом на выход 270d выполняет дальнейшие манипуляции, скажем, вычисления, с опорой на коэффициенты фильтрации, то оно может быть задействовано, к примеру, в модели вычислителя на фиг.5E вместо распределителя 410.It would be logical to note here that the types of extractor 250 shown in FIGS. 3A-3C, the filter options for preliminary analysis of the echo signal 260 shown in FIGS. 4A-4B, and the modifications to the calculator 270 shown in FIGS. 5A-5E may be mutually integrated in accordance with a specific application. Suppose that a filter parameter calculator 370 selector device with subsequent output to output 270d performs further manipulations, say calculations, based on filter coefficients, then it can be used, for example, in the model of calculator in Fig. 5E instead of distributor 410.

Конструктивные решения по предлагаемому изобретению, представленные выше в описании и на фиг. с 2 по 5, являются новейшими разработками, обеспечивающими раздельное подавление стационарных и нестационарных компонент акустического эха. Это достигается благодаря оценке эхо-сигнала отдельно от неустойчивых и устойчивых составляющих сигнала громкоговорителя. Далее, конструкция согласно данной разработке предусматривает расчет характеристик двух соответствующих фильтров эхокомпенсации для сигналов обоих типов. Эхокомпенсирующие фильтры могут быть оптимизированы индивидуально для максимального улучшения эхоподавления и минимизации артефактов и искажений сигнала на передающем конце.The design solutions of the invention presented above in the description and in FIG. 2 to 5 are the latest developments providing separate suppression of stationary and non-stationary components of the acoustic echo. This is achieved by evaluating the echo separately from the unstable and stable components of the loudspeaker signal. Further, the design according to this development provides for calculating the characteristics of two respective echo cancellation filters for both types of signals. Echo cancellation filters can be individually optimized to maximize echo cancellation and minimize artifacts and signal distortion at the transmitting end.

Дальнейший процесс построен следующим образом. Сначала моделируют сигнал громкоговорителя. Затем, в соответствии с этой моделью разделяют стационарные и нестационарные составляющие, что может быть выполнено на основе оценки стационарных составляющих. После этого спектры мощности стационарных и нестационарных составляющих эха оценивают с помощью фильтров предварительного анализа эхо-сигнала. В силу этого, в некоторых реализациях данного изобретения предусматривается расчет параметров двух эхокомпенсирующих фильтров. Впоследствии процесс разделения стационарных и нестационарных компонент может быть отрегулирован, исходя из практики применения фильтров эхокомпенсации.The further process is structured as follows. First, a speaker signal is modeled. Then, in accordance with this model, stationary and non-stationary components are separated, which can be done based on the assessment of stationary components. After that, the power spectra of the stationary and non-stationary components of the echo are evaluated using filters of preliminary analysis of the echo signal. Because of this, in some implementations of the present invention provides for the calculation of the parameters of two echo canceller filters. Subsequently, the process of separation of stationary and non-stationary components can be adjusted based on the practice of using echo cancellation filters.

Говоря о моделировании сигнала, следует учитывать, что оценка спектра вносимого эха или спектра плотности мощности эхо-сигнала с помощью фильтра предварительного анализа эхо-сигнала на практике, как правило, не очень точна, поскольку для рассмотрения доступна только часть истинной длины траектории эха. С целью предупреждения остаточного эха из-за высокой степени погрешности фильтры эхокомпенсации настраивают на агрессивный режим подавления эхо-сигналов, при котором остаточное эхо полностью удаляется. Такая настройка достигается путем завышения оценки спектра плотности мощности эхо-сигнала и сглаживания по времени, что способствует поддержанию низких значений фильтра усиления.Speaking about signal modeling, it should be taken into account that the estimation of the spectrum of the introduced echo or the power density spectrum of the echo signal using the preliminary echo analysis filter in practice, as a rule, is not very accurate, since only a part of the true length of the echo path is available for consideration. In order to prevent residual echo due to a high degree of error, echo cancellation filters are set to an aggressive echo cancellation mode, in which the residual echo is completely removed. This tuning is achieved by overestimating the spectrum of the power density of the echo signal and smoothing over time, which helps to maintain a low gain filter.

Когда сигнал громкоговорителя содержит стационарный шум, эхозаградитель пытается блокировать эхо. Применение названных выше фильтров агрессивного эхоподавления часто ведет к глушению не только стационарных эхошумов, но и к ослаблению стационарного шума и речи на ближнем конце.When the loudspeaker signal contains stationary noise, the echo suppressor attempts to block the echo. The use of the above filters of aggressive echo cancellation often leads to jamming not only stationary echo noises, but also to weaken stationary noise and speech at the near end.

Здесь предлагается смягчить проблему посредством двух разных путей подавления эха стационарных и нестационарных сигналов, что проиллюстрировано на фиг.6.It is proposed here to mitigate the problem by means of two different ways of suppressing the echo of stationary and non-stationary signals, as illustrated in FIG. 6.

На фиг.6 дана принципиальная блочная схема устройства 200 в соответствии с настоящим изобретением, в состав которого входят громкоговоритель 100 и микрофон 110. На громкоговоритель 100 поступает сигнал x[n], который поступает также на экстрактор 250. Другое название экстрактора 250 - дискриминатор устойчивого состояния (селектор стабильности). Как пояснялось в связи с фиг.2, экстрактор 250 снабжен двумя выходами, соединенными с вычислителем 270. В дополнение к этому на вычислитель 270 поступает сигнал y[n] микрофона 110.6 is a schematic block diagram of a device 200 in accordance with the present invention, which includes a loudspeaker 100 and a microphone 110. A signal x [n] is supplied to the loudspeaker 100, which also goes to the extractor 250. Another name for the extractor 250 is the discriminator of the stable state (stability selector). As explained in connection with FIG. 2, the extractor 250 is provided with two outputs connected to the calculator 270. In addition, the signal y [n] of the microphone 110 is supplied to the calculator 270.

В контексте фиг.5А уже был показан вычислитель 270, который включает в себя первый вычислитель параметров фильтрации 370-1 для стационарной составляющей сигнала и второй вычислитель параметров фильтрации 370-2 для нестационарной составляющей сигнала, которые формируются на выходе экстрактора 250. Дополнительно оба вычислителя параметров фильтрации 370 принимают сигнал микрофона.In the context of FIG. 5A, a calculator 270 has already been shown, which includes a first filter parameter calculator 370-1 for the stationary signal component and a second filter parameter calculator 370-2 for the non-stationary signal component, which are generated at the output of the extractor 250. Additionally, both parameter calculators Filtration 370 receive a microphone signal.

На основании принятых сигналов вычислитель параметров фильтрации 370 рассчитывает коэффициенты фильтрации H_w и H_s, которые передаются на комбинатор 380. С этой целью выход каждого из обоих вычислителей параметров фильтрации 370 соединен с комбинатором 380. Комбинатор 380, в свою очередь, выводит на адаптивный фильтр 210 коэффициенты фильтрации, рассчитанные или заданные на базе двух наборов коэффициентов фильтрации H_w и H_s.Based on the received signals, the filter parameter calculator 370 calculates the filter coefficients H _w and H _s , which are transmitted to the combinator 380. For this purpose, the output of each of the two filter parameter calculators 370 is connected to the combinator 380. The combinator 380, in turn, outputs to the adaptive filter 210 filtering coefficients calculated or specified based on two sets of filtering coefficients H _w and H _s .

Чтобы в результате из микрофонного сигнала y[n] получить сигнал с блокированным эхом e[n], на вход адаптивного фильтра 210 дополнительно напрямую поступает сигнал микрофона. На выходе адаптивного фильтра 210 формируется сигнал с блокированным эхом e[n].In order to obtain a signal with a blocked echo e [n] from the microphone signal y [n], the microphone signal is additionally directly input to the adaptive filter 210. At the output of the adaptive filter 210, a signal with a blocked echo e [n] is generated.

Таким образом, адаптивный фильтр 210 осуществляет эхоподавление, в котором участвуют два вычислителя параметров фильтрации 370, каждый из которых рассчитывает фильтры эхокомпенсации в виде соответствующих коэффициентов пропускания фильтра, которые затем комбинатора 380 интегрирует в эффективный фильтр эхокомпенсации.Thus, adaptive filter 210 performs echo cancellation, in which two filter parameter calculators 370 are involved, each of which calculates echo cancellation filters in the form of respective filter transmittances, which are then integrated by combiner 380 into an effective echo cancellation filter.

Обращаясь к фиг.6, следует иметь в виду, что представленная на ней принципиальная схема является упрощенной блочной схемой, где не показаны, например, реализованные времячастотный преобразователь или фильтры предварительного анализа эхо-сигнала.Turning to Fig.6, it should be borne in mind that the schematic diagram presented on it is a simplified block diagram, where, for example, implemented time-frequency converter or filters for preliminary analysis of the echo signal are not shown.

Подавление нестационарного (речевого) эхо-сигнала должно выполняться в агрессивном режиме во избежание остаточного эха, являющегося раздражающим фактором. Однако стационарные эхо-сигналы, которые могут являться следствием стационарных помех в сигналах громкоговорителя, обычно подавляют менее агрессивно, чтобы предупредить возникновение таких артефактов, как, например, тональные искажения. Чтобы сгенерировать надлежащую модель, сигнал x[n], исходящий от громкоговорителя, может быть разбит на составляющие в соответствии с выражениемSuppression of unsteady (speech) echo should be performed aggressively to avoid residual echo, which is an annoying factor. However, stationary echoes, which may result from stationary interference in the loudspeaker signals, are usually suppressed less aggressively in order to prevent the occurrence of artifacts such as tonal distortions. In order to generate a proper model, the signal x [n] coming from the speaker can be broken down into components according to the expression

где x_s[n] моделирует нестационарную составляющую речевого сигнала, a x_w[n] имитирует стационарные помехи. Дискретное время обозначено переменным показателем n.where x _s [n] models the non-stationary component of the speech signal, ax _w [n] simulates stationary interference. Discrete time is indicated by a variable exponent n.

Сначала над обеими частями модели, выраженной уравнением (1), выполняется кратковременное преобразование Фурье с учетом разделения стационарных и нестационарных составляющих, из чего получаемFirst, on both sides of the model, expressed by equation (1), a short-term Fourier transform is performed taking into account the separation of stationary and non-stationary components, from which we obtain

где m, показатель частоты, и k, временной показатель блока данных, - целые числа. В уравнении (2) нестационарные и стационарные составляющие спектральной плотности мощности сигнала громкоговорителя |X[k,m]|² выражены какwhere m, the frequency indicator, and k, the time indicator of the data block, are integers. In equation (2), the non-stationary and stationary components of the power spectral density of the loudspeaker signal | X [k, m] | ² expressed as

|X_s[k,m]|² и |X_w[k,m]|².| X _s [k, m] | ² and | X _w [k, m] | ² .

Логично допустить, что x_s[n] и x_w[n] не коррелируют и имеют среднее значение, приближающееся к нулю. Из этого следует, что |X[k,m]|² выводится изIt is logical to assume that x _s [n] and x _w [n] do not correlate and have an average value approaching zero. It follows that | X [k, m] | ² is derived from

В силу этого мгновенная спектральная плотность мощности нестационарной

составляющей сигнала громкоговорителя x_s[n] может быть восстановлена путем вычитания оценочного спектра мощности стационарной компоненты сигнала из спектра мощности сигнала громкоговорителя |X[k,m]|² согласно уравнениюBecause of this, the instantaneous spectral power density of non-stationary

the loudspeaker signal component x _s [n] can be restored by subtracting the estimated power spectrum of the stationary signal component from the power spectrum of the loudspeaker signal | X [k, m] | ² according to the equation

В действительности

оценивается посредством фильтрации спектральной плотности мощности сигнала громкоговорителя |X[k,m]|², следуя выражениюIn fact

estimated by filtering the spectral power density of the speaker signal | X [k, m] | ² , following the expression

Фильтр F_x[k,m], называемый также фильтром усиления, может быть записан в своей исходной форме согласно [W.Etter and G.S.Moschytz. Noise reduction by noise-adaptive spectral magnitude expansion. J. Audio Eng. Soc., 42: 341-349, May 1994] в следующем виде:The filter F _x [k, m], also called the gain filter, can be written in its original form according to [W.Etter and GSMoschytz. Noise reduction by noise-adaptive spectral magnitude expansion. J. Audio Eng. Soc., 42: 341-349, May 1994] as follows:

где γ_х - экспонент, а β_x - управляющий параметр или параметр настройки интенсивности подавления составляющих стационарного сигнала на случай, если ожидаемое значение этой характеристики было занижено или завышено. Разделение стационарных и нестационарных компонент будет продемонстрировано в контексте фиг.8 при показателе частоты 1 кГц.where γ _x is the exponent, and β _x is the control parameter or the parameter for adjusting the intensity of suppression of the components of the stationary signal in case the expected value of this characteristic was underestimated or overestimated. The separation of stationary and non-stationary components will be demonstrated in the context of Fig. 8 at a frequency index of 1 kHz.

Уравнения (5) и (6) описывают функциональные возможности фильтра усиления 330 в составе экстрактора 250, представленные в контексте фиг.2-6.Equations (5) and (6) describe the functionality of the gain filter 330 in the extractor 250, presented in the context of figure 2-6.

Оценка стационарных помех может быть выполнена путем корректировки ожидаемой кратковременной спектральной плотность мощности шума

времени. В каждом блоке данных (фрейме) k энергетический спектр шума корректируется путем однополюсного усреднения при двух постоянных времени с целью дискриминации речи и шума. Короткий период атаки показывает, что текущий блок данных содержит шум.Estimation of stationary interference can be performed by adjusting the expected short-term noise power spectral density

time. In each data block (frame) k, the noise energy spectrum is corrected by unipolar averaging at two time constants in order to discriminate between speech and noise. A short attack period indicates that the current data block contains noise.

Продолжительная постоянная времени ослабления показывает, что текущий блок данных содержит речь.A long damping time constant indicates that the current data block contains speech.

Практически это реализуется в соответствии сIn practice, this is implemented in accordance with

где µ₁ - постоянная времени атаки и µ₂ - постоянная времени спада. Следует пояснить, что µ₁ и µ₂ в уравнении (7) являются безразмерными параметрами, для которых действует условие µ₁<µ₂. Однако, принимая во внимание частоту дискретизации, эти параметры можно интерпретировать и воспринимать как, например, вышеозначенные временные константы. Как в дальнейшем подтвердит отношение пропорциональности (16), фактические постоянные времени и эти параметры обратно пропорциональны друг другу. Постоянная времени атаки µ₁ может иметь, например, величину 10000 мс=10 сек., тогда как постоянная времени ослабления, учитывая период дискретизации, может составлять величину 10 мс.where µ ₁ is the attack time constant and µ ₂ is the decay time constant. It should be clarified that μ ₁ and μ ₂ in equation (7) are dimensionless parameters for which the condition μ ₁ <μ ₂ is valid. However, taking into account the sampling rate, these parameters can be interpreted and perceived as, for example, the aforementioned time constants. As subsequently confirms the proportionality relation (16), the actual time constants and these parameters are inversely proportional to each other. The attack time constant µ ₁ can, for example, be 10,000 ms = 10 seconds, while the attenuation time constant, taking into account the sampling period, can be 10 ms.

При реализации настоящего изобретения, отображенной на фиг.2-6, функции, описываемые уравнением (7), выполняет усреднитель 320 в составе экстрактора 250.When implementing the present invention, shown in Fig.2-6, the functions described by equation (7), performs averager 320 as part of the extractor 250.

При определении мощности эхо-сигнала оценку спектра эха можно выполнить применив фильтр предварительного анализа эхо-сигнала G[k,m] к отсроченному по времени спектру мощности сигнала громкоговорителя согласно уравнению:When determining the power of the echo signal, the evaluation of the echo spectrum can be performed by applying the filter of preliminary analysis of the echo signal G [k, m] to the time-delayed power spectrum of the loudspeaker signal according to the equation:

где |Ŷ[k, m]|² выражает оценку спектральной плотности мощности эха в сигнале микрофона. Из примененного уравнения (3) следует, что эхо, образующееся из нестационарной составляющей сигнала громкоговорителя, определяется изwhere | Ŷ [k, m] | ² expresses an estimate of the spectral density of an echo power in a microphone signal. It follows from the applied equation (3) that the echo generated from the non-stationary component of the loudspeaker signal is determined from

а эхо, формирующееся из стационарной составляющей сигнала громкоговорителя, определяется изand the echo formed from the stationary component of the loudspeaker signal is determined from

В зависимости от одного из конструктивных решений по настоящему изобретению, варианты которых показаны, к примеру, в контексте фиг.2-5, рабочие функции, описываемые уравнением (8), могут быть выполнены, скажем, первым фильтром предварительного анализа эхо-сигнала 240. Техническими возможностями, выраженными уравнениями (9) и (10), может обладать второй фильтр предварительного анализа эхо-сигнала 260, содержащий в себе два звена фильтра 360-1 и 360-2.Depending on one of the constructive solutions of the present invention, the variants of which are shown, for example, in the context of FIGS. 2-5, the working functions described by equation (8) can be performed, say, by the first filter of preliminary analysis of the echo signal 240. The technical capabilities expressed by equations (9) and (10) may be possessed by a second filter for preliminary analysis of the echo signal 260, which contains two filter links 360-1 and 360-2.

Как уже говорилось выше, реализовывать задержку сигналов на величину d, используемую в уравнениях (8)-(10), также могут фильтры предварительного анализа эхо-сигнала 240, 260. Вместо них задержку может обеспечивать времячастотный преобразователь 230, если эта функция не возложена на другое целевое автономное устройство.As already mentioned above, the delay of signals by the value of d used in equations (8) - (10) can also be realized by filters for preliminary analysis of the echo signal 240, 260. Instead, the delay can be provided by a time-frequency converter 230, if this function is not assigned to another target standalone device.

Для компенсации эха рассчитываются взаимодействующие фильтры эхоподавления H_s[k,m] и H_w[k,m] и применяются для устранения эха микрофонного сигнала на основании оценок нестационарных и стационарных эхо-сигналов

и

.To compensate for the echo, the interacting echo cancellation filters H _s [k, m] and H _w [k, m] are calculated and used to eliminate the echo of the microphone signal based on estimates of non-stationary and stationary echo signals

and

.

Конструкция, включающая в себя вычислитель 270, показанный, в частности, на фиг.5A, практически может быть осуществлена в соответствии сA structure including a calculator 270, shown in particular in FIG. 5A, may in practice be implemented in accordance with

В этом случае умножение соответствующих коэффициентов пропускания эхокомпенсирующих фильтров H_s[k,m] и H_w[k,m] выполняет комбинатор 380, замещающий последовательное соединение соответствующих фильтров эхокомпенсации. Умножение нужных коэффициентов фильтрации в частотной области соответствует свертке соответствующих импульсных характеристик во временной области.In this case, the multiplication of the respective transmission coefficients of the echo canceller filters H _s [k, m] and H _w [k, m] is performed by a combinator 380, replacing the serial connection of the respective echo cancellation filters. The multiplication of the desired filtering coefficients in the frequency domain corresponds to the convolution of the corresponding impulse characteristics in the time domain.

Реализуя фильтры эхокомпенсации путем разложения на множители согласно уравнению (II), можно вводить разные коэффициенты усиления в качестве параметров различных составляющих эха. Компоненты нестационарного фильтра эхокомпенсации можно вычислить, например, следуя уравнению:By implementing echo cancellation filters by factoring according to equation (II), different gain factors can be introduced as parameters of the various echo components. The components of a non-stationary echo cancellation filter can be calculated, for example, by following the equation:

а составляющие стационарного эхокомпенсирующего фильтра могут быть вычислены в соответствии сand the components of the stationary echo cancellation filter can be calculated in accordance with

Расчетные показатели β_s, γ_s, β_w и γ_w могут быть использованы для управления планируемым режимом работы каждого из эхокомпенсирующих фильтров. В зависимости от специфики осуществления предлагаемой разработки названные расчетные показатели могут быть отобраны и зафиксированы, предварительно заданы в любой требуемой адаптируемой, программируемой или модифицируемой форме. Стандартным набором экспоненциальных параметров является, в частности, γ_s=γ_w=2.The calculated parameters β _s , γ _s , β _w and γ _w can be used to control the planned mode of operation of each of the echo canceling filters. Depending on the specifics of the implementation of the proposed development, the named calculated indicators can be selected and fixed, predefined in any required adaptable, programmable or modifiable form. The standard set of exponential parameters is, in particular, γ _s = γ _w = 2.

Так называемые коэффициенты избыточности оценки β_s и β_w служат для контроля степени агрессивности глушения эха. Скажем, интенсивность некоторого эхокомпенсирующего фильтра может быть усилена за счет увеличения коэффициента избыточности оценки. Поэтому обычно фильтр компенсации стационарного эхо-сигнала H_w[k,m] подбирается с учетом показателя β_w=2, благодаря чему достигается умеренное ослабление эха.The so-called redundancy ratios β _s and β _w are used to control the degree of aggressiveness of echo jamming. Say, the intensity of some echo cancellation filter can be enhanced by increasing the redundancy coefficient of the estimate. Therefore, usually the compensation filter for the stationary echo signal H _w [k, m] is selected taking into account the parameter β _w = 2, due to which a moderate attenuation of the echo is achieved.

С другой стороны, эхокомпенсирующий фильтр, отвечающий за подавление нестационарных составляющих эха, подразумевает агрессивный режим подавления для эффективного ослабления шумовых составляющих речи в эхо-сигнале. В силу этого, расчетный коэффициент избыточности оценки β_sчасто превышает β_w, и, соответственно, β_s>β_w. Например, при выборе β_w=2, β_s может иметь значения в диапазоне 20>β_s>2=β_w, (предположим, β_s=4). В большинстве случаев β_w и β_s относятся к одному порядку величины.On the other hand, an echo cancellation filter responsible for suppressing the non-stationary components of an echo implies an aggressive suppression mode to effectively attenuate the noise components of speech in the echo signal. Due to this, the estimated redundancy coefficient of the estimate β _s often exceeds β _w , and, accordingly, β _s > β _w . For example, when choosing β _w = 2, β _s can have values in the range 20> β _s > 2 = β _w , (suppose β _s = 4). In most cases, β _w and β _s are of the same order of magnitude.

Пороговые величины L_s и L_w устанавливают максимальный предел затухания эха в децибелах (дБ). Номинальным значением для фильтра компенсации стационарного эхо-сигнала является L_w=-10 дБ или -15 дБ, которое оптимально ограничивает ослабление стационарных эхо-сигналов, снижая вероятность возникновения случайных артефактов. При наличии нестационарного голосового сигнала на дальнем конце линии связи помеховые эхокомпоненты должны быть полностью устранены, что осуществляется за счет установления предельного значения L_s около -60 дБ для нестационарного компонентного сигнала.The threshold values L _s and L _w set the maximum decay limit of the echo in decibels (dB). The nominal value for the stationary echo cancellation filter is L _w = -10 dB or -15 dB, which optimally limits the attenuation of stationary echoes, reducing the likelihood of random artifacts. In the presence of an unsteady voice signal at the far end of the communication line, the interfering echo components must be completely eliminated, which is achieved by setting the limit value L _{s of} about -60 dB for the unsteady component signal.

Конструктивные возможности, описанные уравнениями (12) и (13), могут быть реализованы в вычислителе параметров фильтрации 370, входящем в конструкции, описанные с помощью фиг.2-6.The design capabilities described by equations (12) and (13) can be implemented in the filtering parameter calculator 370 included in the structures described using FIGS. 2-6.

Иногда практическое назначение предлагаемой разработки требует не прямого эхоподавления с помощью эхокомпенсирующих фильтров, как описывают уравнения (12) и (13), а, скорее - компенсации эхо-сигнала на базе сопоставимой сглаженной по времени интерпретации. Как и рассмотренные выше расчетные параметры, параметры временного сглаживания обычно требуют подстройки вручную и оптимизации подавления отдельно нестационарного и стационарного эха. Благодаря этому повышается качество воспринимаемого звука, так как специфика подавления стационарных шумовых компонент отличается от особенностей компенсации нестационарных составляющих голосового сигнала.Sometimes the practical purpose of the proposed development does not require direct echo cancellation using echo cancellation filters, as equations (12) and (13) describe, but rather, compensation of the echo signal based on a comparable time-smoothed interpretation. Like the calculated parameters discussed above, the parameters of temporary smoothing usually require manual tuning and optimization of the suppression of separately non-stationary and stationary echoes. This improves the quality of perceived sound, since the specifics of suppressing stationary noise components differ from the features of compensation for non-stationary components of a voice signal.

Эти функциональные возможности могут быть реализованы, например, собственно вычислителем параметров фильтрации 370 или любым устройством с питающей стороны от них, например, комбинатором 380, селектором 390 или распределителем 410. В случае необходимости такое сглаживание по времени может выполняться напрямую с помощью адаптивного фильтра 210.These functionalities can be implemented, for example, by the filter parameters calculator 370 itself or by any device on the supply side from them, for example, combinator 380, selector 390, or distributor 410. If necessary, such time smoothing can be performed directly using adaptive filter 210.

Говоря об улучшении качества звучания, следует вспомнить, например, что сглаживание должно усиливаться при глушении стационарных составляющих сигнала во избежание так называемых тональных искажений, что описано в [О.Capp'e. Elimination of the musical noise phenomenon with the ephrain and malah noise suppressor. IEEE Trans. Speech and Audio Processing, 2(2): 345-349, April 1994.]. Вместе с тем, при аттенюации следует обеспечить низкие пропускные характеристики фильтров компенсации нестационарного эха, чтобы поддерживать достаточно высокую степень ослабления эхо-сигнала, поскольку названной аттенюации способствует отражение траектории эха. Тем не менее, это не должно снижать адаптируемость адаптивного фильтра 210 в случаях быстрого изменения уровней эхо-сигнала. Обсуждение аспектов качества ясно показывает необходимость индивидуального подхода к отладке и оптимизации эхокомпенсирующих фильтров, определяемых уравнениями (12) и (13).Speaking about improving sound quality, it should be remembered, for example, that smoothing should be enhanced when the stationary components of the signal are suppressed in order to avoid the so-called tonal distortions, which is described in [O. Capp'e. Elimination of the musical noise phenomenon with the ephrain and malah noise suppressor. IEEE Trans. Speech and Audio Processing, 2 (2): 345-349, April 1994.]. At the same time, during attenuation, low throughput characteristics of non-stationary echo compensation filters should be ensured in order to maintain a sufficiently high degree of attenuation of the echo signal, since the reflection of the echo path contributes to this attenuation. However, this should not reduce the adaptability of the adaptive filter 210 in cases of rapid changes in echo levels. The discussion of quality aspects clearly shows the need for an individual approach to debugging and optimization of echo cancellation filters defined by equations (12) and (13).

На фиг.7 представлена расширенная, более полная, принципиальная схема процесса или блок-схема алгоритма аттенюации акустического эха, которая рассматривается дальше. Отображенное на фиг.7 конструктивное решение во многом аналогично варианту исполнения на фиг.2. Здесь также в устройство 200 введен времячастотный преобразователь 230 в конфигурации оконного преобразователя Фурье (ОПФ), принимающий сигнал громкоговорителя x[n].Figure 7 presents an expanded, more complete, basic diagram of the process or a block diagram of the attenuation algorithm of the acoustic echo, which is discussed further. The design solution shown in Fig. 7 is in many respects similar to the embodiment in Fig. 2. Here, a time-frequency converter 230 is also introduced into the device 200 in the configuration of a window Fourier converter (OPF) receiving a speaker signal x [n].

В верхней части фиг.7 в качестве примера показана кривая 420 сигнала громкоговорителя x[n] как функции времени, проходящая по значениям времени n.At the top of FIG. 7, an example is a curve 420 of the speaker signal x [n] as a function of time passing over time values n.

Кроме конвертации сигнала x[n] из временной области в частотную область время-частотный преобразователь 230 выполняет задержку на величину d, о которой говорилось выше. В результате, на выходе времячастотного преобразователя 230 формируется спектр X[k-d,m], являющийся, как правило, комплекснозначным. Данный спектр X[k-d,m] передается на экстрактор 250, обозначенный на фиг.7 буквами SD [ДС] (stationary discrimination [=дискриминатор по стабильности]). В отношении технических решений по изобретению, представленных на фиг.2-5, уже пояснялось, что экстрактор 250 способен также формировать, как показано на фиг.7, в частотной области стационарный компонентный сигнал X_w[k,m,] и нестационарный компонентный сигнал X_s[k,m]. Эти компонентные сигналы поступают на вычислитель 270.In addition to converting the signal x [n] from the time domain to the frequency domain, the time-to-frequency converter 230 performs a delay of the amount d mentioned above. As a result, the spectrum X [kd, m] is formed at the output of the time-frequency converter 230, which is, as a rule, complex-valued. This spectrum X [kd, m] is transmitted to an extractor 250, indicated in FIG. 7 by the letters SD [DS] (stationary discrimination [= stability discriminator]). With regard to the technical solutions of the invention shown in FIGS. 2-5, it has already been explained that the extractor 250 is also capable of generating, as shown in FIG. 7, a stationary component signal X _w [k, m,] and a non-stationary component signal in the frequency domain X _s [k, m]. These component signals are fed to a calculator 270.

В дополнение к этому, конструкция устройства 200 на фиг.7 имеет в своем составе времячастотный преобразователь 290, также реализованный в форме оконного преобразователя Фурье (ОПФ). В верхней части фиг.7 в качестве примера показано, что времячастотный преобразователь 290 принимает на входе микрофонный сигнал y[n], отображенный кривой 430. Времячастотный преобразователь 290, кроме того, преобразует сигнал микрофона Y[k,m] в соответствующее частотное представление, при этом показатель k по-прежнему обозначает блок данных, а показатель m обозначает полосу частот или значение частоты или коэффициент трансформанты. Указанный спектр Y[k,m] также обычно является комплекснозначным.In addition, the design of the device 200 of FIG. 7 includes a time-frequency converter 290, also implemented in the form of a window Fourier converter (OPF). In the upper part of Fig. 7, it is shown as an example that the time-frequency converter 290 receives the microphone signal y [n] displayed by curve 430 at the input. The time-frequency converter 290 also converts the microphone signal Y [k, m] into the corresponding frequency representation, while the indicator k still indicates a data block, and the indicator m indicates a frequency band or a frequency value or a transform coefficient. The indicated spectrum Y [k, m] is also usually complex-valued.

Тем не менее, в отличие от времячастотного преобразователя 230 времячастотный преобразователь 290 не выполняет дополнительную функцию задержки. В основном - в силу того, что в этом нет необходимости, так как скорость распространения звуковых волн (скорость звука) заметно ниже скорости прохождения электрических сигналов в цепях и схемотехнических элементах, что вызывает запаздывание улавливаемого микрофоном сигнала y[n] относительно соотнесенного сигнала громкоговорителя x[n].However, unlike the time-frequency converter 230, the time-frequency converter 290 does not perform an additional delay function. Basically, due to the fact that this is not necessary, since the speed of propagation of sound waves (speed of sound) is noticeably lower than the speed of transmission of electrical signals in circuits and circuit elements, which causes the signal y [n] picked up by the microphone to be delayed relative to the correlated loudspeaker signal x [n].

Для наглядности этот аспект выделен первой фигурной скобой 440 как фрагмент сигнала громкоговорителя x[n] при сопоставлении кривых графиков 420 и 430 в верху фиг.7. В той же части фиг.7 в сигнале микрофона y[n] участок, соответствующий фрагменту сигнала громкоговорителя x[n], обозначенного скобой 440, показан второй фигурной скобой 450. Таким образом, сигнал громкоговорителя x[n] и микрофонный сигнал y[n] смещены относительно друг друга на величину задержки d, что на фиг.7 обозначено стрелкой 460.For clarity, this aspect is highlighted by the first curly bracket 440 as a fragment of the speaker signal x [n] when comparing the graph curves 420 and 430 at the top of FIG. In the same part of FIG. 7, in the microphone signal y [n], the portion corresponding to the fragment of the speaker signal x [n] indicated by the bracket 440 is shown by the second curly bracket 450. Thus, the speaker signal x [n] and the microphone signal y [n ] are offset relative to each other by the amount of delay d, which in Fig.7 is indicated by arrow 460.

В соответствии с алгоритмом на фиг.7 спектры сигналов громкоговорителя и микрофона поступают на фильтр предварительного анализа эхо-сигнала 470, который на базе получаемых сигналов определяет параметры фильтра предварительного анализа эхо-сигнала или его элементов

. Эти коэффициенты фильтрации также передаются на вычислитель 270.In accordance with the algorithm of Fig. 7, the spectra of the loudspeaker and microphone signals are sent to the preliminary analysis filter of the echo signal 470, which on the basis of the received signals determines the parameters of the preliminary analysis filter of the echo signal

. These filter coefficients are also transmitted to a calculator 270.

Вычислитель 270 на фиг.7, в свою очередь, состоит из двух вычислителей параметров фильтрации стационарной и нестационарной компонент сигнала, 370-1 и 370-2, каждый из которых получает на своем входе спектр сигнала микрофона иThe calculator 270 in Fig. 7, in turn, consists of two calculators of the filtering parameters of the stationary and non-stationary components of the signal, 370-1 and 370-2, each of which receives at its input a spectrum of the microphone signal and

коэффициенты пропускания фильтра предварительного анализа эхо-сигнала

. Таким образом, функциональные возможности, заложенные в обоих вычислителях параметров фильтрации 370, предусматривают не только расчет параметров фильтрации, о чем говорилось в описании к фиг.2-5, но и выполнение операций второго фильтра предварительного анализа эхо-сигнала 260.transmission coefficients of the filter of the preliminary analysis of the echo signal

. Thus, the functionality embedded in both calculators of the filtering parameters 370, provide not only the calculation of the filtering parameters, as described in the description to figure 2-5, but also the execution of the second filter preliminary analysis of the echo signal 260.

Оба вычислителя параметров фильтрации 370, обозначенные на фиг.7 как ERF (фильтр эхокомпенсации), как и на фиг.5A, соединены с комбинатором 380, который обозначен на фиг.7 как FC (комбинация параметров фильтрации). Комбинатор 380 комбинирует коэффициенты фильтрации, полученные от обоих вычислителей параметров фильтрации, для выработки коэффициентов пропускания адаптивного фильтра 210.Both filter parameter calculators 370, indicated in FIG. 7 as ERF (echo cancellation filter), as in FIG. 5A, are connected to combiner 380, which is indicated in FIG. 7 as FC (combination of filter parameters). Combinator 380 combines filter coefficients obtained from both filter parameter calculators to produce transmittances of adaptive filter 210.

Далее, как уже говорилось в описании к Фиг.2 и 5, комбинатор 380 соединен с адаптивным фильтром 210, который на фиг.7 обозначен как SM (спектральная модификация). Адаптивный фильтр 210 осуществляет корректировку спектра на базе принятого им спектрального представления Y[k,m] сигнала микрофона y[n] для ослабления или подавления эхокомпоненты микрофонного сигнала.Further, as already mentioned in the description of FIGS. 2 and 5, the combinator 380 is connected to an adaptive filter 210, which is indicated in FIG. 7 as SM (spectral modification). Adaptive filter 210 performs spectrum adjustment based on the received spectral representation Y [k, m] of the microphone signal y [n] to attenuate or suppress the echo component of the microphone signal.

Наконец, адаптивный фильтр 210 сопряжен с частотно-временным преобразователем 300, который представляет собой обратный оконный преобразователь Фурье (ООПФ). Этот преобразователь формирует на выходе сигнал е[n] во временной области, очищенный от эха. Варианты реализации настоящего изобретения в форме соответствующих способов или устройств 200, как показано, например, на фиг.7, позволяют устранять артефакты, вносимые при модификации спектра адаптивным фильтром 210. Определяя иначе, предлагаемые конструктивные решения в соответствии с изобретением обеспечивают адаптивное регулирование мощности. При наличии речи только на дальнем конце процесс эхоподавления должен протекать в достаточно агрессивном режиме, чтобы не допускать прохождение любого сигнала, поскольку в такой ситуации может быть не желательно разделение нестационарных и стационарных сигналов и составляющих сигналов. По этой причине при распознании такой ситуации может потребоваться адаптация управляющего параметра β_x из уравнения (6), который регулирует или, по меньшей мере, воздействует именно на амплитуду стационарного компонентного сигнала, вычтенного из сигнала громкоговорителя.Finally, the adaptive filter 210 is coupled to a time-frequency converter 300, which is an inverse Fourier window converter (OOPF). This transducer generates at the output a signal e [n] in the time domain, cleared of echo. Embodiments of the present invention in the form of appropriate methods or devices 200, as shown, for example, in FIG. 7, can eliminate artifacts introduced when the spectrum is modified by the adaptive filter 210. Defining otherwise, the proposed structural solutions in accordance with the invention provide adaptive power control. If there is speech only at the far end, the echo cancellation process should proceed in an aggressive enough mode to prevent the passage of any signal, since in such a situation it may not be desirable to separate non-stationary and stationary signals and component signals. For this reason, when recognizing such a situation, it may be necessary to adapt the control parameter β _x from equation (6), which regulates or at least affects the amplitude of the stationary component signal subtracted from the loudspeaker signal.

Для распознавания ситуации, в которой выходной сигнал громкоговорителя содержит только речь дальнего конца линии связи, вычисляют два разных параметра. Первый из них - это значение так называемого выигрыша от предсказания (усиления предсказания), соответствующее полнополосному усреднению функций когерентности между каналом громкоговорителя и каналом микрофона. В качестве второго параметра используют показатель речевой активности канала громкоговорителя, который может быть получен, например, при сравнении разных по времени уровней сигнала громкоговорителя или выбран из специальных параметров голосового кодека, используемого в передаче речи. Слово „кодек" составлено из двух сокращенных английских слов - кодер и декодер, и такие кодеки могут строиться, например, на основе LPC (кодирования с линейным предсказанием) или CELP (линейного предсказания с мультикодовым управлением).To distinguish a situation in which the output signal of the speaker contains only speech at the far end of the communication line, two different parameters are calculated. The first of these is the value of the so-called prediction gain (prediction gain) corresponding to full-band averaging of the coherence functions between the loudspeaker channel and the microphone channel. As a second parameter, an indicator of the speech activity of the loudspeaker channel is used, which can be obtained, for example, by comparing the speaker signal levels different in time or selected from special parameters of the voice codec used in voice transmission. The word "codec" is composed of two abbreviated English words - an encoder and a decoder, and such codecs can be built, for example, on the basis of LPC (linear prediction coding) or CELP (linear prediction with multi-code control).

Показатель выигрыша от предсказания или усиление предсказания эха ω[k], описывает уровень сходства между сигналом микрофона и задержанным сигналом громкоговорителя. Выигрыш от предсказания ω[k] рассчитывается на базе квадратичной функции когерентности между задержанным спектром мощности сигнала громкоговорителя |X_d[k,m]|² и спектром мощности сигнала микрофона в соответствии с уравнениемThe prediction gain or echo prediction gain ω [k] describes the level of similarity between the microphone signal and the delayed speaker signal. The prediction gain ω [k] is calculated based on the quadratic coherence function between the delayed power spectrum of the speaker signal | X _d [k, m] | ² and the power spectrum of the microphone signal in accordance with the equation

где Е{…} обозначает математическое ожидание. Названное ожидаемое математическое значение может быть получено с помощью кратковременной оценки функции когерентности Г_d[k,m] путем вычисления или аппроксимации ожидаемого значения согласно уравнениюwhere E {...} stands for mathematical expectation. The named expected mathematical value can be obtained using a short-term estimate of the coherence function Г _d [k, m] by calculating or approximating the expected value according to the equation

Показатель α определяет степень сглаживания оценки во времени. Этот показатель связан с временной константой, так как равенство (15) приблизительно соответствует экспоненциальному затуханию.Index α determines the degree of smoothing of the estimate over time. This indicator is related to the time constant, since equality (15) approximately corresponds to exponential attenuation.

Постоянная времени Тα экспоненциального затухания в секундах представляет собой приблизительноThe time constant Tα of the exponential decay in seconds is approximately

где f_s обозначает частоту дискретизации. Другими словами, отношение пропорциональности (16) показывает, как коэффициенты, фактически являющиеся безразмерными (здесь - α), относящиеся к частоте дискретизации f_s, могут быть представлены в виде постоянной времени (здесь - Т_α).where f _s denotes the sampling rate. In other words, the proportionality relation (16) shows how coefficients that are actually dimensionless (here, α), related to the sampling frequency f _s , can be represented as a time constant (here, T _α ).

После этого выигрыш от предсказания ω[k] рассчитывают как среднее значение функций когерентности Г_d[k,m] по частотам, обозначенным индексами m=0,…, М-1, в соответствии сAfter that, the gain from the prediction ω [k] is calculated as the average value of the coherence functions Г _d [k, m] at the frequencies indicated by the indices m = 0, ..., M-1, in accordance with

где М показывает количество частотных полос. Коэффициент усиления эхо-сигнала, близкий к 1, показывает, что микрофонный сигнал может быть (почти) полностью предсказан, исходя из задержанного сигнала громкоговорителя. Вследствие этого вероятность того, что микрофонный сигнал содержит только речь на приемном конце, стремится к 1. Задающий параметр β_x при этом можно настраивать в зависимости от значения выигрыша от предсказания ω. Высокое значение выигрыша от предсказания служит показателем наличия голосового сигнала только на дальнем конце, и аттенюация эха должна быть достаточно агрессивной, чтобы устранить все (эхо-)сигналы. Таким образом, помехи удерживаются в пределах нестационарного тракта и устраняются при низшем предельном значении L_s в децибелах (дБ), поскольку задающий параметр выбран как β_x=β_w=0. Низкое значение выигрыша от предсказания указывает на возможное наличие речевого сигнала как на ближнем, так и на дальнем конце, и эхоподавление должно быть менее агрессивным во избежание артефактов. В этом случае помехи компенсируют по стационарному тракту и устраняют при предельном значении L_w в децибелах (дБ). Здесь применяется параметр β_x=β_w.where M shows the number of frequency bands. An echo gain close to 1 indicates that the microphone signal can be (almost) completely predicted based on the delayed speaker signal. As a result, the probability that the microphone signal contains only speech at the receiving end tends to 1. In this case, the setting parameter β _x can be adjusted depending on the value of the gain from the prediction ω. A high prediction gain is an indication of the presence of a voice signal only at the far end, and echo attenuation should be aggressive enough to eliminate all (echo) signals. Thus, the interference is kept within the non-stationary path and is eliminated at the lowest limit value L _s in decibels (dB), since the setting parameter is chosen as β _x = β _w = 0. A low prediction gain indicates a possible presence of a speech signal at both the near and far ends, and echo cancellation should be less aggressive to avoid artifacts. In this case, the interference is compensated through the stationary path and eliminated at the limiting value of L _w in decibels (dB). Here, the parameter β _x = β _w is applied.

При этом, однако, необходимо отметить, что выигрыш от предсказания также может быть высоким, если сигнал громкоговорителя содержит только шум, улавливаемый микрофоном в отсутствии голосового сигнала. Во избежание выбора завышенного значения управляющего параметра β_x, что может привести к избыточному подавлению, применяют второй управляющий параметр - показатель активности речи в канале громкоговорителя. В силу этого, приведенные выше правила вычисления управляющего параметра β_x как функции усиления предсказания ω на практике применяют только при наличии в канале громкоговорителя речевой активности.In this case, however, it should be noted that the gain from the prediction can also be high if the loudspeaker signal contains only the noise picked up by the microphone in the absence of a voice signal. To avoid choosing an overestimated value of the control parameter β _x , which can lead to excessive suppression, a second control parameter is used - an indicator of speech activity in the loudspeaker channel. Because of this, the above rules for calculating the control parameter β _x as a function of enhancing the prediction ω in practice are applied only if there is speech activity in the loudspeaker channel.

В варианте конструкции на фиг.7 технологические операции, описываемые уравнениями с (14) по (17), выполняются вычислителем 270, включая два вычислителя параметров фильтрации 370, и комбинатором 380. Различные варианты конструкции, отображенные на фиг.2-5, предусматривают возможность использования вычислителем 270 не только принимаемого им через вход 280 микрофонного сигнала, который в виде опции показан на фиг.2, но и немодифицированного сигнала громкоговорителя, поступающего через вход 220.In the embodiment of FIG. 7, the technological operations described by equations (14) through (17) are performed by a calculator 270, including two filter parameter calculators 370, and a combinator 380. Various design options shown in FIGS. 2-5 provide for use by the calculator 270 of not only the microphone signal received by it through the input 280, which is shown as an option in FIG. 2, but also of the unmodified loudspeaker signal received through the input 220.

Далее будет графически детально проиллюстрирован принцип обработки сигналов с помощью конструкций, реализованных в соответствии с настоящим изобретением и представленных на фигурах с 8 по 10.Next, the principle of signal processing using structures implemented in accordance with the present invention and shown in figures 8 to 10 will be graphically illustrated in detail.

На фиг.8 показан процесс разделения или экстракции стационарной и нестационарной составляющих сигнала громкоговорителя. В разделе (а) фиг.8 дан график спектральной плотности мощности сигнала громкоговорителя на частоте 1 кГц как функции времени в диапазоне приблизительно от 5 до 7,5 сек. Абсцисса графика в разделе (с) фигуры 8 относится ко всем трем разделам (а)-(с). На графике (b) дан график спектральной плотности мощности нестационарной составляющей, а на графике (с) отражена соответствующая спектральная плотность мощности стационарной составляющей сигнала.On Fig shows the process of separation or extraction of stationary and non-stationary components of the loudspeaker signal. Section (a) of FIG. 8 is a graph of the spectral power density of a speaker signal at a frequency of 1 kHz as a function of time in the range of about 5 to 7.5 seconds. The abscissa of the graph in section (c) of figure 8 applies to all three sections (a) - (c). Graph (b) gives a graph of the power spectral density of the non-stationary component, and graph (c) shows the corresponding power spectral density of the stationary component of the signal.

Нестационарная составляющая или сопряженный нестационарный компонентный сигнал, отображенный на графике 8(b), содержит верхние значения, соответствующие каждому случаю роста величины спектральной плотности мощности на графике 8(а). Следует обратить внимание, что в промежутках между этими диапазонами нестационарная компонента почти полностью исчезает.The non-stationary component or the conjugated non-stationary component signal displayed on the graph 8 (b) contains the upper values corresponding to each case of the increase in the power spectral density on the graph 8 (a). It should be noted that in the intervals between these ranges the unsteady component almost completely disappears.

В противоположность нестационарной составляющей на фиг.8(b) стационарная составляющая на фиг.8(с), определяемая с помощью плавающего рекурсивного усреднения по уравнению (7), графически отображает очевидно меньшие амплитуды и в силу плавающего усреднения представляет собой очевидно более пологую кривую. В частности, стационарная составляющая на фиг.8(с), и/или соответствующий стационарный компонентный сигнал в период времени около 6,4 сек имеют экспоненциальное или подобное экспоненциальному понижение, о чем упоминалось в связи с отношением пропорциональности (16). Такой спад является следствием отсутствия в спектре мощности в этом диапазоне на фиг.8(а) больших величин, которые соответствуют голосовым сигналам. Спектральные составляющие, выходящие за пределы стационарной составляющей, соответственно, удаляются.In contrast to the non-stationary component in Fig. 8 (b), the stationary component in Fig. 8 (c), determined using floating recursive averaging according to equation (7), graphically displays obviously lower amplitudes and, due to floating averaging, is an obviously more gentle curve. In particular, the stationary component in Fig. 8 (c) and / or the corresponding stationary component signal have an exponential or similar exponential decrease over a period of about 6.4 seconds, as mentioned in connection with the proportionality relation (16). Such a decline is a consequence of the absence in the power spectrum in this range of Fig. 8 (a) of large quantities that correspond to voice signals. Spectral components that go beyond the stationary component, respectively, are removed.

Основываясь на данных фиг.8, фиг.9 отображает соответствующие эхокомпенсирующие фильтры. Говоря конкретнее, на фиг.9 даны две соотносящиеся кривые двух взаимосвязанных фильтров эхокомпенсации H_s и H_w на частоте 1 кГц, рассчитанные на базе уравнений (12) и (13). Так, на фиг.9(а) отображена функция фильтра эхокомпенсации H_s, рассчитанного по уравнению (12), для нестационарной составляющей на частоте 1 кГц. В разделе (b) дан график функции соответствующего эхокомпенсирующего фильтра H_w для стационарной составляющей, построенный в соответствии с уравнением (13).Based on the data of FIG. 8, FIG. 9 displays respective echo cancellation filters. More specifically, Fig. 9 shows two correlating curves of two interconnected echo cancellation filters H _s and H _w at a frequency of 1 kHz, calculated on the basis of equations (12) and (13). So, FIG. 9 (a) shows the function of the echo cancellation filter H _s calculated according to equation (12) for the non-stationary component at a frequency of 1 kHz. Section (b) gives a graph of the function of the corresponding echo canceller filter H _w for the stationary component, constructed in accordance with equation (13).

Фиг.10 графически представляет идентичные параметры в расширенном масштабе времени, отображенном на абсциссе графика 10(с) и относящемся одновременно к фигурам 10(а) и 10(b). На фиг.10 показана зависимость от величины управляющего параметра β_x - на графике 10(а) - процесса сепарации стационарной/нестационарной составляющих и - на графике 10(b) - выигрыша от предсказания ω на фоне речевой активности канала громкоговорителя, отображенной на графике 10(с).Figure 10 graphically represents identical parameters in an extended time scale displayed on the abscissa of the graph 10 (c) and relating simultaneously to figures 10 (a) and 10 (b). Figure 10 shows the dependence on the value of the control parameter β _x - in graph 10 (a) - the process of separation of stationary / non-stationary components and - in graph 10 (b) - gain from the prediction ω against the background of the speech activity of the loudspeaker channel displayed in graph 10 (from).

Определяя точнее, фиг.10 наглядно демонстрирует взаимозависимость между управляющим параметром β_x и двумя другими параметрами управления - ω и речевой активности, введенными и описанными ранее. Первая треть процесса, смоделированного на фиг.10, протекает в условиях, характеризуемых наличием речевого сигнала только на дальнем конце и высоким выигрышем от предсказания. В этом случае управляющему параметру β_x задано значение β_x=β_w=0, соответствующее агрессивному режиму подавления нестационарной составляющей и полному подавлению стационарной составляющей.Determining more precisely, figure 10 clearly demonstrates the interdependence between the control parameter β _x and two other control parameters - ω and speech activity, introduced and described earlier. The first third of the process modeled in FIG. 10 proceeds under conditions characterized by the presence of a speech signal only at the far end and a high gain from the prediction. In this case, the control parameter β _x is set to β _x = β _w = 0, which corresponds to the aggressive mode of suppressing the non-stationary component and the complete suppression of the stationary component.

Вторая треть модели отображает ситуацию, характеризуемую наличием речевого сигнала только на дальнем конце и возможностью его распознания при низком коэффициенте усиления предсказания ω и отсутствием речевой активности в сигнале громкоговорителя. При этом величина управляющего параметра β_x задается так, чтобы обеспечить прохождение всех стационарных составляющих по стационарному тракту и их устранение на низком уровне агрессивности, предупреждающем внесение артефактов. Последняя треть модели воспроизводит режим диалога, при котором управляющий параметр β_x варьируется между низкими значениями при наличии речевой активности в канале громкоговорителя и более высокими значениями, когда речевая активность не распознается.The second third of the model displays a situation characterized by the presence of a speech signal only at the far end and the possibility of its recognition with a low prediction gain ω and the absence of speech activity in the speaker signal. In this case, the value of the control parameter β _x is set so as to ensure that all stationary components pass along the stationary path and eliminate them at a low level of aggressiveness, preventing artifacts from being introduced. The last third of the model reproduces a dialogue mode in which the control parameter β _x varies between low values when there is speech activity in the speaker channel and higher values when speech activity is not recognized.

В описанных выше вариантах реализации предлагаемого изобретения, включая фиг.6, также представляющую собой общую блок-схему соответствующего алгоритма, раздельное подавление стационарных и нестационарных составляющих эха осуществляется не за счет разделения соответствующих сигналов громкоговорителя, а за счет оценки эхо-сигнала в целом.In the embodiments of the invention described above, including FIG. 6, which is also a general block diagram of the corresponding algorithm, the separate suppression of the stationary and non-stationary components of the echo is carried out not by separating the corresponding loudspeaker signals, but by evaluating the echo signal as a whole.

Во всех рассмотренных версиях осуществления изобретения оценка спектральной плотности мощности эхо-сигнала выполнялась посредством применения фильтра предварительного анализа эхо-сигнала G[k,m] или G[k,m]² к задержанному варианту спектра мощности сигнала громкоговорителя согласно уравнению (8), гдеIn all considered versions of the invention, the evaluation of the spectral density of the power of the echo signal was performed by applying the filter of preliminary analysis of the echo signal G [k, m] or G [k, m] ² to the delayed version of the power spectrum of the signal of the speaker according to equation (8), where

оценка спектра мощности эха, содержащаяся в сигнале микрофона. В результате разделения спектра мощности сигнала громкоговорителя в соответствии с уравнением (3) на стационарные |X_w[k,m]|² и нестационарные |X_s[k,m]|² составляющие эхо-сигнал, возникающий из нестационарной компоненты сигнала громкоговорителя, рассчитывается с помощью уравнения (10), а эхо-сигнал, возникающий из стационарных компонент сигнала громкоговорителя, получается из уравнения (9).an estimate of the echo power spectrum contained in the microphone signal. As a result of dividing the power spectrum of the loudspeaker signal in accordance with equation (3) into stationary | X _w [k, m] | ² and non-stationary | X _s [k, m] | ^{The 2} components of the echo arising from the non-stationary component of the loudspeaker signal are calculated using equation (10), and the echo signal arising from the stationary components of the loudspeaker signal is obtained from equation (9).

Используя оценки нестационарныхUsing estimates of non-stationary

и стационарныхand stationary

эхо-сигналов, можно рассчитать соответствующие фильтры эхокомпенсации H_s[k,m] и H_w[k,m]. Затем эти фильтры эхокомпенсации объединяются и используются для подавления эха в сигнале микрофона в соответствии с уравнениемecho signals, the corresponding echo cancellation filters H _s [k, m] and H _w [k, m] can be calculated. These echo cancellation filters are then combined and used to suppress the echo in the microphone signal in accordance with the equation

где H[k,m] выводится изwhere H [k, m] is derived from

Один из возможных способов объединения разных эхокомпенсирующих фильтров H_s[k,m] и H_w[k,m] состоит в использовании их выходных данных в соответствии с уравнением (11), что аналогично последовательному соединению двух фильтров.One possible way to combine different echo cancellation filters H _s [k, m] and H _w [k, m] is to use their output in accordance with equation (11), which is similar to connecting two filters in series.

Другой возможный способ заключается в использовании соответствующего минимума эхокомпенсирующих фильтров согласноAnother possible way is to use an appropriate minimum of echo cancellation filters according to

где функция min(…) представляет минимум соответствующих величин. Другими словами, в данном случае применено соответствие (…)=min(…).where the function min (...) represents the minimum of the corresponding quantities. In other words, in this case, the correspondence (...) = min (...) is applied.

Как пояснялось выше, эти вычисления могут быть выполнены, в частности, комбинатором 380, а кроме того, селектором 390 или распределителем 410. В дополнение к этому отдельные устройства способны осуществлять более сложные операции комбинирования и расчета индивидуальных заграждающих фильтров, базирующихся, например, на линейных построениях или нелинейных уравнениях. Также, предусмотрена возможность комбинирования не только полосовых сигналов, но и групп полосовых сигналов или всей совокупности полосовых сигналов.As explained above, these calculations can be performed, in particular, by combinator 380, and in addition, by selector 390 or distributor 410. In addition, individual devices are able to perform more complex operations of combining and calculating individual blocking filters, based, for example, on linear constructions or nonlinear equations. Also, it is possible to combine not only band signals, but also groups of band signals or the entire set of band signals.

Благодаря комбинированию эхокомпенсирующих фильтров для разных составляющих эха могут быть введены различные коэффициенты усиления. Фильтр компенсации нестационарного эха рассчитывается по уравнению (12), а фильтр компенсации стационарного эхо-сигнала вычисляется согласно уравнению (13).By combining echo cancellation filters for different echo components, different gain factors can be introduced. The non-stationary echo cancellation filter is calculated according to equation (12), and the stationary echo cancellation filter is calculated according to equation (13).

Часто на практике эхоподавление осуществляется не на базе прямого приложения фильтров эхокомпенсации в соответствии с уравнениями (12) и (13), а на базе соответствующих сглаженных по времени версий. Подобно вышеописанным расчетным параметрам параметры временного сглаживания также могут быть откорректированы вручную отдельно для подавления нестационарного и стационарного эхо-сигналов. Таким образом, качество воспринимаемого звука может быть улучшено, поскольку требования к подавлению стационарных шумовых составляющих отличаются от требований к компенсации нестационарных составляющих голосового сигнала.Often, in practice, echo cancellation is not based on the direct application of echo cancellation filters in accordance with equations (12) and (13), but on the basis of the corresponding time-smoothed versions. Like the calculation parameters described above, the temporal smoothing parameters can also be manually adjusted separately to suppress non-stationary and stationary echo signals. Thus, the quality of the perceived sound can be improved, since the requirements for suppressing stationary noise components are different from the requirements for compensating for non-stationary components of a voice signal.

Скажем, хорошо известно, что для подавления стационарных составляющих сигнала требуется более интенсивное сглаживание во избежание так называемых музыкальных тонов. С другой стороны, сглаживание, задаваемое фильтрам компенсации нестационарного эха, должно поддерживать более низкие значения, чтобы в достаточной мере подавлять эхо-сигналы, вносимые длительными составляющими пути эха, или длинными хвостами пути эха. Тем не менее, при этом не должен нарушаться порядок прохождения быстро изменяющихся уровней эха. Обсуждение качества восприятия ясно показывает необходимость индивидуальной адаптации и оптимизации двух разновидностей фильтров эхокомпенсации в соответствии с уравнениями (12) и (13).Say, it is well known that to suppress the stationary components of a signal, more intensive smoothing is required to avoid the so-called musical tones. On the other hand, the anti-aliasing given to the non-stationary echo compensation filters should maintain lower values to sufficiently suppress the echoes introduced by the long components of the echo path, or the long tails of the echo path. However, this should not interfere with the passage of rapidly changing echo levels. The discussion of the quality of perception clearly shows the need for individual adaptation and optimization of two types of echo cancellation filters in accordance with equations (12) and (13).

Описываемый далее подход к реализации предлагаемого способа и/или устройства заключается в раздельном применении эхокомпенсирующих фильтров к стационарным и нестационарным составляющим сигнала.The following approach to the implementation of the proposed method and / or device consists in the separate application of echo cancellation filters to the stationary and non-stationary components of the signal.

На фиг.11 дана принципиальная блочная схема устройства 200, включающего в себя адаптивный фильтр 210. Однотипность конструкций позволяет в дальнейшем ссылаться на конструктивные решения, представленные на фиг.2-5, 6 и 7.Figure 11 is a schematic block diagram of a device 200 including an adaptive filter 210. The uniformity of the structures allows further reference to the structural solutions presented in figures 2-5, 6 and 7.

Устройство 200 согласно изобретению включает в себя громкоговоритель 100 или терминал для подключения громкоговорителя 100, или вход для сигнала громкоговорителя x[n]. Времячастотный преобразователь (ВЧП) 230, обозначенный на схеме как ДПФ (дискретное преобразование Фурье), конвертирует сигнал громкоговорителя x[n] в трансформанту в виде X[k,m]. Затем, сигнал громкоговорителя поступает на устройство задержки 480, которое формирует задержанный сигнал X[k-d(k,m),m] с величиной задержки d(k,m).The device 200 according to the invention includes a speaker 100 or a terminal for connecting a speaker 100, or an input for a speaker signal x [n]. The time-frequency converter (RFP) 230, designated as a DFT (discrete Fourier transform) in the diagram, converts the speaker signal x [n] into a transform in the form X [k, m]. Then, the loudspeaker signal is supplied to a delay device 480, which generates a delayed signal X [k-d (k, m), m] with a delay value d (k, m).

От устройства задержки 480 задержанный сигнал передается на первый фильтр предварительного анализа эхо-сигнала 240, который на базе коэффициентов фильтрацииFrom the delay device 480, the delayed signal is transmitted to the first filter of the preliminary analysis of the echo signal 240, which is based on filtering coefficients

G[k,m] генерирует оценочный сигнал эхаG [k, m] generates an estimated echo signal

Оценочный сигнал эхаEcho Evaluation

посылается на экстрактор 250, который, исходя из спектральных коэффициентов такого оцененного эхо-сигнала, генерирует нестационарные и стационарные спектры мощности этого сигнала как (производные) составляющие сигнала громкоговорителя.sent to the extractor 250, which, based on the spectral coefficients of such an estimated echo signal, generates non-stationary and stationary power spectra of this signal as (derivatives) of the loudspeaker signal.

После этого сигналыAfter that signals

и

and

выводятся из экстрактора 250 на вычислитель 270.output from the extractor 250 to the calculator 270.

Сигнал y[n] микрофона 110 вводится во времячастотный преобразователь (ВЧП) 290, сокращенно обозначенный ДПФ, который преобразует его из временной области в спектральное представление Y[k,m]. Преобразованный сигнал поступает в вычислитель уровня энергии 490, который, учитывая спектральные компоненты сигнала микрофона, рассчитывает их спектральную плотность мощности путем возведения в квадрат (абсолютной) величины каждого показателя. Полученный таким образом спектр мощности также вводится в вычислитель 270, который параллельно с вышеописанными спектрами мощности рассчитывает два фильтра эхокомпенсации H_s[k,m] и H_w[k,m], коэффициенты пропускания действующего адаптивного фильтра H[k,m] и передает их на адаптивный фильтр 210.The signal y [n] of the microphone 110 is input to a time frequency converter (RFP) 290, abbreviated as DFT, which converts it from the time domain to the spectral representation Y [k, m]. The converted signal enters the energy level calculator 490, which, taking into account the spectral components of the microphone signal, calculates their spectral power density by squaring the (absolute) value of each indicator. The power spectrum obtained in this way is also input into a calculator 270, which, in parallel with the power spectra described above, calculates two echo cancellation filters H _s [k, m] and H _w [k, m], the transmittances of the current adaptive filter H [k, m] and transmits them to adaptive filter 210.

Адаптивный фильтр 210 одновременно сопряжен с выходом времячастотного преобразователя 290 и, следовательно, тоже получает спектральные компоненты Y[k,m] микрофонного сигнала y[n], из которого он вырабатывает сигнал с блокированным эхом в частотной области H[k,m], учитывая коэффициенты фильтрации H[k,m]. Затем данный сигнал с компенсированным эхом передается на частотно-временной преобразователь (ЧВП) 300, обозначенный на схеме как ОДПФ (обратное ДПФ), который в завершение преобразует этот сигнал назад во временную область.The adaptive filter 210 is simultaneously coupled to the output of the time-frequency converter 290 and, therefore, also receives the spectral components Y [k, m] of the microphone signal y [n], from which it generates a signal with a blocked echo in the frequency domain H [k, m], given filtering coefficients H [k, m]. Then this signal with a compensated echo is transmitted to a time-frequency converter (FWM) 300, indicated in the diagram as an DFT (inverse DFT), which finally converts this signal back to the time domain.

С целью установления величины задержки d(k, m) для устройства задержки 480 и для определения коэффициентов фильтра предварительного анализа эхо-сигнала 240 представления в области спектра сигнала громкоговорителя X[k,m] и сигнала микрофона Y[k,m] вводятся в соответствующий вычислитель энергии 500, 510, каждый из которых соединен с выходом времячастотного преобразователя 230, 290, соответственно. Вычислитель уровня энергии 500 соединен с выходом времячастотного преобразователя 230, вычислитель уровня энергии 510 соединен с выходом частотно-временного преобразователя 300.In order to establish the delay value d (k, m) for the delay device 480 and to determine the filter coefficients of the preliminary analysis of the echo signal 240, the representations in the spectral region of the speaker signal X [k, m] and the microphone signal Y [k, m] are input into the corresponding an energy calculator 500, 510, each of which is connected to the output of the time-frequency converter 230, 290, respectively. The energy level calculator 500 is connected to the output of the time-frequency converter 230, the energy level calculator 510 is connected to the output of the time-frequency converter 300.

Каждый из двух вычислителей величины энергии 500 и 510 вычисляет, как и вычислитель уровня энергии 490, спектральные плотности мощности, возводя в квадрат величины соответствующих спектральных компонент, и выводит на следующий вычислитель 520. На основе введенных в него величин вычислитель 520 оценивает величину задержки d(k,m) и коэффициенты пропускания G[k,m] фильтра предварительного анализа эхо-сигнала 240. Параллельно вычислитель 520 сопряжен с устройством задержки 480 и с фильтром предварительного анализа эхо-сигнала 240, на которые пересылаются полученные им соответствующие показатели.Each of the two energy value calculators 500 and 510 calculates, like the energy level calculator 490, the power spectral densities, squaring the values of the corresponding spectral components, and outputs the next calculator 520. Based on the values entered into it, the calculator 520 estimates the delay value d ( k, m) and transmission coefficients G [k, m] of the preliminary analysis filter of the echo signal 240. In parallel, the calculator 520 is coupled to a delay device 480 and a filter of the preliminary analysis of the echo signal 240, which are sent via Scientists named the figures.

Как следует из варианта решения на фиг.11, соответствующие компонентные сигналыAs follows from the solution in FIG. 11, the corresponding component signals

(

и

),(

and

),

таким образом, могут быть разделены на базе оценки спектра эхо-сигналаthus can be divided based on the evaluation of the spectrum of the echo

которая делается в соответствии с уравнениемwhich is done in accordance with the equation

Указанное вычисление выполняется фильтром предварительного анализа эхо-сигнала 240.The specified calculation is performed by the filter preliminary analysis of the echo signal 240.

Определение двух фильтров эхокомпенсации H_s[k,m] и H_w[k,m] согласно уравнениям (12) и (13) остается неизменным. То же применимо к определению объединенного эхокомпенсирующего фильтра H[k,m]. Следовательно, дополнительный способ и соотнесенное с ним устройство 200, представленные на фиг.11, основаны на заключении, что стационарные и нестационарные составляющие эха спрогнозированных эхо-сигналов некоррелированы, так чтоThe determination of the two echo cancellation filters H _s [k, m] and H _w [k, m] according to equations (12) and (13) remains unchanged. The same applies to the definition of the combined echo canceller filter H [k, m]. Therefore, the additional method and associated device 200 shown in FIG. 11 are based on the conclusion that the stationary and non-stationary echo components of the predicted echoes are uncorrelated, so that

Тогда, оценочные спектры мощности стационарных составляющих эха могут быть определены путем вычитания оценки стационарной составляющей эха

из спектральной плотности мощности оценочного эхо-сигнала.Then, the estimated power spectra of the stationary components of the echo can be determined by subtracting the estimates of the stationary component of the echo

from the power spectral density of the estimated echo signal.

Таким образом,In this way,

На практике сигнал |Ŷ_s[k,m]|² оценивается путем фильтрации спектра мощности эхо-сигнала, рассчитенного, следуяIn practice, the signal | Ŷ _s [k, m] | ² is estimated by filtering the power spectrum of the echo signal calculated by following

Поскольку используемый фильтр усиления F_y[k,m] или квадрат его значения F_y[k,m]²определяется по аналогии с фильтром усиления F_x[k,m] или F_x[k,m]², этот компонент здесь подробно не истолковывается. Подобные функции выполняет также экстрактор 250, используя полученные сигналы.Since the used gain filter F _y [k, m] or the square of its value F _y [k, m] ² is determined by analogy with the gain filter F _x [k, m] or F _x [k, m] ² , this component is detailed here not construed. The extractor 250 also performs similar functions using the received signals.

Здесь следует отметить, что конструктивное решение, показанное на фиг.11, относится к тому случаю, когда оцененный спектр эхо-сигнала

уже известен. Безусловно, подобный способ применим также, когда известен только сигнал расчетной мощности эхо-сигнала

, оцененного с применением уравнения (8). Подобный вариант рассмотрен подробно при описании технического решения, отображенного на фиг.12.It should be noted here that the constructive solution shown in FIG. 11 relates to the case where the estimated echo spectrum

already known. Of course, this method is also applicable when only the signal of the estimated power of the echo signal is known.

estimated using equation (8). A similar option is considered in detail in the description of the technical solution shown in Fig. 12.

Блок-схема на фиг.12 иллюстрирует подход, аналогичный представленному на фиг.11 алгоритму аттенюации акустического эха способом сепарации стационарных и нестационарных составляющих эха на основе ожидаемого спектра эха

Тем не менее, способ на фиг.12 отличается тем, что при нем аттенюация акустического эха построена на разделении стационарных и нестационарных составляющих эха на базе оценки спектральной плотности мощности эхо-сигнала

The flowchart of FIG. 12 illustrates an approach similar to that presented in FIG. 11 for an acoustic echo attenuation algorithm for separating stationary and non-stationary echo components based on the expected echo spectrum.

However, the method of FIG. 12 is characterized in that, with it, attenuation of the acoustic echo is based on the separation of stationary and non-stationary components of the echo based on an estimate of the spectral power density of the echo signal

Из следующего ниже описания очевидно, что реализации на фигурах 11 и 12 аналогичны между собой не только по своим функциям, но и по конструкции.From the following description, it is obvious that the implementations in figures 11 and 12 are similar to each other not only in their functions, but also in design.

Говоря конкретнее, существенным отличием версии на фиг.12 от фиг.11 является то, что вычислитель уровня энергии 500, принимающий и обрабатывающий преобразованный в частотную область сигнал громкоговорителя x[n], смонтирован не строго перед вычислителем 520, а подключен напрямую к выходу времячастотного преобразователя 230 в конфигурации ДПФ. При подобной компоновке, как на вычислитель 520, так и на устройство задержки 480 на фильтр предварительного анализа эхо-сигнала 240 и на экстрактор 250 больше не поступают собственно спектральные составляющие, а их спектры мощности.More specifically, a significant difference between the version of FIG. 12 and FIG. 11 is that the energy level calculator 500, which receives and processes the loudspeaker signal x [n] converted to the frequency domain, is mounted not strictly in front of the calculator 520, but is connected directly to the time-frequency output Converter 230 in the configuration of the DFT. With this arrangement, both the calculator 520 and the delay device 480 to the preliminary filter analysis of the echo signal 240 and the extractor 250 no longer receive the actual spectral components, and their power spectra.

В остальном, две разновидности одной конструкции на фиг.11 и 12 различаются только тем, что аналогичные вычисления выполняются в них разными элементами и устройствами несколько по-разному. В частности, экстрактор 250 не выполняет вычисление энергопоказателей отдельных компонент спектра, поскольку это было предварительно сделано вычислителем величины энергии 500.Otherwise, the two varieties of the same design in FIGS. 11 and 12 differ only in that similar calculations are performed in them by different elements and devices in slightly different ways. In particular, the extractor 250 does not calculate the energy indices of the individual components of the spectrum, since this was previously done by the energy magnitude calculator 500.

На фиг.13 показан вариант технического исполнения изобретения, в котором, например, на устройство 200 поступает больше одного сигнала громкоговорителя или больше одного сигнала микрофона. Формулируя иначе, на фиг.13 представлена реализация многоканального устройства.FIG. 13 shows an embodiment of the invention in which, for example, more than one loudspeaker signal or more than one microphone signal is supplied to the device 200. Formulating otherwise, FIG. 13 shows an implementation of a multi-channel device.

Ранее были описаны и обсуждены конструктивные решения данного изобретения с раздельными каналами или с одиночным каналом для передачи только одного сигнала громкоговорителя и одного сигнала микрофона, однако данное изобретение не ограничивается лишь одноканальным исполнением, что и будет рассмотрено далее. Предшествующие варианты конструкции по аналогии могут быть применены в многоканальных системах глушения акустического эха.Previously, structural solutions of the present invention with separate channels or with a single channel for transmitting only one loudspeaker signal and one microphone signal have been described and discussed, however, this invention is not limited to a single-channel execution, which will be discussed later. The previous design options by analogy can be applied in multichannel acoustic echo jamming systems.

Поскольку версия устройства 200 на фиг.13 в целом аналогична конструкции на фиг.2, ниже при описании режимов работы, соединений и других аспектов будут даваться ссылки на описание фиг.2-5.Since the version of the device 200 in FIG. 13 is generally similar to the design in FIG. 2, below, when describing operation modes, connections and other aspects, reference will be made to the description of FIGS.

Отображенный на фиг.13 многоканальный вариант устройства 200 имеет неограниченное количество входов 220-1, 220-2, …, через которые на него может поступать множество сигналов громкоговорителя. Соответственно, устройство 200 может включать в свой состав опцию в виде необходимого количества времячастотных преобразователей 230-1, 230-2, … для перевода или преобразования сигналов громкоговорителя из временной области в частотную область, о чем подробно говорилось в связи с фиг.2.The multichannel embodiment of the device 200 shown in FIG. 13 has an unlimited number of inputs 220-1, 220-2, ... through which a plurality of speaker signals can be fed to it. Accordingly, the device 200 may include an option in the form of the required number of time-frequency converters 230-1, 230-2, ... to translate or convert the speaker signals from the time domain to the frequency domain, as discussed in detail in connection with FIG. 2.

Все времячастотные преобразователи 230 сопряжены с соответствующим количеством входов группиратора 530, который объединяет входящие сигналы громкоговорителя в производный сигнал громкоговорителя, который затем пересылается на первый фильтр предварительного анализа эхо-сигнала 240 или на экстрактор 250 в зависимости от того, смонтирован ли произвольный первый фильтр предварительного анализа эхо-сигнала 240. В контексте фиг.2 уже говорилось, что экстрактор 250 может быть соединен с произвольным вторым фильтром предварительного анализа эхо-сигнала 260 или непосредственно с вычислителем 270. Названный вычислитель формирует на выходе рассчитанные коэффициенты фильтрации.All time-frequency converters 230 are coupled to the corresponding number of inputs of the grouping device 530, which combines the input loudspeaker signals into a derivative loudspeaker signal, which is then sent to the first pre-analysis filter of the echo signal 240 or to the extractor 250, depending on whether an arbitrary first preliminary analysis filter is mounted echo signal 240. In the context of FIG. 2, it has already been said that the extractor 250 can be connected to an arbitrary second echo-pre-filter feeder 260 or directly with calculator 270. The named calculator generates calculated filtration coefficients at the output.

В отличие от варианта на фиг.2 многоканальное устройство 200 на фиг.13, кроме того, содержит в своей конструкции группиратор 540, входные каналы которого подключены к соответствующему количеству входных терминалов 280-1, 280-2, … для микрофонных сигналов, возможно, через посредство времячастотных преобразователей 290-1, 290-2, …, также являющихся опцией. Группиратор 540, подобно группиратору 530, формирует на основании принятых микрофонных сигналов во временном или в частотном представлении производный - эффективный или общий - сигнал микрофона, который может факультативно быть передан на экстрактор 250 или вычислитель 270.In contrast to the variant in FIG. 2, the multi-channel device 200 in FIG. 13 also contains in its design a grouping device 540, the input channels of which are connected to the corresponding number of input terminals 280-1, 280-2, ... for microphone signals, possibly via time-frequency converters 290-1, 290-2, ..., also an option. The grouping device 540, like the grouping device 530, generates, based on the received microphone signals in time or frequency representation, a derivative - effective or common - microphone signal, which can optionally be transmitted to the extractor 250 or calculator 270.

Далее, устройство 200 в многоканальном исполнении, как показано на фиг.13, содержит адаптивные фильтры 210-1, 210-2, … для каждого микрофонного сигнала или каждого терминала ввода микрофонного сигнала 280, причем подключение адаптивных фильтров 210-1, 210-2, … к соответствующим входам 280-1, 280-2 … произвольно возможно через времячастотные преобразователи 290-1, 290-2,… Так же, адаптивные фильтры 210-1, 210-2, … соединены с соответствующими выходными терминалами 310-1, 310-2…, при необходимости - через соответствующее количество частотно-временных преобразователей 300-1, 300-2, … На выходе адаптивных фильтров 210 сигналы, очищенные от эха или спектрально модифицированные, поступают на терминалы вывода 310 из устройства 200.Further, the multi-channel device 200, as shown in FIG. 13, comprises adaptive filters 210-1, 210-2, ... for each microphone signal or each microphone input terminal 280, and the connection of adaptive filters 210-1, 210-2 , ... to the corresponding inputs 280-1, 280-2 ... arbitrarily possible through time-frequency converters 290-1, 290-2, ... Also, adaptive filters 210-1, 210-2, ... are connected to the corresponding output terminals 310-1, 310-2 ..., if necessary, through the appropriate number of time-frequency converters 300-1, 300-2, ... At the output of adaptive filters 210, the signals, cleared of the echo or spectrally modified, are sent to output terminals 310 from device 200.

Все адаптивные фильтры 210-1, 210-2,… параллельно соединены с выходом вычислителя 270, с которого они получают коэффициенты фильтрации. Другими словами, в варианте реализации на фиг.13 все множество микрофонных сигналов фильтруется, с функциональной точки зрения, одним и тем же адаптивным фильтром, то есть базируясь на одних и тех же коэффициентах фильтрации, с целью получения спектрально модифицированных или эхокомпенсированных интерпретаций соответствующих микрофонных сигналов.All adaptive filters 210-1, 210-2, ... in parallel are connected to the output of the calculator 270, from which they receive filtering coefficients. In other words, in the embodiment of FIG. 13, the entire set of microphone signals is filtered, from a functional point of view, by the same adaptive filter, that is, based on the same filtering coefficients, in order to obtain spectrally modified or echo-compensated interpretations of the corresponding microphone signals .

Следовательно, если x₁[n] - сигналы громкоговорителя 1, где 1 - целое число в пределах от 0 до L - 1 и где L обозначает количество громкоговорителей или сигналов громкоговорителя, то та же самая модель может быть введена по аналогии с уравнением (1) в соответствии сTherefore, if x ₁ [n] are speaker signals 1, where 1 is an integer ranging from 0 to L - 1 and where L is the number of speakers or speaker signals, then the same model can be introduced by analogy with equation (1 ) in accordance with

где x_s,1[n] модулирует составляющую нестационарной речи, a x_w,1[n] модулирует составляющую стационарного шума, которые содержатся в сигнале громкоговорителя 1. В соответствии с уравнением (2) ОПФ-представление уравнения (25) выводят изwhere x _{s, 1} [n] modulates the component of non-stationary speech, ax _{w, 1} [n] modulates the component of stationary noise contained in the signal of loudspeaker 1. In accordance with equation (2), the OPF representation of equation (25) is derived from

Затем с помощью группиратора 530, который можно видеть на фиг.13, вычисляют общий, групповой спектр мощности всех каналов громкоговорителя, полученный путем объединения индивидуальных спектров сигналов громкоговорителя в соответствии сThen, using the grouping device 530, which can be seen in FIG. 13, the total group power spectrum of all speaker channels obtained by combining the individual spectra of the speaker signals in accordance with

где L обозначает количество каналов громкоговорителя. После этого нестационарные и стационарные компоненты сигнала сепарируют согласно уравнениям (5) и (7) с учетом общей или сгруппированной спектральной плотности мощности, следуя уравнению (27).where L denotes the number of speaker channels. After that, non-stationary and stationary signal components are separated according to equations (5) and (7) taking into account the total or grouped power spectral density, following equation (27).

По аналогии с этим вычисляют общий или объединенный спектр мощности каналов микрофона в соответствии сBy analogy with this, the total or combined power spectrum of the microphone channels is calculated in accordance with

где Y_p[k,m] определяет сигнал микрофона 110 p, а P отображает количество микрофонов. Показатель p - также целое число в пределах от 0 до Р - 1. В версии на фиг.13 этот расчет может быть выполнен вторым группиратором 540.where Y _p [k, m] determines the microphone signal 110 p, and P displays the number of microphones. The exponent p is also an integer ranging from 0 to P - 1. In the version of FIG. 13, this calculation can be performed by the second grouping device 540.

Для определения двух фильтров эхокомпенсации в соответствии с уравнениями (12) и (13) в качестве следующих шагов алгоритма используют спектры (мощности) громкоговорителя |X[k,m]|² в соответствии с уравнением (27) и спектр (мощности) микрофона |Y[k,m]|² в соответствии с уравнением (28), как уже описывалось выше. Задание управляющего параметра β_x согласно уравнениям с (14) по (17), описанное выше в контексте контроля рабочих процессов, может быть выполнено также на базе общих или групповых спектров согласно уравнениям (27) и (28).To determine the two echo cancellation filters in accordance with equations (12) and (13), the spectra (powers) of the loudspeaker | X [k, m] | ² in accordance with equation (27) and the spectrum (power) of the microphone | Y [k, m] | ² in accordance with equation (28), as already described above. The setting of the control parameter β _x according to equations (14) to (17), described above in the context of monitoring work processes, can also be performed on the basis of general or group spectra according to equations (27) and (28).

Собственно эхоподавление в рамках модификации спектра выполняют затем для каждого сигнала микрофона индивидуально, но с использование одного фильтра эхокомпенсации 210 для каждого микрофонного канала, следуя уравнениюActually, the echo cancellation as part of the spectrum modification is then performed individually for each microphone signal, but using one echo cancellation filter 210 for each microphone channel, following the equation

при p=0, 1,…,P-1.at p = 0, 1, ..., P-1.

Но, как говорилось выше, эхокомпенсирующие фильтры 210 могут быть реализованы по-другому, скажем, в соответствии с уравнением (19).But, as mentioned above, echo cancellation filters 210 can be implemented differently, say, in accordance with equation (19).

Здесь следует отметить, что при многоканальном исполнении устройства 200, как, например, на фиг.13, количество сигналов громкоговорителя L и количество сигналов микрофона Р может быть и одинаковым, и различным. В принципе, количество входных сигналов громкоговорителя и микрофона может быть любым. Более того, не обязательно применение обоих группираторов 530, 540 для множества входных сигналов громкоговорителя и микрофона. Изобретение допускает ввод только множества сигналов громкоговорителя с помощью группиратора 530 без использования группиратора 540 для множества микрофонных сигналов. Такая система применима, когда один микрофонный сигнал от одного абонента на дальнем конце линии связи поступает на несколько громкоговорителей, например, при диспетчерской связи с автомобилями.It should be noted here that with the multi-channel design of the device 200, as, for example, in FIG. 13, the number of speaker signals L and the number of microphone signals P can be the same or different. In principle, the number of input signals from the speaker and microphone can be any. Moreover, it is not necessary to use both grouping devices 530, 540 for a plurality of speaker and microphone input signals. The invention allows only a plurality of loudspeaker signals to be input using a grouping device 530 without using a grouping device 540 for multiple microphone signals. Such a system is applicable when a single microphone signal from one subscriber at the far end of the communication line arrives at several loudspeakers, for example, during dispatch communication with cars.

Естественно, нет необходимости задействовать многоканальный группиратор 530 для ввода одного сигнала громкоговорителя, например, центрального в системе конференцсвязи, где каждый из множества участников диалога имеет персональный микрофон. В такой ситуации рекомендуется введение группиратора 540.Naturally, there is no need to use multichannel grouping device 530 to input one loudspeaker signal, for example, central in a conference communication system, where each of the many participants in the dialogue has a personal microphone. In this situation, the introduction of the 540 grouper is recommended.

Следует дополнить, что конструкции с группираторами 530 и 540 могут быть рассчитаны на большее число сигналов громкоговорителя или микрофона, чем на них поступает в конкретный момент. Естественно, что в устройстве 200 может быть предусмотрено большее количество входов 220, 280, чем используется практически. В подобных случаях предшествующие по схеме контуры, например произвольные времячастотные преобразователи 230, 290 или группираторы 530, 540, способны самостоятельно определять количество рабочих каналов и выбирать соответствующие показатели L и Р. Естественно, также предусмотрен ввод показателей количества каналов и ожидаемого количества сигналов микрофонов и громкоговорителей извне.It should be added that designs with groupers 530 and 540 can be designed for a larger number of loudspeaker or microphone signals than are received at a particular time. Naturally, in the device 200 may be provided with a larger number of inputs 220, 280 than is used in practice. In such cases, the circuits preceding the scheme, for example, arbitrary time-frequency converters 230, 290 or grouping units 530, 540, are able to independently determine the number of working channels and select the corresponding indicators L and P. Naturally, input of indicators of the number of channels and the expected number of microphone and speaker signals is also provided from the outside.

Кроме того, конструктивное решение, представленное на фиг.13, конечно, может работать с одиночными сигналами громкоговорителя и микрофона, в группиратор 530 введены соответствующие показатели L и Р. В принципе, уравнения (27) и (28) применимы при Р=1 и/или L=1. Таким образом, конструктивное решение, показанное на фиг.13, представляет собой совместимое „сверху вниз” расширение версии реализации на фиг.2.In addition, the constructive solution presented in Fig. 13, of course, can work with single signals of the loudspeaker and microphone, the corresponding indicators L and P are introduced into the grouping device 530. In principle, equations (27) and (28) are applicable at P = 1 and / or L = 1. Thus, the constructive solution shown in FIG. 13 is a top-down compatible extension of the implementation version of FIG. 2.

Частотное разрешение рекомендуется в форме производного от ОПФ. Равномерность ОПФ по спектральному разрешению не очень хорошо соотносится с физиологией человеческого слуха. В силу этого предпочтительно следует перегруппировать равномерно распределенные коэффициенты |X[k,m]|²и |Y[k,m]|² в порядок непересекающихся секторов или групп, как показано в [С.Faller and F.Baumgarte. Binaural Cue Coding - Part II: Schemes and applications. IEEE Trans. on Speech and Audio Proc., 11(6): 520-531, Nov. 2003], содержащих полосы частот, соотносимых по частотной разрешающей способности со слуховой системой человека, как представлено, в частности, в [10].Frequency resolution is recommended in the form of a derivative of OPF. The uniformity of OPF in spectral resolution does not correlate very well with the physiology of human hearing. Therefore, it is preferable to rearrange the uniformly distributed coefficients | X [k, m] | ² and | Y [k, m] | ² into the order of disjoint sectors or groups, as shown in [C. Faller and F. Baumgarte. Binaural Cue Coding - Part II: Schemes and applications. IEEE Trans. on Speech and Audio Proc., 11 (6): 520-531, Nov. 2003], containing frequency bands correlated in frequency resolution with the human auditory system, as presented, in particular, in [10].

Частоте дискретизации 16 кГц при кратковременном преобразовании Фурье нормально соответствует длина блока ДПФ в 512 отсчетов и 15 групп, или сегментов, каждый из которых имеет полосу пропускания, примерно соответствующую двойной ширине эквивалентной прямоугольной полосы пропускания (ERB/ЭППП), о чем говорится в [В.R.Glasberg and В. С.J.Moore. Derivation of auditory filter shapes from notched-noise data. Hear. Res., 47: 103-138, 1990]. Полосы пропускания соответствуют сегментам, как показано на фиг.14.The sampling frequency of 16 kHz during the short-term Fourier transform normally corresponds to the length of the DFT block of 512 samples and 15 groups or segments, each of which has a passband that approximately corresponds to the double width of the equivalent rectangular passband (ERB / EPT), as described in [B .R. Glasberg and B.C. J. Moore. Derivation of auditory filter shapes from notched-noise data. Hear. Res., 47: 103-138, 1990]. The bandwidths correspond to segments, as shown in FIG.

На фиг.14 показано, как коэффициенты равномерного спектра ОПФ могут быть сгруппированы или разложены с целью имитации неравномерного частотного разрешения слуховой системы человека. Как видно на фиг.14, ось частоты проходит от 0 Гц примерно до 8000 Гц, что соответствует эффективной полосе пропускания, основанной на частоте дискретизации 16 кГц.On Fig shows how the coefficients of the uniform spectrum of OPF can be grouped or decomposed in order to simulate the uneven frequency resolution of the human auditory system. As can be seen in FIG. 14, the frequency axis extends from 0 Hz to about 8000 Hz, which corresponds to an effective bandwidth based on a sampling frequency of 16 kHz.

Фильтры усиления рассчитываются только для центральной частоты каждой группы. Дополнительно это снижает вычислительную сложность по сравнению с полным спектральным разрешением равномерного ОПФ. Перед применением фильтра усиления последнего сегмента или группы к равномерному сигналу спектра ОПФ последний интерполируется фильтрами-интерполяторами Ханна.Gain filters are only calculated for the center frequency of each group. Additionally, this reduces computational complexity compared to the full spectral resolution of uniform OPF. Before applying the gain filter of the last segment or group to a uniform signal of the OPF spectrum, the latter is interpolated by Hann interpolator filters.

На фиг.15(а) показаны интерполирующие фильтры Ханна, применимые для сглаживания фильтров усиления в зависимости от частоты. На фиг.15(b) в виде сплошной линии показаны коэффициенты фильтров усиления, интерполированные из значений фильтров усиления в отдельных сегментах, отмеченных, в свою очередь, жирными точками.FIG. 15 (a) shows Hann interpolation filters useful for smoothing frequency-dependent gain filters. On Fig (b) in a solid line shows the coefficients of the gain filters, interpolated from the values of the gain filters in individual segments, marked, in turn, by thick dots.

Изображение (а) на фиг.15 подробно представляет фильтры Ханна, изображение (b) приводит пример значений фильтра усиления до и после интерполяции. Точки на фиг.15b обозначают величины до интерполяции, в то время как сплошная линия соответствует значениям, полученным в результате интерполяции. Сглаживание фильтров усиления по частоте дает в результате сглаженный вариант спектра как функции частоты и, таким образом, компенсирует музыкальные тоны и другие артефакты.Image (a) in Fig. 15 represents Hann filters in detail, image (b) gives an example of gain filter values before and after interpolation. The points in FIG. 15b indicate the values before interpolation, while the solid line corresponds to the values obtained by interpolation. Smoothing the frequency gain filters results in a smoothed version of the spectrum as a function of frequency and thus compensates for musical tones and other artifacts.

Предшествующее описание вариантов конструктивных решений показало, что данное изобретение реализуется за счет введения в предлагаемую конструкцию различных функциональных блоков, которые выполняют определенную последовательность операций, составляющих заданный алгоритм, обобщенно представленный ниже. Осуществление предлагаемого изобретения включает в себя следующий порядок действий: прием, по меньшей мере, одного сигнала громкоговорителя, прием, по меньшей мере, одного сигнала микрофона, преобразование сигнала громкоговорителя и сигнала микрофона в кратковременные спектры, вычисление спектральной плотности мощности сигналов громкоговорителя и микрофона, выделение или разложение спектральной плотности мощности на стационарную и нестационарную составляющие, расчет фильтра усиления эхокомпенсации с использованием стационарных спектров мощности громкоговорителя, расчет фильтра усиления эхокомпенсации с использованием нестационарного спектра мощности громкоговорителя, применение фильтра усиления к спектру микрофона для подавления эхо-сигнала, обратное преобразование эхокомпенсированного спектра микрофона во временную область.The previous description of the options for constructive solutions showed that this invention is implemented by introducing into the proposed design various functional units that perform a certain sequence of operations that make up a given algorithm, summarized below. The implementation of the invention includes the following procedure: receiving at least one loudspeaker signal, receiving at least one microphone signal, converting the loudspeaker signal and the microphone signal into short-term spectra, calculating the power spectral density of the loudspeaker and microphone signals, highlighting or decomposition of the power spectral density into stationary and non-stationary components, calculation of an echo cancellation amplification filter using stationary loudspeaker power spectra, calculating an echo cancellation gain filter using an unsteady loudspeaker power spectrum, applying an amplification filter to the microphone spectrum to suppress the echo signal, inverting the echo-compensated microphone spectrum to the time domain.

В зависимости от условий способ, составляющий настоящее изобретение, может быть осуществлен как в виде аппаратных средств, так и в виде программного обеспечения. Изобретение может быть реализовано на любом цифровом накопителе, в частности на гибком диске, CD или DVD, несущем электронно-считываемые управляющие сигналы, которые могут взаимодействовать с программируемой компьютерной системой таким образом, чтобы мог быть осуществлен изобретенный способ. Реализация настоящего изобретения, в основном, представляет собой программное обеспечение или компьютерную программу, или программный продукт с кодом программы, хранящиеся на машиночитаемом носителе, предназначенные для осуществления предлагаемого способа при условии выполнения программы на компьютере или микропроцессоре. Другими словами, данное изобретение может быть реализовано в виде компьютерной программы или программного обеспечения, или программы, имеющих код программы, для осуществления предлагаемого в изобретении способа при выполнении программы с использованием процессора. Процессор может быть схемотехническим элементом компьютера, чип-карты (интеллектуальной карты), интегрированной системы SOC (SOC = система на кристалле), прикладной интегральной схемы (ASIC) или какой-либо иной интегральной микросхемы (ИС).Depending on the conditions, the method constituting the present invention can be implemented both in hardware and in software. The invention can be implemented on any digital storage device, in particular a floppy disk, CD or DVD, carrying electronically readable control signals that can interact with a programmable computer system so that the inventive method can be implemented. The implementation of the present invention, basically, is a software or computer program, or software product with program code stored on a machine-readable medium, designed to implement the proposed method, provided that the program is executed on a computer or microprocessor. In other words, the present invention can be implemented as a computer program or software, or a program having program code, for implementing the method of the invention when executing a program using a processor. The processor may be a circuitry element of a computer, a chip card (smart card), an integrated SOC system (SOC = system on a chip), an application integrated circuit (ASIC), or some other integrated circuit (IC).

Claims

1. The device (200) for calculating the transmittance of the adaptive filter (210) of the microphone microphone signal to suppress the echo excited by the loudspeaker signal, including a filter for preliminary analysis of the echo signal (240) to evaluate the spectrum of the echo component or the spectral density of the echo power in the microphone signal; an extractor (250) for extracting the stationary component of the signal and the non-stationary component of the signal (1) from the speaker signal or (2) from a signal derived from the speaker signal, calculated based on the estimated spectrum of the echo component or spectral density of the echo power in the microphone signal; and a computer (270) for calculating the transmittance of the adaptive filter (210) - (1) based on the stationary component of the signal and the non-stationary component of the signal extracted from the speaker signal, and on the basis of the estimated spectrum of the echo component or the energy spectrum of the echo in the microphone signal, or - (2) based on the stationary component of the signal and the non-stationary component of the signal derived from the derived signal.

2. The device (200) according to claim 1, in which the extractor (250) extracts the stationary component of the signal based on averaging the energy indicator of the band-pass signal of the speaker or a signal derived from it.

3. The device (200) according to claim 2, where the extractor (250) performs averaging by finding the floating average value of the current data block on which the band signal is based, and the values of at least one data block that precedes the current time data block.

4. The device (200) according to claim 2, in which the extractor (250) performs averaging by determining a floating average based on a combination of the details of the calculation and based on a comparison of the energy indicator of the current data unit with the energy indicator of the previous data unit or comparison with the value of the previous averaging.

5. The device (200) according to claim 3, in which the extractor (250) performs recursive floating averaging by summing the energy index of the current data block with the result of the previous averaging depending on the addition parameter, the addition parameter being smaller when the energy indicator of the current data block greater than the value of the previously derived averaging, and a larger value when the power indicator of the current data block is less than the value of the previously determined averaging.

6. The device (200) according to claim 1, in which the extractor (250) selects the non-stationary component of the signal based on the strip signal of the speaker or a derivative of it.

7. The device (200) according to claim 1, in which the extractor (250) selects the non-stationary signal component based on the stationary component and the gain filter.

8. The device (200) according to claim 7, in which the extractor (250) is characterized in that the gain filter depends on a variable or non-variable control parameter.

9. The device (200) according to claim 8, in which the extractor (250) sets the control parameter of the gain filter based on the coherence function based on the loudspeaker signal or its derivative, and on the microphone signal or its derivative.

10. The device (200) according to claim 9, in which the extractor (250) sets the control parameter based on the average value of the coherence function over the set of loudspeaker band signals or for a signal derived from them and for the set of band signals of a microphone or signal derived from them.

11. The device (200) according to claim 1, including a grouping device (540), designed to combine multiple microphone signals with the receipt of the total microphone signal or a signal derived from it.

12. The device (200) according to claim 1, in which the extractor (250) generates the stationary signal component and the non-stationary signal component at the output, also comprising the transmitter (270) calculates the transmittance of the first filter based on the stationary signal component and calculates the coefficients transmittance of the second filter based on the non-stationary component of the signal and, in addition, determines the filtering coefficients based on the transmittance of the first and second filters.

13. The device (200) according to item 12, in which the extractor (250) calculates the filtering coefficients so that they correspond to the serial connection of the first filter, which corresponds to the transmittance of the first filter, with the second filter, which corresponds to the transmittance of the second filter.

14. The device (200) according to claim 1, in which the extractor (250) generates a stationary component signal and an unsteady component signal at the output, also comprising a calculator (270) calculates the transmittance of the first filter based on the stationary component signal and transmittance a second filter based on a non-stationary component signal and, in addition, determines the filtering coefficients based on the transmittance of either the first or second filter.

15. The device (200) according to claim 14, in which the calculator (270) sets the filtering coefficients based on those transmittances of the first or second filter that correspond to a higher attenuation level.

16. The device (200) according to claim 1, in which the extractor (250) provides a stationary component signal or a non-stationary component signal, as well as a control information signal containing the parameters of the output component signal, in addition, in which the calculator (270) calculates filtering coefficients based on the output signal of the extractor (250), the signal derived from it, and the data contained in the control information signal.

17. The device (200) according to clause 16, in which the extractor (250) generates at the output a stationary component signal or an unsteady component signal for transmission to a computer (270) depending on the ratio of the energy indicators of the stationary component signal and the unsteady component signal.

18. The device (200) according to claim 1, in which the extractor (250) selects a stationary component signal or an unsteady component signal as expected signals.

19. The device (200) according to claim 1, in which the extractor (250) generates a stationary component signal and an unsteady component signal at the output.

20. The device (200) according to claim 1, including, in addition to the above, an adaptive filter (210), designed to filter the microphone signal using the specified transmittance.

21. The device (200) according to claim 1, including a grouping device (540) and a number of adaptive filters (210), in order to filter at least two of the many microphone signals based on identical filtering coefficients received from the computer (270 )

22. The device (200) according to claim 1, including a grouping device (530) for combining a plurality of speaker signals and generating an aggregate speaker signal or a signal derived from it.

23. The method of calculating the transmittance of the adaptive filter (210) of the microphone signal, including: estimating the spectrum of the echo component or the spectral density of the echo power in the microphone signal; extracting the stationary component of the signal or the non-stationary component of the signal (1) from the speaker signal or (2) from the signal derived from the speaker signal based on the estimated spectrum of the echo component or spectral density of the echo power in the microphone signal; and calculating the transmittances of the adaptive filter,
- (1) based on the stationary component of the signal or the non-stationary component of the signal derived from the loudspeaker signal, and on the basis of the estimated spectrum of the echo component or the energy spectrum of the echo in the microphone signal; or
- (2) based on the stationary component of the signal and the non-stationary component of the signal derived from the derived signal.

24. Machine-readable medium containing program code for implementing the method according to item 23 using processor technology.