RU93173U1

RU93173U1 - ANNOUNCER VOICE DISTORTION SYSTEM

Info

Publication number: RU93173U1
Application number: RU2009148684/22U
Authority: RU
Inventors: Михаил Андреевич Котов; Дмитрий Анатольевич Леднов; Максим Николаевич Меркулов; Андрей Валентинович Хацкевич
Original assignee: Общество с ограниченной ответственностью "Стэл-Компьютерные Системы"
Priority date: 2009-12-29
Filing date: 2009-12-29
Publication date: 2010-04-20

Abstract

1. Система искажения голоса диктора, содержащая генератор базисных сигналов, формирователь параметров разбиения диапазона частот, интеграторы, формирователь порогового уровня, блок питания, подключенный к питающим входам составных блоков системы, формирователь параметров смещения позиций значимых интервалов спектра, последовательно соединенные цифровое записывающее устройство, блок дискретизации, блок дискретного преобразования Фурье и блок нормирования спектра Фурье, последовательно соединенные блок формирования огибающей спектра Фурье и первый дополнительный блок нормирования спектра Фурье, последовательно соединенные блок формирования параметров изменения огибающей спектра Фурье и дополнительный блок формирования огибающей спектра Фурье, последовательно соединенные компаратор, сумматор, дополнительный сумматор, блок определения максимумов и соответствующих им аргументов, регистратор максимальных значений и соответствующих им аргументов, блок динамического программирования, дополнительный компаратор, блок принятия решений, блок определения позиций значимых интервалов спектра, блок определения смещенных позиций значимых интервалов спектра, второй дополнительный блок нормирования спектра Фурье, блок обратного преобразования Фурье, блок памяти, блок сглаживания фазы и блок воспроизведения, при этом входы интеграторов соединены с соответствующими выходами блока нормирования спектра Фурье и генератора базисных сигналов, а выходы параллельно подключены к входам компаратора и сумматора, выход формирователя параметров разбиения диапазона частот соединен с одним из входов блока опред 1. A speaker’s voice distortion system, comprising a basic signal generator, a driver for splitting the frequency range, integrators, a threshold driver, a power supply connected to the power inputs of the component blocks of the system, a driver for position offset parameters of significant spectrum intervals, a digital recording device, a block connected in series discretization, a discrete Fourier transform unit and a Fourier spectrum normalization unit, connected in series to a spectral envelope formation unit Fourier and the first additional block for normalizing the Fourier spectrum, a series-connected block for generating parameters for changing the Fourier spectrum envelope and an additional block for generating the envelope for the Fourier spectrum, series-connected comparator, adder, additional adder, block for determining the maxima and their corresponding arguments, the recorder of maximum values and their corresponding arguments, dynamic programming block, additional comparator, decision block, position determination block s the initial intervals of the spectrum, the unit for determining the displaced positions of significant intervals of the spectrum, the second additional unit for normalizing the Fourier spectrum, the inverse Fourier transform unit, the memory unit, the phase smoothing unit and the playback unit, while the inputs of the integrators are connected to the corresponding outputs of the normalizing unit of the Fourier spectrum and the basis signal generator and the outputs are connected in parallel to the inputs of the comparator and adder, the output of the parameterizer for splitting the frequency range is connected to one of the inputs of the block

Description

Полезная модель относится к технике противодействия опознаванию личности по голосу и предназначена для использования в охранных системах, включающих в себя средства формирования спектральных характеристик голоса для его искажения. Кроме того, полезная модель может быть применена в мобильных и стационарных телефонах, а также в средствах трансляции информации по каналам связи.The utility model relates to techniques for countering voice recognition and is intended for use in security systems, which include means of forming the spectral characteristics of the voice to distort it. In addition, the utility model can be applied in mobile and landline telephones, as well as in means of transmitting information through communication channels.

Известна система для выделения частоты основного тона с помощью узкополосного фильтра (Вокодерная телефония. Под ред. А.А.Пирогова. М.; Связь, 1974). Указанным фильтром осуществляется слежение за изменением частоты первой гармоники речевого сигнала. При этом ширина полосы итерационно подстраивается под среднюю частоту основного тона, рассчитываемую на основании выходной функции этого фильтра и передаваемую на фильтр благодаря органу обратной связи. Это предопределяет высокое качество выделения частоты основного тона при условии, что фильтр подстраивается под диктора в течение нескольких минут.A known system for isolating the frequency of the fundamental tone using a narrow-band filter (Vocoder telephony. Edited by A.A. Pirogov. M .; Communication, 1974). The specified filter monitors the change in the frequency of the first harmonic of the speech signal. In this case, the bandwidth is iteratively adjusted to the average frequency of the fundamental tone, calculated on the basis of the output function of this filter and transmitted to the filter due to the feedback organ. This determines the high quality of the allocation of the frequency of the fundamental tone, provided that the filter adjusts to the speaker for several minutes.

Однако известная система является непригодной для выявления частоты основного тона в сообщениях длительностью в несколько секунд, что не позволяет на последующих этапах формировать варианты искажения голоса диктора в соответствии с заданными параметрами.However, the known system is unsuitable for detecting the frequency of the fundamental tone in messages lasting several seconds, which does not allow the subsequent stages to form options for distortion of the voice of the speaker in accordance with the specified parameters.

Известна система для выделения частоты основного тона с предварительной записью речевого сигнала и его последующей обработкой, охватывающая три канала обработки речевого сигнала (М.Е. Hernandez-Diaz Huici and J.V. Lorenzo Ginori Combined algorithm for pitch detection of speech signals // Electronics Letters 5-th January 1995 Vol.31, No. 1, pp.15-16). В первом канале проводится амплитудная селекция по схеме Голда, во втором канале используется аппроксимация первой гармоники основного тона экспоненциальной функцией, а в третьем канале выполняется вычисление корреляционной функции по схеме Медана. В том случае, когда разность между полученными значениями оценок частоты основного тона для различных каналов не превышает заданную величину, считают, что частота основного тона найдена.A known system for isolating the frequency of the fundamental tone with preliminary recording of the speech signal and its subsequent processing, covering three channels of processing the speech signal (M.E. Hernandez-Diaz Huici and JV Lorenzo Ginori Combined algorithm for pitch detection of speech signals // Electronics Letters 5- th January 1995 Vol.31, No. 1, pp. 15-16). In the first channel, amplitude selection is performed according to the Gold scheme, in the second channel, the approximation of the first harmonic of the fundamental tone is used by an exponential function, and in the third channel, the correlation function is calculated according to the Medan scheme. In the case when the difference between the obtained values of the estimates of the frequency of the fundamental tone for different channels does not exceed a predetermined value, it is believed that the frequency of the fundamental tone is found.

Недостаток известной системы состоит в том, что точности каждого из использующихся алгоритмов являются недостаточно высокими, что исключает последующее формирование искажения голоса с заданными параметрами.A disadvantage of the known system is that the accuracy of each of the algorithms used is not high enough, which eliminates the subsequent formation of voice distortion with specified parameters.

Известны различные системы, позволяющие в условиях присутствия шумовой составляющей в акустическом сигнале выделять речевую составляющую сигнала (RU 231830, 27.06.2004; RU 296376, 27.03.2007; RU 2271578, 10.03.2006; RU 2263358, 27.10.2005; RU 2103753, 27.01.1998; RU 2161826, 10.01.2001 и др.).В охранных системах объектов недвижимости и транспортных средств получили распространение системы речевой верификации пользователя, в том числе с анализом текстозависимых параметров и физиологических особенностей личности (RU 95103817 Аl, 20.12.1995; RU 2077999 Сl, 27.04.1997).Various systems are known that make it possible to isolate the speech component of a signal in the presence of a noise component in an acoustic signal (RU 231830, 06.27.2004; RU 296376, 03.27.2007; RU 2271578, 03/10/2006; RU 2263358, 10.27.2005; RU 2103753, 27.01 .1998; RU 2161826, 01/10/2001, etc.). In the security systems of real estate and vehicles, the system of verbal verification of the user, including the analysis of text-dependent parameters and physiological characteristics of the person (RU 95103817 Al, 12.20.1995; RU 2077999 Cl, 04/27/1997).

Однако известные системы не обеспечивают формирование спектрального состава речевой составляющей сигнала в соответствии с заданным алгоритмом искажения голоса.However, the known systems do not provide the formation of the spectral composition of the speech component of the signal in accordance with a given algorithm for voice distortion.

Также известна система для определения параметров линейчатых спектров вокализованных звуков, представленная в RU 2364957, 27.12.2007 г.и содержащая последовательно соединенные цифровое записывающее устройство, блок дискретизации, блок дискретного преобразования Фурье, блок нормирования спектра Фурье, формирователь результирующей матрицы сверток, сумматор, регистратор максимальных значений, формирователь вектора признаков, блок линий задержки, блок формирования мер веса последовательности информативных признаков, блок перебора последовательности компонент информативных признаков и выделения спектров, обладающих линейчатостью и гладкостью динамики частоты основного тона, компаратор, селектор компонентов преобразования Фурье и блок регистрации информативных признаков. Кроме того, в систему входят блок питания, блок памяти, генератор базисных сигналов, формирователь параметров разбиения диапазона частот основного тона голоса, формирователь мер схожести и формирователь порогового уровня.Also known is a system for determining the parameters of the line spectra of voiced sounds, presented in RU 2364957, December 27, 2007 and containing a series-connected digital recording device, a sampling unit, a discrete Fourier transform unit, a Fourier spectrum normalization unit, a resultant convolution matrix generator, an adder, a recorder maximum values, vector of features, block of delay lines, block of forming measures of the weight of a sequence of informative signs, block of enumeration of the sequence nosti component and isolation of informative signs of the spectra having a linear and smooth dynamics of the pitch frequency, the comparator, the selector Fourier transform components, and a registration unit informative signs. In addition, the system includes a power supply unit, a memory unit, a basic signal generator, a shaper of parameters for splitting the frequency range of the fundamental tone of the voice, a shaper of similarity measures, and a threshold level shaper.

Недостатком известной системы является отсутствие возможности искажения голоса диктора в соответствии с заданными параметрами и выбора вариантов изменения спектральных характеристик исходного голоса диктора.A disadvantage of the known system is the inability to distort the voice of the speaker in accordance with the specified parameters and the choice of options for changing the spectral characteristics of the original voice of the speaker.

Наиболее близкой к заявленному техническому решению является система голосовой идентификации (RU 85445, 05.05.2009 г.), содержащая генератор базисных сигналов, формирователь параметров разбиения диапазона частот, интеграторы, блок формирования последовательности векторов признаков, формирователь порогового уровня, блок формирования потенциалов единичных зарядов, блок формирования модуля градиентов, блок памяти идентификационных номеров дикторов, цифровое запоминающее устройство, блок дискретизации, блок дискретного преобразования Фурье и блок нормирования спектра Фурье, компаратор, сумматор, дополнительный сумматор, блок определения максимумов и соответствующих им аргументов, регистратор максимальных значений, блок динамического программирования, дополнительный компаратор, блок принятия решений, переключатель режимов, блок определения условной вероятности, блок умножения, дополнительный блок принятия решений, блок упорядочивания векторов, селектор, блок предварительной кластеризации последовательности векторов признаков, блок определения статистических характеристик кластеров и блок памяти вероятностных характеристик.Closest to the claimed technical solution is a voice identification system (RU 85445, 05/05/2009), containing a base signal generator, a frequency range splitter, integrators, a feature vector sequence generator, a threshold level generator, unit charge potential generation unit, a unit for generating a gradient module, a memory unit for speaker identification numbers, a digital storage device, a sampling unit, a discrete Fourier transform unit, and Fourier spectrum normalization unit, comparator, adder, additional adder, unit for determining the maxima and their corresponding arguments, maximum values recorder, dynamic programming unit, additional comparator, decision making unit, mode switch, conditional probability determination unit, multiplication unit, additional decision making unit , block of ordering vectors, selector, block of preliminary clustering of a sequence of feature vectors, block for determining statistical nature Stick clusters and memory block probability characteristics.

Недостаток известной системы определяется низким значением вероятности неправильного распознавания голоса диктора при разборчивом и естественном звучании речи.A disadvantage of the known system is determined by the low probability of incorrect recognition of the speaker’s voice during legible and natural sounding of speech.

Задачей полезной модели является обеспечение возможности эффективного искажения голоса диктора в условиях сохранения разборчивости и естественности звучания речи.The objective of the utility model is to provide the possibility of effective distortion of the speaker’s voice while preserving the intelligibility and naturalness of the sound of speech.

Технический результат, достигаемый при решении задачи, выражается в снижении вероятности голосовой идентификации личности.The technical result achieved in solving the problem is expressed in reducing the likelihood of voice recognition of a person.

Технический результат обеспечивается системой искажения голоса диктора, содержащей генератор базисных сигналов, формирователь параметров разбиения диапазона частот, интеграторы, формирователь порогового уровня, блок питания, подключенный к питающим входам составных блоков системы, формирователь параметров смещения значимых интервалов спектра, последовательно соединенные цифровое записывающее устройство, блок дискретизации, блок дискретного преобразования Фурье и блок нормирования спектра Фурье, последовательно соединенные блок формирования огибающей спектра Фурье и первый дополнительный блок нормирования спектра Фурье, последовательно соединенные блок формирования параметров изменения огибающей спектра Фурье и дополнительный блок формирования огибающей спектра Фурье, последовательно соединенные компаратор, сумматор, дополнительный сумматор, блок определения максимумов и соответствующих им аргументов, регистратор максимальных значений и соответствующих им аргументов, блок динамического программирования, дополнительный компаратор, блок принятия решений, блок определения позиций значимых интервалов спектра, блок определения смещенных позиций значимых интервалов спектра, второй дополнительный блок нормирования спектра Фурье, блок обратного преобразования Фурье, блок памяти, блок сглаживания фазы и блок воспроизведения, при этомThe technical result is provided by a voice distortion system of a speaker containing a basic signal generator, a driver for splitting the frequency range, integrators, a threshold driver, a power supply connected to the power inputs of the component blocks of the system, a driver for the parameters of the offset of the significant intervals of the spectrum, a digital recording device, a block connected in series discretization, discrete Fourier transform block and the Fourier spectrum normalization block, series-connected block the formation of the envelope of the Fourier spectrum and the first additional unit for normalizing the Fourier spectrum, a series-connected block for generating parameters for changing the envelope of the Fourier spectrum and an additional block for generating the envelope of the Fourier spectrum, series-connected comparator, adder, additional adder, unit for determining the maxima and their corresponding arguments, the recorder of maximum values and corresponding arguments, dynamic programming block, additional comparator, decision block solved i, a unit for determining the positions of significant intervals of the spectrum, a unit for determining the offset positions of significant intervals of the spectrum, a second additional unit for normalizing the Fourier spectrum, an inverse Fourier transform unit, a memory unit, a phase smoothing unit, and a playback unit,

входы интеграторов соединены с соответствующими выходами блока нормирования спектра Фурье и генератора базисных сигналов, а выходы параллельно подключены к входам компаратора и сумматора,the integrator inputs are connected to the corresponding outputs of the normalization block of the Fourier spectrum and the basis signal generator, and the outputs are connected in parallel to the inputs of the comparator and adder,

выход формирователя параметров разбиения диапазона частот соединен с одним из входов блока определения максимумов и соответствующих им аргументов,the output of the generator of parameters for splitting the frequency range is connected to one of the inputs of the unit for determining the maxima and the corresponding arguments,

выход формирователя порогового уровня подключен к задающему входу дополнительного компаратора, выход формирователя параметров смещения позиций значимых интервалов спектра соединен с одним из входов блока определения смещенных позиций значимых интервалов спектра,the output of the threshold level shaper is connected to the input of the additional comparator, the output of the position shifter of the positions of significant intervals of the spectrum is connected to one of the inputs of the block for determining the offset positions of the significant intervals of the spectrum,

один из выходов блока дискретного преобразования спектра Фурье дополнительно параллельно подключен к входу блока формирования огибающей спектра Фурье, одному из входов первого дополнительного блока нормирования спектра Фурье и одному из входов блока определения позиций значимых интервалов спектра,one of the outputs of the discrete Fourier spectrum transform unit is additionally connected in parallel to the input of the Fourier spectrum envelope forming unit, one of the inputs of the first additional Fourier spectrum normalization unit and one of the inputs of the unit for determining the positions of significant spectrum intervals,

а выходы первого дополнительного блока нормирования спектра Фурье и дополнительного блока формирования огибающей спектра Фурье соединены с соответствующими входами второго дополнительного блока нормирования спектра Фурье.and the outputs of the first additional unit for normalizing the Fourier spectrum and the additional unit for generating the envelope of the Fourier spectrum are connected to the corresponding inputs of the second additional unit for normalizing the Fourier spectrum.

Решению поставленной задачи и достижению указанного технического результата способствуют также частные существенные признаки полезной модели.Particular essential features of the utility model also contribute to the solution of the problem and the achievement of the indicated technical result.

Система искажения голоса диктора снабжена блоком автоматического определения параметров исходного голоса, вход которого подключен к входу цифрового записывающего устройства, а выход - к входу блока формирования параметров изменения огибающей спектра Фурье.The speaker’s voice distortion system is equipped with a unit for automatically determining the parameters of the original voice, the input of which is connected to the input of a digital recording device, and the output is connected to the input of the unit for generating parameters for changing the Fourier spectrum envelope.

На фиг.1 представлена структурная схема системы искажения голоса диктора,Figure 1 presents a structural diagram of a system for distorting the voice of the speaker,

на фиг.2 - график вейвлет - функции, используемой для анализа спектра звука,figure 2 is a graph of a wavelet function used to analyze the spectrum of sound,

на фиг.3 - пример свертки спектра Фурье с вейвлет - функциями в выбранном отсчете спектра,figure 3 is an example of convolution of the Fourier spectrum with wavelet functions in the selected sample of the spectrum,

на фиг.4 - граф схемы динамического программирования на последовательности множеств пар максимумов и их аргументов,figure 4 is a graph of a dynamic programming scheme on a sequence of sets of pairs of maxima and their arguments,

на фиг.5 - выделенные значимые участки спектра, подвергаемые преобразованиям,figure 5 - selected significant sections of the spectrum subjected to transformations,

на фиг.6 - аппроксимация сглаженного спектра звука (сплошная линия), полученного по значимым участкам спектра тремя гауссоидами (пунктирные линии),figure 6 is an approximation of a smoothed spectrum of sound (solid line) obtained from the significant parts of the spectrum by three gaussoids (dashed lines),

на фиг.7 - спектр исходного сигнала (пунктирная линия) и спектр сигнала после процедуры смещения линий (сплошная линия).Fig.7 is the spectrum of the original signal (dashed line) and the spectrum of the signal after the procedure for the displacement of the lines (solid line).

Система искажения голоса диктора содержит генератор 1 базисных сигналов, формирователь 2 параметров разбиения диапазона частот, интеграторы 3, 4, 5, формирователь 6 порогового уровня, блок 7 питания, подключенный к питающим входам составных блоков системы, формирователь 8 параметров смещения позиций значимых интервалов спектра, последовательно соединенные цифровое записывающее устройство 9, блок 10 дискретизации, блок 11 дискретного преобразования Фурье и блок 12 нормирования спектра Фурье, последовательно соединенные блок 13 формирования огибающей спектра Фурье и первый дополнительный блок 14 нормирования спектра Фурье, последовательно соединенные блок 15 формирования параметров изменения огибающей спектра Фурье и дополнительный блок 16 формирования огибающей спектра Фурье, последовательно соединенные компаратор 17, сумматор 18, дополнительный сумматор 19, блок 20 определения максимумов и соответствующих им аргументов, регистратор 21 максимальных значений и соответствующих им аргументов, блок 22 динамического программирования, дополнительный компаратор 23, блок 24 принятия решений, блок 25 определения позиций значимых интервалов спектра, блок 26 определения смещенных позиций значимых интервалов спектра, второй дополнительный блок 27 нормирования спектра Фурье, блок 28 обратного преобразования Фурье, блок 29 памяти, блок 30 сглаживания фазы и блок 31 воспроизведения, при этомThe speaker’s voice distortion system contains a generator of 1 basic signals, a shaper of 2 parameters for splitting the frequency range, integrators 3, 4, 5, a shaper of a threshold level 6, a power supply unit 7 connected to the supply inputs of the component blocks of the system, a shaper of 8 parameters for shifting the positions of significant spectrum intervals, a digital recorder 9, a sampling unit 10, a discrete Fourier transform unit 11 and a Fourier spectrum normalization unit 12, connected in series to a of the Fourier spectrum and the first additional unit 14 for normalizing the Fourier spectrum, a series-connected block 15 for generating parameters for changing the envelope of the Fourier spectrum and an additional block 16 for generating the envelope of the Fourier spectrum, sequentially connected to a comparator 17, adder 18, additional adder 19, and block 20 for determining the maxima arguments, recorder 21 maximum values and their corresponding arguments, block 22 dynamic programming, additional comparator 23, block 24 decision niy, block 25 for determining the positions of significant intervals of the spectrum, block 26 for determining the offset positions of significant intervals of the spectrum, the second additional block 27 for normalizing the Fourier spectrum, block 28 for the inverse Fourier transform, block 29 for memory, block 30 for smoothing the phase and block 31 for reproduction

входы интеграторов 3, 4, 5 соединены с выходом блока 12 нормирования спектра Фурье и выходом генератора 1 базисных сигналов, а выходы параллельно подключены к входам компаратора 17 и сумматора 18,the inputs of the integrators 3, 4, 5 are connected to the output of the Fourier spectrum normalization unit 12 and the output of the basis signal generator 1, and the outputs are connected in parallel to the inputs of the comparator 17 and the adder 18,

выход формирователя 2 параметров разбиения диапазона частот соединен с одним из входов блока 20 определения максимумов и соответствующих им аргументов,the output of the shaper 2 of the parameters for splitting the frequency range is connected to one of the inputs of the unit 20 for determining the maxima and the corresponding arguments,

выход формирователя 6 порогового уровня подключен к задающему входу дополнительного компаратора 23, выход формирователя 8 параметров смещения позиций значимых интервалов спектра соединен с одним из входов блока 26 определения смещенных позиций значимых интервалов спектра,the output of the threshold level driver 6 is connected to the input of the additional comparator 23, the output of the 8 parameters for offsetting the positions of the significant intervals of the spectrum is connected to one of the inputs of the block 26 for determining the offset positions of the significant intervals of the spectrum,

один из выходов блока 11 дискретного преобразования Фурье дополнительно параллельно подключен к входу блока 13 формирования огибающей спектра Фурье, одному из входов первого дополнительного блока 14 нормирования спектра Фурье и одному из входов блока 25 определения позиций значимых интервалов спектра,one of the outputs of the block 11 of the discrete Fourier transform is additionally connected in parallel to the input of the block 13 of the formation of the envelope of the Fourier spectrum, one of the inputs of the first additional block 14 of the normalization of the Fourier spectrum and one of the inputs of the block 25 determining the positions of the significant intervals of the spectrum,

а выходы первого дополнительного блока 14 нормирования спектра Фурье и дополнительного блока 16 формирования огибающей спектра Фурье соединены с соответствующими входами второго дополнительного блока 27 нормирования спектра Фурье.and the outputs of the first additional block 14 of the normalization of the Fourier spectrum and the additional block 16 of the formation of the envelope of the Fourier spectrum are connected to the corresponding inputs of the second additional block 27 of the normalization of the Fourier spectrum.

Также система искажения голоса снабжена блоком 32 автоматического определения параметров исходного голоса, вход которого подключен к входу цифрового записывающего устройства 9, а выход - к входу блока 15 формирования параметров изменения огибающей спектра Фурье.Also, the voice distortion system is equipped with a block 32 for automatically determining the parameters of the original voice, the input of which is connected to the input of a digital recording device 9, and the output is to the input of the block 15 for generating parameters for changing the Fourier spectrum envelope.

Система искажения голоса диктора функционирует следующим образом.The speaker voice distortion system operates as follows.

Акустический сигнал поступает на вход цифрового записывающего устройства 9, на выходе которого формируется записанный оцифрованный сигнал. В блоке 10 дискретизации осуществляется его оконное преобразование (дискретизация), при этом не перекрывающиеся интервалы окон имеют длительность не менее 0,032 с и следуют друг за другом со смещением, длительность которого не превышает 0,010 с.Для получения сигнала, соответствующего каждому окну, в блоке 11 происходит вычисление дискретного преобразования Фурье. Блок 12 определяет спектр Фурье и осуществляет его последующее нормирование в соответствии с зависимостьюThe acoustic signal is fed to the input of a digital recording device 9, at the output of which a recorded digitized signal is generated. In block 10 of sampling, its window conversion (sampling) is carried out, while non-overlapping window intervals have a duration of at least 0.032 s and follow each other with an offset, the duration of which does not exceed 0.010 s. To receive a signal corresponding to each window, in block 11 the discrete Fourier transform is calculated. Block 12 determines the Fourier spectrum and carries out its subsequent normalization in accordance with the dependence

где - компоненты нормированного спектра Фурье,Where - components of the normalized Fourier spectrum,

φ_i - компоненты исходного спектра Фурье,φ _i are the components of the initial Fourier spectrum,

j - номер компонента спектра Фурье,j is the number of the component of the Fourier spectrum,

n - количество компонентов спектра Фурье.n is the number of components of the Fourier spectrum.

Генератор 1 базисных сигналов формирует управляющие сигналы, структура которых определяется параметрами вейвлет-функции W={w(ω,jτ)}_Y (фиг.2), имеющей видThe basis signal generator 1 generates control signals, the structure of which is determined by the parameters of the wavelet function W = {w (ω, jτ)} _Y (Fig. 2), which has the form

где τ -шаг анализа частоты основного тона,where τ is the pitch pitch analysis step,

j- номер отсчета вейвлет-функции, j=0,…,Y; j is the reference number of the wavelet function, j = 0, ..., Y;

ω_min - минимальное значение частоты основного тона (~80 Гц,),ω _min - the minimum value of the frequency of the fundamental tone (~ 80 Hz,),

ω_max - максимальное значение частоты основного тона (~450 Гц),ω _max - the maximum value of the frequency of the fundamental tone (~ 450 Hz),

ω - текущее значение частоты,ω is the current value of the frequency,

π=3,14.π = 3.14.

Управляющие базисные сигналы с выхода блока 1 поступают на входы интеграторов 3-5, которые проводят интегрирование нормированного спектра Фурье φ(ω) с параметрическим классом вейвлет-функций W={w(ω,jπ)}_Y. Результатом работы интегратора 3 является вычисление значения интегралов видаThe control basic signals from the output of block 1 are fed to the inputs of integrators 3-5, which integrate the normalized Fourier spectrum φ (ω) with the parametric class of wavelet functions W = {w (ω, jπ)} _Y. The result of the work of integrator 3 is the calculation of the value of integrals of the form

где переменная ξ определяет местоположение максимума вейвлет-функции на спектре.where the variable ξ determines the location of the maximum of the wavelet function on the spectrum.

Результатом работы интегратора 4 является вычисление значения интегралов видаThe result of the work of integrator 4 is the calculation of the value of integrals of the form

а результатом работы интегратора 5 является вычисление значения интегралов видаand the result of the work of integrator 5 is the calculation of the value of integrals of the form

На фиг.3 представлено примерное наложение вейвлет-функции в выбранном отсчете спектра Фурье. Свертка спектра Фурье с вейвлет-функциями всех возможных масштабов производится в каждом отсчете спектра, причем суммы свертки определяются отдельно для каждой полуволны вейвлет-функции.Figure 3 presents an exemplary overlap of the wavelet function in the selected sample of the Fourier spectrum. The convolution of the Fourier spectrum with wavelet functions of all possible scales is performed in each sample of the spectrum, and the sum of the convolution is determined separately for each half-wave of the wavelet function.

С выхода интеграторов 3-5 сигналы поступают на входы сумматора 18 и компаратора 17. В компараторе 17 проводится сравнение значений сигналов, полученных с выходов интеграторов 3-5 для каждого значения положения вейвлет-функции ξ и каждого значения основного тона jτ. На выходе компаратора появляется положительное значение +U, если выполняются условияFrom the output of the integrators 3-5, the signals are fed to the inputs of the adder 18 and the comparator 17. In the comparator 17, the values of the signals received from the outputs of the integrators 3-5 are compared for each value of the position of the wavelet function ξ and each value of the fundamental tone jτ. A positive value + U appears at the output of the comparator if the conditions are met

Если же условия (6) не выполняются, то на выходе компаратора появляется отрицательное значение -U.If conditions (6) are not satisfied, then a negative value of -U appears at the output of the comparator.

Сумматор 18 при появлении на выходе компаратора 17 отрицательного значения -U, формирует на своем выходе ноль. Если же значение на выходе компаратора положительно +U, то он формирует на своем выходе значение g_j(ξ), которое образуется на основе значенийThe adder 18 when a negative value -U appears at the output of the comparator 17, forms a zero at its output. If the value at the output of the comparator is positive + U, then it generates at its output the value g _j (ξ), which is formed on the basis of the values

сигналов, полученных с выходов интеграторов 3-5, т.е.signals received from the outputs of integrators 3-5, i.e.

В дополнительном сумматоре 19 суммируются значения сигналов g_j(ξ) для различных значений смещения h и частоты основного тона jτAn additional adder 19 summarizes the values of the signals g _j (ξ) for various values of the bias h and the frequency of the fundamental tone jτ

где h- значение смещения.where h is the offset value.

В блоке 20 определения максимумов и соответствующих им аргументов определяются значения максимумов для каждого из трех диапазонов частот основного тона In block 20 determining the maximums and the corresponding arguments, the maximum values for each of the three frequency ranges of the fundamental tone are determined

и соответствующих аргументов максимумовand the corresponding arguments for the maxima

Эти значения сохраняются в регистраторе 21 максимальных значений и соответствующих им аргументов. Границы диапазонов частот (от 90 до 180 Гц, от 180 до 360 Гц, от 360 до 450 Гц) заданы формирователем 2 параметров разбиения диапазона частот основного тона голоса, разбитого на три неперекрывающихся диапазона. Верхнюю границу первого диапазона выбирают из условия G,=2G_min, где G_min - известное минимальное значение частоты основного тона мужского голоса, верхнюю границу второго диапазона - из условия G₂=4G_min, а верхнюю границу третьего диапазона принимают равной известному максимальному значению частоты G_max основного тона женского голоса. По мере накопления регистратором 21 максимальных значений и соответствующих им аргументов 4-х значений они передаются в блок 22 динамического программирования, реализующего поиск наиболее вероятной траектории с использованием алгоритма динамического программирования.These values are stored in the logger 21 maximum values and their corresponding arguments. The boundaries of the frequency ranges (from 90 to 180 Hz, from 180 to 360 Hz, from 360 to 450 Hz) are set by the driver of 2 parameters for dividing the frequency range of the fundamental tone of the voice, divided into three non-overlapping ranges. The upper boundary of the first range is selected from the condition G, = 2G _min , where G _min is the known minimum value of the frequency of the fundamental tone of the male voice, the upper boundary of the second range is from the condition G ₂ = 4G _min , and the upper boundary of the third range is taken equal to the known maximum value of the frequency G _{max the} fundamental tone of the female voice. As the registrar accumulates 21 maximum values and the corresponding arguments of 4 values, they are transferred to the dynamic programming unit 22, which implements the search for the most probable trajectory using the dynamic programming algorithm.

Функционирование блока 22 динамического программирования осуществляется на основе введенной меры вероятностей следования пар (значение максимума, значение аргумента максимума) друг за другомThe functioning of the dynamic programming unit 22 is carried out on the basis of the introduced measure of the probabilities of the succession of pairs (maximum value, maximum argument value) one after another

где σ -дисперсия частоты основного тона,where σ is the dispersion of the fundamental frequency,

t-дискретное время,t-discrete time

- компоненты вектора информативных признаков, взятые в дискретные моменты времени t; t=l,2… - components of the vector of informative features taken at discrete instants of time t; t = l, 2 ...

- максимальное значение интеграла (9), взятое в момент времени t, is the maximum value of the integral (9) taken at time t,

τ - шаг анализа частоты основного тона.τ is the pitch pitch analysis step.

Граф связей во времени между последовательностью состояний представлен на фиг.4, при этом каждое ребро графа соответствует вычислению меры схожести этих пар.The graph of relationships in time between the sequence of states is shown in Fig. 4, with each edge of the graph corresponding to the calculation of the measure of similarity of these pairs.

На выходе блока 22 динамического программирования формируется сигнал, соответствующий значению вероятности наиболее вероятной траектории и поступающий на вход дополнительного компаратора 23 для сравнения с значением порогового уровня, которое задается формирователем 6 порогового уровня. Если значение сигнала с блока 22 динамического программирования превышает значение установленного порога, то блоком 24 принятия решений принимается решение о том, что на интервале времени, на котором получена последовательность из четырех спектров, присутствует основной тон и соответственно вокализированный участок речи. По первому найденному интервалу устанавливается граница начала речи, а по последнему интервалу, на котором вероятность траектории не преодолела порог, устанавливается окончание речи. Управляющий сигнал с выхода блока 24 принятия решений поступает на один из входов блока 25 определения позиций значимых интервалов спектра, на другой вход которого поступает сигнал с выхода блока 11 дискретного преобразования Фурье. На основе значений гармоник частот основного тона и их полос спектра, им соответствующих, в блоке 25 определения позиций значимых интервалов спектра выделяются позиции значимых подвергаемых преобразованиям интервалов спектра (фиг.5, масштаб амплитуды - линейный), информационный сигнал о которых поступает на один из входов блока 26 определения смещенных позиций значимых интервалов спектра. При этом в формирователе 8 параметров смещения позиций значимых интервалов спектра формируются величины смещений позиции значимых интервалов линейчатого спектра, поступающие на другой вход блока 26 определения смещенных позиций значимых интервалов спектра, в результате чего определяются новые позиции значимых интервалов линейчатого спектра. На основе спектра, полученного в блоке 11 дискретного преобразования Фурье, в блоке 13 формирования огибающей спектра Фурье формируется огибающая спектра Е(со)и проводится ее разложение на взвешенную сумму 3-х несимметричных гауссоидAt the output of the dynamic programming unit 22, a signal is generated corresponding to the probability value of the most probable trajectory and fed to the input of an additional comparator 23 for comparison with the threshold level value, which is set by the threshold level generator 6. If the value of the signal from the dynamic programming unit 22 exceeds the value of the set threshold, then the decision unit 24 makes a decision that, in the time interval over which a sequence of four spectra is received, the main tone and accordingly voiced speech section are present. The boundary of the beginning of speech is established by the first found interval, and the end of speech is established by the last interval, at which the probability of the trajectory has not passed the threshold. The control signal from the output of decision block 24 is fed to one of the inputs of block 25 for determining the positions of significant intervals of the spectrum, the other input of which receives a signal from the output of block 11 of the discrete Fourier transform. Based on the values of the harmonics of the frequencies of the fundamental tone and their spectrum bands corresponding to them, in block 25 for determining the positions of the significant intervals of the spectrum, the positions of the significant transformations of the spectrum intervals (Fig. 5, the amplitude scale is linear) are allocated, the information signal of which is fed to one of the inputs block 26 determining the offset positions of significant intervals of the spectrum. At the same time, in the shaper 8 of the parameters for shifting the positions of the significant intervals of the spectrum, the values of the displacements of the positions of the significant intervals of the line spectrum are generated, which are transmitted to the other input of the block 26 for determining the offset positions of the significant intervals of the spectrum, as a result of which new positions of the significant intervals of the linear spectrum are determined. Based on the spectrum obtained in the block 11 of the discrete Fourier transform, in the block 13 of the formation of the envelope of the Fourier spectrum, the envelope of the spectrum E (co) is formed and it is decomposed into a weighted sum of 3 asymmetric gaussoids

где a_i- амплитуда i-ой гауссоиды,where a _i is the amplitude of the i-th Gaussoid,

θ - параметры гауссоидыθ - parameters of the gaussoid

ехр=2,72.exp = 2.72.

В приведенных зависимостях указаны положение i-ой гауссоиды и значения среднеквадратических отклонений слева и справа, соответственно. В данном случае реализуется предположение о том, что звуки речи могут быть описаны с помощью трех формант, и параметр положения гауссоиды, по сути, определяет частоту форманты, а среднее правой и левой дисперсий гауссоиды определяет ширину формантной области. На фиг.6 масштаб изменения амплитуды принят линейным. В первом дополнительном блоке 14 нормирования спектра Фурье проводится нормирование спектра, полученного от блока 11 дискретного преобразования Фурье по огибающей,In the given dependences, the position of the i-th Gaussoid and the values of standard deviations on the left and on the right, respectively, are indicated. In this case, the assumption is realized that speech sounds can be described using three formants, and the position parameter of the Gaussoid, in fact, determines the frequency of the formant, and the average of the right and left Gaussian dispersions determines the width of the formant region. In Fig.6, the scale of the amplitude change is linear. In the first additional block 14 of the normalization of the Fourier spectrum is the normalization of the spectrum obtained from the block 11 of the discrete Fourier transform along the envelope,

определяемой зависимостью (12) и полученной на выходе блока 13 формирования огибающей спектра Фурье.determined by dependence (12) and obtained at the output of block 13 of the formation of the envelope of the Fourier spectrum.

В блоке 15 формирования параметров изменения огибающей спектра Фурье формируются величины изменений амплитуд и частот формант , в соответствии с которыми в дополнительном блоке 16 формирования огибающей спектра Фурье формируется сигнал, соответствующий новой огибающейIn block 15 of the formation of parameters for changing the envelope of the Fourier spectrum, the values of changes in the amplitudes and frequencies of the formants are formed according to which, in the additional block 16 of the formation of the envelope of the Fourier spectrum, a signal is generated corresponding to the new envelope

и поступающий на один из входов второго дополнительного блока 27 нормирования спектра Фурье, осуществляющего перенормирование реальной и мнимых частей преобразования Фурье. С выхода блока 27 сигнал поступает на блок 28 обратного преобразования Фурье, на выходе которого формируется окно искаженного сигнала, помещаемое далее в блок 29 памяти.and arriving at one of the inputs of the second additional unit 27 for normalizing the Fourier spectrum, which renormalizes the real and imaginary parts of the Fourier transform. From the output of block 27, the signal is supplied to the inverse Fourier transform block 28, at the output of which a distorted signal window is formed, which is placed further in the memory block 29.

После накопления в блоке 29 памяти двух окон искаженного сигнала они поступают в блок 30 сглаживания фазы, который предотвращает появление щелчков при наложении сигналов с различными фазами. Результирующий сигнал поступает на блок 31 воспроизведения, в качестве которого могут быть использованы, например, усилитель и акустические колонки.After the accumulation in the memory unit 29 of the two windows of the distorted signal, they enter the phase smoothing unit 30, which prevents clicks when applying signals with different phases. The resulting signal is supplied to the playback unit 31, for which, for example, an amplifier and speakers can be used.

При использовании в составе системы блока 32 автоматического определения параметров исходного голоса акустический сигнал поступает, как на вход цифрового записывающего устройства 9, так и на вход блока 32, на выходе которого формируется информационный сигнал, соответствующий параметрам исходного голоса диктора и поступающий на вход блока 15 формирования параметров изменения огибающей спектра Фурье. В этом случае в блоке 15 формируются смещения амплитуд и частот формант , равныеWhen an automatic determination of the parameters of the initial voice is used as part of the system 32, the acoustic signal is supplied both to the input of the digital recording device 9 and to the input of the block 32, at the output of which an information signal is generated corresponding to the parameters of the original voice of the speaker and fed to the input of the forming unit 15 parameters for changing the envelope of the Fourier spectrum. In this case, in block 15, displacements of the amplitudes and frequencies of the formants are formed equal

где ω1, ω2, ω3, a1, a2, a3 - частоты и амплитуды соответствующих формант исходного голоса,where ω1, ω2, ω3, a 1, a 2, a 3 are the frequencies and amplitudes of the corresponding formants of the original voice,

p1, p2, p3 - равномерно распределенные случайные величины в интервале [- 0.15, 0.15],p1, p2, p3 - uniformly distributed random variables in the interval [- 0.15, 0.15],

d1, d2, d3 - равномерно распределенные случайные величины в интервале [0, 0.15],d1, d2, d3 - uniformly distributed random variables in the interval [0, 0.15],

которые генерируются в блоке 32.which are generated in block 32.

На фиг.7. в логарифмическом масштабе представлены полученные экспериментально спектры исходного сигнала и сигнала после процедуры смещения линий. Питание системы осуществляется от блока 7 питания, подключенного к питающим входам составных блоков системы.7. on a logarithmic scale, the experimentally obtained spectra of the original signal and the signal after the line shift procedure are presented. The system is powered from the power unit 7, connected to the power inputs of the composite units of the system.

Система искажения голоса диктора может применяться для обеспечения возможности искажения голоса диктора в соответствии с заданными параметрами, прослушивания искаженного голоса диктора, выбора конечного варианта искажения, а также для передачи в режиме реального времени искаженного голоса диктора на выход аудиоустройства с возможностью дальнейшей трансляции по каналам связи.The speaker’s voice distortion system can be used to provide the speaker’s voice distortion in accordance with the specified parameters, listen to the speaker’s distorted voice, select the final distortion option, as well as to transmit the distorted speaker’s voice in real time to the output of the audio device with the possibility of further broadcasting through communication channels.

Использование системы искажения голоса диктора позволяет снизить значение вероятности распознавания голоса за счет использования различных вариантов трансформации спектральных характеристик исходного голоса диктора.Using the speaker’s voice distortion system allows reducing the probability of voice recognition through the use of various options for transforming the spectral characteristics of the original speaker’s voice.

Claims

1. A speaker’s voice distortion system, comprising a basic signal generator, a driver for splitting the frequency range, integrators, a threshold driver, a power supply connected to the power inputs of the component blocks of the system, a driver for position offset parameters of significant spectrum intervals, a digital recording device, a block connected in series discretization, a discrete Fourier transform unit and a Fourier spectrum normalization unit, connected in series to a spectral envelope formation unit Fourier and the first additional block for normalizing the Fourier spectrum, a series-connected block for generating parameters for changing the Fourier spectrum envelope and an additional block for generating the envelope for the Fourier spectrum, series-connected comparator, adder, additional adder, block for determining the maxima and their corresponding arguments, the recorder of maximum values and their corresponding arguments, dynamic programming block, additional comparator, decision block, position determination block s the initial intervals of the spectrum, the unit for determining the displaced positions of significant intervals of the spectrum, the second additional unit for normalizing the Fourier spectrum, the inverse Fourier transform unit, the memory unit, the phase smoothing unit and the playback unit, while the inputs of the integrators are connected to the corresponding outputs of the normalizing unit of the Fourier spectrum and the basis signal generator and the outputs are connected in parallel to the inputs of the comparator and adder, the output of the parameterizer for splitting the frequency range is connected to one of the inputs of the block maxima and the corresponding arguments, the output of the threshold level generator is connected to the input of the additional comparator, the output of the parameter for shifting the positions of significant intervals of the spectrum is connected to one of the inputs of the block for determining the offset positions of significant intervals of the spectrum, one of the outputs of the block for discrete Fourier spectrum conversion is additionally connected in parallel to the input of the Fourier spectrum envelope formation unit, one of the inputs of the first additional spectrum normalization unit Fourier and one of the inputs of block position detection significant intervals of the spectrum, and outputs the first additive unit normalization Fourier spectrum and the complementary box forming Fourier spectrum envelope are connected to respective inputs of the second additional unit normalization Fourier spectrum.

2. The voice distortion system of the speaker according to claim 1, which is equipped with a unit for automatically determining the parameters of the original voice, the input of which is connected to the input of a digital recording device, and the output is to the input of the unit for generating parameters for changing the Fourier spectrum envelope.