RU2748935C1

RU2748935C1 - Method of recognition of new low bit rate coding protocols

Info

Publication number: RU2748935C1
Application number: RU2020129303A
Authority: RU
Inventors: Виктор Алексеевич Аладинский; Игорь Леонидович Гатилов; Сергей Владиславович Кузьминский; Павел Леонидович Смирнов; Дмитрий Николаевич Чубатый
Original assignee: Общество с ограниченной ответственностью "Специальный Технологический Центр"
Priority date: 2020-09-03
Filing date: 2020-09-03
Publication date: 2021-06-01

Abstract

FIELD: information technology.

SUBSTANCE: invention relates to the field of information technology, namely to the field of digital communication. A recognition method (NSCR) implemented in vocoders is proposed, a binary digital information stream y is received during the time interval ΔT, a normalized autocorrelation function a is formed on the basis of y, a decision is made on the presence of a block structure in the digital information stream, used to describe the input implementation image in the form of a square matrix M of mean values and the corresponding covariance matrices C. Based on the values of _Vjl, a divergence vector _Vj is formed for each j-th reference image. J vectors _Vj form a matrix of divergence values V of dimension J×L, L is the number of bits between the extrema of the autocorrelation function. A decision is made to discover a new previously unknown NSCR protocol and the number J + 1 is assigned to a new reference description.

EFFECT: technical result consists in reducing the probability of a false alarm and, as a consequence, increasing the recognition reliability (the probability of correct recognition) of new protocols (NSCR).

1 cl, 5 dwg

Description

Изобретение относится к области цифровой связи, а именно, к обработке цифровых потоков канального уровня эталонной модели взаимодействия открытых систем и может быть использовано в широком классе средств обработки цифровых сигналов для распознавания новых протоколов низкоскоростного кодирования речи (НСКР), используемых при передаче речевых сообщений по цифровым каналам связи.The invention relates to the field of digital communication, namely, to the processing of digital streams of the channel level of the reference model of the interaction of open systems and can be used in a wide class of digital signal processing tools for recognizing new low-speed speech coding (LSCR) protocols used in the transmission of speech messages over digital communication channels.

Заявленное техническое решение расширяет арсенал средств аналогичного назначения за счет возможности распознавания новых протоколов НСКР с меньшей вероятностью ложной тревоги.The claimed technical solution expands the arsenal of tools for a similar purpose due to the ability to recognize new NSCR protocols with a lesser likelihood of false alarms.

Известен способ распознавания протоколов НСКР (см. Патент РФ № RU 2610285 С1, МПК G10L 19/008 (2013.01), Н03М 13/03 (2006.01). Способ распознавания протоколов низкоскоростного кодирования. Аладинский В.А., Кузьминский С.В., Смирнов П.Л., Чубатый Д.Н., опубл. 08.02.2017), в котором на основе сравнения эталонных описаний, представленных наборами усредненных значений коэффициента избыточности (КИ), и исследуемого образа, представленного набором значений КИ столбцов прямоугольной матрицы, сформированной из входного цифрового информационного потока, определяют величины отклонения между измеренным набором значений КИ и наборами усредненных значений КИ, в полученной совокупности значений отклонения выбирают минимальное, сравнивают его с пороговой величиной, по результатам сравнения принимают решение о том, что исследуемый цифровой поток сформирован на основе одного из известных протоколов НСКР, в противном случае полагают, что для формирования анализируемого цифрового потока был применен новый (неизвестный ранее) протокол НСКР. Недостатком способа является низкая помехоустойчивость распознавания, что приводит к появлению большого числа ложных решений о применении на линиях радиосвязи новых протоколов НСКР при вероятности битовой ошибки Р_ош>0,03.A known method for recognizing NSCR protocols (see RF Patent No. RU 2610285 C1, IPC G10L 19/008 (2013.01), H03M 13/03 (2006.01). A method for recognizing low-speed coding protocols. Aladinsky VA, Kuzminsky SV, Smirnov P.L., Chubaty D.N., publ. 02/08/2017), in which, based on a comparison of the reference descriptions represented by sets of averaged values of the redundancy factor (RI), and the investigated image, represented by a set of RR values of the columns of a rectangular matrix formed from the input digital information stream, the values of the deviation between the measured set of CI values and the sets of averaged CI values are determined, in the obtained set of values of the deviation, the minimum is selected, it is compared with the threshold value, based on the comparison results, a decision is made that the investigated digital stream is formed on the basis of one from the known NSCR protocols, otherwise it is believed that for the formation of the analyzed digital stream, a new th (previously unknown) NSCR protocol. The disadvantage of this method is the low noise immunity of recognition, which leads to the appearance of a large number of false decisions about the use of new NSCR protocols on radio communication lines with the probability of a bit error P _osh > 0.03.

Наиболее близким к заявленному является способ (прототип) распознавания протоколов низкоскоростного кодирования (см. Патент РФ № RU 2667462 С1, Способ распознавания протоколов низкоскоростного кодирования, МПК G10L 19/008 (2013.01), Н03М 13/03 (2006.01). Аладинский В.А., Вещунин Е.А., Кузьминский С.В., Смирнов П.Л. опубл. 19.09.2018), заключающийся в том, что принимают бинарный цифровой информационный поток у в течение интервала времени ΔT, формируют на основе у нормированную автокорреляционную функцию а, принимают решение о наличии блочной структуры в цифровом информационном потоке у по регулярным с равными интервалами Δτ экстремумам автокорреляционной функции а, делят цифровой информационный поток у на информационные блоки объемом N_б бит каждый с интервалами Δτ между экстремумами автокорреляционной функции а, последовательно присваивают информационным блокам порядковые номера k=1, 2, …, K, начиная с первого информационного блока, формируют прямоугольную информационную матрицу Y размеров K×L, L=N_б, строками которой являются последовательно размещенные друг под другом информационные блоки в соответствии с их порядковыми номерами k=1, 2, …, K, из матрицы Y выделяют столбцы y_z, z=1, 2, …, L; вычисляют по каждому столбцу y_z значение математического ожидания (МО) m_z, на основе полученных значений m_z формируют вектор значений МО m₀=(m₁, m₂, …, m_z, …, m_L) путем последовательного их размещения в соответствии с порядковыми номерами z, на основе полученного вектора m₀ формируют квадратную матрицу М значений МО размера L, строки m_l, l=0, 1, 2, …, L-1, которой содержат значения вектора m₀, последовательно циркулярно сдвинутые влево на величину l; формируют прямоугольные эталонные информационные матрицы на основе которыхClosest to the claimed is a method (prototype) for recognizing low-speed coding protocols (see RF Patent No. RU 2667462 C1, Method for recognizing low-speed coding protocols, IPC G10L 19/008 (2013.01), H03M 13/03 (2006.01). Aladinsky V.A. ., Veshchunin E.A., Kuz'minsky S.V., Smirnov P.L.publ. 19.09.2018), which consists in the fact that they receive a binary digital information stream y during the time interval ΔT, form a normalized autocorrelation function based on y a, make a decision on the presence of a block structure in the digital information stream y according to the extrema of the autocorrelation function a regular with equal intervals Δτ, divide the digital information stream y into information blocks of N _b bits each with intervals Δτ between the extrema of the autocorrelation function a, sequentially assign the information blocks serial numbers k = 1, 2, ..., K, starting from the first information block, form a rectangular information matrix Y of size erov K × L, L = N _b , the rows of which are information blocks sequentially placed under each other in accordance with their ordinal numbers k = 1, 2, ..., K, columns y _z , z = 1, 2, are selected from the matrix Y, ..., L; the value of the mathematical expectation (MO) m _{z is} calculated for each column y _z , based on the obtained values of m _z , a vector of values of MO m ₀ = (m ₁ , m ₂ , ..., m _z , ..., m _L ) is formed by sequentially placing them in in accordance with the ordinal numbers z, on the basis of the obtained vector m ₀ form a square matrix M of MO values of size L, rows m _l , l = 0, 1, 2, ..., L-1, which contains the values of the vector m ₀ , sequentially circularly shifted to the left by the value l; form rectangular reference information matrices on the basis of which

вычисляют векторы эталонных значений МО m_{j эт}, j=1, 2, …, J, и квадратные эталонные ковариационные матрицы {C_{j эт}}J, соответствующие J известным протоколам НСКР; вычисляют значения вероятности правильного распознавания P_jl j-то протокола НСКР, j=1, 2, …, J, по каждой l-й строке m_l матрицы М, принимают решение в пользу того j-го протокола НСКР, для которого обеспечивается максимальное значение P_jl.calculate the vectors of reference values MO m _{j et} , j = 1, 2, ..., J, and square reference covariance matrices {C _{j et} } J corresponding to J known NSCR protocols; the values of the probability of correct recognition P _{jl of the} jth NSCR protocol, j = 1, 2, ..., J, _{are calculated for each l-th row m l of the} matrix M, a decision is made in favor of the j-th NSCR protocol for which the maximum value is provided P _jl .

Способ-прототип обладает высокой точностью распознавания протоколов НСКР в условиях высокого уровня битовых ошибок в исследуемом цифровом информационном потоке у. Однако, недостатком прототипа является появление при вычислении вероятности правильного распознавания неопределенности вида P_jl=0/0 для всех j, l. Последняя возникает в случае подачи на вход цифровых информационных потоков, в том числе сформированных на основе новых протоколов НСКР, за исключением реализаций, для которых имеются эталонные описания. В результате имеет место увеличение вероятности ложной тревоги при распознавании новых протоколов НСКР в условиях уменьшения объема К входного цифрового информационного потока у, особенно при величине К<100. Следствием увеличения количества J эталонных описаний, формируемых за счет неправильно распознанных входных реализаций при обучении системы распознавания протоколов НСКР, является снижение вероятности правильного распознавания.The prototype method has a high accuracy of recognition of the NSCR protocols in conditions of a high level of bit errors in the investigated digital information stream y. However, the disadvantage of the prototype is the appearance when calculating the probability of correct recognition of the uncertainty of the form P _jl = 0/0 for all j, l. The latter arises when digital information streams are fed to the input, including those formed on the basis of new NSCR protocols, with the exception of implementations for which there are reference descriptions. As a result, there is an increase in the probability of a false alarm when recognizing new NSCR protocols in conditions of a decrease in the volume K of the input digital information stream y, especially when K <100. The consequence of an increase in the number J of reference descriptions formed due to incorrectly recognized input realizations during training of the NSCR protocol recognition system is a decrease in the probability of correct recognition.

Целью заявленного технического решения является разработка способа, обеспечивающего снижение вероятности ложной тревоги и, как следствие, повышение достоверности распознавания (вероятности правильного распознавания) новых протоколов НСКР.The aim of the claimed technical solution is to develop a method that reduces the probability of false alarms and, as a consequence, increases the reliability of recognition (the probability of correct recognition) of new NSCR protocols.

Поставленная цель достигается тем, что в известном способе распознавания протоколов НСКР, включающем прием бинарного цифрового информационного потока у в течение интервала времени ΔТ, формирование на основе у нормированной автокорреляционной функции а, принятие решения о наличии блочной структуры в информационном потоке у по регулярным с равными интервалами Δτ экстремумам автокорреляционной функции а, деление цифрового информационного потока у на информационные блоки объемом N_б бит каждый по интервалам между экстремумами автокорреляционной функции а, последовательное присвоение информационным блокам порядковых номеров k=1, 2, …, K, начиная с первого информационного блока, формирование прямоугольной информационной матрицы Y размеров K×L, L=N_б, строками которой являются последовательно размещенные друг под другом информационные блоки в соответствии с их порядковыми номерами k=1, 2, …, K, выделение из матрицы Y столбцов y_z, z=1, 2, …, L, определение значений МО m_z по каждому столбцу y_z, формирование вектора значений МО m₀=(m₁, m₂, …, m_z, …, m_L) последовательным размещением значений МО m_z, формирование квадратной матрицы значений МО М размером L×L, строки m_l, l=0, 1, 2, …, L-1, которой содержат значения вектора то, последовательно циркулярно сдвинутые влево на величину l, формирование прямоугольных эталонных информационных матриц {Y_{j эт}}_J, вычисление на основе эталонных информационных матриц {Y_{j эт}}_J векторов эталонных значений МО m_{j эт}, у=1, 2,…, J и квадратных эталонных ковариационных матриц {C_{j эт}}_J, вычисление значения вероятности правильного распознавания P_jl для каждого j-го протокола НСКР, j=1, 2, …, J, по каждой 1-й строке m_l матрицы М, принятие решения в пользу j-го протокола НСКР, для которого обеспечивается максимальное значение P_jl, при получении результата вычисления вероятности правильного распознавания в виде неопределенности P_jl=0/0 для всех j, l, формируют совокупность прямоугольных матриц {Y_l}_L, столбцы которых сдвинуты циркулярно на l=0, 1, 2, …, L-1 относительно столбцов прямоугольной информационной матрицы Y. Для каждой матрицы Y_l вычисляют соответствующую ей квадратную ковариационную матрицу C_l.This goal is achieved by the fact that in the known method for recognizing NSCR protocols, which includes receiving a binary digital information stream y during the time interval ΔT, forming a normalized autocorrelation function a on the basis of y, making a decision on the presence of a block structure in the information stream y at regular intervals Δτ to the extrema of the autocorrelation function a, dividing the digital information flow y into information blocks of N _b bits each according to the intervals between the extrema of the autocorrelation function a, sequentially assigning sequence numbers k = 1, 2, ..., K to the information blocks, starting from the first information block, forming a rectangular information matrix Y of sizes K × L, L = N _b , the rows of which are information blocks sequentially placed under each other in accordance with their ordinal numbers k = 1, 2, ..., K, extraction of columns y _z , z = from the matrix Y 1, 2, ..., L, determination of values of MO m _z for each column y _z , formation of a vector of values of MO m ₀ = (m ₁ , m ₂ , ..., m _z , ..., m _L ) by sequential placement of values of MO m _z , formation of a square matrix of values of MO M size L × L, rows m _l , l = 0, 1, 2, ..., L-1, which contains the values of the vector then, sequentially circularly shifted to the left by the value l, the formation of rectangular reference information matrices {Y _{j et} } _J , calculation based on the reference information matrices {Y _{j et} } _J vectors of reference values of MO m _{j et} , y = 1, 2, ..., J and square reference covariance matrices {C _{j et} } _J , calculating the value of the probability of correct recognition P _jl for each j-th NSCR protocol, j = 1, 2 , ..., J, for each 1st row m _{l of the} matrix M, making a decision in favor of the jth NSCR protocol, for which the maximum value of P _jl is provided, upon obtaining the result of calculating the probability of correct recognition in the form of uncertainty P _jl = 0/0 for all j, l, form a set of rectangular matrices {Y _l } _L , the columns of which rykh are shifted circularly by l = 0, 1, 2, ..., L-1 relative to the columns of the rectangular information matrix Y. For each matrix Y _{l, the} corresponding square covariance matrix C _{l is} calculated.

Определяют значение дивергенции v_jl между образом входной реализации, представленной строками m_l квадратной матрицы М значений МО и соответствующими ковариационными матрицами C_l, и каждым j-м эталонным образом, представленным вектором m_{j эт} и соответствующей квадратной эталонной ковариационной матрицей С_{j эт.}на основе полученных значений v_jl формируют вектор значений дивергенции v_j=(v_j0, v_j1, …, v_jl …, v_j(L-1)) для каждого j-го эталонного образа. Составляют из J векторов v_j матрицу значений дивергенции V размеров J×L.Determine the value of the divergence v _jl between the image of the input implementation, represented by rows m _{l of the} square matrix of M values of MO and the corresponding covariance matrices C _l , and each j-th reference image represented by the vector m _{j et} and the corresponding square reference covariance matrix C _{j et.} on the basis of the obtained values of v _jl , a vector of divergence values v _j = (v _j0 , v _j1 , ..., v _jl ..., v _{j (L-1)} ) is formed for each j-th reference image. A matrix of values of divergence V of sizes J × L is composed of J vectors v _j.

Принимают решение о обнаружении нового ранее неизвестного протокола НСКР и присваивают условный номер J=J+1 новому эталонному описанию, которое включает вектор математического ожидания m(J+1)_эт=m₀ и квадратную эталонную ковариационную матрицу С(J+1)_эт=С₀, если для всех элементов v_jl матрицы значений дивергенции V выполняется условие v_jl>v_пop=2000 при любых j, l, где v_пop - пороговая величина, или только для одного элемента v_jl<300, а все остальные v_jl>v_пор, принимают решение, что входная реализация у сформирована по j-му известному протоколу НСКР и в информационном потоке у отсутствуют начальные l бит в первом k=1 информационном блоке. В случае 300≤v_jl≤2000 принимают решение о том, что входная реализация у не распознана.A decision is made to detect a new previously unknown NSCR protocol and a conditional number J = J + 1 is assigned to a new reference description, which includes the mathematical expectation vector m (J + 1) _fl = m ₀ and a square reference covariance matrix C (J + 1) _fl = С ₀ , if for all elements v _{jl of the} matrix of divergence values V the condition v _jl > v _pop = 2000 is satisfied for any j, l, where v _pop is the threshold value, or only for one element v _jl <300, and all the rest v _jl > v _then , it is decided that the input implementation y is formed according to the j-th known NSCR protocol and the information stream y has no initial l bits in the first k = 1 information block. In the case of 300≤v _jl ≤2000, a decision is made that the input implementation y is not recognized.

Данному решению соответствует, например, ситуация, когда цифровой информационный поток у содержит менее 100 информационных блоков (т.е. K<100) или количество одиночных битовых ошибок составляет в нем более 10%.This solution corresponds, for example, to the situation when the digital information stream y contains less than 100 information blocks (i.e., K <100) or the number of single bit errors in it is more than 10%.

Благодаря новой совокупности существенных признаков в заявленном способе достигается повышение достоверности распознавания (вероятности правильного распознавания) новых протоколов НСКР за счет более полного описания образа входной реализации, представленного строками m_l квадратной матрицы М значений МО и соответствующими ковариационными матрицами C_l, а также определения дивергенции v_jl между этим образом и эталонными образами, представленных векторами m_{j эт} и соответствующими эталонными ковариационными матрицами C_{j эт.}.Thanks to the new set of essential features in the claimed method, an increase in the recognition reliability (the probability of correct recognition) of new NSCR protocols is achieved due to a more complete description of the input implementation image represented by rows m _{l of the} square matrix M of MO values and the corresponding covariance matrices C _l , as well as determining the divergence v _jl between this image and the reference images represented by vectors m _{j et} and the corresponding reference covariance matrices C _{j et.} ...

Заявленный способ поясняется чертежами, на которых показаны:The claimed method is illustrated by drawings, which show:

на фиг. 1 - зависимость вероятности ложной тревоги Р_лт от количества блоков K, содержащихся в цифровом информационном потоке у, при распознавании протоколов НСКР;in fig. 1 - the dependence of the probability of false alarm P _lt on the number of blocks K contained in the digital information stream y, when recognizing the NSCR protocols;

на фиг. 2 - порядок формирования квадратной матрицы значений МО М размеров Z=L;in fig. 2 - the order of formation of a square matrix of values of MO M dimensions Z = L;

на фиг. 3 - алгоритм распознавания новых протоколов НСКР;in fig. 3 - an algorithm for recognizing new NSCR protocols;

на фиг. 4 - зависимости достоверности D_p принимаемого решения от количества блоков K, содержащихся в цифровом информационном потоке у при распознавании протоколов НСКР;in fig. 4 - dependences of the reliability D _{p of the} decision taken on the number of blocks K contained in the digital information stream y when recognizing the NSCR protocols;

на фиг. 5 - зависимость вероятности ложной тревоги Р_лт от вероятности битовой ошибки Р_ош в цифровом информационном потоке у, при распознавании протоколов НСКР.in fig. 5 - dependence of the probability of false alarm P _lt on the probability of a bit error P _osh in the digital information stream y, when recognizing the NSCR protocols.

Развитие цифровой телефонной радиосвязи ведется по многим направлениям, одним из которых является разработка аппаратно-программных средств НСКР, называемых вокодерами. Например, для применения на линиях радиосвязи диапазона высоких частот (ВЧ) к настоящему моменту разработаны не менее 20-ти различных типов вокодеров и соответствующих им протоколов НСКР (см. Аладинский В.А, Кузьминский С.В. Анализ цифровых потоков на выходах вокодеров, принимаемых на зарубежных линиях радиосвязи диапазона высоких волн // Успехи современной радиоэлектроники. - М.: Радиотехника, №7, 2015 г. - С. 73-76). Известные протоколы НСКР могут быть открытыми или закрытыми. Информация о закрытых протоколах является конфиденциальной. В технике радиосвязи диапазона ВЧ применяются открытые протоколы LPC-10-2400 (STANAG 4198) и MELPe-2400 (STANAG 4591), опубликованные в соответствующих стандартах НАТО. В тоже время применяется закрытый протокол MELPe-600 (STANAG 4591). Новые (неизвестные, не применявшиеся ранее) протоколы НСКР являются закрытыми.The development of digital telephone radio communication is carried out in many directions, one of which is the development of NSCR hardware and software, called vocoders. For example, to date, at least 20 different types of vocoders and their corresponding NSCR protocols have been developed for use on high-frequency (HF) radio communication lines (see Aladinsky V.A., Kuzminsky S.V. Analysis of digital streams at the outputs of vocoders, accepted on foreign lines of radio communication of the range of high waves // Successes of modern radio electronics. - M .: Radiotekhnika, No. 7, 2015 - P. 73-76). Known NSCR protocols can be open or closed. Information about closed protocols is confidential. In HF radio communication technology, the open protocols LPC-10-2400 (STANAG 4198) and MELPe-2400 (STANAG 4591), published in the relevant NATO standards, are used. At the same time, the closed protocol MELPe-600 (STANAG 4591) is used. New (unknown, not previously applied) NSCR protocols are closed.

Появление новых протоколов НСКР на линиях радиосвязи обусловлено стремлением фирм-производителей аппаратуры радиосвязи к повышению занимаемой доли рынка, коммерческой прибыли и защите своих компетенций, с одной стороны, а также к повышению качества связи за счет внедрения новых технических решений, в том числе модификации ранее известных протоколов, с другой стороны. Реализация новых протоколов НСКР в существующих средствах радиосвязи может быть выполнена на программном уровне или подключением внешнего вокодера с помощью имеющихся интерфейсов. Таким образом, применение новых протоколов НСКР при передаче речевых сообщений с помощью средств радиосвязи не является исключительным событием, что определяет необходимость решения этой технической задачи методами распознавания образов.The emergence of new NSCR protocols on radio communication lines is due to the desire of manufacturers of radio communication equipment to increase their market share, commercial profits and protect their competencies, on the one hand, as well as to improve the quality of communication through the introduction of new technical solutions, including modifications of previously known protocols on the other hand. The implementation of the new NSCR protocols in the existing radio communication facilities can be performed at the software level or by connecting an external vocoder using the existing interfaces. Thus, the use of new NSCR protocols when transmitting voice messages using radio communications is not an exceptional event, which determines the need to solve this technical problem using pattern recognition methods.

Использование способа-прототипа распознавания протоколов НСКР в условиях большого количества ошибок (более 10% от общего числа бит) в исследуемых цифровых потоках, полученных при кодировании речевых сообщений с помощью известных протоколов НСКР, вызывает появление результатов вычисления вероятности правильного распознавания в виде неопределенности P_ji=0/0 для всех j, l. Аналогичные результаты имеют место и при меньшем количестве битовых ошибок, если К<100. Это свидетельствует о недостаточной помехоустойчивости прототипа при распознавании новых протоколов НСКР и недостаточной информативности используемого набора признаков для описания образа входной реализации при малом объеме исследуемого информационного потока у, представленного только вектором МО m₀.The use of the prototype method for recognizing NSCR protocols under conditions of a large number of errors (more than 10% of the total number of bits) in the studied digital streams obtained by encoding speech messages using known NSCR protocols, causes the appearance of the results of calculating the probability of correct recognition in the form of uncertainty P _ji = 0/0 for all j, l. Similar results take place with fewer bit errors if K <100. This indicates insufficient noise immunity of the prototype when recognizing new NSCR protocols and insufficient information content of the used set of features to describe the image of the input implementation with a small volume of the investigated information flow y, represented only by the vector MO m ₀ .

Частным случаем является начало работы системы распознавания (начальный режим), при котором отмечается отсутствие эталонных описаний протоколов НСКР. Возникает необходимость в принятии решения о принадлежности входной реализации у к классу j=1 по результатам анализа параметров автокорреляционной функции а, формировании 1-го эталонного описания и дальнейшему сравнению с полученным эталонным описанием других входных реализаций.A special case is the beginning of the recognition system operation (initial mode), in which the absence of reference descriptions of the NSCR protocols is noted. There is a need to make a decision on whether the input realization y belongs to the class j = 1 based on the results of the analysis of the parameters of the autocorrelation function a, the formation of the 1st reference description and further comparison with the obtained reference description of other input implementations.

В общем случае (включающем режим обучения) факт применения нового протокола НСКР необходимо установить при отсутствии его эталонного описания и произвести обучение системы распознавания на основе одной полученной реализации бинарного цифрового информационного потока у, что автоматически выполняется при решении в пользу нового протокола НСКР.In the general case (including the training mode), the fact of using the new NSCR protocol must be established in the absence of its reference description and the recognition system must be trained on the basis of one obtained implementation of the binary digital information flow y, which is automatically performed when deciding in favor of the new NSCR protocol.

Положительный эффект в предлагаемом способе достигается за счет увеличения размерности данных для описания образа входной реализации, представляемых в виде квадратной матрицы М значений МО и соответствующих ковариационных матриц С_l, использованием более информативной меры различения, в качестве которой выступает дивергенция. Последняя позволяет сравнивать образ входной реализации увеличенной размерности с эталонными описаниями, в следствие чего обеспечивается повышение вероятности правильного распознавания новых протоколов НСКР. Кроме того, уменьшение влияния уровня битовых ошибок во входном цифровом информационном потоке у и значений его объема К на качество распознавания новых протоколов НСКР достигается введением дополнительной градации «Принадлежность реализации не установлена» в критерий принятия решения. В результате образы входных реализаций, имеющие по различным причинам существенные искажения, не признаются новыми, что существенно снижает вероятность ложной тревоги при распознавании новых протоколов НСКР.A positive effect in the proposed method is achieved by increasing the dimension of the data to describe the image of the input implementation, represented in the form of a square matrix M of MO values and the corresponding covariance matrices _C1 , using a more informative measure of discrimination, which is divergence. The latter allows comparing the image of the input implementation of increased dimension with the reference descriptions, as a result of which the probability of correct recognition of new NSCR protocols is increased. In addition, a decrease in the influence of the level of bit errors in the input digital information stream y and the values of its volume K on the recognition quality of new NSCR protocols is achieved by introducing an additional gradation "Implementation ownership is not established" in the decision criterion. As a result, the images of input implementations, which for various reasons have significant distortions, are not recognized as new, which significantly reduces the probability of false alarms when recognizing new NSCR protocols.

Реализация заявленного способа может быть осуществлена следующим образом (см. Фиг. 3). На этапе ввода исходных данных целесообразно определить значения параметров ΔT, N_цп цифрового информационного потока у, длительность интервала анализа у и/или емкость анализируемого потока у соответственно, сформировать совокупность {N_б}, включающую все возможные значения этого параметра, в общем случае - ввести эталонные описания J известных протоколов НСКР, включающие совокупности {m_{j эт}}J, {C_{j эт}}_J, в частном случае - установить параметр J=0.The implementation of the claimed method can be carried out as follows (see Fig. 3). At the stage of inputting the initial data, it is advisable to determine the values of the parameters ΔT, N _{cp of the} digital information stream y, the duration of the analysis interval y and / or the capacity of the analyzed stream y, respectively, to form a set {N _b }, including all possible values of this parameter, in the general case - to enter reference descriptions J of known NSCR protocols, including the sets {m _{j et} } J, {C _{j et} } _J , in a particular case - set the parameter J = 0.

После этого вычисляют нормированную автокорреляционную функцию а, которая характеризует линейную зависимость исходной последовательности у от последовательности у(τ), сдвинутой циркулярно на τ=1, 2, …, N_цп - 1 бит по отношению к у. Нормированная автокорреляционная функция а цифрового потока, содержащего речевое сообщение, имеет явно выраженные локальные максимумы, расположенные с периодичностью Δτ. Наличие локальных максимумов автокорреляционной функции, расположенных с периодом Δτ, объясняется тем, что информационные символы, описывающие параметры цифрового потока с речевым сообщением, расположены внутри информационного блока строго в определенных местах, на интервалах Δτ, кратных длине информационного блока. Параметры речевого сигнала обладают значительной линейной зависимостью, поэтому значение АКФ в этих точках будет наибольшим. Искомой автокорреляционной функцией считают совокупность значений видаAfter that, the normalized autocorrelation function a is calculated, which characterizes the linear dependence of the original sequence y on the sequence y (τ), shifted circularly by τ = 1, 2, ..., N _cp - 1 bit with respect to y. The normalized autocorrelation function a of a digital stream containing a speech message has pronounced local maxima located with a periodicity of Δτ. The presence of local maxima of the autocorrelation function, located with a period of Δτ, is explained by the fact that information symbols describing the parameters of a digital stream with a voice message are located inside the information block strictly in certain places, at intervals Δτ, which are multiples of the length of the information block. The parameters of the speech signal have a significant linear relationship, so the ACF value at these points will be the largest. The sought autocorrelation function is considered to be a set of values of the form

где а(τ)=c(τ)/d(y) - коэффициент корреляции; с(τ) - коэффициент ковариации; d(y) - дисперсия цифрового информационного потока у.where a (τ) = c (τ) / d (y) is the correlation coefficient; с (τ) - covariance coefficient; d (y) is the variance of the digital information stream y.

По регулярным с равными интервалами Δτ=N_б экстремумам нормированной автокорреляционной функции а принимают решение о наличии блочной структуры в информационном потоке у. Отсутствие экстремумов нормированной автокорреляционной функции а с равными интервалами Δτ свидетельствует о том, что цифровой информационный поток у не содержит речевое сообщение, а выполнение его анализа завершается выводом результата о том, что входная реализация у не содержит речевое сообщение, а выполнение его анализа завершается выводом результата о том, что входная реализация у не содержит речевое сообщение.By regular with equal intervals Δτ = N _b extrema of the normalized autocorrelation function a, a decision is made on the presence of a block structure in the information flow y. The absence of extrema of the normalized autocorrelation function a with equal intervals Δτ indicates that the digital information stream y does not contain a speech message, and its analysis ends with the output of a result that the input implementation y does not contain a speech message, and its analysis ends with the output of the result that the input implementation y does not contain a speech message.

При наличии экстремумов нормированной автокорреляционной функции а с равными интервалами Δτ вычисляют количество информационных блоков, содержащихся в цифровом информационном потоке у следующим образом:In the presence of extrema of the normalized autocorrelation function a with equal intervals Δτ, the number of information blocks contained in the digital information stream y is calculated as follows:

где

- оператор округления величины до меньшего целого значения.Where

- operator of rounding of a value to a smaller integer value.

Делят информационный поток у на информационные блоки объемом N_б бит каждый. Последовательно присваивают информационным блокам порядковые номера k=1, 2, …, K, начиная с первого информационного блока. Формируют прямоугольную информационную матрицу Y, строками которой являются последовательно размещенные друг под другом информационные блоки в соответствии с их порядковыми номерами k=1, 2, …, K.Divide the information flow from information blocks on a volume N _b bits each. The information blocks are sequentially assigned sequence numbers k = 1, 2, ..., K, starting from the first information block. A rectangular information matrix Y is formed, the rows of which are information blocks sequentially placed one under the other in accordance with their ordinal numbers k = 1, 2, ..., K.

Из информационной матрицы Y выделяют столбцы y_z. Проверяют условие J≥1 (т.е. наличие хотя бы одного эталонного описания), устанавливающее возможность введения режима обучения. _{Columns y z} are separated from the information matrix Y. The condition J≥1 is checked (i.e., the presence of at least one reference description), which establishes the possibility of introducing the learning mode.

Если условие J≥1 не выполняется (это частный случай - начальный режим), то с помощью известных выражений (см. Аладинский В.А., Кузьминский С.В. Метод формирования признаков распознавания протоколов низкоскоростного кодирования речи / Наукоемкие технологии. - М.: Радиотехника. №12, 2015. - С. 20-25) определяют значения МО m_z по столбцам y_z. Из набора значений МО формируют вектор значений математического ожидания вида m₀=(m₁, m₂, …, m_z, …, m_L) размера L. На основе прямоугольной информационной матрицы Y вычисляют квадратную ковариационную матрицу С₀ (см. там же). Далее формируют эталонное описание, которому присваивают порядковый номер j=1, и в этом случае m₀=m_{1 эт}, С₀=C_{1 эт}.If the condition J≥1 is not met (this is a special case - the initial mode), then using well-known expressions (see Aladinsky V.A., Kuzminsky S.V. : Radio engineering. No. 12, 2015. - S. 20-25) determine the values of MO m _z by the columns y _z . From the set of MO values, a vector of values of the mathematical expectation of the form m ₀ = (m ₁ , m ₂ , ..., m _z , ..., m _L ) of size L is formed. On the basis of a rectangular information matrix Y, a square covariance matrix C _{0 is} calculated (see ibid. ). Next, a reference description is formed, which is assigned a serial number j = 1, and in this case m ₀ = m _{1 floor} , C ₀ = C _{1 floor} .

При выполнении условия J≥1 также определяют значения МО m_z, формируют вектор значений математического ожидания вида m₀, квадратную матрицу М=(m₀, m₁, …, m_l, …, m_L-1) размеров L, содержащую строки m_l, получаемые на основе циркулярного сдвига элементов вектора m₀ (см. фиг. 2). Формируют совокупность прямоугольных матриц {Y_l}_L={Y₀, Y₁, …, Y_l, …, Y_L-1}, у которых столбцы сдвинуты циркулярно на l=0, 1, 2, …, L-1 относительно столбцов прямоугольной информационной матрицы Y. По каждой матрице Y_l из совокупности {Y_l}z, вычисляют квадратные ковариационные матрицы C_l.When the condition J≥1 is fulfilled, the values of MO m _z are also determined, a vector of values of the mathematical expectation of the form m ₀ , a square matrix M = (m ₀ , m ₁ , ..., m _l , ..., m _L-1 ) of sizes L, containing rows m _l obtained on the basis of the circular shift of the elements of the vector m ₀ (see Fig. 2). A set of rectangular matrices {Y _l } _L = {Y ₀ , Y ₁ , ..., Y _l , ..., Y _L-1 } is formed, whose columns are shifted circularly by l = 0, 1, 2, ..., L-1 with respect to columns of the rectangular information matrix Y. For each matrix Y _l from the collection {Y _l } z, calculate the square covariance matrices C _l .

Определяют значения дивергенции v_jl между образом входной реализации, представляемым соответствующими векторами m_l и квадратными ковариационными матрицами C_l, и j-м эталонным описанием. Последний представляется вектором m_{j эт} и соответствующей эталонной ковариационной матрицей C_{j эт}. В общем случае выявление нового протокола предполагает выполнение условий m_l≠m_j и C_l≠C_j, то дивергенция, характеризующая разделяющую поверхность образа и эталонного описания, вычисляется по формуле вида (см. Ту Дж., Гонсалес Р. Принципы распознавания образов / Пер. с англ. - М.: Мир, 1978. - 411 с.)The values of the divergence v _jl between the image of the input implementation, represented by the corresponding vectors m _l and square covariance matrices C _l , and the j-th reference description are determined. The latter is represented by the vector m _{j et} and the corresponding reference covariance matrix C _{j et} . In the general case, the identification of a new protocol assumes the fulfillment of the conditions m _l ≠ m _j and C _l ≠ C _j , then the divergence characterizing the separating surface of the image and the reference description is calculated using a formula of the form (see Tu J., Gonzalez R. Principles of pattern recognition / Per. From English .-- M .: Mir, 1978 .-- 411 p.)

где

- след матрицы S размеров Z;

- след матрицы В размеров Z; a_zz, b_zz - элементы диагоналей матриц S и В соответственно;Where

- trace of the matrix S of dimensions Z;

- trace of the matrix B of dimensions Z; a _zz , b _zz are the elements of the diagonals of the matrices S and B, respectively;

Из полученного набора значений v_jl для удобства дальнейшего анализа формируют матрицу значений дивергенции V размеров J×L. Сравнивают значение каждого элемента v_jl матрицы V с пороговым значением v_пор=2000.From the obtained set of values v _jl, for the convenience of further analysis, a matrix of divergence values V of sizes J × L is formed. The value of each element v _{jl of the} matrix V is compared with the threshold value v _pore = 2000.

Если для всех элементов v_jl матрицы V выполняется условие v_jl>v_пор, то принимают решение об обнаружении нового протокола НСКР и присваивают условный номер J-J+1 новому эталонному описанию, которое включает вектор математического ожидания m(j+1)_эт=m₀ и квадратную эталонную ковариационную матрицу С(J+1)_эт=С₀. Выводят сообщение «Новый J+1 протокол НСКР». При невыполнении данного условия и если выполняется условие v_jl<300 только для одного элемента матрицы V, а остальные v_jl>v_пор=2000, то принимают решение, что входная реализация у сформирована по j-му известному протоколу НСКР и в информационном потоке у отсутствуют начальные l бит в первом k=1 информационном блоке. При необходимости проводят дообучение системы распознавания, осуществляют коррекцию j-го эталонного описания m_{j эт}, C_{j эт}. Выводят сообщение «Неполный первый блок входных данных, j-й протокол НСКР».If all elements v _jl matrix V satisfies the condition v _jl> v _then, a decision is made about the discovery of a new protocol NCIS and assigned identification number J-J + 1 new reference description that includes the expectation vector m (j + 1) _fl = m ₀ and a square reference covariance matrix C (J + 1) _et = C ₀ . The message "New J + 1 NSCR protocol" is displayed. If this condition is not met and if the condition v _jl <300 is satisfied only for one element of the matrix V, and the remaining v _jl > v _pores = 2000, then it is decided that the input implementation y is formed according to the j-th known NSCR protocol and in the information flow y there are no initial l bits in the first k = 1 information block. If necessary, additional training of the recognition system is carried out, the j-th reference description is corrected m _{j et} , C _{j et} . The message "Incomplete first block of input data, j-th NSCR protocol" is displayed.

В других случаях, когда справедливо 300≤v_jl≤2000 для всех элементов матрицы V, считают, что входная реализация у не может быть отнесена к какому-либо j-му известному протоколу НСКР или новому протоколу НСКР, выводят сообщение «Принадлежность входной реализации не установлена».In other cases, when 300≤v _jl ≤2000 is true for all elements of the matrix V, it is considered that the input implementation y cannot be attributed to any j-th known NSCR protocol or the new NSCR protocol, the message “The input implementation does not belong to installed ".

Имитационное моделирование заявленного способа распознавания протоколов НСКР выполнено с использованием цифровых информационных потоков, сформированных на основе известных протоколов LPC-10-2400 (STANAG 4197), условный класс j=1, MELPe-2400 (STANAG 4591), j=2, применяемых на линиях радиосвязи диапазона высоких частот, и цифровых информационных потоков, сформированных программным способом вокодером вида MELP-240Q по протоколу НСКР с номером класса j=3, который рассматривается как новый, показало следующее. Принятое пороговое значение D_p=70% достигается на основе предлагаемого способа при значениях K=82, в то время как способ-прототип обеспечивает удовлетворение этого требования при K=124. Достоверность D_p распознавания предлагаемым способом при значениях K≥82 выше, чем при использовании прототипа на величину от (11 до 55)%, что подтверждает эффективность предлагаемых технических решений в условиях малого объема исследуемых цифровых информационных потоков.Simulation modeling of the claimed method for recognizing NSCR protocols is performed using digital information streams formed on the basis of the well-known protocols LPC-10-2400 (STANAG 4197), conditional class j = 1, MELPe-2400 (STANAG 4591), j = 2, used on the lines radio communications of the high frequency range, and digital information streams, generated by the software by the MELP-240Q vocoder according to the NSCR protocol with the class number j = 3, which is considered new, showed the following. The accepted threshold value D _p = 70% is achieved on the basis of the proposed method at values of K = 82, while the prototype method ensures the satisfaction of this requirement at K = 124. The reliability D _{p of} recognition by the proposed method at values of K≥82 is higher than when using the prototype by an amount from (11 to 55)%, which confirms the effectiveness of the proposed technical solutions in a small volume of the investigated digital information flows.

Сравнение помехозащищенности предлагаемого способа и способа-прототипа также выполнено на основе имитационного моделирования при K≥100, битовые ошибки в цифровые информационные потоки вносились по случайному закону с равномерной вероятностью появления ошибки, что соответствует современным условиям формирования дискретных сигналов передачи речевых сообщений, когда используется помехоустойчивое кодирование, перемежение и скремблирование цифровых информационных потоков до передачи сигнала в канал радиосвязи. Принятое максимальное значение вероятности Р_{ош_мах}=15% соответствует значению, при котором еще возможно восстановление речевого сигнала. В результате установлено, что разработанный способ распознавания новых протоколов НСКР обеспечивает меньшую вероятность ложной тревоги Р_ош при равном значении вероятности битовой ошибки Р_ош в цифровом информационном потоке у (см. Фиг. 5) на величину от (2 до 25) %, чем при использовании способа-прототипа, условие Р_лт≤30%, являющееся граничным, для прототипа выполняется при Р_ош≤10%, в то же время для предлагаемого способа - при Р_ош≤14%, что свидетельствует о его лучшей помехозащищенности по сравнению с прототипом.Comparison of noise immunity of the proposed method and the prototype method is also made on the basis of simulation at K≥100, bit errors were introduced into digital information streams according to a random law with a uniform error probability, which corresponds to modern conditions for the formation of discrete signals for the transmission of speech messages when noise-immune coding is used , interleaving and scrambling of digital information streams prior to signal transmission to the radio communication channel. The accepted maximum value of the probability P _{osh_max} = 15% corresponds to the value at which the recovery of the speech signal is still possible. As a result, it was found that the developed method for recognizing new NSCR protocols provides a lower probability of a false alarm P _osh with an equal value of the probability of a bit error P _osh in a digital information stream y (see Fig. 5) by an amount from (2 to 25)% than with using the prototype method, the condition P _lt ≤30%, which is the boundary, is fulfilled for the prototype at P _osh ≤10%, at the same time for the proposed method - at P _osh ≤14%, which indicates its better noise immunity compared to the prototype ...

Claims

A method for recognizing new low-rate speech coding (NSCR) protocols implemented in vocoders, which consists in the fact that they receive a binary digital information stream y during the time interval ΔT, form a normalized autocorrelation function a on the basis of y, decide on the presence of a block structure in the digital information stream y along the extrema of the autocorrelation function a regular with equal intervals Δτ, divide the digital information stream y into information blocks of N _b bits each according to the intervals between the extrema of the autocorrelation function a, sequentially assign the sequence numbers k = 1, 2, ..., K to the information blocks, starting from the first information block, a rectangular information matrix Y of size K × L, L = N _{b is formed} , the rows of which are information blocks sequentially placed under each other in accordance with their ordinal numbers k = 1, 2, ..., K, are extracted from the matrix Y columns y _z , z = 1, 2,…, L; determine for each column y _{z the} value of its mathematical expectation m _z , form a vector of values of the mathematical expectation m ₀ = (m ₁ , m ₂ , ..., m _z , ..., m _L ) size L, based on the resulting vector m ₀ form a square matrix M values of the mathematical expectation of size L × L, lines m _l , l = 0, 1, 2, ..., L-1, which contains the values of the vector m ₀ , sequentially circularly shifted to the left by the value l, according to the available training samples {y _{j et} } _J (j = 1, 2, ..., J) corresponding to J well-known NSCR protocols, rectangular reference information matrices {Y _{j et} } _{J are formed} , on the basis of which vectors of reference values of the mathematical expectation m _{j et} , j = 1, 2 , ..., J, and square reference covariance matrices {C _{j et} } _J , calculate the values of the probability of correct recognition P _{jl of the} j-th NSCR protocol, j = 1, 2, ..., J, for each l-th row m _{l of the} matrix M make a decision in favor of the jth NSCR protocol, for which the maximum value of P _jl is provided, which differs which is that for the value of the probability of correct recognition P _jl = 0/0 for all j, l, a set of rectangular matrices {Y _l } _{L is formed} , the columns of which are shifted circularly by l = 0, 1, 2, ..., L-1 relative to the columns rectangular information matrix Y, for each matrix Y _l calculate its square covariance matrix C _l , determine the values of the divergence v _jl between the images of the input implementation, represented by rows m _{l of the} square matrix M of the mean values and the corresponding covariance matrices C _l , and each j-th standard image represented by the vector m _{j et} and the corresponding square reference covariance matrix C _{j et} , based on the obtained values of v _jl , a divergence vector v _j = (v _j0 , v _j1 , ..., v _jl ..., v _{j (L-1)} ) for each j-th reference image, make up a _{matrix of divergence values V of size J × L from J vectors v j} , decide to detect a new previously unknown NSCR protocol and assign a conditional number р J = J + 1 to the new reference description, which includes the vector of mathematical expectation m _{(J + 1) fl} = m ₀ and the square reference covariance matrix C _{(J + 1) fl} = С ₀ , if for all elements v _{jl of the} matrix of divergence values V, the condition v _jl > v _pores = 2000 for any j, l, where v _pores is the threshold value, or only for one element v _jl <300, and the rest v _jl > v _pores , make a decision that the input realization y is formed according to The j-th known NSCR protocol and the information stream y lack the initial l bits in the first k = 1 information block, and in the case of 300≤v _jl ≤2000, a decision is made that the input implementation of у is not recognized.