WO2019235975A1

WO2019235975A1 - Conversion of a structured data array containing a linguistic sentence

Info

Publication number: WO2019235975A1
Application number: PCT/RU2019/050078
Authority: WO
Inventors: Игорь Петрович РОГАЧЕВ
Original assignee: Rogachev Igor Petrovich
Priority date: 2018-06-07
Filing date: 2019-06-07
Publication date: 2019-12-12
Also published as: RU2685967C1; EA202092988A1

Abstract

The group of inventions relates to solutions in the field of processing data arrays, particularly to solutions in the field of processing structured data arrays containing natural language text, particularly linguistic sentences, and can be used for the preliminary conversion of a structured data array to allow the subsequent processing of same. The technical result achieved by carrying out the present invention consists in the more efficient and precise preliminary processing of a natural language text in order to subsequently index and process same.

Description

TRANSFORMING A STRUCTURED DATA ARRAY CONTAINING A LINGUISTIC PROPOSAL

[0001] FIELD OF THE INVENTION

[0002] The group of inventions relates to solutions in the field of processing data arrays, in particular, to solutions in the field of processing

structured data arrays containing natural language text, in particular, linguistic sentences, and can be used for preliminary transformation of a structured data array to ensure its subsequent processing.

[0003] BACKGROUND OF THE INVENTION

[0004] From the patent of the Russian Federation 2399959 (CJSC "AVIKOMP SERVICEZ"),

published on May 10, 2010 (D1), a method for automatically indexing texts in natural languages is known. The method known from D1 is that the text is segmented in electronic form into elementary units, identifies stable phrases, form sentences, identifies semantically significant objects and semantically significant relationships between them, form many triads for each semantically significant relationship, in which the only triad of the first type corresponds to the relationship established by the semantically significant relationship between the two

semantically significant objects, with each of the triads of the second type corresponding to the value of a specific attribute of one of these semantically significant objects, each of the triads of the third type corresponding to the value of a specific attribute of the semantically significant relation, then all related triangles are indexed

semantically significant relationships, semantically significant objects individually, remember the formed triads in the database and

the resulting indices, together with a link to the source text from which these triads are formed.

[0005] However, the method known from D1 does not have sufficient accuracy of indexing text elements in natural language, which, in turn, affects the accuracy of subsequent processing of the text in natural language and the accuracy of the search in the text in natural language. This is mainly happening due to insufficiently effective preliminary processing of text in a natural language, which, accordingly, does not allow indexing with sufficient accuracy.

[0006] SUMMARY OF THE INVENTION

[0007] Based on this, a technical problem solved by the present invention is the creation of a method and devices that implement the method and / or systems with increased efficiency of preprocessing natural language text for its subsequent indexation and processing.

[0008] Accordingly, the technical result achieved by the implementation of the present invention is to increase the efficiency and accuracy of preprocessing natural language text for its subsequent indexation and processing.

[0009] The technical result is achieved due to the fact that the claimed method of converting a structured data array (SMD) performed by a processor of a computer device containing at least a linguistic sentence is characterized by the following steps: A] forming the first data structure on which the first structure is formed SMD data containing elements of said first data structure, said elements of the first data structure being text elements (TE) of a linguistic pre dix, and

identification data of text elements (identification data of TE), representing for each TE, at least: value of a text element (value of TE) and serial number of a text element (serial number of TE] in a linguistic sentence; B] formation of a database of linguistic features (BDLP ], on which linguistic signs of text elements (linguistic signs of TE) of a linguistic sentence are revealed, from which a database is formed, which is a BDLP of text elements of linguis B] the formation of the second data structure, on which the second SMD data structure is formed containing the elements of the second data structure, said elements of the second data structure being the syntactic units (CE) of the linguistic sentence, formed on the basis of information from the BDLP of the textual elements of the linguistic sentences, and also represent identification data of syntactic units (identification CE data), representing for each CE at least: the value of the syntactic unit (CE value) and the serial (serial) number

(numbers) TE linguistic sentences constituting the CE.

[0010] Embodiments of the present invention relate to methods, devices, systems, and computer-readable storage media for ensuring the efficiency and accuracy of preprocessing natural language text for subsequent indexing and processing.

[0011] A BRIEF DESCRIPTION OF THE DRAWINGS

[0012] Illustrative embodiments of the present invention are described below in detail with reference to the accompanying drawings, which are incorporated herein by reference, and in which:

[0013] FIG. 1 shows an exemplary general flowchart for the steps of the claimed method 100 for converting a structured data array containing at least a linguistic sentence that is the original data structure for the method 100 in question.

[0014] FIG. 2 shows an exemplary general flowchart of the steps of step 101 of generating the first data structure.

[0015] FIG. 3 shows an exemplary general structure of an initial data structure from which a first data structure of a structured data array is formed.

[0016] FIG. 4 shows an exemplary general structure of the generated first data structure.

[0017] FIG. 5 shows an exemplary general flowchart for the steps of step 102 of forming a database of linguistic features,

representing a database of linguistic features of text elements 21 sentences 11.

[0018] FIG. 6 shows an exemplary general structure of the generated database of linguistic features (BDLP), which is BDLP of text elements 21 of the linguistic sentence 11.

[0019] FIG. 7 depicts an exemplary general flowchart of the steps of step 103 of generating a second SMD data structure.

[0020] FIG. 8 depicts an exemplary general structure of a generated second data structure of a structured data array.

[0021] FIG. 9 shows an exemplary general flowchart step 104 of forming a third data structure of the structured data array.

[0022] FIG. 10 shows an exemplary general structure of the generated third data structure of a structured data array.

[0023] FIG. 11 shows an exemplary general flowchart of step 105 of generating a fourth data structure of a structured data array.

[0024] FIG. 12 depicts an exemplary general structure of the generated fourth data structure of a structured data array.

[0025] FIG. 13 shows an exemplary general flowchart of the inventive method 200 for converting a structured data array containing at least syntactic units of a linguistic sentence and identification data of syntactic units, which is the initial data structure for the method 200 in question.

[0026] FIG. 14 depicts an example general flowchart of an identification step 201 of a transformable data structure containing syntactic units that is the original data structure for

the method in question 200.

[0027] FIG. 15 shows an exemplary general data structure,

which is the original data structure for the conversion method 200, which is a transformable data structure containing syntactic units.

[0028] In FIG. 16 shows an exemplary general flowchart for the steps of step 202 of generating a fifth data structure of a structured data array, which is the first data structure for the method 200 in question.

[0029] FIG. 17 depicts an exemplary general structure of a database of linguistic features (BDLP) generated in step 2021, which is a BDLP of text elements 21 of a linguistic sentence 11 contained in elements 22 suitable for transforming a data structure containing syntactic units.

[0030] FIG. 18 depicts an exemplary general structure of the database of linguistic features [BDLP] generated in step 2022, which is the BDLP of the text elements 21 of the linguistic sentence 11 contained in the elements 22 suitable for converting the data structure containing CE. [0031] Ha of FIG. 19 shows an exemplary general structure of the generated fifth data structure of the structured data array, which is the first for the present conversion method 200.

[0032] FIG. 20 depicts an exemplary general flowchart of step 203 of generating the sixth data structure of a structured data array, which is the second data structure for the method 200 in question.

[0033] FIG. 21 shows an exemplary general structure of the database of linguistic features (BDLP) generated as part of step 2031, which is the BDLP of text elements 21 of the linguistic sentence 11 contained in the elements 61 of the fifth data structure of the structured data array.

[0034] FIG. 22 shows an exemplary general structure of the sixth data structure of the structured data array generated, which is the second data structure for the present conversion method 200.

[0035] FIG. 23 shows an exemplary general flowchart of steps 204 of generating a seventh data structure of a structured data array, which is the third data structure for the method 200 in question.

[0036] FIG. 24 shows an exemplary general structure of the generated seventh data structure of the structured data array, which is the third data structure for the present conversion method 200.

[0037] FIG. 25 shows an exemplary general flowchart of the inventive method 300 for converting a structured data array containing at least basic linguistic-logical objects (LLL) of a linguistic sentence and LLL identification data,

which is the original data structure for the method 300 in question.

[0038] FIG. 26 depicts an exemplary general flowchart for identifying a transformable data structure comprising basic linguistic logic objects [OLLO], which is the original data structure for the method 300 in question, that is capable of being converted.

[0039] FIG. 27 depicts an exemplary general data structure that is the starting point for the transform method 300, which is a transformable data structure containing basic linguistic objects [OLLO].

[0040] FIG. 28 shows an exemplary general flowchart step 302 of forming the eighth data structure of the structured data array, which is the first data structure for the method 300 in question.

[0041] FIG. 29 depicts an exemplary general structure of a linguistic feature database (LIB) formed within block 3021, which is a LLLB of text elements 21 of sentence 11 contained in elements 91 suitable for transforming a data structure containing ALLO.

[0042] FIG. 30 shows an exemplary general structure of the generated eighth SMD data structure, which is the first SMD data structure for the present conversion method 300.

[0043] FIG. 31 shows an exemplary general flowchart of the steps of step 303 of generating the ninth SMD data structure, which is the second data structure for the method 300 in question.

[0044] FIG. 32 depicts an exemplary general structure of the generated ninth DMD data structure, which is the second SMD data structure for the present conversion method 300.

[0045] FIG. 33 depicts an example general flowchart of the steps of step 304 of generating the tenth SMD data structure, which is the third data structure for the method 300 in question.

[0046] FIG. 34 depicts an exemplary general structure of a database of linguistic features [BDLP] generated as part of step 3041, which is a BDLP of text elements 21 of sentence 11 contained in elements 13 of the ninth data structure of the DMD.

[0047] FIG. 35 depicts an exemplary general structure of the tenth SMD data structure generated, which is the third SMD data structure for the present conversion method 300.

[0048] FIG. 36 shows an exemplary general flowchart for the steps of step 305 of generating the eleventh SMD data structure, which is the third SMD data structure for the method 300 in question.

[0049] FIG. 37 illustrates an exemplary general structure of the generated eleventh SMD data structure, which is the fourth SMD data structure for the present conversion method 300.

[0050] FIG. 38 depicts an exemplary general flowchart of the inventive method 400 for converting an SMD comprising at least main linguistic-logical objects (LLLF) of the linguistic sentence and LLLF identification data, which is the initial data structure for the method 400 under consideration.

[0051] FIG. 39 depicts an example general flowchart of the steps of step 402 of generating the twelfth SMD data structure, which is the first data structure for the method 400 in question.

[0052] FIG. 40 shows an exemplary general structure of a database of linguistic features (BDLP) generated as part of step 4021, which is BDLP of text elements 21 of sentence 11 contained in elements 91 suitable for transforming a data structure containing ALLO,

generated in step 4021.

[0053] FIG. 41 shows an exemplary general structure of the generated twelfth SMD data structure, which is the first for the present conversion method 400.

[0054] FIG. 42 depicts an example general flowchart of the steps of step 403 of generating the thirteenth SMD data structure, which is the second data structure for the method 400 in question.

[0055] In FIG. 43 depicts an exemplary general structure of a database of linguistic features [BDLP] generated as part of step 4031, which is BDLP of text elements 21 of sentence 11 contained in elements 17 of the twelfth data structure of the DMD.

[0056] FIG. 44 shows an exemplary general structure of the generated thirteenth SMD data structure, which is the second SMD data structure for the present conversion method 400.

[0057] FIG. 45 shows an exemplary general system diagram for

transformations of a structured data array.

[0058] EMBODIMENTS FOR CARRYING OUT THE INVENTION

[0059] The possible implementations of the embodiments of the present invention described in this section are presented in non-limiting legal examples, with reference to specific embodiments of the present invention, which are assumed in all their aspects

illustrative and not limiting. Alternative embodiments of the present invention, not beyond the scope of its legal protection, are obvious to experts in this field, with the usual qualifications on which this invention is intended.

[0060] FIG. 1, as an example, but not limitation, depicts a General diagram of the steps of the claimed method 100 conversion

a structured data array (DMD) containing at least a linguistic sentence (LP), which is the original data structure for the method 100 under consideration. The claimed method 100 of converting an SMD containing at least a linguistic sentence is characterized by: performing the first step 101 of generating the first data structures on which the first SMD data structure is formed containing the elements of said first data structure, said elements of the first data structure being textual linguistic elements (TE)

sentences as well as identification of text elements

(TE identification data), representing for each TE, as an example, but not limitation: the value of the text element (TE value) and the serial number of the text element (TE serial number) in

linguistic sentence; the implementation of step 102 of the formation of a database of linguistic features (BDLP), which reveal

linguistic attributes of text elements (linguistic attributes of TE) of a linguistic sentence from which a database is formed

which is a database of linguistic features of text elements of a linguistic sentence (BDLP TE linguistic

offers); performing step 103 of generating a second data structure, wherein a second SMD data structure is formed comprising elements of said second data structure, said elements of the second data structure being syntactic units (CEs)

linguistic sentences, formed on the basis of information from BDLP TE linguistic sentences, and also represent

identification data of syntactic units (CE identification data), representing for each CE, as an example, but not limitation: the value of the syntactic unit (CE value) and the serial number (s) of the TE of the linguistic sentence constituting CE; the execution of step 104 of the formation of the third data structure, which form the third data structure of the SMD containing the elements of said third data structure, said elements of the third structure data represent linguistic objects (LO) of the linguistic sentence, formed by identifying them with the above

syntactic units, or resulting from the conversion of the mentioned syntactic units, and also represent

identification data of linguistic objects (identification data of LO), representing for each LO, as an example, but not limitation: the value of the linguistic object (value of LO) and the serial (serial) number (numbers) of the TE of the linguistic sentence constituting (components) LO; the execution of the step 105 of the formation of the fourth data structure, which forms the fourth data structure of the SMD containing the elements of the said fourth data structure, moreover,

the elements of the fourth data structure are logical objects (LogO) of a linguistic sentence, formed by correlating linguistic and logical objects and identified with the aforementioned linguistic objects, and also represent the identification data of logical objects (LogO identification data),

representing for each LogO, as an example, but not limitation: the value of the logical object (the value of LogO) and the serial (ordinal) number (s) of the TE of the linguistic sentence that makes up (the components of) the LogO.

[0061] FIG. 2, by way of example, but not limitation, a general flowchart of the steps of step 101 of generating the first data structure is shown. Step 101 is characterized by: performing step 1011 of identifying the source

SMD data structures on which elements 11 of the initial SMD data structure are identified, which are linguistic sentences 11 (LP 11); by performing step 1012 of identifying elements 21 of the first SMD data structure, at which elements 21 of the first SMD data structure are identified, which are text elements (TE) of the linguistic sentence 11, as well as identification data of elements 21, which are for each TE, as an example, but not restrictions: the value 211 of the element 21 of the first SMD data structure and the serial number 212 of the element 21 of the first SMD data structure, and form the first SMD data structure.

[0062] FIG. 3, by way of example, but not limitation, the general structure of the original data structure from which the first SMD data structure is formed is shown. The source data is a SMD containing elements 11 the original data structure, which is a linguistic sentence (LP). Such an array of data is a multitude of

linguistic sentences 11 related to any field of activity and any purpose. Elements 11 lack their unique names characterizing their practical use. In the original data structure, elements 11, as an example, but not limitation, can be referred to as “LP1”, “LP2”, “LPZ”, “LPP”, where n> 1 is the serial number of the element in the linguistic sentence. Linguistic sentence is

a grammatically organized combination of words (many syntactically related words) with semantic and logical completeness. In addition to words, the PL can contain the following objects: numbers (numbers), punctuation marks and indices (constructions of letters, numbers and signs). All of the above objects are components of LP 11 (components of LP) and in the original data structure are separate elements prepared in advance and placed in the original data structure not in the form of linguistic text, but in the form of a structured array (list, list and the like) of individual LP. Such preparatory steps may be carried out by any method known in the art and, accordingly, are not further described.

[0063] The identification of the elements 11 of the original data structure in step 1011 is reduced to providing a classification of the elements that make up the original data structure as linguistic sentences (LP 11). In this case, the components of the drug are all numbers (numbers), punctuation marks and indices (constructions of letters, numbers and signs) contained in the drug and separated by a space.

[0064] The original data structure is the result of a plurality of elements 11 identified in step 1011.

[0065] FIG. 4, by way of example, but not limitation, the general structure of the generated first data structure is shown. The first data structure is a DMD containing the elements 21 of the first data structure, which are the text elements (TE 21) of the linguistic sentence 11 and the TE identification data. The text elements 21 of the linguistic sentence 11 lack their unique names characterizing their practical use. In the data structure, elements 21, as an example, but not limitation, may be referred to as “TE”, “TE2”, “TEZ”, “TEP”, where n> 1 is the serial number of the element in the linguistic sentence 11. TE 21 of the linguistic sentence (PL 11) are components of the PL, that is, words, numbers, punctuation marks or indices (constructions of letters, numbers and signs) contained in LP 11 and separated by a space. The text elements 21 of the linguistic sentence 11 have TE identification data, such as, by way of example, but not limitation: TE value 211 and TE number number 212.

The value of 211 TEQs is, as an example, but not limitation, the set of letters, numbers, and / or punctuation marks that make up TEQs. Serial number 212 of the TE is the serial number of TE 21 in the linguistic sentence 11.

[0066] The formation of the text elements 21 of the first data structure during step 1012 is performed by identifying individual words or groups of words, numbers (numbers) or indices, the numbers (numbers) or indices not separated by a space, as well as punctuation marks. Moreover, it is preferable that the last punctuation mark in linguistic sentence 11 is not taken into account and not

was considered as a text element 21 of the linguistic sentence 11.

[0067] The identification of the value 211 of the text element 21 of the first data structure during step 1012 is carried out by registering the characters (letters, numbers and / or punctuation marks) that make up the text element 21. Identification of the serial number 212 of the text element 21 of the first

data structures during step 1012 are produced by calculating the location of the TE 21 in the linguistic sentence 11. In this case, the first text element 21 in the linguistic sentence 11 receives the serial number “1”, and all

subsequent fuel cells receive a serial number greater than one

serial number of the previous TE 21.

[0068] Generating a first SMD data structure during step

1012 are produced by combining in one data structure the elements 21 of the first SMD data structure, as well as their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0069] FIG. 5, as an example, but not limitation, depicts a General diagram of the steps of step 102 of the formation of the database of linguistic features, which is a database of linguistic features text elements 21 of sentence 11. Step 102 is characterized by: performing step 1021 of forming linguistic features of text elements 21 of sentence 11, in which for the linguistic analysis of the text element provide identification data of element 21 (value 211 and serial number 212 TE) and get the linguistic characteristics of 213 text elements 21 linguistic sentences 11, as well as the meanings of the mentioned linguistic characteristics 2131; performing step 1022 of creating a database of linguistic features (BDLP), on which form BDLP of text elements 21 of the linguistic sentence 11. In this case, the linguistic feature of the text element 21 will be all linguistic characteristics 213 obtained for the text element 21 during the step 1021, having linguistic characteristics 2131.

[0070] FIG. 6, by way of example, but not limitation, the general structure of the generated database of linguistic features (BDLP) is shown, which is the BDLP of text elements 21 of the linguistic sentence 11. The first part (main) of linguistic characteristics 213 of the text elements 21 of the linguistic sentence 11 may contain morphological, syntactic and semantic characteristics. Moreover, the totality of the values of all the linguistic characteristics of the text element is for each TE 21 of the linguistic sentence 11 its distinctive (unique) linguistic feature in the linguistic sentence. Morphological characteristics preferably indicate the morphological characteristics of TE 21 of the linguistic sentence 11, which can be classified, as an example, but not limited, by the level of nesting (genus-species-subspecies). Moreover, the morphological genera of TE 21 of the linguistic sentence 11 are preferably a word, a number, punctuation marks, other signs;

morphological species - part of speech (for words), type of number (Arabic, Roman), type of punctuation mark (dot, comma, etc.), type of other sign;

morphological subspecies - gender, number, case of parts of speech and the like (for words), number, binary code, index and the like (for numbers).

The syntactic characteristics preferably indicate the many syntactic features of TE 21 of the linguistic sentence 11, among which, by way of example, but not limited to, the following syntactic characteristics of TE 21 of linguistic sentence 11:

syntactic role (predicate, subject, and the like.); syntactic parent (syntactically main word); syntactic descendants (syntactically subordinate words); syntactic compositional connection (the presence of another TE having the same syntactic role and the same syntactic parent).

The semantic characteristics preferably indicate the semantic features of TE 21 of the linguistic sentence 11, among which

to highlight, as an example, but not limitation, the following semantic characteristics of TE 21 of linguistic sentence 11: a semantic group (a group of words that can be attributed to one class, gender, type or subspecies of objects or actions of the world when the signs coincide

mentioned classes, genera, species or subspecies), semantic status

(the semantic meaning of a word or a group of words within the framework of a phrase that refers to a certain conceivable image (object or action). For example, but not limited to, the conceivable image “absence of a seller at the location of the consumer” consists of two elements of the upper level of nesting

(terms): the first is “the absence of the seller”, and the second is the “location of the consumer”, which have the following semantic statuses: the first has the main one (defines the meaning of the term), the second has the additional one (clarifies the meaning of the main term defined earlier)) .

[0071] The formation of the first part of the (main) linguistic

characteristics 213 and their values 2131 for text elements 21

linguistic sentences 11 are preferably produced at step 1021 by a comprehensive linguistic analysis of each text element 21 of the linguistic sentences 11, representing, by way of example, but not limitation, TE analysis based on the TE location in the sentence structure, its meaning, type, classification of its conceivable image and analysis of its relationships with other textual elements in the sentence. Based on the results of a comprehensive analysis, it is preferable to formulate the main linguistic characteristics 213 and enter them at step 1022 into the BDLP in the form of a list of the first part of linguistic characteristics 213 with the values of these characteristics 2131. For example, but not limited to, one of the linguistic characteristics 213 may be a “syntactic role” , with a value of 2131 of this linguistic characteristic “subject”. Such an analysis can be performed by any a method known from the prior art and, accordingly, is not described in further detail below. For example, without limitation, such an analysis can be performed traditionally by a specialist linguist, or using a software algorithm of a linguistic (syntactic) processor. Moreover, if there are a sufficient number of examples, it is possible to perform such an analysis using a statistical processor (neural network) through the use of neural network training technology.

[0072] Based on the revealed first part of the (main) linguistic characteristics of 213 text elements 21 of the linguistic sentence 11 and their meanings 2131, a database of linguistic features is formed, which is the BDLP of the text elements 21 of the linguistic sentence 11. Moreover, the first part of the linguistic characteristics of 213 text elements 21 linguistic sentences 11 and their meanings 2131 forms unique linguistic features of the textual elements of 21 linguistic

Proposals 11.

[0073] FIG. 7, as an example, but not limitation, depicts a General diagram of the steps of step 103 of the formation of the second data structure SMD. Step 103 is characterized by: performing step 1031 of identifying and

the formation of the first elements of the second SMD data structure, on which the elements 31 of the second SMD data are identified and formed, as well as the identification data of the elements 31, which for each element 31, as an example, but not limitation, the value 311 of the element 31 of the second SMD data structure and serial (serial) number (s) 312 of the TE of the linguistic sentence 11 constituting element 31 (meaning and serial numbers of TE of the elements 31); performing step 1032 of generating the second elements of the second SMD data structure, on which the elements 32 of the second SMD data structure are formed, and

the identification data of the elements 32, which for each element 32 is, by way of example, but not limitation, the value 321 of the elements 32 of the second SMD data structure and the sequence numbers 322 of the TE of the linguistic sentence 11 constituting the elements 32 (the value and the sequence numbers of the TE of the elements 32); performing step 1033 of forming a second SMD data structure, in which a second SMD data structure is formed by combining the identified elements 31 and 32 and their identification data (values 311, 321 and serial numbers 312, 322 TE).

[0074] FIG. 8, by way of example, but not limitation, the general structure of the generated second SMD data structure is shown. The second SMD data structure is a SMD containing elements 31 and (or) elements 32, which are the syntactic units (CE) of the linguistic sentence 11 and the CE identification data, representing for each CE, as an example, but not limitation, the value 311 element 31 and (or) value 321 of element 32, as well as serial (ordinal) number (s) 312 and (or) 322 TE 21 of the linguistic sentence constituting

(constituent) elements 31 and / or elements 32. The syntactic units (CE) of sentence 11 are preferably single words and groups of words connected by syntactic link. The CE of the linguistic proposal 11 has their unique names characterizing them (UN CE), which have practical use, as an example, but not limitation:

“Subject”, “predicate”, “direct complement” and the like. In the data structure, elements 31 and 32 are referred to as unique names and, as an example, but not limited to, may be referred to as “UN CE p”, where n> 1 is the ordinal index of the unique name element in the linguistic sentence, starting with “1” for each new UN CE, not previously met in a linguistic sentence. By way of example, but not limitation, to clarify the determination of the ordinal indices of data structure elements having a unique name (CE and others specified below), the determination of the ordinal index of the UN CE can be demonstrated as follows. In the sentence: “The buyer is obliged to pay and pick up the paid goods, containers and packaging” there are seven CEs of four unique names: “subject” - one CE, “predicate” - two CEs, “definition” - one CE and “addition” - three CEs . The data structure will contain one element “subject”, which will have the index “1”, as well as two elements “predicate”, which will have the corresponding indexes “1” and “2”, and the index “1” will have that

“Predicate”, which has a lower serial number of the fuel cell, as well as one element “definition”, which will have the index “1”, as well as three elements

"Addition", which will have the corresponding indices "1", "2" and "3", and the index "1" will have that "addition", which has a lower serial number of the fuel cell. In the future, when mentioning CE as an element of the second data structure refers to UN CE. By their structure, CEs can consist of one word or several words. A single-word CE (one text element 21 of sentence 11) is always element 31, which is the original syntactic object (ISO), that is, a sentence member established as a result of parsing. In the future, when mentioning element 31 of the second data structure, which is ISO, it means the ISO. Moreover, among the members of the proposal, there are many characteristics that can in practice give rise to a list of specified members of the proposal. Thus, the ISO list is not a predefined list, but a variable list, depending on predefined parameters. The same word (text element 21 of sentence 11) in sentence 11 may be

named by different syntactic unit 31, depending on the presence of a given text element 21 of sentence 11 of various linguistic characteristics 213 and their meanings 2131, as well as the relevance of individual

linguistic characteristics. For example, but not limited to, when

identification of ISO "supplement" in which some

linguistic characteristics 213 of this ISO, ISO 31 will be assigned to one unique name of the syntactic units of ISO 31

addition ", and ISO" indirect addition ". When updating all

linguistic characteristics of 213 ISO as separate syntactic units 31 will be identified by ISO "direct complement" and ISO "indirect

addition". Syntactic units of several words (several text elements 21 of sentence 11) are sometimes ISO (element 31), and sometimes

syntax constructs (element 32). ISO (element 31) of several words (text elements 21 of sentence 11) are found, in particular, in ISO

"predicate". For example, but not limited to, the ISO “predicate” - “could be done” - consists of three text elements 21 of sentence 11 due to the linguistic features of the formation of the predicate, which, in addition to the verb “completed”, can have the auxiliary verb “to be” and modality

(modal verb) "could." Syntactic construct 32 always consists of several words (text elements 21 of sentence 11) and is a construct of syntactically related ISOs (elements 31), as an example, but not limitation, the syntactic construction (UN SC) is

“Phrase”, “syntactic turnover” (participial, participial), “Simple sentence”, “complex sentence”, etc. In the future, when mentioning element 32 of the second data structure, which is a syntactic structure (SC), we mean the UN SC. In practice, in order to solve

applied tasks, the formation of a variety of syntactic structures is possible, while the list and description of syntactic structures can be predefined. The list of CE 31 (ISO) and CE 32 (SC) to be identified in Proposition 11 is preferably given

preliminary.

[0075] The identification or formation of the first element 31 of the second SMD data structure in step 1031 is performed by a comprehensive analysis of the values of 2131 linguistic characteristics 213 of the text elements 21 of the linguistic sentence 11. Elements 31 of the second SMD data structure, which are ISO, are identifiable (if they consist of one TE ) or formed (if they consist of several fuel cells) within the framework of step 1031, in fact, are identified with relevant ISO. Relevant ISOs are an actual list of ISOs with current linguistic characteristics, pre-installed and recorded in the first user database (first PBB), which is thus a database of current syntactic units (CE) containing relevant ISO and current SK. The identification and formation (if required) of ISO elements 31 is performed by comparing the values 2131 of the linguistic characteristics 213 of the text elements registered in the database of linguistic features (BDLP) 21

linguistic sentences 11 with predefined values of the linguistic characteristics of relevant ISO. With the coincidence of the values of 2131 linguistic characteristics 213 of the text element 21 of the linguistic sentence 11 and the predefined values of the linguistic

characteristics of relevant ISO identified text element identify and, if necessary, carry out its formation as the corresponding ISO (element 31). Such a comprehensive analysis can be performed by any method known in the art and, accordingly, will not be described in further detail below. For example, without limitation, such a comprehensive analysis can be performed traditionally by a specialist linguist, or using a software algorithm of a linguistic (syntactic) processor. Moreover, if there are enough examples, it is possible to perform such an analysis. using a statistical processor (neural network) through the application of neural network training technology. The formation of the second element 32 of the second SMD data structure within the framework of step 1032 is carried out by a comprehensive analysis of the values of 2131 linguistic characteristics of 213 text elements that are ISO (elements 31). Upon revealing the linguistic characteristics of the text elements that make up the ISO that correspond to the description of the syntactic structure (SC) contained in the database of relevant CEs, the SC (elements 32) are formed from the two or more ISOs mentioned (elements 31). Such a comprehensive analysis can be performed by any method known in the art and, accordingly, will not be described in further detail below. For example, without limitation, such a comprehensive analysis can be performed traditionally by a specialist linguist, or using a software algorithm

linguistic (syntactic) processor. Moreover, if there are a sufficient number of examples, it is possible to perform such an analysis using a statistical processor (neural network) through the use of neural network training technology.

[0076] Identification of the value and serial number (numbers) of the TE of element 31 (ISO) of the second SMD data structure under step 1032 is performed as follows: value (s) 211 text

(text) element (s) 21 of the linguistic sentence 11 constituting the (component) element 31 is identified as the value 311 of element 31 (ISO), and the serial number (numbers) 212 of the mentioned (mentioned) text (text) element (s) ) 21 of the linguistic sentence 11 constituting the (component) element 31, is identified as the serial number of the text element (s) constituting the (element) element 31 (ISO). The identification of the value and sequence numbers of the TE of element 32 (SC) of the second SMD data structure in step 1032 is performed as follows: the values of 211 text elements 21 of the linguistic sentence 11 constituting element 32 are identified as the value 321 of element 32 (SK), and the sequence numbers 212 the said text elements 21 of the linguistic sentence 11 constituting the element 32 are identified as the serial numbers of the text elements 322 constituting the element 32 (SK).

[0077] Generating a second SMD data structure during step 1033 produced by combining in one data structure elements 31 of the second SMD data structure and elements 32 of the second SMD data structure, as well as their identification data according to the principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0078] Identification of the linguistic characteristics of FCs constituting the elements 31 (ISO) of the second SMD data structure and their values at

Necessities are made by organizing a request to the BDLP, which is formed as part of step 102, consisting of the identification data of the TEs that make up ISO 31, and obtaining the values of 2131 linguistic characteristics 213 of the text elements 21 of the linguistic sentence 11 of which element 31 (ISO) consists. Identification of the linguistic characteristics of TEs that make up the elements 32 (SC) of the second SMD data structure and their values, if necessary, is carried out by organizing a request to the BDLP generated in step 102, which consists of identification data of TEs that make up SC 32 and obtaining values 2131 of linguistic characteristics 213 text elements 21 of the linguistic sentence 11, of which the element 32 (SC) consists. Moreover, as described previously, the linguistic features of the elements 31 (ISO) and 32 (SC) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 31 (ISO) and 32 (SC) are composed.

[0079] FIG. 9, as an example, but not limitation, depicts a General diagram of the steps of step 104 of the formation of the third data structure SMD. Step 104 is characterized by: performing step 1041 of identifying a first element of a third DMD data structure, in which elements 41 of a third SMD data structure are identified or formed, and

the identification data of the elements 41, which, for each element 41, is, by way of example, but not limitation, the value 411 of the element 41 of the third SMD data structure and the serial number (s) 412 of the TE of the linguistic sentence 11 constituting the (component) element 41; performing step 1042 of forming the second element of the third SMD data structure, on which the elements 42 of the third SMD data structure are formed, as well as the identification data of the elements 42, which for each element 42, as an example, but not limitation, the value 421 of the element 42 of the third SMD data structure and serial number (numbers) 422 TE of linguistic sentence 11, constituting

(constituent) element 42; performing step 1043 of the formation of the third SMD data structure, in which the third SMD data structure is formed by combining the identified elements 41 and 42 and their

identification data.

[0080] FIG. 10, by way of example, but not limitation, the general structure of the generated third SMD data structure is shown. The third SMD data structure is a SMD containing elements 41 and / or elements 42, which are linguistic objects (LO) of the linguistic sentence 11 and the identification data of the LO, representing for each LO, as an example, but not limitation, the value 411 element 41 and (or) the value 421 of element 42 (the value of elements 41 and (or) 42), as well as the serial number (numbers) 412 and (or) 422 text elements 21 of the linguistic sentence constituting the (component) element 41 and (or) element 42 (orders numbered TE numbers of elements 41 and (or) 42).

A linguistic object (LO) is a special syntactic unit (ELE). The LO of the linguistic sentence 11 has their unique names characterizing them (UN LO), which have practical use as an example, but not limitation, is: "a simple sentence without homogeneous members and turns", "the first phrase" and the like. In the data structure, elements 41 and 42 are referred to by unique names, as an example, but not limitation, can be referred to as “UN LO p”, where n> 1 is the ordinal index of the unique name element in the linguistic sentence, starting with “1” for each new UN LO, not previously met in

linguistic sentence. In the future, when referring to LO as an element of the third data structure or elements 41 and 42 of the third data structure, we mean UN LO. OSE is preliminarily formed as part of a linguological transformation, the result of which is the correlation of linguistic and logical objects and the transformation

syntactic units of sentences in an array of logical objects. The linguistic-logical transformation mechanism allows, on the basis of a previously known idea of the logical model of the sentence (the logical structure of a simple uncomplicated sentence, identical to a simple proposition) to establish a correlation (relationship) between the logical objects of the sentence and syntactic units of a sentence. In the course of forming a table of interconnected (linguistic and logical) objects, in each individual case, such EFs are formed that are the desired linguistic objects (the sought-after LO) that can be identified with the sought-after logical objects (the sought-after LogO). The sought-for LO thus formed can be completely identical to one or several CEs (such a sought LO will generate the first linguistic object (the first LO), which is an element of 41 linguistic sentences 11), or it can be a derived syntactic object obtained by converting one or more CEs to an object that will be an object that is identical to the desired LO (such a sought LO will generate a second linguistic object (second LO), which is an element of 42

linguistic sentences 11). Searched LogOs can be individual logical objects or groups of logical objects that are

logical entities and formed according to the laws of logic (the science of thinking). At the same time, there are basic academic logical entities, as an example, but not restrictions, such as “concept”, “judgment”, “subject of judgment” (something that is affirmed or denied in the judgment), “predicate

judgments ”(that which is affirmed or denied about the subject of judgment). At the same time, for practical purposes, other logical entities can be formed that reflect the needs of the current applied problem and are used as part of the linguistic-logical transformation. For example, without limitation, in a predicate of a judgment, separate more functionally homogeneous logical entities can be distinguished than the predicate of a proposition itself, namely, “the action of a predicate of a proposition”, “an object of a predicate of a proposition” and “circumstances of a predicate of a proposition”.

[0081] The identification of element 41 (first L O) of the third SMD data structure in step 1041 is performed by analyzing the composition of the desired LO. In this case, the composition of the required LOs is preliminarily set in the form of a CE of a unique name (UN CE) or a list of CE of unique names with an indication of the method

the formation of LO, which, without being limited to, can be one of the following methods: identification with the installed (several installed) CE or conversion of the installed (installed) CE according to the previously described method. The aforementioned composition of the desired LO and methods of forming the LO form This is the second user database (the second PBB), which is thus the database of the sought-after LOs, which contains, among other things, the linguistic characteristics of the text elements from which the sought-after LOs are formed. If as a result of the analysis of the composition of a particular LO it is established that

the analyzed LO consists of identifiable CEs (elements 31 or elements 32), then such LOs are identified (if identical to one CE) or formed (if identical to several CEs) as element 41 of the third SMD data structure. The syntactic units (elements 31 or elements 32) of the second SMD data structure identified in the analysis of the composition of linguistic objects in accordance with predefined parameters

the identification or formation of elements 41 of the third SMD data structure contained in the description of the desired linguistic objects is identified with the first LO (first linguistic object (element 41 of the third SMD data structure)), whereby element 41 of the third SMD data structure is formed. An additional part of the element 41 of the third SMD data structure can be service parts of speech (prepositions, conjunctions, particles) and punctuation marks located between syntactic units combined under the element 41 of the third SMD data structure in accordance with the method

the formation of the first LO, set by the user. If, as a result of the analysis of the composition of a particular LO, it is established that the analyzed LO consists of converted CEs, then such a linguistic object will belong to the elements 42 of the third SMD data structure (second LO) and will be formed as part of step 1042. Formation of element 42 of the third SMD data structure ( the second LO) in the framework of step 1042 is produced by converting CE,

predefined in the description of the desired LO of the form "second LO" by

a predefined conversion method to the second PBB.

Converting CE to elements 42 of the third SMD data structure is an optional step for the present conversion method.

structured data array. The present method of transforming a structured data array can be implemented with sufficient efficiency and accuracy even if only element 41 of the third SMD data structure is identified, provided that all the required LRs generate only the first LR that is an element 41 of linguistic sentence 11.

[0082] Identification of the value and serial number (numbers) of the fuel cell, element 41 (first LO) of the third DMD data structure within the framework of step 1041 is performed as follows: the value (s) of 311 or 321 elements (elements) 31 or 32 of the linguistic sentence 11,

identified (identified) with element 41 (first LO),

identify as the value 411 of element 41 (first LO), and ordinal

(serial) number (s) 312 or 322 TE of the element (s) 31 or 32 of the linguistic sentence 11 identified (identified) with element 41 (first LO) is identified as the serial (serial) number (numbers) of 412 text (text) elements ( elements) constituting

(constituent) element 41 (first LO). If the formation of element 41 is carried out according to the requirements of the second PDU by combining several CEs, then the values and serial numbers of TE of service parts of speech are added to the values and numbers of TE of element 41 (first LO) of the third data structure of SMD

(prepositions, conjunctions, particles) and punctuation marks between

unified CE. The identification of the value and serial (ordinal) numbers of the TE of element 42 (second LO) of the third SMD data structure within the framework of step 1042 is carried out as follows: the value (values) of the converted CE (from element (s) 31 and (or) 32 of the linguistic sentence 11), identified (identified) with element 42 (second LO),

identify as the value 421 of element 42 (second L O), and the ordinal

(serial) number (s) of transformed CE (from element (s) 31 and (or) 32 of linguistic sentence 11), identified

(identified) with element 42 (second LO) is identified as the serial number (s) 422 of the text (text) element (s) constituting (making up) element 42 (second LO).

[0083] The formation of the third SMD data structure during step 1043 is performed by combining in one data structure the elements 41 of the third SMD data structure and the elements 42 of the third SMD data structure, as well as their identification data according to the principles and methods known from the prior art, which, respectively further not described in detail.

[0084] Identification of the linguistic characteristics of TEs that make up the elements 41 (first LO) of the third SMD data structure and their values, if necessary, is carried out by organizing a request in the BDLP, which is formed as part of step 102, consisting of TE identification data constituting the first LO 41, and obtaining the values 2131 of the linguistic characteristics 213 of the text elements 21 of the linguistic sentence 11, of which the element 41 (the first LO) consists. Identification of the linguistic characteristics of TEs that make up the elements 42 (second LO) of the third SMD data structure and their values, if necessary, is carried out by organizing a request to the BDLP generated in step 102, which consists of identification data of TEs that make up the second LO 42 and obtaining 2131 linguistic values characteristics 213 of the text elements 21 of the linguistic sentence 11 of which element 42 (the second LO) consists. Moreover, as previously described, the linguistic features of elements 41 (first LO) and 42 (second LO) are at least the morphological, syntactic, and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 41 (first LO) and 42 (second LO).

[0085] FIG. 11, as an example, but not limitation, depicts a General diagram of the execution stage 105 of the formation of the fourth data structure SMD. Step 105 is characterized by the formation of the element of the fourth data structure, in which the elements 51 of the fourth data structure of the DMD are formed, as well as the identification data of the elements 51, which for each element 51, as an example, but not limitation, the value 511 of the element 51 of the fourth data structure of the DMD and serial number (s) 512 TE of linguistic sentence 11 constituting the element 51.

[0086] FIG. 12, by way of example, but not limitation, the general structure of the generated fourth SMD data structure is shown. The fourth SMD data structure is a SMD containing the elements 51 of the fourth SMD data structure, which are logical entities (LogO) of the linguistic sentence 11 and the LogO identification data, which for each LogO, as an example, but not limitation, the value 511 of the element 51 Fourth SMD Data Structure

(value of element 51) and serial (serial) number (s) of 512 TEs that make up (components) element 51 (serial numbers 512 TEs of element 51). Logical objects (LogO) are logical entities and

formed according to the laws of logic (science of thinking). The LogO of the linguistic sentence 11 has unique names characterizing them (UN LogO), which have practical use, as an example, but not restrictions - “concept”, “judgment”, “subject of judgment” (that which is affirmed or denied in the judgment), “predicate of judgment” (that

affirmed or denied about the subject of judgment and the like. In the data structure, elements 51 are referred to by unique names, as an example, but not limitation, can be referred to as “UN LogO p”, where n> 1 is the ordinal index of the unique name element in the linguistic sentence, starting with “1” for each new UN LogO previously not found in the linguistic sentence of the element in the linguistic sentence. In the future, when mentioning LogO as an element of the fourth data structure or element 51 of the fourth data structure, we mean the UNO LogO. An array of logical objects of a sentence is a different, non-linguistic way of registering the meaning of what is being discussed in a linguistic sentence. Unlike a linguistic sentence, the meaning of which can be saved only if all the text elements of the sentence are registered in one place (as a single unit of information), an array of logical objects allows storing the same information without loss of meaning in a divided form, in many independent places of information storage (as an array of logical objects). Moreover, the formation of an array of logical objects allows not only to divide the single information into parts, but also to systematically structure these parts for any transformed within the framework of this method

linguistic sentences. The ability to register and store logically and systemically structured information in parts opens up new

opportunities for high-precision processing of such information, since in this case the search and analysis of information can technically be carried out not in a single volume of source information (linguistic sentence), but only in certain logical parts of this information, the characteristics and requirements of which correspond to the logical essence and logical subject of the search or analysis. As a result, this method of storing information significantly reduces the volume (zone) of search and analysis, which leads to a decrease in time and an increase in the quality (relevance, accuracy) of search and analysis.

[0087] The formation of the element 51 (LogO) of the fourth SMD data structure in step 105 is performed using the third SMD data structure containing linguistic objects (LO) and a correlation database

linguistic and logical objects, representing in this way third user database (third PBB). The third PBB is formed previously for linguistic-logical conversion. The third PBB is a list of the necessary logical objects that can be highlighted in a linguistic sentence and into which it can eventually be divided. In addition, the third PBB contains a list of linguistic objects

unique names (UN LO) and descriptions of linguistic objects that are supposed to be identical to the desired logical objects

unique names (UN LogO) under the specified conditions (the specified composition of the UN LO (CE list) and (or) the specified CE conversion method, as well as the indicated linguistic characteristics of the mentioned CEs). To form a logical object 51 correlated with the selected linguistic object 41 or 42, a separate LO (from the list of LOs) is selected in the third data structure and, using the third PBB, a logical object (UNLOGO) correlated with UNLO is detected, as an example, but not limitation , choose “the first phrase” (the phrase consisting of a simple sentence without homogeneous members and turns (for example, subject) and all words dependent on it) from the third LO data structure with a unique name (UN LO))) and send this UN LO as a request to the third DBB to search for the selected UN LO in the list of unique names of LOs there. If such a VL LO is detected (for example, “the first phrase”), in accordance with the third PBB, a VL of the logical object identified with it is obtained (in the given example, VL LogO is the “subject of judgment”). In this way, a logical object is formed (element 51 of the fourth SMD data structure) for

the selected linguistic object 41 or 42. The result of the formation of element 51 is the identified UNLOGO.

[0088] The identification of the values and sequence numbers of the TE element 51 (LogO) of the fourth SMD data structure within the framework of step 105 is performed

as follows: the value (s) 211 of the text element (s) 21 of the linguistic sentence 11 constituting (the components) element 41 or 42 correlated with element 51 are identified as the value 511 of element 51 (LogO), and the serial (ordinal) number (numbers) 212 of the said (mentioned) textual element (s) 21 of the linguistic sentence 11 constituting

element 41 or 42 correlated with element 51 is identified as ordinal (ordinal) number (s) of the text (text) element (s) 512 constituting the (component) element 51 (LogO).

[0089] The formation of the fourth SMD data structure during step 105 is performed by combining in one data structure the elements 51 of the fourth SMD data structure, as well as their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0090] Identification of the linguistic characteristics of TEs constituting the elements 51 (LogO) of the fourth SMD data structure and their values, if necessary, is carried out by organizing a request in the BDLP generated in step 102, consisting of TE identification data constituting LogO 51, and obtaining values 2131 linguistic characteristics of 213 text elements 21 of the linguistic sentence 11, of which consists of element 51 (LogO). Moreover, as previously described by linguistic

signs of elements 51 (LogO) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 51 (LogO) are composed.

[0091] Further, after completing the previously described step 103, in order to improve the accuracy of the subsequent search in the structured data array, it becomes possible to further transform the SMD,

containing at least the syntactic units (CE) of the linguistic sentence and their identification data.

[0092] FIG. 13, by way of example, but not limitation, a general flow chart of the steps of the inventive SMD conversion method 200 is shown, comprising at least the syntactic units (CE) of the linguistic sentence and the identity of the CE being the source

the data structure for the method 200 in question. The claimed method 200 for converting an SMD containing at least a linguistic sentence CE and CE identification data is characterized by: performing an identification step 201 of a transformable data structure containing a CE, in which an SMD data structure containing elements of said transformable data structure containing CE, which is the original data structure of the DMD within

the method 200 in question, said elements being suitable for data structure transformations are syntactic units (CE) of a linguistic sentence and CE identification data;

performing step 202 of the formation of the fifth SMD data structure, in which the fifth SMD data structure is formed, which is the first SMD data structure in the framework of the method 200 under consideration, containing elements of said fifth SMD data structure, said elements of the fifth SMD data structure being linguistic logical units ( LLE)

linguistic sentences identified and generated by the results of linguo-logical analysis of syntactic units, and also represent LLE identification data, representing for each LLE, as an example, but not limitation: LLE value and serial (ordinal) number (s) of the Linguistic TE proposal constituting

(constituents) LLE; the execution of step 203 of the formation of the sixth SMD data structure, which forms the sixth SMD data structure, which is the second SMD data structure in the framework of the method 200,

containing elements of the said sixth structure of the SMD, and the mentioned elements of the sixth data structure of the SMD are groups of linguistic logical units (LLE groups) of a linguistic sentence

formed on the basis of the LLE syntactic links scheme in a simple sentence of the original linguistic sentence, and also represent the identification data of the LLE groups, representing for each LLE group, as an example, but not limitation: the value of the LLE group and the sequence numbers of the TE linguistic sentences that make up the group LLE; the execution of the step 204 of forming the seventh SMD data structure, which forms the seventh SMD data structure, which is the final SMD data structure for the method 200 under consideration, containing elements of the seventh SMD data structure, said elements of the seventh SMD data structure being the main linguistic-logical objects ( OLLO) a linguistic sentence formed from LLE groups by eliminating homogeneities in LLE groups, and also represent the identification data of the LLLE, representing amounts to the each OLLO, as an example, but not limited to: the value of OLLO and serial (serial) number (s) TE linguistic offer amounts

(constituents) OLLLO. [0093] FIG. 14, by way of example, but not limitation, a general flowchart is shown for performing a transformable data structure containing syntactic unit (CE) data structure 201 that is the original data structure for the method 200 in question. 201 is a process for identifying a data structure suitable for conversion containing CE, on which the elements 22 of the data structure suitable for transformation containing CE are identified, as well as the identification data of the elements 22, which are for of each element 22, by way of example, but not limitation, the value 221 of element 22 is suitable for converting a data structure containing CE and serial number (s) 222 of the TE of the linguistic sentence 11 constituting (constituting) element 22.

[0094] FIG. 15, as an example, but not limitation, depicts a General data structure, which is the original data structure for the present conversion method 200, which is suitable for

transform data structure containing syntactic units.

The original data structure is a DMD containing elements 22 of a transformable data structure containing CE,

representing syntactic units (CE) and CE identification data, representing for each CE, as an example, but not limitation, the value 221 of element 22 is suitable for converting a data structure containing CE and serial number (s) 222 of the linguistic proposals constituting (constituting) the CE, which, by way of example, but not limitation, have been described previously with reference to FIG. 8 of the method 100. Elements 22 of the linguistic sentence 11 have their unique names characterizing them, having practical use, as an example, but not limitation: “subject”,

"Predicate", "direct complement" and the like. In the data structure, elements 22 are referred to by unique names and, as an example, but not limited to, may be referred to as “UN CE p”, where n> 1 is the index number of the unique name element in the linguistic sentence, starting with “1” for each new CE not previously seen in a linguistic sentence. As mentioned earlier, CE can be of two types: source syntactic objects (ISO), that is, syntactic units - members suggestions identified by parsing, and

syntactic constructions representing a construction from

syntactically related ISOs. Elements 22 of a transformable data structure containing CE are a plurality of ISO

unique names (UN ISO) of the linguistic sentence 11. ISO 22 in a transformable data structure containing CEs are separate elements prepared in advance and placed in a convertible data structure containing CE in the form of a structured array (list, list, etc.) ) individual ISO. Such

preparatory actions can be carried out by any method known in the art or unknown by the method, such as the method specified in the description of method 100, and, accordingly, are not further described.

[0095] The identification of the elements 22 of the transformable data structure containing the CE during step 201 is reduced to the classification of the elements of which the transformable data structure containing the CE consists of the original syntax objects (ISO 22). In this case, individual ISO 22s have a value of 221 and a serial (serial) number

(numbers) 222 TE of the linguistic sentence constituting the syntactic units. The value 221 of ISO 22 is one or more text elements (TE 21) of proposal 11 of which ISO is composed. As a rule, ISOs consist of one TE 21, with the exception of some ISOs.

For example, but not limited to, the ISO "predicate" may consist of three TE 21 - "could be completed." The ISO “predicate” consists of three text elements due to the linguistic features of the formation of the predicate, which, in addition to the verb “completed”, can have the auxiliary verb “to be” and modality

(modal verb) "could."

[0096] The identification of the value 221 of the element 22 (ISO) suitable for converting the data structure containing the CE during step 201 is performed by registering the character set of the TE 21 of proposal 11 constituting

(constituent) element of ISO 22. The identification of the ordinal (ordinal) number (s) of 222TE, constituting (constituent) element 22 (ISO) of a suitable data structure containing CE during step 201 is performed by registering the ordinal (ordinal) number (s) mentioned TE in accordance with its (their) location in the linguistic sentence, provided that the first TE in the linguistic sentence has a serial number of "1", and all subsequent TE have a serial number greater by one than the serial number of the previous TE.

[0097] A transformable data structure containing CE that is the starting point for the present conversion method 200 is the result of a plurality of elements 22, their values 221 and sequence numbers 222 of the FCs constituting the elements 22 identified in step 201.

[0098] Identification of the linguistic characteristics of TEs constituting elements 22 (ISO) of a data structure suitable for transforming CE and their values, if necessary, is carried out by organizing a request in BDLP generated in step 102, which consists of identification data of TEs that comprise ISO 22, and obtaining the values 2131 of the linguistic characteristics 213 of the text elements 21 of the linguistic sentence 11 of which the element 22 (ISO) consists. Moreover, as described previously,

the linguistic features of elements 22 (ISO) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11 of which the elements 22 (ISO) are composed.

[0099] FIG. 16, by way of example, but not limitation, a general flowchart is shown of the steps of forming the fifth SMD data structure 202, which is the first data structure for the method 200 under consideration. Step 202 is characterized by: performing step 2021 of generating values 2141 of the second part of linguistic characteristics 214 of text elements 21 constituting elements 22 of a transformable data structure containing CE on which values 2141 of the second part are formed

the linguistic characteristics of the 214 text elements 21 constituting the elements 22 and enter the received information into the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 102, forming as a result the BDLP of the text elements 21 of the linguistic sentence 11 formed in the frame of step 2021; by performing step 2022 of generating values 2231 of the third part of the linguistic characteristics 223 of the text elements 21 constituting the elements 22 of a transformable data structure containing CE on which the types of element 22 are identified suitable for transformation of a data structure containing CE for which it is necessary to change some values of linguistic characteristics, the mentioned values of 2231 of the third part of linguistic

characteristics 223 of the text elements 21 constituting the elements 22 and inputting the received information into the BDLP of the text elements 21 of the linguistic sentence 11 generated in step 2021, forming as a result the BDLP of the text elements 21 of the linguistic sentence 11 formed in the framework of step 2022; by performing step 2023 of identifying elements 61 of the fifth DMD data structure, on which elements 61, of the fifth SMD data structure, and identifying elements 61, representing for each element 61, are identified and formed, as an example, but not limitation, the value 611 of element 61 fifth SMD data structure and

serial number (s) 612 TE of the linguistic sentence 11 constituting the component 61 and form the fifth data structure of the SMD.

[0100] FIG. 17, by way of example, but not limitation, the general structure of the database of linguistic features (BDLP) generated in step 2021 is shown, which is BDLP of the text elements 21 of the linguistic sentence 11 contained in the elements 22 suitable for transforming a data structure containing CE. The BDLP generated in step 2021 differs from the BDLP generated in step 102 in the presence of values 2141 of the second part of the linguistic characteristics 214, indicating

syntactic-logical identity of element 22 (ISO). To identify linguistic logical units (LLE) of a linguistic sentence, it is necessary to check for all syntactic units of the ISO type (element 22) that their syntactic and logical roles are identical in the linguistic

Proposition 11. As a result of the above verification of the syntactic units of ISO 22 for their syntactic-logical identity, the presence or absence of the identity of ISO 22 should be established. ISO 22 will be considered

syntactically logical identical if its syntactic and logical roles coincide, and it will be considered syntactically logical non-identical if the mentioned roles do not match. The syntactic role of ISO 22 is the function of a word in a sentence, which is defined as part of the interconnections of words in a sentence. The syntactic role of ISO 22 determines which member of the sentence is a word, and in what syntactic relations it consists of other words. One and the same form of a word can fulfill different functions, that is, be different members of a sentence. The logical role of ISO 22 is a syntactic role identified with the logical function of a word in a simple proposition. View

logical function in a simple proposition is associated with the unique name of a logical object (UN LO), which is an element of a simple proposition, which refers to ISO 22. A simple proposition is a logical entity, which is the primary logical construction of thinking, necessarily containing elements of a simple proposition - the subject of the proposition

(the subject of the surrounding world), and the predicate of judgment (that which is affirmed or refuted about the subject of judgment). In this case, the subject of judgment is one object of the world, and the predicate of judgment consists usually of an action and many objects that are in a certain relationship with each other, which ultimately performs the function of a predicate of judgment, that is, it explains what is affirmed or disproved about the subject of judgment. Therefore, the predicate of judgment can be divided into logical functionally homogeneous parts (logical objects), as an example, but not restrictions, such as an action, an object, and

judgment predicate circumstances. The previously mentioned subject of judgment, as well as the action, object and circumstances of the predicate of judgment may be

unique names of the logical object (UN LO), which is an element of simple judgment. In accordance with the word used (ISO 22) by which elements of simple judgment are expressed, these words (ISO 22) will have a certain logical function, which, in accordance with predefined conditions, can be identified or not

identify with a particular syntactic role. Thus

it becomes possible to check the identity of the syntactic and logical roles of individual ISO 22 linguistic sentences 11 and convert ISO 22 into linguistic logical units (LLE).

[0101] The formation of the second part of the linguistic characteristics 214 and their values 2141 for the text elements 21 of the linguistic sentence 11 contained in the elements 22 in order to identify the syntactic-logical identity of ISO 22 at step 2021 is performed by checking the ISO 22 for the identity of the syntactic and logical roles in the linguistic Proposition 11. The check is a comparison of the values 2131 of the first part of the linguistic characteristics 213 of the text elements 21 constituting the elements 22 with the predefined values of the predefined linguistic characteristics of the text elements 21 constituting the ISO 22. Conditions are predefined for the verification, such as, as example but not

restrictions: lists of pairs of ISO 22 for which the syntactic and logical roles may not coincide, as well as signs of a mismatch of the syntactic and logical roles (some values of the linguistic characteristics of these ISO 22, in the presence of which ISO 22 is identified as ISO 22 with a mismatch of syntactic and logical roles) . ISO 22, which successfully pass

said verification is identified as ISO 22 with matching syntactic and logical roles (SL-identical ISO 22, that is, syntactically logical identical ISO). ISO 22, which do not pass the test because their roles do not match, are identified as ISO 22 with mismatched syntactic and logical roles (SL non-identical ISO 22, i.e. syntactically logical non-identical ISOs). For example, but not

limiting itself, the ISO in the form of a verb has a syntactic role - the "predicate", and the ISO in the form of participle - the syntactic role of the "agreed definition". Despite the fact that the syntactic roles of the verb and the participle do not coincide, from the logical point of view, both the ISO and the other are syntactically the main words in the same element of a simple proposition “the action of the predicate of proposition”, that is, they have the same logical role. As a result, the ISO in the form of a verb passes the verification of the identity of the syntactic and logical roles, since the “verb” is what expresses the action. In this case, ISO in the form

an agreed definition does not pass the test, since a “definition” is a sign, not an action. For this reason, and also under the additional condition that the participle is common (it has dependent words - significant parts of speech), it is considered that the ISO “agreed definition” in the form of participle does not pass the mentioned test and are SL-non-identical to ISO 22. Mentioned conditions used to verify the identity of the syntactic and logical roles, ISO 22 is pre-written in the fourth

user database (fourth PBB).

[0102] The values 2141 (syntactic-logical identity of ISO 22) of the second part of the linguistic values generated as a result of step 2021 characteristics of 214 elements 21 constituting ISO 22 of a data structure suitable for converting CE are introduced into the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 102, thereby forming the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 2021. When this, both parts (213 and 214) of the linguistic characteristics of the text elements 21 of the linguistic sentence 11 and their meanings (2131 and 2141) form unique

linguistic attributes of the elements 22.

[0103] In FIG. 18, by way of example, but not limitation, the general structure of the database of linguistic features (BDLP) generated in step 2022 is shown, which is BDLP of the text elements 21 of the linguistic sentence 11 contained in the elements 22 suitable for transforming a data structure containing CE. BDLP formed in step 2022 differs from BDLP formed in step 2021 in the presence of values 2231 of the third part of linguistic characteristics 223 indicating the type of element 22 (ISO) and actual values of some linguistic characteristics of 213 text elements making up element 22 (ISO) . In order to identify linguistic-logical units (LLE) of linguistic 11, it is necessary for all syntactic units of ISO 22, by which the identity of their syntactic and logical roles was checked, additionally identify types of ISO 22 by the basis of making changes to the values of their linguistic characteristics 213 in accordance with their value is 2141

syntactic-logical identity 214. As a result of a check on step 2021 for syntactic-logical identity, ISO 22 was

it was found that all ISO 22 are divided according to this criterion into SL-identical and SL-identical ISO 22. Moreover, SL-identical ISO 22 can be mutable and unchanged, that is, such SL-non-identical ISO 22, for which, in accordance with a predetermined the condition must be changed 2131 values of some linguistic characteristics 213 or not changed. In addition, in some cases, under predetermined conditions, there may be a requirement to create a duplicate of a SL-non-identical ISO 22, in which the linguistic characteristics do not change. Mentioned conditions and

requirements are established in the fourth PBB.

[0104] The formation of the third part of the linguistic characteristics 223 and their values 2231 for the text elements 21 of the linguistic sentence 11 contained in the elements 22 are produced at step 2022 in order to identify various types of ISO 22 based on syntactic-logical identity (SL-identity) and to duplicate the SL-non-identical ISO 22 by a complex analysis of conditions, established in the fourth PBB, and based on the results of the verification of ISO 22 for SL identity. As a result of the above analysis, a specific view is established for each ISO 22 — the first, second or third view. If, under the conditions established in the fourth PBB, there is a requirement to create a duplicate of SL-non-identical ISO 22, then such ISO 22 is classified as the fourth type of ISO 22. In accordance with the established type of ISO 22

form values 2231 of the third part of the linguistic characteristics of 223 text elements 21 of the linguistic sentence 11 contained in elements 22. The first type of ISO 22 (SL-identical) retains all of their original values 221 and numbers of text elements 222, as well as their linguistic characteristics 213 and 214 with values of 2131 and 2141. In this regard, the values of 2231 linguistic characteristics 213 for such ISO 22 are absent. In the second type, ISO 22 (SL-non-identical variable), their original value 221 and the numbers of text elements 222, as well as the second part of their linguistic characteristics 214 with values of 2141, remain unchanged, but the values of 2131 of some linguistic characteristics 213 of the first part are changed. In this regard, the values 2231 of the linguistic characteristic 223 instead of the variable values 2131 of some linguistic characteristics 213 of such ISO 22 are indicated as established in the fourth PBB. The third type of ISO 22 (SL non-identical unchanging) retains all their original values 221 and text element numbers 222, as well as their linguistic characteristics 213 and 214 with values 2111 and 2141. In this regard, values 2231

there are no linguistic characteristics 213 for such ISO 22. The fourth type of ISO 22 (a duplicate of SL-non-identical unchanged ISOs) remains

their original value 221 and the numbers of text elements 222, as well as the second part of their linguistic characteristics 214 with values 2141, remain unchanged, but the values 2131 of some linguistic characteristics 213 of the first part are changed. In this regard, the values 2231 of the linguistic characteristic 223, instead of the variable values 2131 of some linguistic characteristics 213 of such ISO 22, are indicated as established in the fourth PBB. [0105] The values 2231 generated by the results of step 2022 (modified values of the linguistic characteristics of ISO 22) of the third part of the linguistic characteristics of 223 elements 21 constituting ISO 22 are suitable for

transformations of the data structure containing CE are introduced into the BDLP of the text elements 21 of the linguistic sentence 11, which is formed as part of the step 2021, thus forming the BDLP of the text elements 21 of the linguistic sentence 11, which is formed as part of the step 2022. Moreover, all three parts (213, 214 and 223) linguistic characteristics of text elements 21

linguistic sentences 11 and their meanings (2131, 2141 and 2231) form unique linguistic features of element 22.

[0106] In FIG. 19, by way of example, but not limitation, the general structure of the generated fifth SMD data structure is shown, which is the first for the present conversion method 200. The fifth SMD data structure

represents the SMD containing elements 61 of the fifth data structure of the SMD, which are linguistic logical units (LLE 61)

linguistic sentence 11 and LLE 61 identification data, representing for each LLE, as an example, but not limitation, the value 611 of element 61 of the fifth data structure and the serial number (s) of 612 TEs representing (constituent) LLE 61. U LLE 61 of the linguistic sentence 11 there are unique names characterizing them (UN LLE), which have practical use as an example, but not restrictions: “subject”, “predicate”, “direct complement” and the like. In the data structure, elements 61 are referred to by unique names, and, as an example, but not limitation, can be referred to as “UNLLE p”, where n> 1 is the ordinal index of the unique name element in the linguistic sentence, starting with “1” for each new UN LLE, not previously seen in a linguistic sentence. A linguistic-logical unit (LLE 61) is a syntactic unit of the ISO type, by which a check is made for the identity of its syntactic and logical roles in a linguistic sentence. There can be two types of LLE 61: the first type is

the original ISO 22 that passed the mentioned test, which was contained in a transformable data structure containing CE, and the second type is a duplicate of ISO 22, which was not in the mentioned data structure. The first type of LLE 61 corresponds to all ISO 22 suitable for transformation of the structure data containing CE. The second type of LLE 61 corresponds to those of ISO 22 suitable for the conversion of the data structure containing CE, which are assigned to the fourth type (duplicate of SL-identical identical unchanged ISOs), indicated in the value 2231 of the third part of the linguistic characteristics 223.

[0107] The identification and generation of elements 61 of the fifth DMD data structure during step 2023 is based on the results of step 2022 to identify the types of elements 22 suitable for converting the data structure containing CE. The identification of the elements 61 (LLE) of the first type at step 2023 is carried out by identifying them with the elements 22 (ISO) of the first, second and third types. ISO 22, identified at step 2022 as the fourth type, form new elements - elements 61 of the second type of the fifth data structure of the DMD. The formation of the elements 61 (LLE) of the second type at step 2023 is carried out by identifying them with the elements 22 (ISO) of the fourth type.

[0108] the Identification of the values and serial (serial) numbers (numbers) of the TE element 61 of the first type of the fifth data structure SMD,

identified with the first type of ISO 22, is carried out in step 2023 by identifying the value 611 of element 61 with the value 221 of element 22 of the first type, and the serial number (s) of 612 FCs that make up the element 61 by identifying with the ordinal ( serial) number (numbers) 222 TE, constituting (constituent) element 22 of the first type. The identification of the value and serial number of the TE element 61 of the first type of the fifth SMD data structure, identified with the second type of ISO 22, is carried out in step 2023 by identifying the value 611 of the element 61 with the value 221 of the element 22 of the second type, and

ordinal (ordinal) numbers (numbers) 612ТЭ, constituting element 61, with ordinal (ordinal) numbers 222 ТЭ, constituting (constituting) element 22 of the second kind. The identification of the value and the ordinal number (s) of the TE element 61 of the first type of the fifth SMD data structure identified with the third type of ISO 22 is performed as part of step 2023 by identifying the value 611 of the element 61 with the value 221 of the element 22 of the third kind, and the ordinal ( serial) numbers (numbers) of 612 text elements 21 constituting element 61, with serial (ordinal) number (numbers) 222 of TEs constituting (constituting) element 22 of the third kind. Identification values and serial (serial) numbers (numbers) of TE element 61 of the second type of the fifth data structure of the SMD, identified with the fourth type of ISO 22, is performed as part of step 2023 by identifying the value 611 of element 61 with the value 221 of element 22 of the third type, and the ordinal (ordinal ) numbers (numbers) 612ТЭ of the component (constituent) element 61, with serial (ordinal) number (numbers) 222 of the TE, constituting (constituent) element 22 of the third kind.

[0109] The formation of the fifth SMD data structure during step 2023 is performed by combining in one data structure the elements 61 of the first and second types of the fifth SMD data structure and their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0110] Identification of the linguistic characteristics of TEs that make up the first or second elements 61 (LLE) of the fifth SMD data structure and their values, if necessary, is carried out by organizing a request in the BDLP generated in step 2022, consisting of TE identification data constituting the first or second element 61 (LLE), and obtaining values (2131, 2141, 2231) of all parts of the linguistic characteristics (213, 214, 223) of the text elements 21 of the linguistic sentence 11 of which the first or second element 61 (LLE) consists. Moreover, as previously described by the linguistic features of the first and second elements 61 (LLE) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 61 (LLE) are composed.

[0111] In FIG. 20, by way of example, but not limitation, a general flowchart is shown for performing step 203 of creating the sixth SMD data structure, which is the second data structure for the method 200 under consideration. Step 203 is characterized by: performing step 2031 of generating values 2151 of the fourth part of the linguistic characteristics of 215 text elements 21 constituting elements 61 of the fifth SMD data structure, on which values 2151 of the fourth part of the linguistic characteristics 215 of the text elements 21 constituting the elements 61 are generated and the results obtained from SIC BDLP in the text linguistic elements 21 Proposition 11 that is generated in phase in 2022, forming eventually BDLP text elements 21 linguistic sentence 11 formed in the framework of step 2031; performing step 2032 of forming elements of the sixth SMD data structure, on which the elements of the sixth SMD data structure are formed, as well as the identification data of the elements 71, which for each element 71, as an example, but not limitation, the value 711 of the element of the sixth SMD data structure and serial number (s) 712 TE of the linguistic sentence 11 constituting the element 71; and form a sixth SMD data structure.

[0112] In FIG. 21, by way of example, but not limitation, the general structure of the database of linguistic features (BDLP) generated in step 2031 is shown, which is the BDLP of text elements 21 of the linguistic sentence 11 contained in elements 61 of the fifth data structure of the SMD. BDLP generated in step 2031 differs from BDLP generated in step 2022 in the presence of values 2151 of the fourth part of linguistic characteristics 215, indicating the level of syntactic significance of element 61 (LLE). The level of syntactic significance of element 61 (LLE) characterizes the location of LLE in the syntactic tree of the linguistic sentence 11. The syntactic tree is the syntactic relationship diagram of sentence 11, in which a subordinate is established between individual words (LLE)

syntactic link. Depending on the composition and structure of the proposal

the syntax tree may contain an indefinite amount

syntactic levels and parts of the sentence, which contain the basic logical elements (subject and predicate of judgment). The syntax tree used in the present method contains elements (LLE) of at least three levels of syntactic significance (HSS). The first HSS includes LLE,

located at the top of the syntax tree. The second HSS includes LLEs, which are direct syntactic descendants of syntactic vertices. The third HSS includes all other LLE linguistic

offers. A syntactic vertex is a syntactic unit (LLE) having the syntactic role of “predicate”. A feature of LLE “predicate” is that it can consist of three words. A direct syntactic descendant of a syntactic vertex is a syntactic unit (LLE), which has a direct subordinate relationship with the word of a LLE group of the first level. Establishing the syntactic level of LLE is the syntactic structuring of LLE. [0113] The formation of the fourth part of the linguistic characteristics 215 and their values 2151 for the text elements 21 of the linguistic sentence 11 contained in the elements 61, in order to establish the level of syntactic significance of the elements 61 of the fifth data structure of the SMD during step 2031 is carried out on the basis of a comprehensive analysis of the values of linguistic characteristics all LLE 61 linguistic sentences, on the basis of which the syntactic tree of the linguistic sentence is formed. As a result of ascertaining the position of each LLE in the syntax tree, sentences determine the level of syntactic significance of each LLE 61.

[0114] The values 2151 (level of syntactic significance of LLE 61) of the fourth part of the linguistic

characteristics of 215 elements 21 constituting elements 61 of the fifth SMD data structure are entered into the BDLP of the text elements 21 of the linguistic

sentencesN generated in the framework of step 2022, thus forming the BDLP of the text elements 21 of the linguistic sentence 11, formed in the framework of the step 2031. Moreover, all parts (211, 214, 223, 215) of the linguistic characteristics of the text elements 21 of the linguistic sentence 11 and their meanings ( 2111, 2141, 2231, 2151) form unique linguistic features of elements 61.

[0115] In FIG. 22, by way of example, but not limitation, the general structure of the sixth SMD data structure generated is shown, which is the second data structure for the present conversion method 200. The sixth SMD data structure is a SMD containing elements 71, which are structured groups of linguo logical units (LLE groups) of the linguistic sentence 11 and the identification data of LLE groups, representing for each LLE group, as an example, but not limitation, the value 711 of element 71 of the sixth string data structures and

serial (serial) number (s) of 712 TEs constituting (constituting) the LLE group. LLE groups 71 of the linguistic sentence 11 do not have unique names characterizing them (CN LLE groups) that have practical use. In the data structure, elements 71, by way of example, but not limitation, can be referred to as “LLE group 1”, “LLE group 2”, “LLE group 3”, “LLE group n”, where n> 1 is the element serial number in linguistic sentence. LLE Linguistic Proposal Group 11 - these are syntactic units of the “syntactic construction” form, which are formed from linguo-logical units (elements 61 of the fifth SMD data structure) of different levels of LLE syntactic significance. The first LLE group (first elements 71 of the sixth SMD data structure) includes the main LLEs of the first LLE group, which are the mentioned syntactic vertices (LLE of the first HSS), and other LLEs of the first LLE group (LLE groups 1), which are successively subordinate LLEs, the first of which it has a direct subordinate relationship with the LLE of the first HSS, up to the LLE, which is the main LLE of the second group of LLE. The second LLE group (second elements 71 of the sixth SMD data structure) is referred to as the main LLE of the second LLE group, which are mentioned by the LLE of the second HSS, provided that such LLE has the syntactic role of the subject or any secondary member of the sentence, except for the definition (excluding definitions in the form

common participles or germs), as well as other LLEs of the second LLE group, which are successively subordinate to LLEs, the first of which is LLE of the third HSS, which have a direct subordinate connection with the mentioned main LLE of the second LLE group, up to the last LLEs in the LLE chain with continuous subordinate communication.

[0116] The identification of elements 71 (first and second elements 71) of the sixth data structure CM during step 2032 is performed by

identification of the main LLE of the first and second LLE groups, as well as identification of other LLE of the first and second LLE groups. To identify the main LLEs of the first LLE groups, it is necessary to identify all the LLE “predicate” (including all parts of each LLE “predicate”) of the analyzed sentence. To identify the main LLEs of the second LLE groups, it is necessary to identify all LLEs that have a direct syntactic connection with any part of the LLE “predicate” and which at the same time have the syntactic role of the subject or any secondary member of the sentence, except for the definition (excluding definitions in the form

common participle or gerimony). To identify other LLEs of the first or second LLE group, it is necessary to identify all LLEs that have continuous syntactic subordination, starting from the main LLE of the first or second LLE group and up to another identified main LLE of the first or second LLE group, or to the last LLE in the continuous chain syntactic subordinate link from the main LLE first or second group LLE. The formation of the first elements 71 of the sixth SMD data structure is performed by combining the identified main LLE of the first group LLE and other LLEs for this main LLE. The formation of the second elements 71 of the sixth SM data structure is performed by combining the identified main LLE of the second LLE group and other LLEs for this main LLE.

[0117] Identification of the value and serial number (s) of the TE of the first element 71 (first group of LLEs) of the sixth SMD data structure as part of step 2032 is performed as follows: the value (values) 211 of TE 21 of the linguistic sentence 11 constituting (components) element (s) 61 of which (of) the first element 71 consists,

identify as the value 711 of the first element 71 (the first group of LLEs), and the serial number (s) 212 of the mentioned (mentioned) TE 21 of the linguistic sentence 11 constituting the element (s) 61 of which the first element 71,

identify as serial number (s) 712 TE,

component (s) of the first element 71 (first group LLE).

The identification of the value and serial number of the TE of the second element 71 (second group of LLEs) of the sixth SMD data structure as part of step 2032 is carried out as follows: the value (values) 211 of TE 21 of the linguistic sentence 11 constituting the element (s) ) 61, of which (of which) the second element 71 consists,

identify as the value 711 of the second element 71 (the second group of LLEs), and the serial number (s) 212 of the mentioned (mentioned) TE 21 of the linguistic sentence 11 constituting the (element) element 61 of which (the) second element 71 consists, identify as the serial number (s) 712 TE, constituting (constituent) the second element 71 (second group LLE).

[0118] Generating a sixth SMD data structure during the step

2032 is produced by combining in one data structure the first and second elements 71 of the sixth SMD data structure, as well as their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0119] Identification of the linguistic characteristics of TE, comprising the first or second elements 71 (the first or second group of LLEs) of the sixth SMD data structure and their values, if necessary, by organizing a request in the BDLP generated in step 2031, consisting of the identification data of the FCs constituting the first or second element 71 (the first or the second group LLE), and obtaining the values (2131, 2141, 2231, 2151) of all parts of the linguistic characteristics (213, 214, 223, 215) of the text elements 21 of the linguistic sentence 11, of which the first or second element 71 consists (the first or second group LLE ) Moreover, as previously described, the linguistic features of the first and second elements 71 (the first and second groups of LLE) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 71 (LLE groups) are composed.

[0120] In FIG. 23, by way of example, but not limitation, a general flowchart is shown of the steps of step 204 of generating the seventh SMD data structure, which is the third data structure for the method 200 in question. Step 204 is characterized by: performing step 2041 of identifying the types of elements 71 of the sixth SMD data structure indicating the presence in element 71 of the complications of the linguistic sentence, on which

identify the first and second types of elements 71 of the sixth structure of the SMD data, as well as the identification of the unconverted elements of the seventh

SMD data structures and their identification data, representing for each non-transformed element 81, as an example, but not limitation, the value 811 of the non-transformed element 81 of the seventh SMD data structure and the serial number (s) of 812 TE linguistic sentences 11 constituting ( constituents) the unreformed element 81 (the value and serial numbers of the TE of the unreformed element 81);

performing step 2042 of generating the transformed elements 81 of the seventh SMD data structure, on which the converted elements of the seventh SMD data structure are formed from the second type elements 71, and

the identification data of the transformed elements 81, which for each transformed element 81, is, by way of example, but not limitation, the value 811 of the transformed element 81 of the seventh SMD data structure and the serial number (s) of the 812 TE linguistic sentence 11 constituting (components) transformed item 81 (the value and serial numbers of the TE of the converted element 81); the execution of step 2043 of the formation of the seventh data structure of the SMD, which forms the seventh data structure of the SMD from the converted elements 81 and non-converted elements 81 of the seventh data structure of the SMD.

[0121] In FIG. 24, by way of example, but not limitation, the general structure of the generated seventh SMD data structure is shown, which is the third data structure for the present conversion method 200. The seventh SMD data structure is a SMD containing untransformed elements 81 and transformed elements 81, which are basic linguistic-logical objects (LLL) of the linguistic sentence 11. The LLLL of the linguistic sentence 11 has their unique names characterizing them (UNOLLO), which have practical and use, as an example, but not limitation: "subject of judgment", "predicate of judgment" and the like. In the data structure, elements 81 are referred to by unique names, and, as an example, but not limitation, can be referred to as “UNOLLO p,” where n> 1 is the index number of a unique name element in a linguistic sentence, starting with “1” for each new UN OLLO, not previously seen in a linguistic sentence. Further, at

The mention of OLLO as an element of the seventh data structure means UN OLLO. The main linguistic and logical objects of a sentence are a borderline entity with signs of both linguistic and logical objects.

An entity is an object in a computer system that contains a set of critical rules. From a linguistic point of view, the main linguistic-logical object is a syntactic unit of the form “syntactic construction”, which can be formed from a structured LLE group by heterogenizing the LLE group. The heterogenization of the LLE group is the elimination of homogeneities in the syntactic structure by identifying the syntactic compositional connection in it and transforming the syntactic structure with

homogeneities into several syntactic constructions without homogeneities. A compound link is a link between syntactically equal units of a sentence (in this case, between LLEs) that have the same syntactically main word (main LLE) and the same syntactic role (syntactic role of LLE). LLE between which identified

syntactic compositional connection, are considered “homogeneous members” (LLE- 04). From a logical point of view, OLLOs are logical entities (objects) that are elements of simple judgment, with the help of which they form and convey the idea that something (a predicate of judgment) is affirmed or

refutes the subject of judgment (subject of judgment). Moreover, something (that is affirmed or refuted about the subject of judgment) may be

additionally logically structured. For example, but not limited to, the predicate of judgment can be divided into three logically separable logical entities (objects) - the action of the predicate of judgment, the object of the predicate of judgment and the circumstances of the predicate of judgment. The elements of simple proposition mentioned above for example (the subject of the proposition, the action of the proposition predicate, the object of the proposition predicate, and the circumstances of the proposition predicate) can be

are identified with individual UNOLLOs as separate syntactic constructions of linguistic sentence 11. From a semantic point of view, ALLOs are a generalized conceivable (semantic) image of an element of simple judgment.

[0122] The identification of elements 71 of the first or second type of the sixth SMD data structure in step 2041 is performed by analyzing the linguistic characteristics of the text elements of the sixth SMD data structure constituting element 71 (LLE group), with the aim of identifying syntactic compositional links in the LLE group 71. Syntactic

the composing link between text elements (words, LLE) in the LLE 71 group is identified if the same LLE (words) have the same syntactic parent (the syntactically main word that has a direct syntactic subordinate link to such LLE (words)).

Additional conditions may be the presence of the same syntactic role for such LLEs (words), as well as the presence of a sign of a relationship between such LLEs (words), namely a comma or a compound union. Comparative unions are divided by value into connectives (“and”, “yes” in the meaning of “and”, “neither ... nor”, “also”, “too”, “both ... and”), separation (“ or "," or ... or "," either "," either ... or "," then ... then "," either ... or "," not that ... not that " ) and opposing (“a”, “but”, “however”, “yes” in the meaning of “but”), including comparative ones (“not only ... but also”, “not only ... but and” , "Not only not ... but", "not so much ... how much"). Other own criteria for identifying a compositional connection may be preliminarily set, or, for example, without limitation, ignoring some or an additional condition, for example, not limited to, conditions on the presence of a sign of communication. When identifying the syntactic compositional connection in the LLE group, such LLE groups with homogeneous members are identified as the second type of element 71 of the sixth SMD data structure. All other LLE groups in which the syntactic compositional relationship is not identified

identified as the first type of element 71 of the sixth data structure SMD. In this case, the identified first type of element 71 of the sixth SMD data structure is identified with the non-transformed element 81 of the seventh SMD data structure. The generation of the transformed elements of the seventh SMD data structure (the transformed elements 81 obtained from the second kind of elements 71) at step 2042 is performed based on the syntactic writing link identified at step 2041 and homogeneous members by converting the identified elements 71 of the second type of the sixth SMD data structure. The meaning of the transformation of the second type element 71 at step 2042 is to heterogenize the LLE group, that is, eliminate syntactic homogeneities in the LLE group by forming from the second type element 71 many new LLE groups identical in uniformity to the first type element 71. Moreover, each new LLE group of the said set can contain only one of the mentioned homogeneous members (if the element 71 of the second type contains one row of homogeneous members) or only one unique combination of the said homogeneous members of different rows of homogeneous members (if the element 71 of the second type contains several rows of homogeneous members). Many LLE groups formed from an element 71 of the second type are heterogeneous OLLO (transformed elements 81), that is, formed from one group of LLE 71 of the second type by its heterogeneous transformation

(heterogenization). The heterogenization process of elements of the second type 71 can be carried out, as an example, but not limited, as follows: at the first stage, rows of homogeneous members in the analyzed element 71 of the second type are revealed; in the course of identifying rows of homogeneous members, the rows of homogeneous members themselves and serial numbers of homogeneous members in the series, as well as signs of the connection of each homogeneous member (LLE-OCH) are revealed; at the second stage, chains of homogeneous members are revealed for each identified LLE-OCH element 71 of the second type; in this case, chains of homogeneous members (chains of OCh) are LLE-OCH and a dependent chain of OCH (a chain of syntactically related LLEs, starting from the first syntax descendant of LLE-OCH (syntactically dependent word) and continuing along the chain of syntactic dependence (according to syntactic subordinate relationship) to another LLE-OCH or to the last LLE in element 71 of the second kind;

identified PF chains are classified based on availability

LLE syntax descendant in dependent chain 04, which is not included in dependent chain 04; those chains 04 that have dependent chains 04 without a syntactic descendant outside dependent chain 04 are considered extreme chains of homogeneous members (extreme chains 04), and those chains 04 that have dependent chains 04 with syntactic descendant outside dependent chain 04 are considered internal chains of homogeneous members (internal chains 04); at the third stage, the LLE of an element 71 of the second type is detected that are not included in the chains of homogeneous members (LLE outside the chains 04); at the fourth stage, replaceable dependent chains (replaceable LC) of the element 71 of the second type are formed; at the same time, a replaceable LC is the basis of the transformed elements 81, consisting of one extreme chain 04 and internal chains 04, if such internal chains 04 along a chain of continuous syntactic subordinate communication exist between the extreme chain 04 and LLE outside chains 04; at the fifth stage, heterogeneous basic linguistic-logical objects (ALLO) are formed; at the same time, heterogeneous ALLO is formed from one replaceable LC and LLE outside chain 04 (if there are such LLEs); the number of removable LC elements of the second type 71 determines the number of heterogeneous OLLOs formed from one element 71 of the second type.

[0123] The identification of the types of elements 71 of the sixth SMD data structure in step 2041 does not change the value 711 of element 71 and does not change the sequence numbers of the text elements 21 of the linguistic sentence 11 constituting element 71 (LLE group). The identification of the value and the ordinal number (s) of the TE of the non-transformed element 81 (OLLL) of the seventh SMD data structure as part of step 2041 is performed as follows: the value (values) 211 of the TE 21 of the linguistic sentence 11 constituting the element 71 of the first kind of the sixth The SMD data structures are identified as the value 811 of the non-transformed element 81 (OLLL), and the serial number (s) 712 of the TE 21 of the linguistic sentence 11 constituting the (element) element 71 of which the non-transform This element 81 is identified as ordinal (ordinal) number (s) 812 of the fuel cell constituting the (s) non-transformed element 81 (OLLO). The identification of the value and serial number of the TE of the transformed element 81 (heterogeneous OLLO) of the seventh SMD data structure within the framework of step 2042 is performed as follows:

the value (s) of 211 TE 21 of the linguistic sentence 11 constituting (constituent) element 71 of the second type of the sixth SMD data structure, with the exception of the value (s) of 211 TE 21 of the linguistic sentence 11 constituting (constituting) a separate part of the element 71 of the second type (LLE-OCH), which (which) was (were) removed (deleted) during heterogenization, and signs of communication of the deleted LLE-OCH (syntactically

subordinate remote LLE-OCH composing unions or punctuation marks) are identified as the value 811 of the transformed element 81 (ALLO), and the serial number (s) 212 TE 21 linguistic

Proposals 11 constituting (constituting) element 71 of the second type of the sixth structure of the SMD data, excluding the serial number

(rooms) 212 TE 21 of the linguistic sentence 11 constituting

(components) the separate (s) part (s) of the element 71 of the second type (LLE-OCH), which (which) was (were) deleted (deleted) during heterogenization, and signs of communication of the deleted LLE-OCH (syntactically subordinate to the remote LLE-OCH joint unions or punctuation marks) are identified as the serial number (s) of 812 TEs that make up the transformed element 81 (heterogeneous OLLO).

[0124] Formation of a seventh SMD data structure as part of the step

2043 is produced by combining in one data structure

elements 81 of the seventh SMD data structure identified in step 2041 and formed in step 2042, as well as their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below. The first of the combined elements of the seventh SMD data structure is the element identified in step 2041 as an element of the first kind 71, which is an unreformed element 81, that is, ALLO 81, which coincides with the group LLE 71 without syntactic juxtaposition. The second of the combined elements of the seventh SMD data structure are transformed elements 81, formed in step 2042 as a set of transformed elements 81, which are heterogeneous OLLO, formed from an element 71 of the second kind (LLE groups with syntactic co-linking) by heterogenizing LLE groups of the second kind.

[0125] Identification of the linguistic characteristics of TE;

constituting the first or second elements 81 (ALLO) of the seventh structure of the SMD data and their values, if necessary, by organizing a request to the BDLP generated in step 2031, consisting of TE identification data constituting the first or second element 81 (ALLO), and obtaining the values (2131, 2141, 2231, 2151) of all parts of the linguistic characteristics (213, 214, 223, 215) of the text elements 21 of the linguistic sentence 11, of which the first or second element 81 (OLLO) consists. Moreover, as previously described, the linguistic features of elements 81 (ALLO) are at least the morphological, syntactic, and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which elements 81 (ALLO) consist.

[0126] Further, after completing the previously described step 204, in order to further improve the accuracy of the subsequent search in the structured data array (SMD), it becomes possible to further

transformation of the SMD containing at least the main linguistic-logical objects (LLL) of the linguistic sentence and their identification data.

[0127] In FIG. 25, as an example, but not limitation, depicts a General diagram of the steps of the claimed method 300 for converting SMD containing at least the main linguistic-logical objects (LLL) of a linguistic sentence and LLL identification data,

which is the original data structure for the considered method 300. The claimed method 300 for converting SMD containing at least

The LLLO of the linguistic sentence and the identification data of the LLLF is characterized by: performing the identification step 301 of a suitable data structure containing the LLLF, on which

identifying the SMD data structure containing elements of said transformable data structure containing OLLO, which is the original SMD data structure within the framework of the method 300 in question, said elements of the transformable data structure being the main linguistic-logical objects (OLLL) Linguistic Suggestions and Identity OLLO; performing the step 302 of forming the eighth data structure of the SMD, which forms the eighth data structure of the SMD, which is the first data structure of the SMD for the method 300 under consideration, containing elements of the said eighth data structure of the SMD, the elements of the eighth data structure of the SMD being the original simple linguistic logical constructions (IP LLK) linguistic sentences,

formed from syntactically related OLLL simple sentences of the original linguistic sentence, and also represent

the identification data of the IP LLK, representing, for each IP, LLK, as an example, but not limitation: the value of the IP LLK and serial numbers of TE linguistic sentences that make up the IP LLK; performing step 303 of generating the ninth SMD data structure, which forms the ninth SMD data structure, which is the second SMD data structure for the method 300 under consideration, containing elements of the ninth SMD data structure, said elements of the ninth SMD data structure being simple linguistic-logical constructions (PLCK ) linguistic sentences formed from IP LLK by eliminating homogeneities in IP LLK, and also represent PLCK identification data,

representing for each PLCC, as an example, but not limitation: the meaning of PLCC and serial numbers of TE of the linguistic sentence that make up PLCC; performing the step 304 of forming the tenth SMD data structure, on which the tenth SMD data structure is formed, which is the third SMD data structure for the method 300 under consideration, containing an element of the tenth SMD data structure, said element of the tenth SMD data structure being the initial complex linguistic-logical construction (IP LLK) linguistic sentences,

formed by combining all PLCs taking into account the syntactic links of PLCC in the original linguistic sentence, and also represents the identification information of the LLK IC, which, as an example, but not limitation: the value of the LLK IC and serial numbers of the TE linguistic sentences that make up the LLK IS; performing step 305 of generating the eleventh DMD data structure, in which the eleventh SMD data structure is formed, which is the final SMD data structure for of the method 300 under consideration, containing elements of the eleventh SMD data structure, said elements of the eleventh SMD data structure being complex linguistic-logical constructions (SLLC) of a linguistic sentence formed from the SLC information system by eliminating uniformities in the SLC information system, as well as the SLC data identification data representing, for each SLLK, as an example, but not limitation: the value of SLLK and serial numbers of TE linguistic

proposals making up the SLLK.

[0128] In FIG. 26, by way of example, but not limitation, a general flowchart of an identification step 301 suitable for

transforming the data structure containing the main linguistic-logical objects (LLLO), which is the initial data structure for the considered method 300. Step 301 is characterized by identifying suitable for the conversion of the data structure containing LLLO, on which

identifying elements 91 of a transformable data structure containing OLLL as well as identifying elements 91,

representing, for each OLLO, as an example, but not limitation, the value of 911 element 91 is suitable for converting a data structure containing OLLO and serial number (s) 912 TE

linguistic sentences 11 constituting (constituent) element 91.

[0129] In FIG. 27, as an example, but not limitation, depicts the General data structure that is the source for the present method

transform 300, which is a transformable data structure containing basic linguistic logic objects (LLLO). The original data structure is a DMD containing elements 91 of a transformable data structure containing OLO,

representing the main linguistic-logical objects (OLLL) and

OLLO identification data, representing, for each OLLO, as an example, but not limitation, the value of 911 element 91 is suitable for converting the data structure containing OLLO and serial number (s) 912 TE linguistic sentences constituting

(constituent) OLLO, which, by way of example, but not limitation, have been described previously with reference to FIG. 24 ways 200. Element 91 of the linguistic sentence 11 has unique names that characterize them (CN OLLO), having practical use as an example, but not limitation: “subject of judgment”, “action of predicate of judgment” and the like. Elements 91 are called unique in the data structure.

names, as an example, but not limitation, can be referred to as “UNOLLO p,” where n> 1 is the ordinal index of the unique name element in the linguistic sentence, starting with “1” for each new UNOLLO not previously encountered in the linguistic sentence . As mentioned earlier, from a linguistic point of view, OLLOs are a syntactic unit of the form “syntactic construction”, which can be formed from structured LLE groups; logically, OLLOs are elements of simple judgment, with the help of which something is affirmed or refuted about the subject of judgment; from a semantic point of view, OLLOs are a generalized conceivable (semantic) image of an element of simple judgment. Elements 91 of a transformable data structure containing ALLO constitute a plurality of LLLOs of unique names (UNOLLO) of linguistic sentence 11. ALLO 91 in a transformable data structure containing ALLO are separate elements prepared in advance and placed in a transformable data structure containing OLLO in the form of a structured array (list, list, etc.) of individual OLLOs. Such

preparatory actions can be carried out by any method known in the art or unknown by the prior art, such as the method indicated in the description of method 200, and accordingly are not further described.

[0130] The identification of the elements 91 of the transformable data structure containing the ALLO during step 301 is reduced to the classification of the elements of which the transformable data structure containing the ALLO consists of the main linguistic-logical objects (ALLO 91). Moreover, all OLLO 91 have values of 911 and the serial numbers of the text elements 912 of the linguistic sentence that make up the OLLO. Values 911 OLLO 91 represent one or more text elements (TE 21)

Proposals 11 of which OLLO is composed. The number of TEs that OLLO consists of is associated with the technical ability of the language to describe the object, action or their attributes, or with the need for the author to describe the entities of the environment of the world (objects, actions or their signs) to use a certain method of linguistic description, as an example, but not a limitation, one can cite various following methods of linguistic description of one and the same entity: "old man" = "old man" = "old man" etc. The specified entity can be linguistically expressed through one, two or three TE. Moreover, for example, not limited to, the essence of "right

consumers ”cannot be linguistically expressed in Russian by one TE.

[0131] The identification of the value 911 of the element 91 (ALLO) suitable for converting the data structure containing the ALLO during step 301 is performed by registering the character set of the text elements 21

Proposals 11, of which the OLLL 91 is composed. The identification of the serial number (s) of 912 TEs of which (of which) the element 91 (OLLL) is suitable for transforming the data structure containing the OLLL, during step 301, is performed by registering the ordinal ( serial) numbers (numbers) of the mentioned TE in accordance with their location in the linguistic sentence, provided that the first TE in the linguistic sentence has the number "1", and all subsequent TEs have a sequence number greater than one by a sequence number, previous present TE.

[0132] A convertible data structure comprising

OLLO, which is the source for the present method of conversion 300 is a result of a multitude of elements 91, their values 911 and

sequence numbers 912 TE, constituting the elements 22 identified at step 301.

[0133] Identification of the linguistic characteristics of TEs comprising elements 91 (ALLO) suitable for transforming the data structure containing ALLO, and their values, if necessary, is carried out by organizing a request into the BDLP generated in step 2031, consisting of the identification data of TEs constituting element 91 (ALLO), and obtaining the values (2131, 2141, 2231, 2151) of all parts of the linguistic characteristics (213, 214, 223, 215) of the text elements 21 of the linguistic sentence 11, of which the element 91 (ALLO) consists. Moreover, as described previously

the linguistic features of elements 91 (OLLL) are at least the morphological, syntactic, and semantic characteristics of textual elements 21 of the linguistic sentence 11 of which elements 91 (ОLLO) are composed.

[0134] FIG. 28, by way of example, but not limitation, a general flowchart is shown of the steps of step 302 of generating the eighth SMD data structure, which is the first data structure of the method 300 in question. Step 302 is characterized by: performing step 3021 of generating values 2161 of the fifth part of the linguistic characteristics of 216 elements 21, constituent elements 91 suitable for transforming a data structure containing OLLL which form values 2161 of the fifth

the linguistic characteristics of the 216 elements 21 constituting the elements 91 and inputting the received information into the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 2031, resulting in the BDLP of the text elements 21 of the linguistic sentence 11 formed in the frame of step 3021; performing step 3022 of forming the elements of the eighth data structure of the SMD on which the elements 12 of the eighth data structure of the SMD are formed, as well as the identification data of the elements 12, which for each element 12, as an example, but not limitation, the value 121 of the element 12 of the eighth data structure of the SMD and serial numbers 122 TE of the linguistic sentence 11 constituting the elements 12, and form the eighth data structure of the DMD.

[0135] In FIG. 29, by way of example, but not limitation, the general structure of the database of linguistic features [BDLP] generated in step 3021 is shown, which is BDLP of text elements 21 of sentence 11 contained in elements 91 suitable for transforming a data structure containing OLL. The BDLP generated in step 3021 differs from the BDLP generated in step 2031 in the presence of values 2161 of the fifth

linguistic characteristics 216, indicating the syntactic role of element 91 [OLLLO]. Since, from a linguistic point of view, OLLO is a syntactic construct, it is natural that in one sentence some syntactic constructs relative to other syntactic constructs, if there is a syntactic subordinate relationship between them, are syntactically main or syntactically dependent. According to this principle, some OLLOs can be syntactically main (main OLLOs), while others can be syntactically dependent (dependent OLLOs). the linguistic characteristics of 216 text elements 21 of the linguistic sentence 11 that make up OLLL 91 and their meanings 2161 is reduced to

classifications of ALLO 91 according to their syntactic role. The syntactic role of OLLO is identified with the syntactic role of the main LLE in OLLO 91. Using the value 2161 of the fifth part of the linguistic characteristics of the 216 elements 21 that make up OLLO 91, all OLLO 91 are divided into main OLLO 91 and dependent OLLO 91.

[0136] the Formation of the fifth part of the linguistic characteristics

216 and their values 2161 during step 3021 are produced by analyzing the existing linguistic features of element 91. During this analysis, for example, but not limitation, the following actions can be performed: at the first stage, in all OLLO 91, the main LLE is syntactically identified; at the second stage, among the linguistic features of the main LLEs of the mentioned OLLL 91, the linguistic characteristic responsible for the syntactic role is identified

syntactically the main LLE element 91; at the third stage, the value of the identified linguistic characteristic responsible for

the syntactic role of the syntactically main LLE; if the meaning (the syntactic role of the syntactically main LLE of the element 91) is “predicable”, then such OLLL 91 (the syntactically main LLE of the element 91) gets the value 2161 of the fifth part of the linguistic characteristics 216 - “the main OLLL”; if the meaning is not "predictable", then such an OLLL 91 (syntactically the main LLE of element 91) receives the value 2161 of the fifth part of the linguistic characteristics 216 - "dependent OLLL". As a result, the values of 2161 of the fifth part are formed

linguistic characteristics 216 for all elements of 91 (OLLL).

[0137] Formed according to the results of stage 3021 values 2161

(the syntactic role of the syntactically main LLE element 91) of the fifth part of the linguistic characteristics 216 of the elements 21 constituting the OLLO 91, suitable for transforming the data structure containing the OLLO, are introduced into the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 2031, resulting in the BDLP text elements 21

linguistic sentences 11 formed within the framework of step 3021. Moreover, all parts (211, 214, 232, 215 and 216) of the linguistic characteristics of the text elements 21 of the linguistic sentences 11 and their meanings (2111, 2141, 2321, 2151 and 2161) form unique linguistic signs of the elements 91. [0138] Ha of FIG. 30, by way of example, but not limitation, the general structure of the generated eighth SMD data structure is shown, which is the first SMD data structure for the present conversion method 300. The eighth SMD data structure is a SMD containing elements 12 of the eighth SMD data structure, which are the original simple linguistic-logical constructions (IP LLK 12) of the linguistic sentence 11 and identification data of the IP LLK, representing for each IP LLK, as an example, but not limitation, the value121 element 12 of the eighth data structure and the sequence numbers of the text elements 122 constituting the LC LLK (the value and sequence numbers of the TE element 12). U IP LLK 12

linguistic sentences 11 there are no unique names characterizing them, having practical use. In the data structure, elements 12, as an example, but not limitation, can be referred to as “IP LLK-1”, “IP LLK-2”, “IP LLK-3”, “IP LLK-n”, where n> 1 - serial number of the element in the linguistic sentence. IP LLK offers are

borderline essence between linguistics and logic. From a linguistic point of view, IP LLK is a syntactic unit of the form “syntactic construction”, formed from ALLO by combining ALLO on the basis of the presence of direct syntactic subordinate or composing connection between ALLO.

The mentioned direct syntactic subordinate or composing link means the presence between the elements of different OLLO (between LLE of various OLLO) syntactic subordinate or composing link. From the point of view of syntax, IP LLK is a separate initial simple sentence or an initial simple sentence as part of the initial complex sentence. In this case, the initial simple sentence in the IP LLK may contain homogeneous members, that is, words between which there is a syntactic compositional connection. From a logical point of view, IP LLK is a logical entity that expresses a simple proposition (non-quantifier proposition), which is the primary

logical construction of thinking, with the help of which

the idea that something (the predicate of judgment) is affirmed or refuted about the subject of the judgment (the subject of the judgment) is conveyed. Moreover, something (that is affirmed or refuted about the subject of judgment) may be

additionally logically structured. For example, without limitation, the predicate of judgment can be divided into three logically separable entities - the action of the predicate of judgment, the object of the predicate of judgment, and the circumstances of the predicate of judgment. The elements of simple judgment mentioned for an example can be identified with individual UNOLLO (unique name OLLLO), from which the IP LLK is formed. The quantity of a simple judgment is determined by its "quantitative characteristic", that is, the presence or absence of unambiguity in a simple proposition. By quantitative characteristic

(quantifier) all simple judgments can be divided into two categories:

unambiguous simple judgments and ambiguous simple judgments. Unambiguous simple judgments are quantified simple propositions in which the quantifier index is always defined and equal to unity, that is, each element of a simple proposition has no variability, is always unambiguous. The unambiguity of a simple judgment is manifested through the presence in the composition of a simple judgment of the elements of a simple judgment of one unique denomination (UN) in

a single quantity (for example, not limited to, one subject of a judgment, an object of a judgment predicate, an action of a judgment predicate, or one

homogeneous circumstances of the predicate of judgment). An example of an unambiguous simple judgment is the following proposition: “The goods must be transferred to the buyer by the seller”. This simple judgment does not allow any ambiguity - only “goods”, only “must be transferred”, only

"Buyer" and only "seller". Ambiguous simple judgments are quantified simple propositions in which the quantifier index of at least one element of a simple proposition of a unique name (UN) is always defined and always greater than unity, that is, the element of a simple proposition in an ambiguous simple proposition always has variations, always

ambiguous. The ambiguity of a simple proposition is manifested by the fact that a simple proposition contains several elements of a simple proposition of one unique denomination (UN) (for example, but not limited to, several subjects of the proposition and (or) objects of the predicate of the proposition and (or) the actions of the predicate of the proposition and (or) several similar circumstances judgment predicate). An example of an ambiguous simple judgment can be the following proposition: “The goods must be transferred to the buyer by the seller or manager of the company.” This simple judgment is ambiguous due to the presence in it of two elements of a simple judgment with the same CN - the subjects of judgment are “seller” and “company manager”, to each of which You can apply the predicate of the judgment "must transfer the goods to the buyer." If simple judgments in a certain array of simple judgments (a separate, separate group of simple judgments) have the same quantitative characteristic (the quantifier is equal to one or more than one), then regardless of the type of quantifier of a simple proposition (unambiguous or ambiguous) simple judgments of such an array are “quantifier”. If simple judgments in a certain array of simple judgments (a separate, separate group of simple judgments) do not have the same quantitative characteristic (the quantifier of simple judgments can be either equal to one or greater than unity), then simple judgments of such an array are called

"Non-quantifier" (without a certain quantifier). In connection with the foregoing, IP LLK, formed from the OLLL of the original sentence, between which there is a syntactic coherent connection, can only be classified as a “non-quantitative” simple proposition, since it does not guarantee the existence of the same quantitative characteristic of all simple judgments directly (without any additional processing) formed from an arbitrary initial linguistic sentence.

[0139] The formation of the elements 12 of the eighth data structure of the SMD, representing the initial simple linguological constructions (IP LLK) during step 3022, is based on the results of the identification of the values of 2161 of the fifth part of the linguistic characteristics of the 216 elements 21 constituting the ALLO 91. To form the element 12 of the eighth SMD data structure, it is necessary to carry out the following actions: at the first stage, the main OLLO 91 is selected; at the second stage, all dependent dependent ALLO 91 subordinate to him are identified using the values 2161 of the fifth part of the linguistic characteristics 216 of the elements 21 making up the ALLO 91; at the third stage, among the identified dependent OLLO 91, such OLLO 91 are identified that have a direct subordinate relationship with the previously selected main OLLO 91; at the fourth stage, a preliminary element 12 of the eighth SMD data structure is formed, which is the initial simple linguological structure (IP LLK) by combining the selected “main OLLO” 91 and all identified “dependent OLLOs” 91 having direct subordinate connections with the selected main OLLO 91 ; at the fifth stage identify compiled by OLLO among the already identified dependent OLLOs 91 using the values 2131 of the first part of the linguistic characteristics of 213 elements 21 constituting OLLO 91. To complete the formation of element 12, a request is made to the BDLP generated in step 3021 to check for signs of communication between the combined OLLOs, and, in the presence of such signs, they receive a text element (text elements) identified (identified) as one of (or several of, or a combination of): a punctuation mark, a union or a union word, is located s between the combining OLLO.

[0140] the Identification of the values and sequence numbers of the TE element 12 of the eighth data structure SMD, produced in the framework of step 3022 by

identifying the value 121 of the element 12 with the values 911 of the combined elements 91 and the values of 211 text elements, which are signs of communication according to their unique linguistic characteristics,

identified between the elements to be joined 91, and the sequence numbers 122 of the text elements 21 constituting the element 12 with the sequence numbers 912 of the text elements constituting the elements to be combined 91 and the sequence numbers 21 of the text elements that are signs of communication identified between the elements to be combined 91.

[0141] The formation of the eighth SMD data structure during step 3022 is performed by combining in one data structure the elements 12 of the eighth SMD data structure and their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below.

[0142] Identification of the linguistic characteristics of TEs that comprise elements 12 (IP LLK) of the eighth SMD data structure, and their values, if necessary, is carried out by organizing a request to the BDLP, which is generated as part of step 3021, consisting of identification data of TEs that comprise element 12 (IE LLK ), and obtaining the values (2131, 2141, 2231, 2151, 2161) of all parts of the linguistic characteristics (213, 214, 223, 215, 216) of the text elements 21 of the linguistic sentence 11, which make up element 12 (IP LLK). Moreover, as described earlier, the linguistic features of elements 12 (IP LLK) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic Proposals 11, of which the elements 12 consist (IP LLK).

[0143] FIG. 31, by way of example, but not limitation, a general flowchart is shown of the steps of step 303 of generating the ninth SMD data structure, which is the second data structure for the method 300 under consideration. Step 303 is characterized by: performing step 3031 of identifying the types of elements 12 of the eighth SMD data structure indicating the presence in the IP LLK composed by ALLO, on which the first and second types of elements are identified

12 of the eighth SMD data structure, and also identify the non-transformed elements 13 of the ninth SMD data structure and their identification data, which is, for example, but not limited to, the value of the 131 non-transformed element 13 of the ninth SMD data structure and ordinal 132 numbers TE of the linguistic sentence 11 constituting element 13 (meaning and serial numbers of TE of the unreformed element 13); performing the step 3032 of generating the converted elements 13 of the ninth SMD data structure, on which the converted elements 13 of the ninth SMD data structure are formed from the second type elements 12, as well as the identification data of the converted elements 13, which is an example, but not for each converted element 13 restrictions, value 131 of the transformed element

13 of the ninth SMD data structure and serial numbers 132 of the TE of the linguistic sentence 11 constituting element 13 (meaning and serial numbers of the TE of the converted element 13); performing step 3033 of generating the ninth SMD data structure, wherein the ninth SMD data structure is formed from the converted elements 13 and non-converted elements 13 of the ninth SMD data structure.

[0144] FIG. 32, by way of example, but not limitation, the general structure of the generated ninth SMD data structure is shown, which is the second SMD data structure for the present conversion method 300. The ninth SMD data structure is a SMD containing elements 13, which are simple linguistic logical constructions (PLCC) of the linguistic sentence 11, and PLCC identification data, representing for each PLCC, as an example, but not limitation, the value of 131 element 13 of the ninth data structure and order stems rooms 132 text elements constituting PLLK. Elements of the ninth structure SMD data are divided according to the principle of their formation into

non-transformed elements 13 and transformed elements 13. PLCC 13 of the linguistic sentence 11 lacks unique names characterizing them, having practical use. In the data structure, elements 13, by way of example, but not limitation, can be referred to as “PLCK1”, “PLCK2”, “PLCKZ”, “PLLKp”, where n> 1 is the serial number of the element in the linguistic sentence. PLCC of Linguistic Proposal 11 is a boundary entity between linguistics and logic. From a linguistic point of view, PLLC is a syntactic unit of the form “syntactic construction”, formed from IP LLK by heterogenization of IP LLK. From the point of view of syntax, PLCC is a single simple sentence or a simple sentence in the original complex sentence, which does not contain homogeneous members (words between which there is a syntactic juxtaposition). From a logical point of view, PLLC is a logical entity that expresses a simple proposition (quantifier proposition), which is the primary logical construction of thinking, with the help of which the idea is formed and transmitted that something (a predicate of proposition) is affirmed or disproved about the subject of judgment (the subject of the proposition ) Moreover, something (that which is affirmed or disproved about the subject of judgment) can be additionally logically structured. For example, without limitation, the predicate of judgment can be divided into three logically separable entities - the action of the predicate of judgment, the object of the predicate of judgment and the circumstances of the predicate of judgment. The elements of simple judgment mentioned for example can be identified with

certain types of OLLO, from which PLLK is formed. The difference between PLLC and IP LLK in the quantity of simple judgment. If IP LLK admits the possibility of ambiguity in a simple proposition, then PLCC has a quantifier value of unity, which completely identifies PLCC with an absolutely unambiguous simple proposition. PLCC cannot contain several elements of a simple judgment of one unique name (UN). For example, but not limited to, a PLCC may not include several subjects of judgment, and (or) objects of a judgment predicate, and (or) actions of a judgment predicate, and (or) homogeneous circumstances of a judgment predicate. In PLCC, there can be no ambiguity either in the subject of judgment or in the predicate of judgment, regardless of the composition and design of the original linguistic sentence. [0145] the Identification of the types of elements 12 of the eighth data structure of the SMD in the framework of step 3031 is performed by analyzing the linguistic characteristics of the text elements 12 (IP LLK) in order to identify syntactic compositional links in the IP LLK 12. Syntactic

the composer link between OLLO (element 91) in LL LLP 12 is identified if the syntaxically important LLLEs have such OLLLs with the same syntactic parent (the syntactically main word that has such a LLL has a direct syntactic subordinate link). Additional conditions may include the presence of the same syntactic role in the main LLLEs of such OLLLs, as well as the presence of a sign of a connection between such LLLEs, namely, a comma or a compound union. When identifying the syntactic compositional link between the main LLEs of two or more OLLLs, such LL LLs are identified as the second type of element 12 (IL LLK) of the eighth data structure of the SMD. All other elements 12 (IP LLK) in which the syntactic compositional link at the main LLE is not identified as the first type of element 12 (IP LLK) of the eighth data structure of the SMD. Wherein

the identified first type of element 12 of the eighth data structure of the SMD is identified with the non-transformed element 13 of the ninth data structure of the SMD. The formation of the transformed elements of the ninth SMD data structure (transformed elements 13 obtained from the second type of elements 12) at step 3032 is performed based on the syntactic writing connection between the elements 91 (ОЛО) of the element 12 (ЛЛК ИП) identified by the step 3031 by converting the second type element 12 . The meaning of the conversion of the element 12 of the second type at step 3032 is to heterogenize the IP LLK 12, that is, to eliminate syntactic homogeneities in the IP LLK 12 by

the formation of the element 12 of the second type of many new PLCA, identical in uniformity to the element 12 of the first type. Moreover, each new PLLC of the said set can contain only one of the mentioned OLLL 91 with homogeneous members (if the element 12 of the second type contains one row of homogeneous members) or only one unique combination of the mentioned OLLL 91 with homogeneous members of different rows of homogeneous members (if element 12 of the second species contains several rows of OLLO 91 with homogeneous members).

Many PLCA (elements 13) formed from the element 12 of the second kind are heterogeneous PLCA, that is, formed from one PI LLK 12 the second type by its heterogeneous transformation (heterogenization). The heterogenization process of PI LLK 12 of the second type can be carried out, as an example, but not limited, as follows: at the first stage, rows of homogeneous members in the element 12 of the second type are revealed; during the identification of the ranks

homogeneous members reveal the very rows of homogeneous members and serial numbers of homogeneous members in a row, as well as signs of communication of each homogeneous member; at the second stage, chains of homogeneous members are revealed for each identified ALLO containing LLE-OCH (0LL0-04) element 12 of the second type; the chain of homogeneous members (chain 04) is the OLLO-OCH and the dependent chain of homogeneous members (the chain of syntactically related OLLLs, starting from the first syntactic descendant of homogeneous members (syntactically dependent OLLL) and continuing along the chain of syntactic dependence (syntactically

subordinate connection) to another OLLO-OCH or to the last OLLO in element 12 of the second type; identified chains 04 are classified by the fact of the presence of a syntactic descendant LLE in the dependent chain 04, which is not included in the dependent chain 04; those chains 04 that have dependent chains 04 without a syntactic descendant outside the dependent chain of homogeneous members are considered extreme chains of homogeneous members (extreme CSCs), and those chains 04 that have dependent chains 04 with syntactic descendant outside the dependent chain 04 are considered internal chains of homogeneous members (internal chains 04); at the third stage, the OLLO of the element 12 of the second type is detected, which are not included in the chains 04 (OLLO outside the chains 04); at the fourth stage, replaceable dependent chains (replaceable LCs) of element 12 of the second kind are formed; a replaceable SC is the basis of the transformed elements 13, consisting of one extreme chain 04 and internal chains 04, if such internal chains 04 along a chain of continuous syntactic subordinate communication exist between the extreme chain 04 and ALLO outside chains 04; at the fifth stage, heterogeneous PLCs are formed (transformed elements 13); heterogeneous PLLC is formed from one replaceable SC and OLLO outside chain 04 (if there are such OLLOs); the number of replaceable LC elements of the second type 12 determines the number of heterogeneous PLCs (converted elements 13) formed from one element 12 of the second type.

[0146] the Identification of the types of elements 12 of the eighth data structure of the DMD in the framework of step 3031 does not change the value 121 of the element 12 and does not change serial numbers of the text elements 21 of the linguistic sentence 11 constituting the element 12 (IP LLK). The value and serial numbers of the TE of the unreformed element 13 (PLC) of the ninth SMD data structure are identified as part of step 3031 as follows: the values of 211 text elements 21 of the linguistic sentence 11 constituting the element 12 of the first type of the eighth SMD data structure are identified as the value 131 of the unreformed element 13 ( PLCK), and the serial numbers 212 of the text elements 21 of the linguistic sentence 11 constituting the element 12 of which the unreformed element 13 consists are identified as oryadkovye number 132 text elements, components

untransformed element 13 (PLCK). The identification of the values and sequence numbers of the TE of the transformed element 13 (heterogeneous PLC) of the ninth SMD data structure as part of step 3032 is performed as follows:

the values of 211 text elements 21 of the linguistic sentence 11 constituting the element 12 of the second type of the eighth data structure of the SMD, with the exception of the values of 211 text elements 21 of the linguistic

Proposals 11, constituting separate parts of element 12 of the second type (ОЛО-04), which were removed during heterogenization, and signs of communication of remote ОЛО-ОЧ (syntactic subordinate compound unions or punctuation marks synthesized by the main LLL to remote ОЛООЧ) are identified as the value 131 of the transformed element 13 (PLCK), and serial numbers 212 of the text elements 21 of the linguistic sentence 11 constituting the element 12 of the second type of the eighth data structure of the SMD, with the exception of serial numbers 212 of the text elements 21 of the linguistic sentences 11, which are separate parts of element 12 of the second type (ОЛО-ОЧ), which were removed during heterogenization, and signs of communication of remote ОЛО-ОЧ (syntactic subordinate compound unions or punctuation marks synthesized by the main LLL to remote ОЛООЧ) are identified as 132 serial numbers elements making up the transformed element 13 (heterogeneous PLCK).

[0147] The formation of the ninth SMD data structure in step 3033 is performed by combining elements 13 of the ninth SMD data structure and their identification data previously identified in steps 3031 and 3032 using the principles and methods known from the prior art, which respectively further not described in detail. The first of the combined elements of the ninth SMD data structure is the element of the first type identified at step 3031, which is the non-transformed element 13, that is, PLCK 13, which coincides with the IP of the LLK 12 without syntactic coupling. The second of the combined elements of the ninth DMD data structure are

the transformed elements 13 formed in step 3032 as a plurality of transformed elements 13, which are heterogeneous PLCC 13,

formed from the element 12 of the second type (IP LLK with syntactic co-linking) by heterogenization of the IP LLK of the second type.

[0148] The linguistic characteristics of TEs constituting the unreformed or transformed elements 13 (PLCC) of the ninth SMD data structure and their values are identified, if necessary, by organizing a request in the BDLP generated in step 3021, consisting of

TE identification data constituting unreformed or

transformed element 13, and obtaining the values (2131, 2141, 2231, 2151, 2161) of all parts of the linguistic characteristics (213, 214, 223, 215, 216) of the text elements 21 of the linguistic sentence 11 of which

untransformed or transformed element 13 (PLCK or heterogeneous PLCK). Moreover, as previously described by the linguistic features of unreformed and transformed elements 13 (PLCC or heterogeneous PLCC) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 13 (PLCC and heterogeneous PLCC) are composed.

[0149] FIG. 33, by way of example, but not limitation, a general flowchart is shown of the steps of step 304 of generating the tenth SMD data structure, which is the third data structure for the method 300 under consideration. Step 304 is characterized by: step 3041 of generating values 2171 of the sixth of the linguistic characteristics of 217 elements 21 constituting elements 13 of the ninth SMD data structure, on which values 2171 of the sixth of the linguistic characteristics of 217 elements 21 are generated,

constituting the elements 13, and inputting the received information into the BDLP of the text elements 21 of the linguistic sentence 11, formed in step 3021, forming as a result the BDLP of the text elements 21 of the linguistic sentence 11, formed in the framework of step 3041; performing step 3042 identification of types of elements 13 of the ninth SMD data structure indicating the presence of composed PLCs, on which the first and subsequent types of elements 13 of the ninth SMD data structure are identified; by performing step 3043 of identifying elements 14 as constituent parts of element 15 of the tenth SMD data structure, which identifies elements 14 of the tenth data structure of the SMD, as well as the identification data of the elements 14, which for each element 14, as an example, but not limitation, the value 141 element 14 of the tenth SMD data structure and serial numbers 142 TE of the linguistic sentence 11 constituting element 14; and form the tenth SMD data structure by combining the elements 14 as components of a single element 15 of the tenth SMD data structure.

[0150] FIG. 34, by way of example, but not limitation, the general structure of the database of linguistic features (BDLP) generated as part of step 3041 is shown, which is BDLP of text elements 21 of sentence 11 contained in elements 13 of the ninth data structure of the DMD. The BDLP generated in step 3041 differs from the BDLP generated in step 3021 in the presence of a sixth of the linguistic characteristics 217 and their values of 2171 text elements 21 contained in elements 13, which establishes the syntactic role of PLCs (elements 13 of the ninth data structure of the DMD).

The syntactic role of PLCA may have meanings: the main PLC or dependent PLC. Since, from a linguistic point of view, PLCC is a syntactic construction, it is natural that in one sentence some syntactic constructs relative to other syntactic constructs, if there is a syntactic subordinate relationship between them, are syntactically main or syntactically dependent. According to this principle, some PLCs can be syntactically main (main PLCs), while others can be syntactically dependent (dependent PLCs).

[0151] The formation of the sixth part of the linguistic characteristics 217 and their values 2171 for the text elements 21 of the linguistic sentence 11 constituting PLCC 13 is performed by classifying PLCC 13 according to their syntactic role during step 3041. The syntactic role of PLCC

identified with the syntactic role of the main LLE in the main OLLO 91 in PLCK 13. In accordance with the value 2171 of the sixth of the linguistic characteristics of the 217 elements 21 constituting PLCC 13, subdivide all PLCC 13 into main PLCC 13 and dependent PLCC 13. The formation of the sixth part of the linguistic characteristics 217 and their values 2171 is performed by analyzing the existing linguistic features of the elements 21 constituting PLCC 13. In the course of such analysis, as an example, but not limitation, the following actions can be performed: at the first stage, all PLCs 13 are identified

syntactically main LLE syntactically main OLLO 91; at the second stage, among the linguistic features of the mentioned main LLEs of the said PLCC 13, the linguistic characteristic responsible for

the syntactic role of the syntactically main LLE element 13; at the third stage, the value of the identified linguistic characteristic responsible for the syntactic role of the syntactically main LLE is checked; if the meaning (the syntactic role of the syntactically important LLE of element 13) is “predicate”, then such a PLCK 13

(syntactically the main LLE of the syntactically main element 91) provide the value 2171 of the sixth of the linguistic characteristics 217- "main PLCK"; if the meaning is not “predictable,” then such PLCC 13 provides a value of 2171 of the sixth of the linguistic characteristics 217 - “dependent PLC”. As a result, values 2171 of the sixth part of the linguistic characteristics 217 are formed for all elements of the PLCC 13.

[0152] The values 2171 generated as a result of step 3041

(the syntactic role of the syntactically main LLE of the syntactically main OLLO element 13) of the sixth part of the linguistic characteristics 217 of the elements 21 constituting the PLCC 13 of the ninth data structure of the DMD is entered into the LLL of the text elements 21 of the linguistic sentence 11, which is formed as part of step 3041. In this case, all parts (211, 214, 232, 215, 216 and 217) linguistic

characteristics of the text elements 21 of the linguistic sentence 11 included in the PLCC 13 and their meanings (2111, 2141, 2321, 2151, 2161 and 2171)

form unique linguistic features of elements 13.

[0153] FIG. 35, by way of example, but not limitation, the general structure of the tenth SMD data structure generated is shown, which is the third SMD data structure for the present conversion method 300. The tenth SMD data structure is a SMD containing element 15, which is the original complex linguistic-logical design (IS LLK) linguistic sentence 11, and the identification data of element 15, which, by way of example, but not limitation, are the value 151 of element 15 of the tenth data structure and the serial numbers 152 of the text elements making up element 15. The original complex LLK 15

linguistic sentence 11 is missing characterizing its unique name (UN), having practical use. The IP LLK of Proposition 11 is a borderline entity between linguistics and logic. From a linguistic point of view, LLK IS is a syntactic unit of the “syntactic construction” form, formed from PLCK by combining PLCK based on the presence of direct syntactic subordinate and composing links between PLCK. IS LLK consists of the number of PLCs that are identified in the original sentence. The mentioned direct syntactic subordinate link means the presence between the elements of different PLCCs (between LLE of different PLCCs) of syntactic subordinate linkage. Direct syntax

composing means the presence of syntactic composing between elements of different PLCCs (between LLE of different PLCCs). In terms of syntax, IP LLK is a transformed source sentence without distorting its meaning. The essence of the transformation of the original linguistic sentence is that all applied linguistic techniques (complications) aimed at simplifying the linguistic forms and structures of statements (negatives) imagined by the author in the text of the linguistic sentence (for example, without limitation, the use of homogeneous members, participles (participles) of turns ) cancel. Instead of the original

linguistic sentences form an array of simple sentences that are syntactically and logically related to each other. The mentioned connections are clarified and then registered in one way or another. The need for such a transformation is relevant because, at the same time as simplification

the use of the aforementioned linguistic techniques leads to the complexity and ambiguity of the logical structure of the original linguistic sentence, to the complexity and ambiguity of the perception of the meaning of the original sentence by the user, as well as to the great technical complexity of the correct intelligent machine analysis of the contents of the original linguistic sentence.

The original sentence thus transformed is subject to the use of the above techniques in the original linguistic sentence), in fact, a complex sentence consisting of converted simple sentences in an amount exceeding the number of initial simple sentences in the original linguistic sentence. However, unlike the original linguistic sentence, all converted simple sentences in the LC LLK are unambiguous simple judgments (conceivable statements or denials). The registration of element 15 (IS LLK) in the form of a data structure (an array of converted PLCs (elements 14)) allows you to identify the multi-level syntactic structure of element 15 (IS LLK) and the relationship between the elements of this structure. The levels of the LLC IC reflect the structure of the syntactic subordinate connection between the structured PLCs (SPLC) as the constituent parts of the PLC IC and are determined by the identified syntactic links between the constituent parts of the element 15 (elements 14 of the linguistic sentence 11.) The element 15 (the PLC LLC) is formed from the mentioned structured PLCs (SPLC) ) - elements 14. Mandatory IS LLK contains only element 14 (SPLK) of the first level (elements 13 of the first kind). The presence of other elements 14 (SPLC) of the second and subsequent levels (elements 13 of the second and subsequent types) is due to

linguistic complications of the original linguistic sentence. The first level SPLCK (elements 13 of the first kind) include PLCK, which are not subordinate simple sentences. The second level SPLCK (elements 13 of the second type) include PLCK, which are subordinate simple sentences, provided that the main one is the first level SPLC. The third level SPLCK (elements 13 of the third type) include PLCKs, which are subordinate simple sentences, provided that the main ones are second level SPLCs. These statements are thus valid for the next level SPLCC (fourth, fifth, sixth, and so on), and the possibility of the existence of such subsequent levels should be obvious to a person skilled in the art.

The syntactic structure of the IC LLK 15, in addition to syntactically subordinate SPLCK, may also contain syntactically composed SPLCK. At the same time, all first-level PLCS (elements 13 of the first kind) are necessarily composed of PLCC with each other. Starting from the second level SPLCK (elements 13 of the second and subsequent types), the one level SPLCK may or may not have syntactic composing connections with each other. The syntactic compositional link between the SLCL in the LLC IC is identified if the syntactically main words (LLEs) have the syntactically main OLLOs of these SLCLs of the same syntactic parent (the syntactically main word that has a direct syntactic subordinate link to such a word (LLE)). Additional conditions may be the presence of the same syntactic role in the main words (LLE) of such SPLCs, as well as the presence of a sign of connection between such words (LLE), namely, a comma or a compound union. IS LLK 15, in contrast to the previously mentioned FE LLK 12 and PLLC 13, can only be singular - from one initial linguistic sentence it is possible to form only one LS LLK 15. From a logical point of view, LS LLK 15 is a logical entity that expresses a complex judgment (non-quantifier proposition), which is a logical construction, the elements of which are simple judgments. The quantity of a complex judgment is determined by its

quantitative characteristic, i.e. presence or absence

ambiguity in a complex judgment. By quantitative characteristics

(quantifier) all complex judgments can be divided into two categories:

unambiguous complex judgments and ambiguous complex judgments.

Unambiguous complex judgments are quantified complex judgments in which the quantifier index is always defined and equal to unity, that is, each element of a complex proposition has no variability, is always unambiguous.

The unambiguity of a complex judgment is manifested by finding elements in a complex judgment that are not able to be interpreted as variants of statements or negations, or as variants of conditionality of the same statement or negation. An example of an unambiguous complex judgment is the following proposition: “If the buyer paid for the goods, then the goods should be transferred to the buyer by the seller”. This complex proposition does not allow any variation - the simple proposition “the goods must be transferred to the buyer by the seller” is due to the only simple proposition “if the buyer paid for the goods”. Ambiguous complex judgments are quantified complex judgments in which the quantifier index is always determined and always greater than unity, that is, at least one element of a complex proposition has variability. The ambiguity of a complex judgment is manifested by finding elements in the composition of a complex judgment that can be interpreted as variants of statements or negations, or as variants of conditionality of one and the same statement or denial. An example of an ambiguous complex judgment is the following proposition: “If the buyer paid for the goods, the goods must be transferred to the buyer by the seller or the goods must be transferred to the buyer by another official

company ". This complex judgment allows for a varied statement - either "the goods must be transferred to the buyer by the seller", or "the goods must be transferred to the buyer by another company official." A complex proposition consists of simple propositions that have the original (established in the original linguistic sentence) syntactic relations between themselves. Due to the fact that such relations can be both syntactic subordinate connections and syntactic composing connections, it is impossible to guarantee the absence in the complex judgment of the variability of statements or negations, or the variability of conditionals of statements (negations). In connection with the foregoing, IS LLK is a complex proposition with an indefinite quantitative characteristic (the quantifier index can be either equal to one or be greater than unity), and such a complex proposition is "non-quantifier". From an analytical point of view, the IP LLK is an information-filled scheme that clarifies the logical construction of a sentence (logical connections between individual simple judgments inherent in the original linguistic sentence). In this regard, the IP LLK is preferably not shown in a linguistic textual form (in the form of a complex sentence consisting of many simple sentences without an inventive connection and participial (active participles) turns, in which there can be either a subordinate or an inventive connection between the mentioned simple sentences) , but in tabular or schematic form. The mentioned tabular or schematic form of the demonstration allows you to quickly and accurately understand and analyze all the used basic logical entities (SPLC) and all logical connections between

the mentioned logical entities of the original linguistic

offers. In particular, not limited to, such a detailed and at the same time structured representation of the original linguistic sentence (a logically connected set of simple sentences without homogeneities) allows to increase the efficiency of processing a linguistic sentence 11 by machine tools since instead of one processed sentence with a complex structure, it becomes possible to process many simple sentences with an identified logical connection between the elements of this set.

[0154] The identification of the types of elements 13 of the ninth SMD data structure during step 3042 is based on the results of identifying the values of 2171 of the sixth of the linguistic characteristics of 217 text elements 21 constituting PLCC 13. To identify the types of PLCC 13, it is necessary, as an example, but not limitation, to carry out the following actions: at the first stage, from all PLCs 13, select PLCs in which the main LLE of the main OLLO element 13 (PLCC) does not have a syntactic parent; such a PLC will be an element 13 of the first type - an indispensable element of the tenth data structure of the DMD; at the second stage, from the remaining PLCC 13, the form of which has not yet been identified, select such PLCs in which the main LLE of the main OLLO element 13 (PLCC) has a syntactic parent in the PLCC identified as element 13 of the first type; such a PLC will be an element 13 of the second type - an optional element of the tenth data structure of the DMD; at the third and subsequent stages, from the remaining PLCC 13, the form of which has not yet been identified, select such PLCs in which the main LLE of the main OLLO element 13 (PLCC) has a syntactic parent in the PLCC identified as element 13 of the second or subsequent types; such a PLC will be an element 13 of the third or subsequent types - an optional element of the tenth data structure of the DMD. The presence of such PLCC is due only to the syntactic complexity of the original linguistic sentence. In addition, an identification step is carried out

connective connections between elements 13 (PLCK). PLCK 13, in which there is no syntactic subordinate connection, but a syntactic compositional link is identified, is assigned to the same form as PLCK 13 with which it is composed. The identification of elements 14 (component parts of element 15) of the tenth data structure of the DMD during step 3043 is performed by

identifications with elements 13 identified at block 3042 as certain types of element 13. Moreover, elements 13 of the first kind

identify as element 14 of the first level, elements 13 of the second type are identified as element 14 of the second level, and so on, elements 13 of the third and subsequent types are identified as element 14 of the third and, respectively, subsequent levels.

[0155] the Identification of the types of elements 13 of the ninth data structure of the DMD in the framework of step 3042 does not change the value 131 of the element 13 and does not change

serial numbers of text elements 21 of the linguistic sentence 11 constituting element 13 (PLCK). The identification of the value and serial numbers of the TE element 14 (LFLC) of the tenth SMD data structure within the framework of step 3043 is carried out as follows: the values of 211 text elements 21 of the linguistic sentence 11 constituting element 13 are identified as the value 141 of element 14 (FLCC), and serial numbers) 212 of the text elements 21 of the linguistic sentence 11 constituting element 13 are identified as serial numbers 142 of the text elements constituting element 14 (CPLC). The identification of the value and serial numbers of the TE element 15 (LC LLK) of the tenth SMD data structure within the framework of step 3043 is carried out as follows: the values of 211 text elements 21 of the linguistic sentence 11, which make up all the elements 14,

identified as the value 151 of element 15 (LC LLK), and serial numbers 212 (of the mentioned text elements 21 of the linguistic sentence 11, constituting all elements 14, identified as serial numbers 152 of text elements constituting element 15 (LC LC).

[0156] The formation of the tenth SMD data structure during step 3043 is performed by combining elements of the tenth SMD data structure 14 as well as their identification data in one data structure. Moreover, the elements 14 combined in one data structure, being the constituent parts of the element 15, thus form the element 15, which is the main element of the tenth data structure of the SMD. The combination of the mentioned elements and their identification data in one data structure is carried out according to the principles and methods known from the prior art, which, accordingly, are not further described in detail. As a result, the tenth data structure of the SMD is an array (list) of structured PLCs (SPLC) - elements 14 of the first and subsequent levels located in the list of SPLC, taking into account the increasing level of element 14 (and indicating this level), as well as taking into account the increasing serial number of the text element of the sign of communication element 14 (and indicating this serial number), as well as the sequence number of the text element of the syntax parent of element 14 (and indicating this serial number). In this case, the first part of the IP LLK and the first serial number in the above list of structured PLCs will be SPLK 14 without a syntactic parent and without a sign of communication. Elements 14 of the tenth SMD data structure are used to construct a diagram or table of logical connections that visually displays the logical construction of the original linguistic sentence (logical connections between individual simple judgments embedded in the original linguistic sentence). In particular, without limiting, as was indicated above with reference to elements 15, such a detailed and at the same time structured representation of the original linguistic sentence (a logically connected set of simple sentences without homogeneities) allows us to increase the efficiency of processing the linguistic sentence 11 by machine tools because instead of one processed sentence with complex structure gives you the opportunity to process many simple sentences with

identified logical connection between the elements of this set.

[0157] Identification of the linguistic characteristics of TEs constituting element 15 (IS LLK) of the tenth data structure of SMD, and their values for

Necessities are made by organizing a request to the BDLP, which is formed as part of step 3041, consisting of the identification data of the fuel cells that make up element 15 (IS LLK), and obtaining the values (2131, 2141, 2231, 2151, 2161, 2171) of all parts of the linguistic characteristics (213 , 214, 223, 215, 216, 217) of the textual elements 21 of the linguistic sentence 11, of which the element 15 (EC LLK) consists. Moreover, as previously described, the linguistic features of element 15 (IS LLK) are at least the morphological, syntactic, and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which element 15 (IS LLK) consist.

[0158] In FIG. 36, by way of example, but not limitation, a general flowchart is shown of the steps of step 305 of generating the eleventh SMD data structure, which is the third SMD data structure for the method 300 under consideration. Step 305 is characterized by: performing step 3051 of identifying views of the element 15 of the tenth SMD data structure indicating for the presence in the LLC IC of composed PLCs, on which the first and second types of element 15 of the tenth structure of the SMD data are identified, as well as identification

untransformed elements 16 of the eleventh data structure of the SMD and their identification data representing for each

untransformed element 16, by way of example, but not limitation, the value 161 of the non-transformed element 16 of the eleventh SMD data structure and serial numbers 162 of the TE of the linguistic sentence 11 constituting the non-transformed element 16; performing step 3052 of generating the transformed elements 16 of the eleventh SMD data structure, on which the converted elements 16 of the eleventh SMD data structure are formed from the second type elements 15, as well as identification data

transformed elements 16, representing for each

the transformed element 16, by way of example, but not limitation, the value 161 of the transformed element 16 of the eleventh SMD data structure and the sequence numbers 162 of the TE of the linguistic sentence 11 constituting the transformed elements 16; the execution of step 3053 formation

of the eleventh SMD data structure, on which the eleventh SMD data structure is formed from the transformed elements 16 of the eleventh

SMD data structures or from an untransformed element 16 of the eleventh SMD data structure.

[0159] FIG. 37, by way of example, but not limitation, the general structure of the generated eleventh SMD data structure is shown, which is the fourth SMD data structure for the present conversion method 300. The eleventh SMD data structure is a SMD containing elements 16 of the eleventh SMD data structure, which are complex linguistic-logical constructions (LLLC) of the linguistic sentence 11 and the logo identification data, representing for each element 16, as an example, but not limitation, beginning 161 of element 16 of the eleventh data structure of the DMD and serial numbers 162 of the text elements making up element 16. The LLLC of the linguistic sentence 11 does not have any unique names characterizing it that have practical use. In the data structure, elements 16, as an example, but not limitation, can be referred to as “SLLK1”, “SLLK2”, “SLLKZ”, “SLLKp”, where n> 1 is the serial number of the element in the linguistic sentence. The SLCC of Linguistic Proposal 11 is a boundary entity between linguistics and logic. From a linguistic point of view, SLLK is a syntactic unit of the form “syntactic construction”, formed from IS LLK by heterogenization of IS LLK. From the point of view of syntax, SLLK is a separate simple sentence (if the original linguistic sentence is a simple sentence without syntactic complications (homogeneous members, revolutions, etc.)) or a complex sentence that does not contain homogeneous members (words between which there is a syntactic writing connection). From a logical point of view, SLLK is a logical entity that expresses a complex proposition (quantifier proposition), which is a complete logical construction of thinking, with the help of which

the idea is formed and transmitted that something (predicate of judgment)

affirmed or disproved about the subject of judgment (subject of judgment).

The difference between SLCK and IS SLC in the quantity of complex judgments. If the IC LLK admits the possibility of ambiguity in a complex proposition, then the LLLC (having a quantifier index equal to one) is an absolutely unambiguous complex proposition. In the composition of SLLK there cannot be several homogeneous (composed) simple judgments. In SLLC, there can be no ambiguity in either affirming (refuting) anything or

conditionality of the statement (refutation), nor with conditionality

elements of approval (refutation), regardless of the composition and

constructions of the original linguistic sentence.

[0160] The type identification of the LLK 15 information system within step 3051 is performed by analyzing the linguistic characteristics of the text elements that make up the LLK 15 system, in order to identify the syntactic compositional links between the SPLK 14 in the LLK 15 IP. The syntactic compositional link between the SPLK 14 is identified if any the syntaxically important LLEs of the syntactically principal LLLOs of such LLLFs of the same syntactic parent (the syntactically major LLLEs of different LLLFs, which have the direct syntax with the word (LLEs) in the compiled LLLFs eskaya subordinate connection).

Additional conditions may include the presence of the same syntactic role in the main LLLOs of the main OLLLs composed of SPLCs, as well as the presence of a sign of connection between the aforementioned composed LLLEs, namely, a comma or a writing union. When identifying syntactic

a link between the main LLEs of the main OLLO of two or more SPLKs such an LLK IS is identified as the second type of element 15. IS LLK 15, in which no syntactic linkage is identified from the main LLE of the main OLLO in SPLC is identified as the first type of element 15. Moreover, the identified first type of element 15 of the tenth SMD data structure is identified with the non-transformed element 16 of the eleventh SMD data structure. The generation of the transformed elements of the eleventh SMD data structure (the transformed elements 16 obtained from the second type elements 15) at step 3052 is performed based on the syntactic writing connection between the SPLK identified at step 3051 by converting the LLK IC of the second kind. The meaning of the conversion of the LLK IC of the second kind at step 3052 is to heterogenize the LLK IC, that is, to eliminate syntactic homogeneities in the LLK IC by forming from the element 15 of the second kind a multitude of new LL LLP identical in uniformity to the element 15 of the first kind. Moreover, each new LLLC IS of the said set can contain only one of the mentioned LLDCs with homogeneous members (if the LLLC IL of the second type contains one row of homogeneous members) or only one unique combination of the said LLDC with homogeneous members of different series of homogeneous members (if the LLLC IS of the second type contains several rows of SPLC with homogeneous members). A plurality of LLK ICs formed from LLK IC (element 15) of the second kind are

heterogeneous SLCK (transformed elements 16), i.e.

formed from a single IC LLK of the second type by its heterogeneous transformation (heterogenization). The process of heterogenization of LC LLK (element 15) of the second type can be carried out, as an example, but not limitation, as follows: at the first stage, rows of homogeneous members in the element 15 of the second type are revealed; in the course of identifying rows of homogeneous members, the rows themselves of homogeneous members and the serial numbers of homogeneous members in the series, as well as signs of the connection of each homogeneous member are revealed; at the second stage, chains of homogeneous members are revealed for each identified SPLCK containing LLE-OCH (SPLLK-OCH) element 15 of the second type; a chain of homogeneous members (a chain of PF) is a DFLC-PF and a dependent chain of homogeneous members (a chain of

syntactically related SPLCKs, starting from the first syntactic descendant of homogeneous members (syntactically dependent SPLCK) and continuing along the chain of syntactic dependence (according to syntactic subordination) to another SPLCK-OCh or to the last PLCK in element 15 of the second kind;

identified chains 04 are classified based on availability

LLE syntax descendant in dependent chain 04 that is not part of dependent chain 04; those chains 04 that have dependent chains 04 without a syntactic descendant outside the dependent chain of homogeneous members are considered extreme chains of homogeneous members (extreme chains 04), and those chains 04 that have dependent chains 04 with a syntactic descendant outside dependent chain 04 are considered internal chains homogeneous members (inner chains 04); at the third stage, the PLLC of element 15 of the second type is detected that are not included in the chains of homogeneous members (PLCC outside chains 04); at the fourth stage, replaceable dependent chains (replaceable LC) of the element 15 of the second type are formed; interchangeable SC is the basis of the transformed elements 16, consisting of one extreme chain 04 and internal chains 04, if such internal chains 04 along a chain of continuous syntactic subordinate communication are between the extreme chain 04 and PLCC outside chains 04; at the fifth stage, heterogeneous SLLK is formed (elements 16); heterogeneous SLLK is formed from one replaceable SC and PLLC outside chain 04 (if there are such PLCs); the number of replaceable LC elements of the second type 15 determines the number of heterogeneous SLLK formed from one element 15 of the second type.

[0161] the Identification of the types of elements 15 of the tenth data structure of the DMD in the framework of step 3051 does not change the value 151 of the element 15 and does not change

serial numbers of text elements 21 of the linguistic sentence 11 constituting element 15 (IP LLK). The value and sequence numbers of the TE of the unreformed element 16 (SLC) of the eleventh SMD data structure are identified as part of step 3051 as follows: the values of 211 text elements 21 of the linguistic sentence 11 constituting the element 15 of the first type of the tenth SMD data structure are identified as the value 161 of the unreformed element 16 (SLLK), and serial numbers 152 of the mentioned text elements 21 of the linguistic sentence 11, constituting the element 15 of the first kind of which the non-transformed element 16 consists, identify as serial numbers 162 of the text elements constituting the non-transformed element 16 (SLLK). The value and serial numbers of the TE of the transformed element 16 (heterogeneous SLCK) of the eleventh SMD data structure are identified as part of step 3052 as follows: values of 211 text elements 21 of the linguistic sentence 11 constituting element 15 of the second type of the tenth SMD data structure, with the exception of the values of 211 text elements 21 linguistic sentences 11 constituting separate parts of element 15 of the second kind

(SPLCK-OCH), which were removed during heterogenization, and signs of communication of remote SPLCK-OCH (syntactically subordinate compound unions or punctuation marks syntactically subordinate to the remote SPLC-OCH), are identified as the value 161 of the transformed element 16 (SLCK), and the serial numbers are 212 text elements 21 linguistic sentences 11 constituting element 15 of the second type of the sixth SMD data structure, with the exception of serial numbers 212 of text elements 21 of linguistic sentences 11 constituting separate parts of element 15 of the second type (SPLC-OCH), which were removed during heterogenization, and signs of communication of remote SPLLK-OCH (syntactic subordinate compound unions or punctuation marks syntactically subordinate to remote SPLLK-OCH) are identified as serial numbers 162 of the text elements making up the transformed element 16 (heterogeneous SLCK).

[0162] The formation of the eleventh SMD data structure in step 3053 is performed by combining in a single data structure,

identified in steps 3051 and 3052 elements 16 of the eleventh

SMD data structures, as well as their identification data according to principles and methods known from the prior art, which, accordingly, are not described in further detail below. The first of the combined elements of the eleventh SMD data structure is the first type of element 15 identified in step 3051, which is an untransformed element 16, that is, SLCK 16,

coinciding with the IP LLK 15 without syntactic writing connection. The second of the combined elements of the eleventh SMD data structure are transformed elements 16, formed at block 3052 as a plurality of transformed elements 16, which are heterogeneous SLL,

formed from an element 15 of the second type (IC LLK with syntactic co-linking) by heterogenization of the IC LLK of the second type.

[0163] Identification of the linguistic characteristics of TEs constituting element 16 (SLLK) of the eleventh SMD data structure and their values, if necessary, is carried out by organizing a request to the BDLP generated in step 3041, consisting of identification data of TEs constituting element 16 (SLLK), and obtaining the values (2131, 2141, 2231, 2151, 2161, 2171) of all parts of the linguistic characteristics (213, 214, 223, 215, 216, 217) of the text elements 21 of the linguistic sentence 11 of which element 16 consists (SLLK). Moreover, as previously described, the linguistic features of element 16 (SLLK) are at least the morphological, syntactic, and semantic characteristics of the text elements 21 of the linguistic sentence 11, which make up the element 16 (SLLK).

[0164] However, after completing the previously described step 301, in order to provide an alternative increase in the accuracy of the subsequent search in a structured data array (SMD), it is alternatively possible to further transform the SMD containing at least basic linguistic-logical objects ( OLLO) linguistic sentences and their identification data. Moreover, it should be obvious to a person skilled in the art that the principles and methods described below, upon completion of the previously described steps 301 of the method 300, can be carried out both simultaneously (in parallel) with the previously described steps 301-305 of the method 300 and not simultaneously (alternatively or sequentially , including regardless of the order), and also that individual principles and methods of the individual steps described below can be implemented in a similar way.

[0165] FIG. 38, as an example, but not limitation, depicts a General diagram of the steps of the claimed method 400 for converting SMD containing at least the main linguistic-logical objects (LLL) of a linguistic sentence and LLL identification data,

which is the original data structure for the considered method 400. The claimed method 400 for converting SMD containing at least

The LLLO of the linguistic sentence and their identification data is characterized by: performing the identification step 301 of a data structure suitable for transforming containing the LLLF, on which

identifying the SMD data structure containing elements of said transformable data structure containing OLLO, which is the original SMD data structure within the framework of the method 400 in question, said elements of the transformable data structure being the main linguistic-logical objects (OLLL)

Linguistic Suggestions and Identity OLLO;

the execution of step 402 of the formation of the twelfth data structure of the DMD, which forms the twelfth data structure of the SMD, which is the first data structure of the SMD for the method 400 under consideration, containing elements said twelfth SMD data structure, wherein said elements of the twelfth SMD data structure are the main linguistic logical elements (ALLE) of the linguistic sentence formed by dividing the ALLO into the semantic parts of ALLO, and also represent the ALLE identification data, representing for each ALLE, as an example, but not limitation: the meaning of ALLE and the ordinal (ordinal) number (s) of TE of the linguistic sentence that make up ALLE; performing step 403 of generating the thirteenth SMD data structure, which forms the thirteenth SMD data structure, which is the final SMD data structure for the method 400 under consideration, containing elements of the thirteenth SMD data structure, said elements of the thirteenth SMD data structure being the original linguistic-logical elements ( ILLE) linguistic sentences,

formed by dividing ALLE into semantic parts of ALLE and forming the initial elements from them, and also represent

ILLE identification data, representing for each ILLE, as an example, but not limitation: the meaning of ILLE and the serial number (s) of the TE of the linguistic sentence constituting

(constituents) ILLE.

[0166] As indicated previously with reference to FIG. 26, by way of example, but not limitation, a general flowchart is shown for performing the SMD data structure identification step 301, which identifies the SMD data structure, which is the original data structure for the method 400 in question, whose elements 91 are the OLLO of the linguistic sentence 11, as well as identification the data of elements 91, representing, for each element 91, as an example, but not limitation, the value of 911 element91 and the serial number (s) of 912 TE linguistic sentences 1 1 constituting the element (s) 91. Described in this way with reference to FIG. 26, the process is not further described in detail. Based on the results of step 301, a data structure is identified,

shown in FIG. 27, the composition of which is described with reference to FIG. 27, respectively, is not further described in detail.

[0167] FIG. 39, as an example, but not limitation, depicts a General diagram of the steps of step 402 of the formation of the twelfth data structure SMD, which is the first data structure for the method 400 under consideration. Step 402 is characterized by: performing step 4021 of generating values of 2181 of the seventh of the linguistic characteristics of 218 elements 21 constituting elements of 91 suitable for transforming a data structure containing ALLO which generates values of 2181 of the seventh of the linguistic

characteristics 218 of the elements 21 constituting the elements 91 and inputting the received information into the BDLP of the text elements 21 of the linguistic sentence 11 formed in step 3041, forming as a result the BDLP of the text elements 21 of the linguistic sentence 11 formed in the step 4021;

performing step 4022 of forming the semantic parts of the elements 91 of a transformable data structure containing ALLO, on which the first and subsequent (second, third, and so on) semantic parts of ALLO are formed, clarifying the semantic structure of ALLO;

performing step 4023 of generating the twelfth data structure of the DMD on which the elements 17 of the twelfth data structure of the SMD are identified, as well as the identification data of the elements 17, which for each element 17, by way of example, but not limitation, the value 171 of the element 17 of the twelfth data structure of the SMD and ordinal (ordinal) number (s) 172 TE of linguistic sentence 11 constituting

(component) element 17, and form the twelfth data structure of the SMD.

[0168] In FIG. 40, by way of example, but not limitation, the general structure of the database of linguistic features (BDLP) generated as part of step 4021 is shown, which is BDLP of text elements 21 of sentence 11 contained in elements 91 suitable for transforming a data structure containing OLL formed within step 4021. The BDLP generated in step 4021 differs from the BDLP generated in step 3021 in the presence of values 2181 of the seventh of the linguistic characteristics 218 indicating the main word of the semantic part of element 91 (ALLO). The reason for revealing the main word of the semantic part of ALLO is that, using the identified main word of the semantic part of ALLO and

previously identified syntactic relationships between words (LLE) in OLLL, you can divide OLLL into separate semantic parts. The semantic parts of OLLO clarify the semantic structure of OLLO, in which individual parts fulfill their limited semantic roles in OLLO. Semantic parts in OLLO make it possible to distinguish in OLLO integrated in it the dominant and clarifying entities of the surrounding world (objects and actions) for the correct identification of their semantic roles and, as a consequence, the meaning of OLLO. The allocation of individual semantic parts in OLLO allows you to establish their unique logical roles in OLLO, which allows you to store these parts separately without losing their semantic logical role in the sentence. Such separate storage allows you to quickly and accurately search for the semantic parts of OLLO, as well as separately from other parts of the linguistic sentence, to analyze and process them differently.

[0169] The formation of the seventh of the linguistic characteristics 218 and their values 2181 during step 4021 is carried out on the basis of a comprehensive analysis of the values of the linguistic characteristics of the text elements 21 included in the elements 91. During this analysis, all text elements 21 constituting the element 91 check for the presence of a characteristic the main word

the semantic part of element 91. A sign of the main word of the semantic part of element 91 can be, as an example, but not limitation, the presence of the following values of linguistic characteristics of a text element (TE):

TE is the main word of OLLO or TE has a dependent TE in the form of an excuse. Other own criteria for identifying the syntactic part of the OLLO can be preliminarily set. For example, but not limited to, a list of parts of speech that indicate the main word of the syntactic part of the OLLO can be refined. If TE has the indicated values of linguistic characteristics, the corresponding values of 2181 of the seventh part of linguistic characteristics 218 are formed - “the main word of the semantic part of OLLO”.

[0170] The values 2181 (the main word of the semantic part of element 91) of the seventh part of the linguistic characteristics 218 of the elements 21 constituting the OLLO 91 suitable for transforming the data structure containing the OLLO are entered into the BDLP of the text elements 21 of the linguistic sentence 11 generated within the framework of step 4021 of step 2031, forming as a result BDLP of the text elements 21 of the linguistic sentence 11, formed as part of step 4021. Moreover, all parts (211, 214, 232, 215, 216 and 218) of the linguistic characteristics of the text elements 21 of the linguistic Proposals 11 and their meanings (2111, 2141, 2321, 2151, 2161 and 2181) form unique linguistic features of elements 91 for alternative ways to improve search accuracy in a structured data array.

[0171] FIG. 41, by way of example, but not limitation, the general structure of the generated twelfth SMD data structure is shown, which is the first for the present conversion method 400. The twelfth SMD data structure is a SMD containing elements 17 of the twelfth SMD data structure, which are the main linguistic logical elements (ALLE) of the linguistic sentence 11 and the ALLE identification data, which are, by way of example, but not limitation, the values of 171 elements 17 of the twelfth data structure and oryadkovye number 172 text elements constituting OLLE. OLLE 12 of the linguistic sentence 11 lacks unique names characterizing them, which have practical use. In the data structure, elements 12, as an example, but not limitation, can be referred to as “ALLE pt,” where n> 1 indicates the serial number of the ALLO in the linguistic sentence, am> 1) indicates the syntactic level of ALLE in the ALLO, starting from the first. The OLLE of Proposition 11 is a boundary entity between linguistics and logic. From a linguistic point of view, ALLE is a syntactic unit of the form

A “syntactic structure” formed from ALLO 91 by highlighting in OLO 91 individual syntactic parts based on the presence of the main words of the syntactic parts in ALLO 91. The main words mentioned in the syntactic part of ALLO 91 are such parts of speech as a noun, pronoun or verb (in any grammatical form), which are either syntactically the main word of OLLO 91, or contain a direct syntactic descendant (a word with which there is a direct syntactic subordinate relationship) with a syntactic role "Pretext". Other criteria for identifying the syntactic part of OLLO 91 can be preliminarily set. For example, not

By limiting ourselves, it is possible to set criteria in the form of a specific list of prepositions, the presence of which indicates the main word of the syntactic part of ALLO 91. From the point of view of syntax, ALL 17 are words and special phrases inside ALLO 91. Depending on the complexity of the linguistic structure, ALLO 91 can be distinguished in it one or more ALLE 17. From a logical point of view, ALLE 17 is a logical entity that is a logically separable part of an element of simple judgment. Moreover, such logically separable parts in their structure and logical role are no different from OLLO 91 for the reason that, in the same way, like OLLO 91, they represent a conceivable (semantic) image of a certain object or action of the surrounding world). Reason for existence

The semantic parts of the image of ALLO 91 is that not every image of ALLO 91 can be linguistically implemented using only one composite element (a group of words without a preposition between words). For example, without limitation, the imaginary image of OLLO 91 about the right to something (for example, without limitation, “the consumer's right to return the goods”) cannot be realized with one constituent element (one group of words without an excuse between words)). In the mentioned example, only two constituent elements - the first - “consumer right”, and the second - “return of goods” - form a conceivable image of an element of simple judgment. The constituent element of ALLO 91 (the semantic part of ALLO) is referred to as the term ALLO 91. C

the semantic point of view the terms OLLO 91, forming OLLO 91

are not the same, that is, they have different semantic roles. Syntactically, the main terms ALLO 91 have a maximum semantic role, that is, they are the terms ALLO 91 of the first semantic part. The term OLLO 91 of the first semantic part defines the semantic vector OLLO 91. For example, without limitation, in the example “consumer right to return the goods”, the semantic essence of the OLLO 91 image is formed by the constituent element “consumer right” rather than the constituent element “return of goods”. The terms OLLO of the second and subsequent semantic parts are syntactically dependent terms OLLO 91, and their presence in OLLO 91 is necessary only to clarify the imaginary image of OLLO 91. The more complex the imaginary image of OLLO 91 is, the more terms OLLO 91 may require to clarify its imaginary image.

Structuring the terms OLLO 91 as the semantic parts of OLLO allows you to correctly perceive the semantic roles of individual terms in OLLO 91. Separating in this way OLLO 91 allows you to intelligently search for information with increased accuracy, since if there is a more structured (by the logical role of words) data array, a more accurate description of the desired objects. As a result of machine processing of such an array of data, the accuracy of the search is increased and the search time is reduced,

[0172] The formation of the parts (semantic parts) of the elements 91 suitable for transforming the data structure containing OLLO, at step 4022, is performed by performing the following steps: at the first stage, from the identified main word of the semantic part of the ALLO 91 identify all syntactically subordinate words, up to the word, which itself is the main word of the semantic part of OLLO 91, but not including this word; at the second stage, the main word of the semantic part of ALLO 91 and all the syntactically subordinate words mentioned, including successively subordinate, form the semantic part of ALLO 91. The serial number of the semantic part in ALLO indicates its semantic role in ALLO and is formed in accordance with the serial number of the text element (TE) of the main word of the semantic part of OLLO 91. The semantic part of OLLO 91 with the lowest sequence number TE of the main word is the first

the semantic part of OLLO 91. The semantic part of OLLO 91, having

the TE number closest to it of the main word of the semantic part is the second semantic part of OLLO 91, and so on. The serial number of ALLE is indicated in the name of ALLE, for example, but not limited to - "ALLE 1.2". Such a name for element 17 suggests that it is the second semantic part in the first OLLO (according to the syntactic level “.2”). The identification of elements 17 of the twelfth SMD data structure during step 4023 is carried out on the basis of step 4022 of forming the semantic parts of elements 91 of a transformable data structure containing OLLO suitable for transformation. All the formed semantic parts of elements 91 are identified with elements 17 of the twelfth data structure of the DMD.

[0173] Identification of the value and serial numbers of the TE of element 17 of the twelfth SMD data structure is performed as part of step 4023 by identifying the value 171 of element 17 with the values of 911

the identified semantic part of element 91, and serial numbers 172 of text elements 21 constituting element 17 by identification with serial numbers 912 of text elements constituting

identified semantic part of element 91.

[0174] The formation of the twelfth SMD data structure during step 4023 is performed by combining the elements 17 in one data structure

the twelfth data structure of the DMD and their identification data according to the principles and methods known from the prior art, which, respectively, are not described in further detail below.

[0175] Identification of the linguistic characteristics of FCs constituting the elements 17 (ALLE) of the twelfth structure of the SMD data and their values for Necessities are made by organizing a request to the BDLP, which is formed as part of step 4021, consisting of the identification data of the fuel cells that make up element 17 (ALLE), and obtaining the values (2131, 2141, 2231; 2151, 2161, 2181) of all parts of the linguistic characteristics (213, 214, 223, 215, 216,218) of text elements 21 of the linguistic sentence 11 of which element 17 (ALLE) consists. Moreover, as previously described, the linguistic features of the elements 17 (ALLE) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 17 (ALLE) consist.

[0176] In FIG. 42, by way of example, but not limitation, a general flowchart of the steps of generating the thirteenth SMD data structure 403 is shown, which is the second data structure for the method 400 under consideration. Step 403 is characterized by: performing step 4031 of generating values of 2191 of the eighth of the linguistic characteristics of 219 elements 21, constituent elements 17 of the twelfth structure of the SMD data, on which the values of 2191 of the eighth of the linguistic characteristics of the 219 elements 21 constituting the elements 91 are formed and the resulting information is introduced I BDLP text elements 21 linguistic Proposition 11, formed in 4021 under the stage, forming eventually BDLP text elements 21

linguistic sentence 11, formed in the framework of step 4031;

performing step 4032 of the formation of the semantic parts of the elements 17 of the twelfth data structure of the DMD, on which the first and

subsequent (second, third, and so on, in order) semantic parts of ALLE, clarifying the semantic structure of ALLE; step 4033

the formation of elements 18 of the thirteenth data structure of the SMD, on which form the elements 18 of the thirteenth data structure of the SMD and

the identification data of the elements 18, which, by way of example, but not limitation, represent the values of 181 elements 18 of the thirteenth SMD data structure, serial numbers 182 of the TE of the linguistic sentence 11 constituting the elements 18, and also identify the first and second types of elements 18 of the thirteenth SMD data structure and form the thirteenth SMD data structure.

[0177] FIG. 43, by way of example, but not limitation, the general structure of the linguistic database formed as part of step 4031 is depicted attributes (BDLP), which is BDLP of text elements 21 of sentence 11 contained in elements 17 of the twelfth data structure of the DMD. The BDLP generated in step 4031 differs from the BDLP formed in step 4021 in the presence of 2191 values of the eighth of the linguistic

characteristics 219 indicating the main word of the semantic part of element 17 (ALLE). The reason for revealing the main word of the semantic part of ALLE is that, using the identified main word of the semantic part of ALLE and the previously syntactic links between words (LLE) in ALLE, it is possible to divide ALLE into separate semantic parts. The semantic parts of ALLE clarify the semantic structure of ALLE, in which the individual parts fulfill their limited semantic roles in ALLE. The semantic parts in ALLE allow us to distinguish in ALLE integrated into it the dominant and clarifying entities of the surrounding world (objects and actions) for the correct identification of their semantic roles and, as a result, the meaning of ALLE. The allocation of individual semantic parts in ALLE allows you to establish their unique logical roles in ALLE, which allows you to store these parts separately without losing their semantic logical role in the sentence. This separate storage allows

carry out a quick and accurate search for the semantic parts of ALLE, as well as separately from other parts of the linguistic sentence, to carry out their various analysis and processing.

[0178] The formation of the eighth of the linguistic characteristics 219 and their values 2191 during step 4031 is carried out on the basis of a comprehensive analysis of the values of the linguistic characteristics of the text elements 21 included in elements 17. During this analysis, all text elements 21 constituting element 17 check for the presence of a characteristic the main word

the semantic part of the element 17. A sign of the main word of the semantic part of the element 17 may be, as an example, but not limitation, the presence of a text element (TE) the following values of linguistic characteristics:

TE is the main word of ALLE or TE has the form of a noun, pronoun or verb (in any grammatical form). Other own criteria for identifying the syntactic part of ALLE can be preliminarily set. For example, without limitation, the list of parts of speech that indicate the main word of the syntactic part of ALLE can be clarified. If the TE indicated values of linguistic characteristics are formed the corresponding values of 2191 of the eighth of the linguistic characteristics 219 are “the main word of the semantic part of ALLE”.

[0179] The values 2191 (the main word of the semantic part of element 17) of the eighth part of the linguistic characteristics 219 of the elements 21 constituting the OLLE 17 of the twelfth data structure of the SMD are generated in the LLLB of the text elements 21 of the linguistic sentence 11 generated in step 4021, forming as a result, the BDLP of the text elements 21 of the linguistic sentence 11, formed in the framework of step 4031. Moreover, all parts (211, 214, 232, 215, 216, 218, 219) of the linguistic characteristics of the text elements 21 of the linguistic sentence 11 and and values (2111, 2141, 2321, 2151, 2161, 2181, 2191) form features unique linguistic elements 17 for an alternative method of increasing search accuracy in a structured dataset.

[0180] FIG. 44, by way of example, but not limitation, the general structure of the generated thirteenth SMD data structure is shown, which is the second SMD data structure for the present conversion method 400. The thirteenth SMD data structure is a SMD containing elements 18 of the first and second types of the thirteenth SMD data structure, which are the original linguistic logic elements (ILE)

linguistic sentence 11 and the ILLE identification data, which are for each ILLE, as an example, but not limitation: the value of 181 element 18 of the sixth SMD data structure, ordinal

(serial) number (s) 182 TE of the linguistic sentence 11 constituting the elements 18. ILLE 18 of the linguistic sentence 11 does not have unique names describing them that have practical use. In the data structure, elements 18, by way of example, but not limitation, can be referred to as "ILLE nmk", where n> 1 indicates the sequence number of the ALLO in the linguistic sentence, am> 1 indicates the syntactic level of ALLE in the ALLO, starting from the first, ak> 1 indicates the syntactic level of ILLE in ALLE, starting with the first. ILLE 18 of Proposition 11 is a boundary entity between linguistics and logic. From a linguistic point of view, ILLE 18 is the primary syntactic unit that is a word. From a logical point of view, ILLE 18 is a logical entity that is a logically separable part of the term element of a simple judgments (the term ЛОLLO). Moreover, such logically separable parts in their structure and logical role are no different from ALLE 17 for the reason that, like ALLE 17, they represent a conceivable (semantic) image of a certain object or action of the surrounding world). The reason for the existence of the syntactic parts of ALLE 17 is that not every conceivable image of ALLE 17 can be represented using only one “source element” (a word,

expressing an object or action). For example, without limitation, the conceivable image of OLLE 17 about the rights of someone, for example, without limitation, “consumer rights”, cannot be realized with one initial element. In the mentioned example, only two initial elements - the first one is “rights” and the second one is “consumer” form an imaginable image of the term of an element of simple judgment (the term OLLO). The original element ALLE 17 is referred to as the concept of the term ALLO (PT OLLO) or elements 18 (ILLE). From the semantic point of view, PT OLLO (ILLE 18), forming the term OLLO (ALLE 17) are not equivalent, that is, they have different semantic roles. Syntactically, the main PT OLLOs have the maximum semantic role, that is, the PT OLLOs of the first semantic part. PT OLLO first

semantic parts determine the semantic vector of the term ALLO (ALLE 17). For example, in the mentioned example of “consumer rights”, the essence of the image of the term ALLO (ALLE 17) is precisely “rights” and not “consumer”. PT OLLO of the second and subsequent semantic parts (syntactically dependent PT OLLO) in the term OLLO (ALLE 17) are necessary to clarify the meaning of the term ALLO (ALLE 17). The more complex the conceivable image of the term ALLO (ALLE 17), the more PT OLLO (ILLE 18) it may require. Structuring PT OLLO (ILLE 18) as the semantic parts of ALLE 17 allows us to correctly perceive the semantic roles of individual PT ALLO (ILLE 18) in terms of ALLO (ALLE 17). Element 18 (ILLE) is formed from ALLE 17 by isolating individual

syntactic parts based on the presence of the main words of the syntactic parts of OLLE 17 and the subsequent structuring of text elements of the syntactic part. Depending on the complexity of the linguistic structure of OLLE 17, one or more syntactic parts can be distinguished in it (ILLE 18).

The syntactic part (ILLE 18) OLLE 17 can be only one main word or several syntactically related words

(significant parts of speech), of which the main word is syntactically the main one. Thus the main word of the syntactic part (ILLE 18) OLLE 17 is identified with element 18 of the first kind, and all words syntactically dependent on it (significant parts of speech), if any, are identified with element 18 of the second kind. From a logical point of view, the initial linguistic elements (elements 18) of the first kind are “concepts” - that is, objects or actions of the surrounding world, and elements 18 of the second kind are “signs of a concept”, that is, some semantic characteristics of the mentioned objects and actions.

[0181] The formation of the parts (semantic parts) of the elements 17 of the twelfth SMD data structure is performed at 4032 by performing the following steps: at the first stage, all syntactically subordinate words are identified from the identified main word of the element 17 (ALL), up to the word which itself is the main word of the semantic part of element 17 (ALLE), but not including this word; at the second stage, the main word of the semantic part of element 17 (ALL) and all the mentioned syntactically subordinate words, including successively subordinate ones, form the semantic part of element 17 (ALL). The serial number of the semantic part in ALLE indicates its semantic role in ALLE, and it is formed in

in accordance with the number of the text element (TE) of the main word of the semantic part of ALLE 17. The semantic part of ALL 17 with the lowest TE number of the main word is the first semantic part of element 17 (ALL). The semantic part of OLLE 17, having the number of the main word TE closest to it

the semantic part is the second semantic part of ALLE 17, and so on. The serial number of ILLE is indicated in the name of ALLE, for example, but not limited to - "ILLE 1.2.1." This name of the element 18 suggests that it is the first semantic part in the second ALLE of the first ALLO.

The formation of elements 18 of the thirteenth SMD data structure during step 4033 is based on the results of step 4032 of forming the semantic parts of elements 17 of the twelfth SMD data structure as follows: at the first stage, all significant parts of speech in the semantic parts of elements 17 are identified with elements 18 of the thirteenth SMD data structure; at the second stage, the identified identified main words of the semantic parts of elements 17 are identified with elements 18 of the first kind; in the third stage, all other elements 18 of this semantic part, except for element 18 of the first kind, are identified with elements 18 of the second kind. [0182] Identification of the value and serial number (s) of the TE of element 18 of the twelfth SMD data structure is performed as part of step 4033 by identifying the value 181 of element 18 with the value (s) of 211 textual (text) element (s) 21

linguistic sentence 11 constituting (constituent) element 18, and serial (ordinal) number (s) 182 of textual (text) element (s) 21 constituting (constituting) element 18 by identifying with ordinal (ordinal) number (s) 212 text (text) element (s) 21 constituting the (component) element 18.

The identification of the types of elements 18 of the twelfth data structure of the DMD within the framework of step 4031 does not change the value 181 of element 18 and does not change the serial number (s) of the TE 21 of the linguistic sentence 11,

component (s) element 18 (ILLE).

[0183] The formation of the thirteenth SMD data structure during step 4033 is performed by combining in one data structure the elements 18 of the first and second types of the thirteenth SMD data structure, as well as their identification data according to the principles and methods known from the prior art, which, accordingly, are not described in further detail below. are described.

[0184] Identification of the linguistic characteristics of TEs constituting the elements 18 (LLE) of the thirteenth SMD data structure and their values, if necessary, is carried out by organizing a request in the BDLP generated in step 4031, which consists of the identification data of the TEs constituting the element 18 (LLE), and obtaining the values (2131, 2141, 2231; 2151, 2161, 2181, 2191) of all parts of the linguistic characteristics (213, 214, 223, 215, 216, 218, 219) of the text elements 21 of the linguistic sentence 11 of which element 18 consists (ILLE ) Moreover, as described previously

the linguistic features of elements 18 (ILLE) are at least the morphological, syntactic and semantic characteristics of the text elements 21 of the linguistic sentence 11, of which the elements 18 (ILLE) are composed.

[0185] FIG. 45, by way of example, but not limitation,

illustrated is an exemplary diagram of a conversion system 500

a structured data array, which in a preferred embodiment comprises at least one or more computer devices 501 converting a structured data array containing at least one or more processors 5011 and memory 5012. Said devices 501 converting a structured data array may include, but are not limited to: a personal computer,

laptop computer, tablet computer, PDA, smartphone, phablet and the like. The memory (computer-readable storage medium) 5012 of the structured data array conversion device 501 contains a program code that, when executed, causes said one or more processors 5011 of the said device 501 to perform the actions of the previously described methods for converting a structured data array. In some cases, the computer device 501 may be a server computer device associated with a user

a computer device configured to transmit to the server computer device 501 commands or instructions causing the processor or processors 5011 of the server computer device to execute code

a program that, when executed by the processor or processors of the server computer device 5011, causes the processor or processors 5011 of the server computer device to perform the actions of any of the previously described methods for converting a structured data array. The user computer device 502 may include, but is not limited to: a personal computer, a laptop computer, a tablet computer, a PDA, a smartphone, a phablet, a thin client, and the like. The user computer device 502 may be connected to the server computer device 501 via a wired or wireless connection. Mentioned memory 5012 of the computer device 501 (server computer device 501) contains subject

transforming one or more structured data arrays containing at least a linguistic sentence, and may also contain any of the previously described source data structures for any of the previously described methods for converting a structured data array. Moreover, one or more to be converted

structured data arrays can be downloaded and stored, in particular, in the database 503 of the structured data array conversion system. As an example, but not limitation, computer-readable the storage medium (memory 5012) may include a random access memory (RAM); read-only memory device (ROM); Electrically Erasable Programmable Read-Only Memory (EEPROM); flash memory or other memory technologies; CDROM, digital versatile disc (DVD) or other optical or holographic storage media; magnetic cassettes, magnetic tape, magnetic disk storage device or other magnetic storage devices, carrier waves or other storage medium that can be used to encode the desired

information and which can be accessed through

devices 501. The memory includes a storage medium based on

a computer storage device in the form of volatile or

non-volatile memory, or combinations thereof. Exemplary hardware devices include solid state memory, hard disk drives, optical disk drives, and so on. Stored in memory

an exemplary environment in which, using computer instructions or codes stored in the device’s memory, a procedure can be performed

transformations of a structured data array. The device contains one or more processors 5011, which are designed to perform

computer instructions or codes stored in the device’s memory in order to ensure the implementation of the procedure for converting a structured data array. Computer instructions or codes stored in memory are designed to perform the conversion of a structured data array. System 500 may also include a database (DB) 503. Database 503 may

represent, but not limited to: a hierarchical database, a network database, a relational database, an object database, an object-oriented database, an object-relational database, a spatial database, a combination of these two or more databases, and the like. The database 503 stores data in memory, which may be, but not limited to: read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory, CDROM, digital versatile disk (DVD) or other optical or holographic data carriers; magnetic cassettes, magnetic tape, magnetic disk storage device or other magnetic storage devices, wave carriers or other storage medium that can be used to store the required information, and which can be accessed by device 501 converting a structured data array. The database 503 is used to store data representing at least commands for performing the steps of the previously described methods for converting a structured data array; one or more structured data arrays to be converted containing at least a linguistic sentence, or one of the data structures that can be loaded into the memory 5012 of the device 501 of the conversion of a structured data array described previously for any conversion method; and other data

necessary for the functioning of the system. Sample System 500

transforming a structured data array may further comprise a server computer device 501, which, in addition to the functions described previously, saves and facilitates the manipulation of computer instructions or codes previously described in this document, which, accordingly, are not further described. The server computer device 501, in addition to the functions described above, can provide data exchange control in the structured data array conversion system 500, and also provides data processing provided that one or more than one user computer devices 502 are connected to it. In this case, all computing power, necessary to ensure the implementation of the procedure for converting a structured data array, located on the server computer device 501. The system 500 can also This may include one or more data networks 504. 5 data networks 204 may include, but are not limited to, one or more local area networks (LANs) and / or wide area networks (WANs), or may be an information telecommunication network Internet, or an Intranet, or a virtual private network (VPN) , or a combination thereof, and the like. Server computer device 501 also has the ability to provide virtual

a computing environment (Virtual Machine) for providing interaction between the user computer device 502 and the database 503. The network 504 serves to provide interaction between the computer device 501, the database 503 and the user computer device 502 of the structured data array conversion system 500. In this case, the user computer device 502 may be associated with a server computer device 501 directly, using wire and wireless communication methods and methods known from the prior art, which, respectively, are not described in further detail below.

Claims

Claim

1. Performed by the processor of the computer device

transforming a structured data array (SMD) containing at least a linguistic sentence characterized by the following steps:

A) the formation of the first data structure, which form the first data structure of the SMD containing the elements of the aforementioned first data structure, and said elements of the first data structure are text elements (TE) of the linguistic sentence, and

identification data of text elements (identification data of TE), representing for each TE, at least: value of a text element (value of TE) and serial number of a text element (serial number of TE) in a linguistic sentence;

B) the formation of a database of linguistic features (BDLP), which reveal the linguistic features of text elements

(linguistic features of TE) of a linguistic sentence, from which a database is formed, which is a BDLP of text elements of a linguistic sentence;

B) forming a second data structure on which the second SMD data structure is formed containing the elements of said second data structure, said elements of the second data structure being the syntactic units (CE) of the linguistic sentence,

formed on the basis of information from BDLP text elements

linguistic sentences, and also represent the identification data of the syntactic units (identification data CE),

representing for each CE at least: the value of the syntactic unit (CE value) and the serial (serial) number (s) of the TE

linguistic sentences constituting CE.

2. The method according to p. 1, characterized in that, in the framework of step A), a method is performed characterized by the steps:

A1) identification of the initial SMD data structure on which

identify elements of the original SMD data structure that are linguistic sentences;

98

SUBSTITUTE SHEET (RULE 26) A2) identifying the elements of the first SMD data structure, on which the elements of the first SMD data structure are identified, which are text elements (TE) of the linguistic sentence, and

TE identification data, representing for each TE, at least: the value of the TE and the serial number of the TE, and form the first data structure of the SMD.

3. The method according to p. 2, characterized in that in the framework of step A1)

provide a classification of the elements that make up the original data structure as linguistic sentences.

4. The method according to p. 2, characterized in that in the framework of step A2)

identify the elements of the first SMD data structure, which are text elements (TE) of the linguistic sentence, by identifying individual words or groups of words, numbers (numbers) or indices, and the numbers (numbers) or indices are not separated by a space, as well as punctuation marks.

5. The method according to claim 4, characterized in that the last punctuation mark in the linguistic sentence is not taken into account during identification and is not considered as a text element of the linguistic sentence.

6. The method according to p. 2, characterized in that in the framework of step A2)

the value of the text element is identified by registering the characters (letters, numbers and (or) punctuation marks) that make up the text element, and the sequence number of the text element is identified by calculating the location of the text element in the linguistic

offer.

7. The method according to p. 6, characterized in that the first text element in the linguistic sentence receives the serial number "1", and all

subsequent text elements receive a sequence number that is one greater than the sequence number of the previous text element.

8. The method according to p. 1, characterized in that stage B) is characterized by the implementation of the steps:

B1) the formation of the linguistic characteristics of the text elements of the linguistic sentence, on which for the linguistic analysis of the text element provide the identification data of the element (value and serial number of the text element) and get the linguistic characteristics of the text elements of the linguistic sentence, as well

99

SUBSTITUTE SHEET (RULE 26) the meanings of said linguistic characteristics 2131;

B2) the formation of a database of linguistic features (BDLP), on which form BDLP text elements of a linguistic sentence; moreover, the linguistic feature of the text element is all received for the text element in the course of stage B1)

linguistic characteristics possessing values of linguistic characteristics.

9. The method according to and. 8, characterized in that in the framework of stage B1)

linguistic characteristics and their meanings are formed by a complex linguistic analysis of each text element of a linguistic sentence, which, by way of example, but not limited to, provides a comprehensive analysis of a text element based on the location of a text element in the structure of a linguistic sentence, its meaning, type, classification of its conceivable image and analysis its connections with other textual elements in a linguistic sentence, after which they produce

the formation of basic linguistic characteristics

10. The method according to and. 8, characterized in that the formation of BDLP is carried out by making formed in the framework of stage B1)

linguistic characteristics and their values in the BDLP in the form of a list of the first part of linguistic characteristics with the values of these characteristics.

11. The method according to and. 1, characterized in that stage B) is characterized by the implementation of the steps:

B1) identification and formation of the first elements of the second SMD data structure, on which the first elements of the second SMD data structure are identified and formed, which are the first syntactic units, as well as the identification data of the first syntactic units,

representing for each first syntactic unit at least the value of the first syntactic unit and the serial number of the text element or text elements of the linguistic sentence constituting the first syntactic unit;

B2) the formation of the second elements of the second SMD data structure, on which the second elements of the second SMD data structure are formed,

being the second syntactic units, as well as the identification data of the second syntactic units, representing for each such

one hundred

SUBSTITUTE SHEET (RULE 26) syntactic unit, at least the meaning of the second syntactic unit and serial numbers of textual elements of the linguistic

sentences that make up the second syntactic unit;

OT) the formation of the second SMD data structure, on which the second SMD data structure is formed by combining the first syntactic units, the second syntactic units and their identification data.

12. The method according to claim 11, characterized in that, as part of step B1), the first syntactic units are identified or formed by a comprehensive analysis of the linguistic characteristics of the text elements of the linguistic sentence, the first syntactic units consisting of one text element being identified, and the formation the first syntactic units consisting of several text elements are exposed, while identification or formation is carried out by identifying Ia first syntactic units

relevant source syntax objects, representing a list of source syntactic objects, pre-installed and recorded in the first user database.

13. The method according to p. 11, characterized in that, as part of step B2) the formation of the second syntactic units of two or more first

syntactic units are produced by a complex analysis of the values of the linguistic characteristics of the text elements, which are the first syntactic units, when identifying the linguistic characteristics of the text elements that make up the first syntactic units that correspond to the description of the syntactic structure contained in the first user database.

14. The method according to p. 11, characterized in that, in the framework of step B1), the identification of the value and serial (ordinal) numbers (numbers) of the TE of the first syntactic units is performed as follows: value

(values) of the text (s) element (s) of the linguistic sentence constituting (making up) the first syntactic unit, is identified as the value of the element of the first syntactic unit, and the serial (serial) number (s) of the mentioned (mentioned) text (text) element (s) ) of a linguistic sentence constituting (constituting) the first syntactic unit, is identified as

101

SUBSTITUTE SHEET (RULE 26) the ordinal number of the text element (s) 312 constituting the first syntactic unit.

15. The method according to p. 11, characterized in that, as part of step B2), the identification of the values and sequence numbers of the TE of the second syntactic units is performed as follows: the values of text elements

the linguistic sentences constituting the second syntactic unit are identified as the value of the second syntactic unit, and the serial numbers of the text elements of the linguistic sentences constituting the second syntactic unit are identified as the ordinal numbers of text elements constituting the second syntactic unit.

16. The method according to p. 11, characterized in that, as part of the OT stage), the formation of a second SMD data structure is performed by combining the first syntactic units and second syntactic units in one data structure.

17. The method according to p. 12, characterized in that, in the framework of stage B1), the linguistic characteristics of TEs that make up the first syntactic units and their values are identified, if necessary, by organizing a request to the BDL formed in stage B), consisting of TE identification data constituting the first syntactic unit, and obtaining the values of the linguistic characteristics of the text elements of the linguistic sentence that make up the first syntactic unit.

18. The method according to p. 13, characterized in that, as part of step B2), the linguistic characteristics of TEs that make up the second syntactic units and their values are identified, if necessary, by organizing a request to the BDL formed in the framework of step B), consisting of TE identification data constituting the second syntactic unit, and obtaining the values of the linguistic characteristics of the text elements of the linguistic sentence that make up the second syntactic unit.

19. A computer device for converting a structured data array containing at least a linguistic sentence containing at least:

a computer device processor, and

102

SUBSTITUTE SHEET (RULE 26) a memory containing program code, which, when executed by the processor of a computer device, causes the processor of the computer device to perform the actions of the method according to any one of paragraphs. 1-18.

20. A system for converting a structured data array containing at least a linguistic sentence containing at least:

server computer device being a computer

the device according to claim 19, and

a user computer device configured to transmit commands or commands to the server computer device,

causing the processor of the server computer device to execute program code, which, when executed by the processor of the server computer device, causes the processor of the server computer device to perform the actions of the method according to any one of paragraphs. 1-18.

21. A computer-readable storage medium containing program code, which, when executed by a processor of a computer device, causes the processor of the computer device to perform the actions of the method according to any one of paragraphs. 1-18.

103

SUBSTITUTE SHEET (RULE 26)