WO2020191993A1

WO2020191993A1 - Method for syntactic parsing of natural language

Info

Publication number: WO2020191993A1
Application number: PCT/CN2019/100638
Authority: WO
Inventors: 秦一男; 朱江
Original assignee: 北京语自成科技有限公司
Priority date: 2019-03-22
Filing date: 2019-08-14
Publication date: 2020-10-01
Also published as: CN110020434B; CN110020434A

Abstract

Disclosed is a method for syntactic parsing of natural language. The present invention addresses technical drawbacks of the Berkeley Parser and the Stanford Parser, two leading natural language syntactic parsers that are internationally recognized in the field of computer science, and discloses a technical solution to resolve these drawbacks. The present invention establishes a novel mathematical model for describing sentences, and proposes a computer-based syntactic parsing method based thereon. The present invention integrates, by using technical means, three aspects of computer processing of natural language, namely, lexical analysis, syntactic parsing, and semantic analysis, in an organic manner, and strengthens mutual constraints of these three aspects, thereby improving the ability of a computer to resolve structural ambiguity. The present invention involves high technical complexity, achieves a high level of integration, has a wide range of applications, involves a large amount of computation, and complies with natural principles of mathematics and computer science, thereby enhancing accuracy of computer syntactic parsing.

Description

A method of natural language syntactic analysis

This application claims the priority of the Chinese patent application filed on March 22, 2019 with the application number 201910224013.X and the title of the invention "a method of natural language syntactic analysis", the entire content of which is incorporated into this application by reference in.

Technical field

The invention relates to the field of computer data processing, in particular to a method of natural language syntax analysis.

Background technique

Natural language processing (NLP) is a very important direction in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language.

Syntactic analysis (syntactic parsing) is one of the key tasks in natural language processing (NLP). The basic task of syntactic analysis is to determine the syntactic structure of a sentence or the interdependence between words within a sentence. Among the existing various syntactic analysis techniques, the Probabilistic Context Free Grammars (PCFG method for short) is a technique widely used in the field of computer science. The PCFG method calculates the matching probability of syntactic rules and selects the syntactic analysis result with the highest probability as the final syntactic structure. In addition to the PCFG method, Dependency Parsing is also a syntactic analysis technique often used in the field of computer science.

Berkeley Parser and Stanford Parser are two internationally leading natural language syntactic analysis devices recognized by the computer science community today. These two kinds of natural language syntax analysis devices both use the lexicalized PCFG method (Lexicalized Probabilistic Context-Free Grammars). While using the lexicalized PCFG method to make syntactic analysis results, Stanford Parser also gave the syntactic analysis results made using dependency analysis methods.

However, Berkeley Parser and Stanford Parser still have some serious technical loopholes.

Special Note:

<1>, the erroneous syntactic analysis results given by Stanford Parser mentioned below include not only the results made by Stanford Parser using the lexical PCFG method, but also the results made by Stanford Parser using the dependency analysis method, that is, Stanford Parser Both the lexicalized PCFG method and the dependency analysis method are wrong results.

<2>, the following researches and comments on the existing computer syntactic analysis technology only involve the PCFG method, not the dependency analysis method.

The first type of technical vulnerabilities:

Since January 2014, the inventor of this patent application has been observing the parsing effects of Berkeley Parser and Stanford Parser online for a long time, and found that these two natural language syntactic analysis devices are effective in English sentences. men who were appointed did't other the liberals wasn't remarked up by the press. The analysis results from January 2014 to the filing date of this patent application-March 22, 2019 have been wrong! This sentence was given by the linguist David R. Dowty in a linguistic monograph written by him. There are no grammatical and logical errors in this sentence, and it is fully in line with the expression habits of written English. The results given by Berkeley Parser and Stanford Parser are exactly the same, and the results are as follows: [See Figure 1]

①That men didn't bother;

②who were appointed;

③the liberals wasn't remarked up by the press.

Among them, ① is the main sentence, which is the core sentence of the whole sentence; ③ is the object of ①, that is, the object clause; ② is the attributive clause, which modifies men; That is the qualifier, which modifies men. In the result, That modifies men is wrong, That as a qualifier cannot modify the plural of a noun; the liberals wasn't remarked is wrong, and the singular and plural of the subject and the predicate are not properly matched.

The correct result of this sentence should be: wasn't remarked up by the press is the core sentence of the whole sentence, that is, the core subject-predicate collocation of the whole sentence; that men didn't other the liberals is the subject in the core sentence, that is , The subject clause in the core sentence; who were appointed is the attributive clause, which modifies men. That in this sentence should be parsed as a subordinate conjunction of the leading subject clause. In English, unless the subject clause is surrounded by left and right quotation marks, the subordinating conjunction that that leads the subject clause cannot be omitted, even in spoken language.

There is also: As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser's online analysis results for the English sentence "That something you learned is wrong is known to the public." are also wrong! There are no grammatical and logical errors in this sentence, and it is fully in line with the expression habits of written English. The results given by Berkeley Parser and Stanford Parser are exactly the same, and the results are as follows: [See Figure 2]

① That something is known to the public;

②you learned is wrong.

Among them, ① is the core sentence of the whole sentence, which is the core subject-predicate collocation of the whole sentence; ② is the attributive clause, which modifies the indefinite pronoun something; That is the qualifier, which modifies something. In the result, That modifies something is wrong. As an indefinite pronoun, something cannot be modified by any qualifier, and of course it cannot be modified by the qualifier that. learned and is wrong cannot be classified under the same verb phrase, learned and is wrong are two different predicates that belong to two clauses respectively.

The correct result of this sentence should be: is known to the public is the core sentence of the whole sentence, that is, the core subject-verb collocation of the whole sentence; That something is wrong is the subject in the core sentence, that is, the subject clause in the core sentence ; That is the subordinate conjunction that leads the subject clause; you learned is the attributive clause, which modifies something.

The common syntactic structure feature of the aforementioned two sentences is that each sentence has a subject clause guided by the subordinating conjunction that, and both have an attributive clause that can be regarded as being inserted into the aforementioned subject clause in a way of overall insertion. From the perspective of English linguistics, all English sentences with the above-mentioned syntactic structure characteristics will often be parsed by Berkeley Parser and Stanford Parser with serious errors!

In the subsequent example operation part, the inventor of this patent application will give the following mathematical model, which is denoted as the Q model. The aforementioned two sentences are sentences that conform to the Q model. The specific meaning of the Q model will be explained in the subsequent example operations.

Suppose S is an English sentence, and there are at least the following three subject-predicate collocations in S (represented by 6-element functions):

f(c ₁ ,l ₁ ,x ₁ ,r ₁ ,y ₁ ,z ₁ );

g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ );

h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ ).

Note: 1, 2, and 3 as the subscripts of the independent variables are just for distinguishing each other, and do not represent the actual sequence meaning.

f, g, h meet the following three conditions:

①l ₂ ＝that;

②f(c ₁ ,l ₁ ,g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ ),r ₁ ,y ₁ ,z ₁ );

③g[h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ )].

There are many example sentences that Berkeley Parser and Stanford Parser made incorrect syntactic analysis results, but due to the length of this patent application, the inventor cannot list them one by one. Only a few of them are listed as follows:

(1) That men who were appointed didn't bother the liberals wasn't remarked up by the press.

(2) That something you learned is wrong is known to the public.

(3) That something you learned is now outdated is known to the public.

(4) That men didn't bother the liberals wasn't remarked up by the press.

(5) That men didn't bother the liberal wasn't remarked up by the press.

(6) That men who were appointed othered the liberals wasn't remarked up by the press.

(7) That men who were appointed didn't bother the liberal wasn't remarked up by the press.

(8) That men who were appointed didn't other the liberals was remarked up by the press.

(9) That officials who were appointed didn't bother the liberals wasn't remarked up by the press.

(10) That officials who were appointed didn't the liberals was remarked up by the press.

(11) That men didn't think the liberals othered the students wasn't remarked up by the press.

(12), That men didn't think the liberal othered the students wasn't remarked up by the press.

(13) That men didn't think the liberals othered the students was remarked up by the press.

(14) That men didn't think the liberals othered the students who studied hard wasn't remarked upon by the press.

(15), That men thought the liberals othered the students wasn't remarked up by the press.

(16) That men thought the liberals othered the students was remarked up by the press.

(17) That officials didn't think the liberals othered the students wasn't remarked up by the press.

(18), That officials didn't think the students othered the liberals wasn't remarked up by the press.

(19) That officials thought the liberals othered the students who studied hard wasn't remarked up by the press.

(20), That men thought the liberals didn't both the musicians who worked hard was remarked up by the press.

(21) That men thought the liberals didn't other the diplomas who worked hard was remarked up by the press.

(22), That boys thought the liberals didn't other the musicians who worked hard was remarked up by the press.

(23) That girls thought the liberals didn't bother the musicians who worked hard was remarked up by the press.

(24) That men didn't bother the boys who studied hard wasn't remarked up by the press.

(25), That men didn't bother the boys who studied hard was remarked up by the press.

(26) That men didn't bother the students who studied hard wasn't remarked up by the press.

(27), That men didn't bother the students who studied hard was remarked up by the press.

(28) That men bother the officials who were appointed wasn't remarked up by the press.

(29) That men bother the officials who were appointed was remarked up by the press.

(30), That food which the company provided to the school attracted the attention of the public wasn't remarked up by the press.

(31), That money which the company provided to the school attracted the attention of the public wasn't remarked upon by the press.

(32), That Jobs which the company provided to the college attracted the attention of the public wasn't marked up by the press.

(33) That food which the company provided to the school attracted the attention of the public was remarked up by the press.

(34) That money which the company provided to the school attracted the attention of the public was remarked up by the press.

(35) That Jobs, which the company provided to the college attracted the attention of the public was remarked upon by the press.

(36), That something you learned about America's ancient history is wrong is likely.

(37) That something about America's ancient history is wrong is likely.

(38), That something Tom learned about America's ancient history is wrong is known to his classmates.

(39), That nuclear war would be madness does not mean that it will not happen.

(40), That near all behavior is learned behavior is a basic assumption that has been put forward by the social scientists.

(41), I don't know whether that girls are well protected represents something good.

(42), I don't know whether that girls are well protected represent good men.

(43), I can understand what that food should be conservative indicators.

(44), I can understand what that water should be conserved indicators.

(45), That what you learned is wrong is known to the public.

(46), That what you learned is now outdated is known to the public.

(47), What that women are amicably treated indicators is not clear.

(48) That what made the students happy didn't both the teachers wasn't remarked up by the press.

(49) That what made the students happy othered the teachers wasn't remarked up by the press.

(50) That what made the students happy othered the teachers was remarked up by the press.

As of the filing date of this patent application-March 22, 2019, the syntactic analysis results given by Berkeley Parser and Stanford Parser on the above sentence are still wrong! There are no grammatical and logical errors in the above sentences, and they are completely in line with the expression habits of written English. Each of the above sentences contains the subject clause guided by that, in which that is a subordinate conjunction (the lexical label is IN); and Berkeley Parser and Stanford Parser misinterpret that in the above sentence as a qualifier (the lexical label is DT). From the perspective of English linguistics, subordinating conjunctions and qualifiers are two different parts of speech with completely different syntactic functions. The differences are very large. Therefore, the aforementioned error is a serious error. In addition to the aforementioned errors, there are many other errors in the above sentence, not to list them all. In the example operation part of this patent application, some of the sentences mentioned above will also be used.

In addition, let’s look at two difficult sentences, as shown below. These two sentences are given by the linguist David R. Dowty in a linguistics monograph written by him:

(51) That that men were appointed didn't bother the liberals wasn't remarked up by the press.

(52) That that that men were appointed didn't bother the liberals wasn't remarked upon by the press upset many women.

There are no grammatical and logical errors in these two sentences, and they are fully in line with the expression habits of written English. Both of these two sentences contain the subject clause guided by that, in which that is a subordinate conjunction (the morphological label is IN); and Berkeley Parser and Stanford Parser have serious errors in the analysis of that in the above two sentences. In the example operation part of this patent application, the above two sentences will be used. In particular, it is pointed out that the above sentences (1)-(52), all of which can use the scheme of this patent application to obtain the correct syntactic analysis result.

The inventor of this patent application used a set of syntactic parsers developed in China to compare with Berkeley Parser and Stanford Parser. This set of syntactic parser developed in China uses the lexical PCFG method, which has the same technical principles as Berkeley Parser and Stanford Parser, and the parsing effect is very similar. Using this set of syntactic parsers developed in China, the inventor of this patent application has done the following syntactic analysis experiment: For the example sentence "That men who were appointed did not have the liberals wasn't remarked up by the press." , The lexical analysis result of this example sentence is limited to That/INmen/NNSwho/WPwere/VBDappointed/VBNdid/VBDn't/RBbother/VBthe/DTliberals/NNSwas/VBDn't/RBremarked /VBNupon/RPby/IN the/DTpress/NN./. This is a lexical analysis result that can be considered correct in English linguistics. It is required to provide 1000 syntactic analysis results with the highest probability, and combine the aforementioned The results are arranged in descending order of probability, and finally the syntactic analysis result with the highest ranking 74th is the result that can be considered correct in English linguistics. The results before the 74th ranking are all incorrect. Also for the aforementioned example sentence, the lexical analysis result of this example sentence is limited to That/IN men/NNS who/WPwere/VBDappointed/VBNdid/VBDn't/RBbother/VBthe/DTliberals/NNSwas/VBD n't/RBremarked/VBNupon/INby/INthe/DTpress/NN./. This is also a lexical analysis result that can be considered correct in English linguistics. It requires 1,000 syntactic analysis with the highest probability As a result, the results are arranged in descending order of probability. Finally, the result of syntactic analysis with the highest ranking 52nd is a result that can be considered correct in English linguistics. The results before the 52nd ranking are all incorrect.

Another example: For the example sentence "That something you learned is wrong is known to the public.", the lexical analysis result of this example sentence is limited to That/INsomething/NN you/PRP learned/VBDis/VBZ wrong/JJis/VBZ known /VBN to/TO the/DT public/NN./. This is a lexical analysis result that can be considered correct in English linguistics. It is required to provide 1000 syntactic analysis results with the highest probability, and the results are increased according to the probability To the small arrangement. In the end, the result of syntactic analysis with the highest ranking 52 is a result that can be considered correct in English linguistics. The results before the 52nd ranking are all incorrect.

It can be seen that using the aforementioned syntactic parser developed in China, the probability of the correct syntactic analysis result obtained by restricting the correct lexical analysis result for the aforementioned two example sentences is very low, ranking after 50. The inventor of this patent application has done a lot of experiments on many sentences that are similar in syntax to the aforementioned two example sentences, and the correct syntactic analysis results obtained are similar to the situation of the aforementioned two example sentences, often with very low probability rankings. result.

Based on the aforementioned comparative study, the inventor of this patent application has reason to believe that if Berkeley Parser and Stanford Parser are used to analyze the aforementioned two example sentences according to the correct lexical notation given above, the result will be similar to the syntactic analysis developed in China. The results obtained by the analyzer are similar, that is, the probability of correct syntactic analysis results is relatively low. If you want to correct the correct syntactic analysis results of the aforementioned two example sentences by slightly adjusting statistical models and parameters within the existing theoretical and technical framework, it is difficult to do; and once the statistics are adjusted significantly Models and parameters will be at the cost of losing many of the current excellent performance. For example, after greatly adjusting the statistical model and parameters, the syntactic parser is likely to make mistakes in the current sentences that can analyze the correct results, or make Sentences that can currently output results are not output.

In summary, the inventor of this patent application believes that the above-mentioned first type of technical vulnerabilities are likely to be the technical blind spots and blind spots of Berkeley Parser and Stanford Parser, as well as current PCFG methods (including lexical PCFG methods). ) The theoretical and technical bottleneck. For the PCFG method (including the lexicalized PCFG method), it is difficult to completely break through the bottleneck within its existing theoretical and technical framework. Imagine this: If you select a series of sentences with the characteristics of that leading subject clause, etc. as the corpus, construct a characteristic corpus, and then use a syntactic parser developed based on the PCFG method (including the lexicalized PCFG method) Analyze each sentence in the corpus, for example: use Berkeley Parser and Stanford Parser for analysis, then the recall rate will be very low.

The second type of technical vulnerabilities: please see the following sentence:

(1), Jack met the patient the nurse the clinic had hired sent to the doctor.

(2), This is the malt the rat the cat the dog worried killed ate.

(3), Jack met the boy the nurse the doctor the clinic had hired sent to the ward introduced to the patient.

(4), Jack met the boy the patient introduced to the nurse the doctor the clinic had hired sent to the ward.

(5), Jack met the boy the patient took to the ward the doctor the clinic had hired sent the nurse to.

(6), Jack ate the food the patient the nurse the clinic had hired sent to the doctor took to the ward.

(7), Jack ate the food the patient took to the nurse the doctor the clinic had hired sent to the ward.

(8), Jack ate the food the patient took to the ward the doctor the clinic had hired sent the nurse to.

(9) That men the nurse the doctor the clinic had hired sent to the ward introduced to the cleaners didn't bother the patients wasn't marked up by the press.

(10) That men the cleaner introduced to the nurses the doctor the clinic had hired sent to the ward didn't bother the patients wasn't marked up by the press.

(11) That men the nurse sent to the ward introduced to the cleaners didn't other the patients wasn't remarked up by the press.

(12) That men the cleaner introduced to the nurses the doctor sent to the ward didn't the other the patients wasn't remarked up by the press.

The first sentence above was given by the linguist David R. Dowty in a linguistic monograph written by him; the second sentence was extracted from an English poem by the linguist. There are no grammatical and logical errors in the above 12 sentences. The above 12 sentences all contain the omission of the clause guiding words; in English, the clause guiding words are not arbitrarily omitted, and the omission of the clause guiding words must meet the grammatical requirements; the above 12 sentences contain the omission of the clause guiding words , All meet the requirements of English grammar. As of the filing date of this patent application-March 22, 2019, the syntactic analysis results given by Berkeley Parser and Stanford Parser on the above 12 sentences are still wrong!

The above 12 sentences all cleverly show the features of deep recursive nesting of sentences, and are flexibly integrated into the syntax rules of some omission clause guide words in English; on this basis, the 9th to the 12th sentences It further incorporates the characteristics of the leading subject clause of that mentioned above, and the 9th to 12th sentences all conform to the Q model mentioned above. It is true that the analysis of the syntactic structure contained in the above sentence is very difficult. It is not appropriate to demand that the computer can perfectly reach the level of human intelligence at this stage, but the problem is objective. There are many similar sentences, so I won’t list them all. In particular, it is pointed out that all the above sentences (1)-(12) can use the scheme of this patent application to obtain correct syntactic analysis results.

Based on a large number of comparative experiments, the inventor of this patent application believes that after the first type of technical vulnerabilities, the second type of technical vulnerabilities are likely to be another technical blind spot and blind spot of Berkeley Parser and Stanford Parser. It is also another theoretical and technical bottleneck of the current PCFG method (including the lexicalized PCFG method). For the PCFG method (including the lexicalized PCFG method), it is difficult to completely break through this bottleneck within the existing theoretical and technical framework. Due to space limitations, I will not elaborate too much.

The third type of technical vulnerabilities: please see the following sentence:

①Part of the reason why Charles Dickens loved his own novel was that it was rather closely modeled on his own life.

②Part of the reason Charles Dickens loved his own novel was that it was rather closely modeled on his own life.

As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave the syntactic analysis results of the first sentence above are correct, and the syntactic analysis results given in the second sentence above are all Incorrect!

The basic frameworks of the syntactic structure of the above two sentences are equivalent. The only difference is: the first sentence retains the relative adverb why of the introductory attributive clause, while the second sentence omits the relative adverb why of the introductory attributive clause, which is omitted here. Fully conforms to English syntax rules. For the sentences that are equivalent to the basic frameworks of the two syntactic structures, Berkeley Parser and Stanford Parser parsed two completely different results. This shows that the PCFG method (including the lexicalized PCFG method) on which Berkeley Parser and Stanford Parser and the aforementioned two parsers are based, does not effectively distinguish the primary and secondary relationships between the various language components within the sentence , The processing is not in place, so there will be errors in parsing roughly equivalent minor changes in syntactic structure.

During long-term observation, the inventor of this patent application often encountered unstable analysis results similar to the above sentence. Sometimes, even if you change a simple adverb in the original sentence that is insignificant in the syntactic structure, the two results of Berkeley Parser and Stanford Parser will be greatly changed. There are many similar sentences, so I won’t list them all. In particular, it is pointed out that all of the above two sentences can use the scheme of this patent application to obtain correct syntactic analysis results.

Reflection and summary of the aforementioned three types of technical vulnerabilities:

The inventor of this patent application believes that the aforementioned three types of technical vulnerabilities are serious technical hidden dangers of Berkeley Parser and Stanford Parser, and also expose serious theoretical defects of the PCFG method (including the lexicalized PCFG method). The reasons for the aforementioned three types of technical vulnerabilities are likely to be as follows:

[1]. The randomness of the corpus conflicts with some basic syntactic functions and definitions inherent in natural language itself.

From a statistical point of view, in any English corpus, the probability of a clause acting as the subject of a sentence is usually far less than the probability of a noun acting as the subject of a sentence; but from the perspective of natural language, clauses can act as the subject of a sentence, and nouns can act as the subject of a sentence. Sentence subject is a basic syntactic function defined by English itself, and both are a possibility defined by English itself. Therefore, the probability of the two in linguistic theory is equal. Further, from a statistical point of view, in any English corpus, the probability that the subject clause guided by that acts as the subject of a sentence is usually much less than the probability that a noun acts as the subject of the sentence; but from the perspective of natural language, that guided by that The subject clause can act as the subject of the sentence, which is also a basic syntactic function derived from the definition of English itself, and it is also a possibility of the definition of English itself. Therefore, the subject clause guided by that acts as the subject of the sentence and the noun acts as the subject of the sentence. The probability difference in linguistic theory is much smaller than the probability difference reflected in the English corpus. As a result, there is a conflict between the randomness of the corpus and some basic syntactic functions and definitions inherent in natural language itself.

[2]. For some important structural features of natural language, the PCFG method (including the lexicalized PCFG method) has insufficient countermeasures and is not in place.

For the distinction between the main component and the modifier in the sentence, the distinction between the main structure and the secondary structure in the sentence, the description of the discrete long-distance correlation situation and the deep recursive nesting situation, etc., these problems It is an important issue related to the structural characteristics of natural language. The PCFG method (including the lexicalized PCFG method), which is the technical principle of Berkeley Parser and Stanford Parser, has insufficient countermeasures against the above-mentioned problems, and there are some points that cannot be taken into account.

[3] The lexical analysis and syntactic analysis of natural language should be mutually constrained, but in actual natural language processing (NLP) projects, lexical analysis and syntactic analysis are separated into two independent parts.

Lexical analysis, syntactic analysis and semantic analysis are in a relationship of mutual reference and mutual restraint. However, in actual natural language processing projects, lexical analysis, syntactic analysis, and semantic analysis are usually carried out independently of each other, and lexical analysis is done independently without relying on syntactic analysis. This arrangement mainly considers the computational complexity and model complexity in natural language processing engineering. However, this arrangement is likely to seriously affect the accuracy of the syntactic analysis results, that is, if the computer makes a misjudgment in the lexical analysis link, then this misjudgment cannot be corrected at all in the other links of the syntactic analysis that will be performed next. And constraints, which have a negative impact on the accuracy of the syntactic analysis results.

Summary of the invention

In view of this, the purpose of the present invention is to provide a natural language syntactic analysis method, including:

S1. Read the sentence data structure to be parsed, and perform preprocessing operations on the sentence data structure to be parsed;

S2. For each word list (i), read the sentence data structure to be parsed after the aforementioned preprocessing: if there is a predicate verb unit in the sentence to be parsed, then generate a word list (ii); There is no predicate verb unit in the sentence, then the sentence is analyzed by the method of probability combined with syntactic rules or the dependency analysis method, and the result of the aforementioned analysis is used as the final analysis result of the computer, and then the corresponding word list is cleared (i) And does not generate a word list (ii);

S3. For each predicate element, generate a corresponding predicate vector; the predicate vector includes a parallel guide element, a subordinate guide element, a subject element, a predicate element, a first position object element, and a second position object element;

Wherein, the predicate element is the corresponding predicate verb unit, or the corresponding adjacent predicate verb combination unit; the predicate element number is the corresponding predicate verb unit number, or the corresponding adjacent predicate verb combination unit number ；

Wherein, the possible value of the coordinate introductory element is one of the coordinate related word units used to connect sentences with a number less than the corresponding predicate element number, or an empty unit; the coordinate related word unit that is not used to connect sentences cannot be used as a coordinate introductory The possible values of the element;

Wherein, the possible value of the subordinate introductory element is one of the subordinate related word units whose number is smaller than the corresponding predicate element number, or one of the adjacent and juxtaposed subordinate related word combination units whose number is smaller than the corresponding predicate element number, or the number is smaller than One of the interrogative unit of the corresponding predicate element number, or one of the adjacent interrogative combination units with a number smaller than the corresponding predicate element number, or an empty unit;

Wherein, the possible value of the subject element is one of the basic noun units whose number is less than the corresponding predicate element number, or one of the adjacent and parallel basic noun combination units whose number is less than the corresponding predicate element number, or the number is less than the corresponding One of the infinitive vectors corresponding to the infinitive element of the predicate element number, or a gerund whose number is less than the corresponding predicate element number-the gerund corresponding to the present participle element-one of the present participle vectors, or one of the corresponding predicate element numbers One of the predicate vectors corresponding to the predicate element, or an empty unit;

Wherein, the possible value of the object element in the first position is one of the basic noun units whose number is greater than the number of the corresponding predicate element and less than the number of the first predicate element that appears after the predicate element, or the number is greater than the corresponding predicate element The element number is less than one of the adjacent basic noun combination units of the first predicate element number that appears after the predicate element, or the number is greater than the corresponding predicate element number and less than the first predicate element that appears after the predicate element. One of the infinitive vectors corresponding to the infinitive element of a predicate element number, or a gerund whose number is greater than the number of the corresponding predicate element and less than the number of the first predicate element that appears after the predicate element-the verb corresponding to the present participle element Noun-one of the present participle vectors, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding predicate element, or an empty unit; the predicative component corresponding to the predicate element that meets the aforementioned requirements is also regarded as the first position object element deal with;

Among them, if the corresponding predicate element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding object element in the first position is a basic noun unit or an adjacent basic noun Combination unit, then the possible value of the object element at the second position is one of the basic noun units with a number greater than the number of the corresponding object element at the first position and less than the number of the first predicate element that appears after the predicate element, or One of the adjacent basic noun combination units whose number is greater than the number of the corresponding object element in the first position and less than the number of the first predicate element that appears after the predicate element, or corresponds to the predicate element whose number is greater than the corresponding predicate element One of the predicate vectors of, or an empty unit; if the corresponding predicate element is a unit composed of a verb that can accept a double object or a verb that can be combined with an object complement, and the corresponding object element in the first position is neither a basic noun If the unit is not an adjacent basic noun combination unit, then the value of the object element in the second position is an empty unit; if the corresponding predicate element is a verb that is complemented by neither a double object nor an unacceptable object combined with an object The possible value of the object element in the second position is an empty unit; among them, the verb of the double-object can be accessed or the verb of the complementary object combined with the object complement and the unacceptable double-object Verbs that cannot accept an object combined with an object complement can be summarized and given in advance by querying a dictionary or statistically; define the verbs that can accept double objects or the verbs that can accept an object combined with the object complement and the said both. Verbs that cannot accept double objects and cannot accept an object combined with an object complement will help reduce the complexity of calculations;

S4. For each infinitive element, generate a corresponding infinitive vector; for each gerund-present participle element, generate a corresponding gerund-present participle vector; for each past participle element, generate a corresponding past participle vector; For each preposition element, a corresponding preposition vector is generated; according to the possible values of the infinitive element, the infinitive first-position object element, and the infinitive second-position object element, obtain the infinitive vector corresponding to each infinitive element All possible values of; According to the possible values of the gerund-present participle element, gerund-present participle object element in the first position, gerund-present participle object element in the second position, obtain each gerund-present participle The gerund corresponding to the element-all possible values of the present participle vector; according to the possible values of the past participle element and the past participle object element, all possible values of the past participle vector corresponding to each past participle element are obtained; State the possible values of preposition elements and preposition object elements, and obtain all possible values of the preposition vector corresponding to each preposition element;

Wherein, the infinitive vector includes infinitive elements, infinitive first-position object elements, and infinitive second-position object elements;

The infinitive element is the corresponding infinitive verb unit, or the corresponding adjacent infinitive verb combination unit; the infinitive element number is the corresponding infinitive verb unit number, or the corresponding adjacent infinitive infinitive Verb combination unit number;

The possible value of the object element in the first position of the infinitive is one of the basic noun units whose number is greater than the number of the corresponding infinitive element and less than the number of the first predicate element that appears after the infinitive element, or the number is greater than the corresponding The number of the infinitive element of and is less than one of the adjacent basic noun combination units of the first predicate element number that appears after the infinitive element, or the number is greater than the number of the corresponding infinitive element and less than the number of the infinitive element One of the infinitive vectors corresponding to the infinitive element of the first predicate element number that appears after the element, or one of the infinitive vectors whose number is greater than the corresponding infinitive element number and less than the number of the first predicate element that appears after the infinitive element Noun-the gerund corresponding to the present participle element-one of the present participle vectors, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding infinitive element, or an empty unit; the infinitive element corresponds to the predicative that meets the aforementioned requirements Component, also treated as an object element in the first position of the infinitive;

If the corresponding infinitive element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the object element in the first position of the corresponding infinitive is a basic noun unit or an adjacent basic Noun combination unit, then the possible value of the object element in the second position of the infinitive is a basic number greater than the number of the object element in the first position of the corresponding infinitive and less than the number of the first predicate element that appears after the infinitive element One of the noun units, or one of the adjacent basic noun combination units whose number is greater than the number of the object element in the first position of the corresponding infinitive and less than the number of the first predicate element that appears after the infinitive element, or one of the corresponding basic noun combination units One of the predicate vectors corresponding to the predicate element with the larger number of the infinitive element, or an empty unit; if the corresponding infinitive element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and corresponds to The object element in the first position of the infinitive is neither a basic noun unit nor an adjacent basic noun combination unit, then the value of the object element in the second position of the infinitive is an empty unit; if the corresponding infinitive element is A unit composed of verbs that can neither accept a double object nor a non-acceptable object combined with an object complement, then the value of the object element in the second position of the infinitive is an empty unit; among them, the verb that can accept a double object or can The verbs that receive the object combined with the object complement and the verbs that can neither receive the double object nor the object combined with the object complement can be summarized and given in advance by querying the dictionary or statistics; define the said acceptable double object The verbs or the verbs that can accept the object and the object complement and the verbs that can not accept the double object or the unacceptable object and the object complement can help reduce the complexity of calculation;

Wherein, the gerund-present participle vector includes gerund-present participle element, gerund-present participle first position object element, gerund-present participle second position object element;

The gerund-present participle element is the corresponding gerund-present participle unit, or the corresponding adjacent gerund-present participle combination unit; the gerund-present participle element number is the corresponding gerund-present participle Unit number, or corresponding adjacent parallel gerund-present participle combination unit number;

The possible value of the object element in the first position of the gerund-present participle is a basic number greater than the number of the corresponding gerund-present participle element and less than the number of the first predicate element that appears after the gerund-present participle element One of the noun units, or one of the adjacent basic noun combination units whose numbers are greater than the corresponding gerund-present participle element number and less than the number of the first predicate element that appears after the gerund-present participle element, or One of the infinitive vectors corresponding to the infinitive element whose number is greater than the corresponding gerund-present participle element number and less than the first predicate element number that appears after the gerund-present participle element, or the number is greater than the corresponding gerund -The present participle element number is less than the gerund with the number of the first predicate element that appears after the present participle element-the gerund corresponding to the present participle element-one of the present participle vectors, or more than the corresponding gerund- One of the predicate vectors corresponding to the predicate element with the higher number of the present participle element, or an empty unit; the predicative component corresponding to the gerund-present participle element that meets the aforementioned requirements is also treated as the object element in the first position of the gerund-present participle;

If the corresponding gerund-present participle element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding gerund-present participle first position object element is a basic noun unit or An adjacent basic noun combination unit, then the possible value of the object element in the second position of the gerund-present participle is that the number is greater than the number of the object element in the first position of the corresponding gerund-present participle and is smaller than the object element number in the first position of the gerund -One of the basic noun units of the first predicate element number that appears after the present participle element, or the number is greater than the corresponding gerund-number of the object element in the first position of the present participle and less than the number that appears after the gerund-present participle element One of the adjacent and juxtaposed basic noun combination units of the first predicate element number, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding gerund-present participle element, or an empty unit; if the corresponding gerund- The present participle element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding gerund-the object element in the first position of the present participle is neither a basic noun unit nor an adjacent juxtaposition The basic noun combination unit of, then the value of the object element in the second position of the gerund-present participle is the empty unit; if the corresponding gerund-present participle element is composed of both unacceptable double objects and unacceptable objects combined with object complements The unit of the verb constituted by the verb, then the value of the object element in the second position of the gerund-present participle is the empty unit; wherein the verb that can accept the double object or the verb of the object complement and the Verbs that can neither accept double objects nor accept objects combined with object complements can be summarized and given in advance by querying the dictionary or statistics; define the verbs that can accept double objects or the verbs that can accept objects combined with object complements Verbs and the mentioned verbs that can neither take double objects nor can take the object combined with the object complement will help reduce the complexity of calculation;

Wherein, the past participle vector includes past participle elements and past participle object elements;

The past participle element is the corresponding past participle unit, or the corresponding adjacent past participle combination unit; the past participle element number is the corresponding past participle unit number, or the corresponding adjacent past participle combination unit number ；

If the corresponding past participle element is a unit consisting of a verb that can accept a double object or a verb that can be combined with an object complement, then the possible value of the past participle object element is that the number is greater than the number of the corresponding past participle element and less than One of the basic noun units of the first predicate element number that appears after the past participle element, or the number greater than the corresponding past participle element number and less than the first predicate element number that appears after the past participle element One of the basic noun combination units that are adjacent to each other, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding past participle element, or an empty unit; if the corresponding past participle element is composed of neither a double object nor an object Combining the unit composed of the verb of the object complement, then the value of the object element of the past participle is the empty unit; wherein, the verb that can be accessed by the double object or the verb that can be combined with the object complement and the verb of the object complement. Verbs that cannot accept a double object or an object combined with an object complement can be summarized and given in advance by querying a dictionary or statistics; define the verbs that can accept a double object or a verb that can accept an object combined with an object complement and The described verbs that can neither accept double objects nor accept objects combined with object complements help to reduce the complexity of calculation;

Wherein, the preposition vector includes a preposition element and a preposition object element;

The preposition element is a corresponding preposition unit, or a corresponding adjacent preposition combination unit; the preposition element number is a corresponding preposition unit number, or a corresponding adjacent preposition combination unit number;

The possible value of the preposition object element is the first basic noun unit whose number is greater than the number of the corresponding preposition element and appears after the preposition element, or the number is greater than the number of the corresponding preposition element and appears after the preposition element The first adjacent basic noun combination unit, or the first gerund-present participle vector whose number is greater than the corresponding preposition element number and appears after the preposition element, or the number is greater than the corresponding preposition element number and is The first infinitive vector that appears after the preposition element, or the preposition vector corresponding to the preposition element whose number is greater than the corresponding preposition element number and is adjacent to the number sequence of the preposition element number, or the preposition vector that is greater than the corresponding preposition element number One of the predicate vectors corresponding to the predicate element, or an empty unit;

S5. The infinitive vector, the gerund-present participle vector, the past participle vector and the preposition vector are collectively referred to as auxiliary vectors; for each auxiliary vector in the sentence to be parsed, any possible value corresponding to the auxiliary vector is selected. In this way, a set of possible values corresponding to all auxiliary vectors is obtained; the possible values corresponding to the aforementioned set of all auxiliary vectors are regarded as a set, which is called an auxiliary system;

S6. Given a standard backbone system arbitrarily, collocation with a corresponding auxiliary system; replace every element outside the excluded vector in each auxiliary vector in the aforementioned auxiliary system with the corresponding number; after replacing the number, check The auxiliary system; if the following unreasonable situation occurs in the auxiliary system, then the auxiliary system is removed; if the following unreasonable situation does not occur in the auxiliary system, then the auxiliary system is retained; the remaining auxiliary system The system is called the specification auxiliary system; the predicate vectors mentioned in the following all refer to the predicate vectors in the aforementioned canonical backbone system;

S6.1. If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector or the same preposition vector appears in two different auxiliary vectors, then the auxiliary system is unreasonable, Clear the auxiliary system;

S6.2. If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector appears in an auxiliary vector and in a predicate vector at the same time, then the auxiliary system is unreasonable and the auxiliary system is removed. system;

S6.3. If two numbers in reverse order appear in an auxiliary vector, then the auxiliary system is unreasonable, and the auxiliary system is cleared;

S6.4. Substituting any two auxiliary vectors that have elements between the two into the relationship, all of which are equivalently substituted; if there is a cross-substitution contradiction between the vectors, then the auxiliary system is unreasonable, and the auxiliary system is cleared; if If two numbers in reverse order appear after equal substitution, then the auxiliary system is unreasonable. Clear the auxiliary system;

S6.5. Substituting any auxiliary vector and any predicate vector that have elements between the two elements into the relationship, all of which are equivalently substituted; if there is a contradiction in the substitution between the vectors, then the auxiliary system is unreasonable, and the Auxiliary system; if there are two numbers in reverse order after equal substitution, then the auxiliary system is unreasonable, and the auxiliary system is cleared;

S6.6. After the inspection, restore to the original state before the inspection for use in subsequent operations;

S7. Generate residual noun system and A-B-C joint system;

S7.1. Given a canonical backbone system and a canonical auxiliary system corresponding to the canonical backbone system, the remaining basic noun units and adjacent parallel basic noun combinations that do not enter the aforementioned canonical backbone system and standard auxiliary system The whole unit is regarded as a set, which is called a residual noun system; each element in the residual noun system is called a residual noun element; the number of a residual noun element is the basic corresponding to the residual noun element The number of the noun unit or the basic noun combination unit; for each remaining noun element, a corresponding remaining noun vector is generated; the remaining noun vector includes only the remaining noun elements, that is, the remaining noun vector and the remaining noun elements are in one-to-one correspondence ；

S7.2. A normative backbone system, a normative auxiliary system and a residual noun system corresponding to each other in the manner described in S7.1 constitute an A-B-C joint system;

S8. For any given ABC joint system, perform the overall blanking operation for the ABC joint system; each slot can receive at most one vector in an overall blanking operation, or no vector, that is, no blanking operation ; Before the overall blanking operation, clear the empty unit; in the overall blanking operation, the vector that constructs a space and receives other vectors into the space is recorded as the received vector; the vector that inserts the space of other vectors is recorded as the inserted vector ；

S8.1. In the aforementioned ABC joint system, for each element in each vector that can be replaced by other vectors, all the corresponding vectors are used for equivalent substitution, regardless of whether the corresponding vector is a predicate vector or an auxiliary vector Vector; perform the aforementioned equal substitution until all the other vectors in each vector are replaced; after the aforementioned equal substitution, if a vector is substituted into another vector, then cancel the substitution into the other vector The original position of the vector in the ABC joint system, so that the two vectors after the aforementioned equal substitution operation are completely integrated; through equal substitution, all the original vectors in the ABC joint system are transformed into mutual differences. There is a new vector in which the elements are substituted; taking equal substitution as the limit, the vector in the ABC joint system before the equal substitution is called the I type vector, and the vector in the ABC joint system after the equal substitution It is called a type II vector; obviously, a certain type I vector and a certain type II vector can be the same vector, that is, a vector may not change before and after the equivalent substitution;

S8.2. Perform the first round of the overall blanking operation in the ABC joint system: take any type II vector ω as the receiving vector of the first round of the overall blanking operation; label each of the vectors ω one by one according to a predetermined direction The order value of an element; according to the order value that has been marked, the i-th element in the vector ω can be selected, and a unique space is constructed only on the first side of the element; after the space is created, any one that excludes the aforementioned vector ω The second type of vector μ outside is used as the insertion vector for the first round of the overall blanking operation; in the way of overall blanking, the vector μ is inserted into the space corresponding to the aforementioned i-th element, and then a new vector is generated. The generated vector is denoted as [ω] ⁱ +<μ; the vectors obtained through the overall blanking operation in the ABC joint system are collectively referred to as type III vectors; the order value of the overall blanking labeling in each round is limited to this Used in a round of overall plug-in process;

S8.3. Perform the second round of the overall blanking operation in the ABC joint system: take the type III vector [ω] ⁱ +<μ as the receiving vector of the second round of the overall blanking operation; according to the predetermined direction, the slave vector Each element from the first element on the first side in [ω] ⁱ +<μ to the first element on the second side inside the vector μ contained in the vector [ω] ⁱ +<μ is marked with an order value; vector The rest of the elements in [ω] ⁱ +<μ are not marked with the order value; according to the marked order value, the j-th element is taken, and only a unique space is constructed on the first side of the element; after the space is created, you can take any A type II vector ξ that has not been used in any previous steps is used as the insertion vector for the second round of the overall blanking operation; the vector ξ is inserted into the space corresponding to the j-th element in the overall blanking manner, and then a new , The newly generated vector is marked as [[ω] ⁱ \μ] ^j +<ξ; or

Take the type III vector [ω] ⁱ +<μ as the receiving vector for the second round of the overall blanking operation; label each element in the vector [ω] ⁱ +<μ according to the predetermined direction; Sequence value, any take the kth element in the vector [ω] ⁱ +<μ, and only construct a unique vacancy on the first side of the element; after creating a vacancy, take any second II that has not been used in any previous steps The class vector ξ is used as the insertion vector for the second round of the overall blanking operation; the vector ξ is inserted into the space corresponding to the k-th element in the overall blanking method, and then a new vector is generated, and the newly generated vector is recorded as ( [ω] ⁱ +<μ) ^k +<ξ; According to this method, the overall interpolation operation is performed. If the same result appears after the execution of S8.4, then the same result will be merged into one result, that is, the same merged vector Merge into a flat vector;

S8.4. In the aforementioned ABC joint system, the overall insertion operation given in S8.3 is repeatedly executed in the following way: take the newly generated vector obtained from the previous round of overall insertion operation as a new round of overall Insert the received vector of the null operation, and any type II vector that has not been used in any previous steps is used as the insertion vector of the new round of the overall null operation; repeat the overall insert operation until all the II types After all the vectors are inserted into the space, it is recorded as the exhaustion of all the insertion vectors, and a type III vector is obtained while all the vectors are inserted. The type III vector obtained while inserting the exhaustion into the vector is recorded as the combined vector; S8.3 Contains 2 types of overall blanking operation methods. For the selection of the overall blanking operation method in S8.3, the previous and subsequent steps should be consistent; arrange the type II vectors used in each round of the overall blanking operation in order, Until all the insertion vectors are exhausted, a blanking scheme corresponding to the ABC joint system is formed; the operations from S8.2 to S8.4 are repeated to exhaust every round of blanking operations involved in the blanking scheme Receiving the space corresponding to each element in the vector, that is, each combined vector involved in the exhaustive insertion scheme;

S8.5. Check the result generated by S8.4: replace with a number; if two numbers in reverse order appear in a combined vector, then the combined vector is unreasonable, clear the combined vector; if it does not appear in a combined vector If the number is reversed, the combined vector is reasonable, and the combined vector is retained;

S8.6. After converting all the type I vectors in the aforementioned ABC joint system into type II vectors, first replace each type II vector in the ABC joint system with corresponding numbers, and then execute the aforementioned The overall blanking operation; according to any given blanking scheme corresponding to the ABC joint system, in each round of the overall blanking operation, a blank is constructed on the first side of each element in the receiving vector, and then Start to filter reasonable gaps; compare the greater or less than relationship between the first number on the left or right side inserted into the vector and the adjacent number on the left or right corresponding to the gap to be filtered, and only select the number sequence to avoid occurrence Inversely, the space that is greater than or less than the relationship is regarded as a reasonable space, and the empty space is inserted, and the remaining space is regarded as an unreasonable space, and no space is inserted; if there is no reasonable space in the receiving vector, then the above-mentioned empty insertion scheme is unreasonable , End the blanking scheme, and replace other blanking schemes; using this method for optimization, the obtained combined vector can be directly recorded as a reasonable combined vector, without the need to reverse the numbering order;

S8.7. Use the principle of multiplication in combinatorics to exhaust all ABC joint systems corresponding to each word list (ii); further, by permuting and combining all type II vectors in each ABC joint system, exhaustive All the blanking schemes corresponding to each ABC joint system; further, the operations from S8.2 to S8.6 are repeated for each blanking scheme until all the stitching vectors corresponding to each blanking scheme are exhausted;

S8.8. Syntactic rule check: Use the syntactic rules of natural language, and use the method of probability combined with syntactic rules or dependency analysis method to check each reasonable combination vector and its corresponding ABC joint system; the aforementioned use Syntactic rules inspection should include the use of event object verbs and non-event object verbs; the event object verbs refer to verbs in natural language that can only use events as objects but not people or things as objects; The non-event object verbs refer to verbs in natural language that can only take people or things as objects, but not events; event object verbs and non-event object verbs can be summarized in advance by querying a dictionary or statistics Give

S8.9. While executing S8.8, repair the syntactic structure; the said syntactic structure repair uses the method of probability combined with syntactic rules or the method of dependency analysis to re-excavate the missing syntactic information, and repair the previous Defects in the obtained syntactic structure; this link can also be repaired through the syntactic structure, distinguishing and adjusting the primary and secondary status of each vector in the syntactic structure of the reserved ABC joint system;

S8.10. Residual noun check: use probability combined with syntactic rules or dependency analysis method to find reasonable residual nouns and unreasonable residual nouns, and discard the A-B-C joint system containing unreasonable residual nouns;

S9. Take the basic framework of the syntactic structure of the sentence to be parsed described by the several ABC joint systems retained by S8 as the standard, and use the method of probability combined with syntactic rules or the dependency analysis method to analyze the sentence to be parsed to obtain sufficient numbers Among the complete syntactic structures of, find the most suitable complete syntactic structure that meets the aforementioned criteria;

S10. Based on several complete syntactic structures generated by S9, using semantic processing methods to find the most suitable semantic relationship subject to the aforementioned syntactic structure constraints, and then take the aforementioned complete syntactic structure corresponding to the semantic relationship as the final Syntactic analysis results.

Preferably, step S1 includes:

S1.1. For the part of speech of each word in the sentence to be parsed, automatic computer analysis and labeling are performed to generate the result of lexical analysis;

S1.2. For natural language elements such as predicate verbs, basic noun phrases, basic adjective phrases, and basic adverb phrases in the sentence to be parsed, automatic computer analysis and labeling; for adjacent noun phrases and adjacent parallel noun phrases Natural language elements such as adjective phrases and adjacent adverb phrases are automatically analyzed and labeled by computer;

S1.3. Combine various adjacent part-of-speech units, and record the merged adjacent part-of-speech units as a corresponding part-of-speech unit;

S1.4. For the language information in the sentences to be parsed as described in S1.2 and S1.3, open a list of words and write them as word list (i); word list (i) includes words and word correspondences The attributes of the words, the position information of the words in the sentence, punctuation marks and their position in the sentence;

S1.5. For the various possible results of lexical analysis, use combinatorial mathematics related methods to generate multiple different word lists (i) to accommodate multiple structural ambiguities; for the multiple different words generated above List (i) is distinguished by different numbers; in the preprocessing operation, the restrictions on the lexical analysis results are relaxed, and multiple different lexical analysis results caused by structural ambiguities are passed through multiple different word lists ( i) Keep it and leave it to the subsequent syntactic analysis link and semantic processing link for identification and screening, that is, through the subsequent syntactic analysis link and semantic processing link, the various lexical analysis results are restricted, thereby increasing the final selection of the correct The possibility of lexical analysis results;

S1.6. For each word list (i), use the method of probability combined with syntactic rules or dependency analysis to check out special sentence patterns such as interrogative sentences, omission sentences, and inverted sentences, and perform corresponding morphological processing of their predicates , In order to deal with the subsequent steps;

S1.7. For each word list (i), remove adverb units, adjective units, adjacent adverb units, adjacent adjective units, interjection units, simple parentheses in non-sentence forms, and particle units , Adjacent juxtaposed particle units, adjacent juxtaposed qualifier units without structural ambiguity, mixed modifier units, impurity components in sentences waiting to be resolved; commas on both sides of non-sentence simple parentheses units waiting to be resolved are removed Contains minor punctuation marks.

Preferably, the step S2 includes:

S2.1. For each word list (i), read the sentence data structure that has been preprocessed to be parsed, and the sentence data structure that has been preprocessed includes the following information:

(1) Coordinate related word units used to connect sentences;

(2) The coordinate related word unit not used to connect sentences; the role of the coordinate related word unit not used to connect sentences is to connect various coordinate components within the sentence;

(3) Predicate verb unit, subordinate related word unit, basic noun unit, infinitive verb unit, gerund-present participle unit, past participle unit, preposition unit, adjacent predicate verb combination unit, adjacent parallel subordinate related words Combination unit, adjacent parallel basic noun combination unit, adjacent parallel infinitive verb combination unit, adjacent parallel gerund-present participle combination unit, adjacent parallel past participle combination unit, adjacent parallel preposition combination unit ；

(4) Interrogative unit, adjacent interrogative combination unit, and structurally ambiguous qualifier unit;

(5), including the parenthesis component of the predicate verb unit;

(6), the main punctuation marks;

S2.2. Generate a word list (ii) for the sentence data structure in the aforementioned S2.1; the word list (ii) includes the aforementioned words, the attributes corresponding to the aforementioned words, and the comparison of the aforementioned words according to the natural language sequence The numbers and main punctuation marks are marked in descending order of numbers.

Preferably, the step S3 includes:

S3.1. Obtain all the predicate vectors corresponding to each predicate element according to the possible values of the predicate element, the parallel guide element, the subordinate guide element, the subject element, the first position object element, and the second position object element Possible values; the predicate vector includes a parallel guide element, a subordinate guide element, a subject element, a predicate element, a first-position object element, and a second-position object element;

S3.2. For each predicate vector in the sentence to be parsed, choose any possible value corresponding to the predicate vector to obtain a set of possible values corresponding to the entire predicate vector; correspond to the aforementioned set of all predicate vectors The possible values of is arranged in a fixed order to form a matrix of n rows and 6 columns; the aforementioned matrix of n rows and 6 columns is called a backbone system;

S3.3. Replace every element outside of each predicate vector in any given backbone system with a corresponding number; after replacing the number, check the backbone system; if in the backbone system If the following unreasonable conditions occur, then the backbone system should be cleared; if the following unreasonable conditions do not occur in the backbone system, then the backbone system should be retained; the remaining backbone system is called the standardized backbone system:

S3.3.1. Check the aforementioned backbone system: compare the word list (ii), if there is a parallel related word unit or subordinate related word unit or adjacent parallel subordinate related word combination unit for connecting sentences that does not enter the main system, then the main The system is unreasonable, clear the backbone system;

S3.3.2. Check the aforementioned backbone system: If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector appears in two different predicate vectors, then the backbone system is unreasonable To clear the backbone system;

S3.3.3. Check the aforementioned backbone system: if there are two numbers in reverse order in a predicate vector, then the backbone system is unreasonable, and the backbone system is cleared;

S3.3.4. Check the aforementioned backbone system: replace any two predicate vectors with elements in the relationship between them, all of which are replaced by equal amounts; if there is a cross contradiction between the substitutions between the vectors, then the backbone system is unreasonable. Clear the backbone system; if two numbers in reverse order appear after equal substitutions, then the backbone system is unreasonable, and the backbone system is cleared;

S3.3.5. After the inspection, return to the original state before the inspection for use in subsequent operations.

Preferably, in the process of executing S3.2, the inspection program of S3.3 is executed synchronously to prevent the generation of unreasonable backbone systems.

Description of the drawings

Through the following description of the embodiments of the present invention with reference to the accompanying drawings, the above and other objectives, features, and advantages of the present invention will be more apparent, in the accompanying drawings:

Figure 1 is a screenshot of the wrong analysis result of the example sentence "That men who wereappointeddidn'tbother the liberals wasn'tremarkedupon by the press" made by Berkeley Parser;

Figure 2 is a screenshot of the wrong analysis result of the example sentence "Thatsomething you learned is wrong is known to the public." made by Berkeley Parser;

Fig. 3 is a schematic diagram of the first correct analysis result for the example sentences "That men who were appointed, didn't other, the liberals wasn't remarked, up by the press." provided by the present invention;

Fig. 4 is a schematic diagram of the second correct analysis result for the example sentences "That men who were appointed did not have the liberals wasn't remarked up by the press." provided by the present invention;

Fig. 5 is a schematic diagram of the correct analysis result of the example sentence "That something you learned is wrong is known to the public." provided by the present invention;

Figure 6 is a screenshot of the wrong parsing result of the example sentence "That that men were appointed didn't other the liberals wasn't remarked up by the press." made by Berkeley Parser;

Figure 7 is a screenshot of the wrong parsing result of the example sentence "That that that men were appointed didn't other the liberals wasn't remarked up by the press upset many women." by Berkeley Parser;

FIG. 8 is a schematic diagram of the correct analysis result of the example sentence "That that men were appointed did not't other the liberals wasn't remarked up by the press." provided by the present invention;

Fig. 9 is a schematic diagram of the correct analysis result of the example sentences "That that that men were appointed the liberals wasn't remarked upon by the press upset many women." provided by the present invention;

Figure 10 is the correct analysis result of the example sentence "Behaviorists suggest the child who is raised in an environment where there are many stimuli which develop his or her capacity for appropriate response response greater" provided by the present invention.

FIG. 11 is a schematic diagram of the correct analysis result of the example sentence "Believing that what he wants, Tom works hard in the company." provided by the present invention;

Figure 12 is a screenshot of the wrong analysis result of the example sentence "A study of travelers conducted by the website TripAdvisor names Yangshuo as one of the top 10destinations in the world." made by Berkeley Parser;

Figure 13 is a schematic diagram of the correct analysis result of the example sentence "A study of travelers conducted by the website TripAdvisor names Yangshuo as one of the top 10 destinations in the world." provided by the present invention;

Figure 14 is a screenshot of the wrong analysis result of the example sentence "That near all behavior is learned behavior is a basic assumption that has been put forward by the social scientists." made by Berkeley Parser;

Fig. 15 is a schematic diagram of the correct analysis result of the example sentence "That near all behavior is learned behavior is a basic assumption that has been put forward by the social scientists." provided by the present invention;

Figure 16 is a screenshot of the error analysis result of the example sentence "Jack met the patient the nurse" by Berkeley Parser; the clinic had hired sent to the doctor;

Figure 17 is a schematic diagram of the correct analysis result of the example sentence "Jack met the patient the nurse the clinic had hired to the doctor" provided by the present invention;

Figure 18 is a screenshot of the wrong analysis result of the example sentence "Jack met the boy the nurse had hired sent to the ward introduced to the patient." made by Berkeley Parser;

Figure 19 is a schematic diagram of the correct analysis result of the example sentences "Jack met the boy the nurse the doctor the clinic had hired sent to the ward introduced to the patient" provided by the present invention;

Figure 20 is a screenshot of the wrong analysis result of the example sentence "This is the malt the rat the cat the dog worried killed by Berkeley Parser";

FIG. 21 is a schematic diagram of the correct analysis result of the example sentence "This is the malt the rat the cat the dog worried killed ate." provided by the present invention;

Figure 22 is a screenshot of the wrong analysis result of the example sentence "Part of the reason Charles Dickens loved his own novel was that it was rather closely modeled on his own life." made by Berkeley Parser;

Figure 23 is a schematic diagram of the correct analysis result of the example sentence "Part of the reason Charles Dickens loved his own novel was closely modeled on his own life." provided by the present invention;

Figure 24 is a step diagram (1) of the first overall inserting method for Example 1;

Figure 25 is a step diagram (2) of the first overall inserting method for Example 1;

Figure 26 is a step diagram (3) of the first overall inserting method for Example 1;

Figure 27 is a step diagram (4) of the first overall inserting method for Example 1;

Figure 28 is the basic frame diagram of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system of Example 1;

Figure 29 is a step diagram (1) of the second method of overall insertion for example 1;

Figure 30 is a step diagram (2) of the second method of overall insertion for example 1;

FIG. 31 is a step diagram of the optimization method for the first and second overall interpolation methods of Example 1;

Figure 32 is the basic frame diagram of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system of Example 2;

Figure 33 is the basic frame diagram of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system of Example 3;

Figure 34 is the basic frame diagram of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system of Example 4;

Figure 35 is a five-wheel overall inserting operation diagram of Example 5;

Figure 36 is a basic frame diagram of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system of Example 6;

Figure 37 is an intuitive morphological diagram of the complete syntactic structure corresponding to the A _a -B _a -C _a joint system of Example 8;

Figure 38 is an intuitive morphological diagram of the complete syntactic structure corresponding to the A _b -B _b -C _b joint system of Example 8;

Figure 39 is a semantic relationship diagram of the syntactic structure constraint corresponding to the A _a -B _a -C _a joint system of Example 8;

Fig. 40 is a semantic relation diagram of syntactic structure constraints corresponding to the A _b -B _b -C _b joint system of Example 8;

Figure 41 is _a diagram of the overall insertion process of the complete syntax structure corresponding to the A _a -B _a -C _a joint system of Example 9;

Figure 42 is a diagram of the overall insertion process of the complete syntax structure corresponding to the A ₁ -B ₁ -C ₁ joint system of Example 10;

Figure 43 is a diagram of the overall insertion process of the complete syntax structure corresponding to the A ₁ -B ₁ -C ₁ joint system of Example 11;

Figure 44 is a diagram of the overall insertion process of the complete syntax structure corresponding to the A ₁ -B ₁ -C ₁ joint system of Example 17;

Figure 45 is a schematic diagram of the correct analysis result of the example sentence "That men the next the doctor the clinic had hired sent to the ward introduced to the cleaners didn't other the patients wasn't marked up by the press." provided by the present invention;

Figure 46 is a screenshot of the example sentence "That men the nurse the clinic had hired sent to the cleaners didn't bother the patients wasn't marked up by the press." by Berkeley Parser. ；

Figure 47 is a diagram of the overall insertion process of the complete syntax structure corresponding to the A ₁ -B ₁ -C ₁ joint system of Example 18;

Figure 48 is a schematic diagram of the correct analysis result of the example sentence "That men the cleaner introduced to the nurses the doctor the clinic had hired to the ward didn't other the patients wasn't marked up by the press." provided by the present invention;

Figure 49 is a screenshot of the example sentence "That men the cleaner introduced to the nurses the doctor the clinic had hired sent to the ward didn't bother the patients wasn't marked up by the press." taken by Berkeley Parser. ；

Figure 50 is a schematic diagram of all the links and algorithms included in the second calculation area (β area).

Specific implementation mode: Introduce some important definitions, which will be used in the following explanation:

The natural language for the following explanations, including but not limited to English language. The internal components of the sentence are divided into 4 categories: impurity components, main components, auxiliary components, and remaining noun components.

In the process of computer syntactic analysis, first, the adverb unit, adjective unit, interjection unit, particle unit, mixed modifier unit, adjacent adverb unit, and adjacent adjective unit are waiting to be analyzed for the impurities in the sentence Remove. Secondly, taking the predicate as the unit, each subject-predicate collocation (simple sentence) in the sentence to be parsed and its main components are processed into a predicate vector, and then all the predicate vectors form a matrix structure of n rows and 6 columns, as The backbone system. Thirdly, each auxiliary component such as infinitive structure, past participle structure, and preposition structure is processed into an auxiliary vector, and then all auxiliary vectors form a set as an auxiliary system. Finally, select a reasonable collocation from the possible main system and auxiliary system as the normative main system and normative auxiliary system, and then process each remaining noun component that cannot enter the normative main system and normative auxiliary system into a residual noun vector , And then all remaining noun vectors form a set as the remaining noun system.

Definition 1: Define +< as an ordered addition operation in mathematics: Let S be an English sentence to be parsed, and let a and b be two different words in the sentence S to be parsed. If (a, b) If +< is satisfied, then the number of word a in sentence S is less than the number of word b in sentence S, that is, a+<b means that the number of word a in sentence S is less than the number of word b in sentence S.

Definition 2: Let S be an English sentence to be parsed, and let f be any predicate vector in the English sentence S. Define 6 variables c, l, x, r, y, z related to the predicate vector f: record c as the coordinating guide element in the predicate vector f; record l as the subordinate guide element in the predicate vector f, Denote x as the subject element in the predicate vector f, r as the predicate element in the predicate vector f, y as the object element in the first position in the predicate vector f, and z as the second element in the predicate vector f Location object element. If c, l, x, r, y, z are regarded as 6 independent variables, then the predicate vector f can be regarded as a 6-ary function composed of the aforementioned 6 independent variables. Therefore, after removing the adverb unit, adjacent adverb unit, mixed modifier unit, interjection unit, particle unit and other impurity components in the predicate vector f, a 6-element function can be obtained that describes the main component of the predicate vector f Expression: f(c,l,x,r,y,z)=c+<l+<x+<r+<y+<z. It is also possible to use the representation method in mathematical set theory to record the aforementioned predicate vector f as a 6-element ordered group (c, l, x, r, y, z).

Definition 3: Suppose the aforementioned sentence S to be parsed has n predicates. According to the aforementioned definition, each predicate vector corresponding to n predicates is expressed in the form of a 6-element function, and the sentence S to be analyzed can be expressed as a matrix structure of n rows and 6 columns. If each independent variable in the matrix is assigned a specific value, that is, each predicate vector in the matrix is assigned a specific value, then the matrix also obtains a set of specific values accordingly. The set of specific values corresponding to the aforementioned matrix structure is called a backbone system of sentence S, which is also called an A system. As follows:

Definition 4: Define the 6 auxiliary vectors in the sentence. Suppose in the aforementioned sentence S to be analyzed: record the infinitive vector as g[To VB](u,v); record the gerund-present participle vector as g[VBG](u,v); record the past participle The vector is denoted as g[VBN](u,v); the preposition vector is denoted as g[PREP](u). For multiple auxiliary vectors of the same type that appear in the same sentence, they are distinguished by number marks, such as: g[To VB,1](u,v), g[To VB,2](u,v), ……, or g[VBG,1](u,v), g[VBG,2](u,v),……, or g[VBN,1](u), g[VBN,2](u ),……, or g[PREP,1](u), g[PREP,2](u),……. Among them, the independent variables u and v in each auxiliary vector respectively represent the first-position object element or the second-position object element or the object element named after the name of the auxiliary vector.

Special note: various forms that belong to the category of verb infinitives are expressed by g[To VB](u,v), for example: the forms expressed using computational linguistic symbols To VB, To VB VBN, To VB VBN VBN, To VB VBG, etc.; various forms that belong to the category of gerunds-present participles are expressed by g[VBG](u,v), for example: forms expressed using computational linguistic symbols VBG, VBG VBN, VBG VBN VBN and many more.

Definition 5: Record all auxiliary vectors as a set, which is called the auxiliary system of the sentence S to be parsed, also called the B system. As follows:

Note: The "number mark" in Definition 3, Definition 4 and Definition 5 is only used to distinguish and mark between multiple similar vectors. It is not the same concept as the "number" in the proposal of this application, so do not confuse it.

Definition 6: The aforementioned predicate vector, auxiliary vector, and the remaining noun vectors mentioned in the solution of this application are collectively referred to as language vectors. Any given two language vectors α and β, and α and β are not residual noun vectors, if the language vector β acts as the subject element of α or the object element in the first position or the object element in the second position or the infinitive in the language vector α One position object element or infinitive second position object element or gerund-present participle first position object element or gerund-present participle second position object element or past participle object element or preposition object element, then it is called language vector α It has a compound relationship with β, which is recorded as vector α compounding vector β, or vector β compounding vector α. The compound relationship between language vectors is also referred to as "element substitution relationship" in the solution of this application.

Two special notes: (i) Auxiliary vector has certain particularity. Usually the predicate vector is compounded with the auxiliary vector; but sometimes the other way around, the auxiliary vector is compounded with the predicate vector. In this regard, the solution of this application has done corresponding technical processing. (ii) The concept of overall interpolation between language vectors mentioned below is subject to the explanation of S8 of the solution of this application.

Take English as an example to illustrate a rule. The composition of sentences follows this rule: the main part of the syntactic structure of any complex sentence is based on the combination of multiple language vectors and the overall interpolation. It is composed of a combination. From the mathematical point of view of probability and statistics, the above rule is a deterministic event, which can be verified by performing statistics in a corpus, that is, in any English sentence sample space with a standard sentence as a sample, the above rule is complicated The probabilities of the sentence are all 1. The above-mentioned law is the source of the common long-distance related problems and deep recursive nesting problems in computer natural language processing, and is also an important starting point for the present invention to solve the technical problems.

In this patent application, based on the relevant natural laws of mathematics and computer science, comprehensive use of mathematics and computer science methods such as exhaustion, permutation and combination, comparison of natural numbers, excluding the reverse order of natural numbers, and probability calculations, establishes the mathematical model needed to solve the problem.

Example operation:

Example 1: That men who were appointed didn't bother the liberals wasn't remarked up by the press.

This example sentence is preprocessed to generate a word list (i-a) and a word list (i-b). Since the word that in the example sentence has structural ambiguity, that may be both a subordinate related word unit and a qualifier unit, so two word lists (i) are generated, and the two word lists (i) Give different marks.

When there is structural ambiguity in a sentence, multiple word lists (i) need to be drawn up for the sentence; the number of word lists (i) can be obtained according to the number of structural ambiguities, using the principle of multiplication in combinatorics. This example sentence also contains a structural ambiguity: "upon" may be both a particle and a preposition, but due to space limitations, it will not be analyzed specifically.

Word list (i-a):

Word list (i-b):

For the above word list (ia) and word list (ib), remove the adjective unit, adverb unit, adjacent adjective unit, adjacent adjacent adverb unit, non-sentence simple parenthesis unit, and particle unit , Adjacent parallel particle units, interjection units and other natural language elements as impurities, and then read the pre-processed sentence data structure to be parsed, and generate the corresponding word list (ii-a) and word list (ii -b), as shown below.

Word list (ii-a):

Word list (ii-b):

Next, this patent application takes the word list (i-a) and the corresponding word list (ii-a) as examples to analyze and explain:

This example sentence has 3 predicate verb units were appointed, didn't bother, wasn't remarked; it can be seen that this example sentence contains 3 predicate elements, which are recorded as r ₁ , r ₂ , and r ₃ in turn; furthermore, for these 3 Predicate elements to generate corresponding predicate vectors f ₁ , f ₂ , f ₃ ; the value of each element in the predicate vectors f ₁ , f ₂ , and f ₃ is as follows:

①For f ₁ :

R ₁ all the possible values of all referred to as {r _1}; S3 based on the information in the application program, it is _{clear: {r 1} = {were} appointed};

All possible values of c ₁ is the entire note {c _1}; S3 based on the information in the application program, can be _{obtained: {c 1} = {e} }.

All possible values for all note l ₁ {l _1}; S3 based on the information in the application program, can be _{obtained: {l 1} = {That} , who, e}.

All possible values of x ₁ is referred to all {x _1}; S3 based on the information in the application program, can be _{obtained: {x 1} = {men} , e}.

Y ₁ all the possible values of all referred to as {y _1}; S3 based on the information in the application program can be _{obtained: {y 1} = {f} 2, f 3, e}.

Z ₁ all the possible values of all referred to as {z _1}; unit while the current corresponding to the predicate element were appointed by the object can be accessed in conjunction with the verb object complement, but the predicate element position corresponding to a first object element , Is neither a basic noun unit nor an adjacent basic noun combination unit, then according to the information in the application plan S3, we can get: {z ₁ }={e}. Verbs that can accept double objects, such as: give, buy, sell, offer, etc.; can accept verbs that combine objects with object complements, such as: make, name, call, find, etc.; the aforementioned verbs can be searched in dictionaries or statistics Summarize and give in advance.

In the foregoing process, all possible values of each element in the predicate vector f ₁ have been generated. All possible values of the predicate vector f ₁ can be obtained by performing related calculations of combinatorial mathematics on all possible values of each element in f ₁ .

Similar to the aforementioned process of generating each element in the predicate vector f ₁ , there is the following process of generating each element in the predicate vector f ₂ and f ₃ :

②For f ₂ : {r ₂ }={didn't bother}; {c ₂ }={e}, {l ₂ }={That,who,e}, {x ₂ }={men,f ₁ ,e}, {y ₂ }={the liberals,f ₃ ,e}, {z ₂ }={e}.

③For f ₃ : {r ₃ }={wasn't remarked}; {c ₃ }={e}, {l ₃ }={That,who,e}, {x ₃ }={men,the liberals ,f ₁ ,f ₂ ,e}, {y ₃ }={the press,e}, {z ₃ }={e}.

On the basis of generating all possible values of each element in the predicate vectors f ₂ and f ₃ , all possible values of the predicate vectors f ₂ and f ₃ can be obtained by comparing each of f ₂ and f ₃ respectively. All possible values of an element are obtained by related calculations of combinatorial mathematics.

In summary, this example sentence has 3 predicate verb units, including 3 predicate elements, and for these 3 predicate elements, corresponding predicate vectors f ₁ , f ₂ , f _{3 are} generated; predicate vectors f ₁ , f ₂ , f The value of each element in ₃ is as follows:

①For f ₁ there are: {r ₁ }={were appointed}; {c ₁ }={e}, {l ₁ }={That,who,e}, {x ₁ }={men,e}, { y ₁ }={f ₂ ,f ₃ ,e}, {z ₁ }={e}.

After generating all possible values of each element in the predicate vectors f ₁ , f ₂ , and f ₃ , all possible values of each of the three predicate vectors can be obtained by comparing f ₁ , f ₂ , and f ₃ respectively All possible values of each element of is obtained by related calculations of combinatorial mathematics.

According to the information in the application plan S3.2, this example sentence has three predicate vectors, the main system of this example sentence should be composed of a matrix with 3 rows and 6 columns, and its abstract form is as follows:

A backbone system is also an A system. Denote the entire backbone system corresponding to this example as {A}; denote the cardinality of the set {A} as ∣A∣. The total of all possible values of the predicate vector f ₁ is recorded as the set {f ₁ }; the cardinality of the set {f ₁ } is recorded as ∣f ₁ ∣. The same treatment is adopted for other predicate vectors and elements. Then use the multiplication principle in combinatorics:

∣f ₁ ∣=∣c ₁ ∣×∣l ₁ ∣×∣x ₁ ∣×∣r ₁ ∣×∣y ₁ ∣×∣z ₁ ∣=1×3×2×1×3×1=18

∣f ₂ ∣=∣c ₂ ∣×∣l ₂ ∣×∣x ₂ ∣×∣r ₂ ∣×∣y ₂ ∣×∣z ₂ ∣=1×3×3×1×3×1=27

∣f ₃ ∣=∣c ₃ ∣×∣l ₃ ∣×∣x ₃ ∣×∣r ₃ ∣×∣y ₃ ∣×∣z ₃ ∣=1×3×5×1×2×1=30

Thus: ∣A∣=∣f ₁ ∣×∣f ₂ ∣×∣f ₃ ∣=18×27×30=14580, a total of 14580 backbone systems are generated.

The above process can be simplified according to claim 5 in the application solution, and the generation and checking of the backbone system can be executed simultaneously, thereby reducing the complexity of calculation.

In the backbone system generated above, that is, among the 14580 matrices with 3 rows and 6 columns generated above, 5 matrices are randomly selected, and the 5 matrices are determined according to the requirements from S3.3.1 to S3.3.4 in the application plan. Check. For ease of presentation, the inventor of this patent application directly replaced any of the five matrices previously selected with numbers, and the numbers correspond to the word list (ii-a). When replacing numbers, the empty cell e remains unchanged, as follows Shown.

The first matrix:

The second matrix:

The third matrix:

The fourth matrix:

The fifth matrix:

According to the requirements in the application scheme S3.3.1, check the first matrix mentioned above: the matrix omits the subordinate related word unit who, whose number is 3. The matrix is unreasonable, that is, the backbone system is unreasonable, clear the backbone system;

According to the requirements in the application plan S3.3.2, check the aforementioned second matrix: In this matrix, the same number 2 appears in two different predicate vectors f ₁ and f ₂ respectively. The matrix is unreasonable, namely The backbone system is unreasonable, clear the backbone system;

According to the requirements in the application plan S3.3.2, check the aforementioned third matrix: In this matrix, the same predicate vector f ₁ appears in two different predicate vectors f ₂ and f ₃ , and the matrix is unreasonable , That is, the backbone system is unreasonable, clear the backbone system;

According to the requirements in the application scheme S3.3.3, check the aforementioned second matrix again: in this matrix, two

numbers

3 and 2 appear in the predicate vector f ₁ in reverse order. The matrix is unreasonable, that is, the backbone system is not Reasonable, clear the backbone system; obviously, the second matrix violated the requirements in the application plan twice.

Application programs in accordance with the requirements of S3.3.4, the fourth check matrix: the matrix, occurs inside predicate vector f ₂ f _3, f ₃ and predicate vector f ₂ also appears inside, which makes it impossible to f ₂ = e+<1+<2+<5+<f ₃ +<e is substituted into f ₃ , and f ₃ ＝e+<e+<f ₂ +<7+<e+<e is substituted into f ₂ , which is a substitution cross contradiction. The matrix is unreasonable, that is, the backbone system is unreasonable, clear the backbone system.

According to the above requirements in the application plan, check the aforementioned fifth matrix: the fifth matrix does not violate any requirement in the application plan S3.3. Therefore, the aforementioned fifth matrix is a canonical backbone system, or a canonical A system. Among the 14,580 3-row and 6-column matrices generated above, there are other standardized backbone systems, which are not listed one by one. Record the fifth matrix mentioned above as the standard A ₁ system. Restore the standard A ₁ system to the following form:

According to S3.3.5 of the application plan, after the inspection, it is restored to the original state before the inspection for future use.

According to the information in application S4, this example sentence has only one preposition unit by, and for the preposition unit by, a corresponding auxiliary vector g[PREP](u) is generated.

According to the information in Application S4, it is obvious that:

g[PREP](u)=by+<(u): PREP=by, u={the press,e}.

All possible values of g[PREP](u)=by+<(u) are: set {by+<the press, by+<e}.

According to the information in the application plan S5, it can be seen that from the aforementioned set {by+<the press,by+<e}, two auxiliary systems can be obtained, that is, two B systems can be obtained, denoted as B ₁ system and B ₂ System; might as well set B ₁ ={g[PREP](u)=by+<the press}, B ₂ ={g[PREP](u)=by+<e}.

Now, given the aforementioned standard A ₁ system, the subsequent operations are consistent with the standard A ₁ system.

Replace the aforementioned B ₁ and B ₂ systems with numbers:

B ₁ ={g[PREP](u)=8+<9}, B ₂ ={g[PREP](u)=8+<e}.

After inspection, the B ₁ system and B ₂ system meet the requirements from S6.1 to S6.5 in the application plan. Since the structures of the B ₁ system and the B ₂ system are relatively simple, they are easy to verify and do not elaborate.

It can be seen that, given the aforementioned standard A ₁ system, the B ₁ system and the B ₂ system are both standard auxiliary systems. The B ₁ system and the B ₂ system can be further denoted as the standard B ₁ system and the standard B ₂ system.

According to S6.6 of the application plan, after the inspection, it is restored to the original state before the inspection for future use.

Generate C system and A-B-C joint system:

Combine the aforementioned canonical A ₁ system and canonical B ₁ system together, and no corresponding residual nouns are generated, then the corresponding residual noun system is recorded as

Combine the aforementioned canonical A ₁ system and canonical B ₂ system to produce a corresponding residual noun system, denoted as C ₂ system, C ₂ ={the press}.

So far, two ABC joint systems have been obtained: A ₁ -B ₁ -C ₁ joint system and A ₁ -B ₂ -C ₂ joint system.

Next, take the A ₁ -B ₁ -C ₁ joint system for the overall plug-in operation. The A ₁ -B ₁ -C ₁ combined system is as follows:

B ₁ ={g[PREP](u)=by+<the press};

The vectors in the above A ₁ -B ₁ -C ₁ joint system are all the type I vectors before the equivalent substitution. Through equivalent substitution, all the type I vectors in the A ₁ -B ₁ -C ₁ joint system are converted into type II vectors, as shown below:

B ₁ ={g[PREP](u)=by+<the press};

After clearing the empty cell e, all the type II vectors in the A ₁ -B ₁ -C ₁ joint system are as follows:

B ₁ ={g[PREP](u)=by+<the press};

The first method of overall insertion:

Next, start the overall plug-in operation, as shown in Figure 24. Take the two vectors shown in the figure as the receiving vector and the insertion vector of the first round of the overall nulling operation, and mark the receiving vector as ω and the insertion vector as μ. Take the right side as the first side, and label the order value of each element in the vector ω one by one from right to left. After labeling the order value, take the second element in the vector ω, and construct a unique space only on the right side of the element. Insert the vector μ into the space corresponding to the second element in the way of overall insertion, and then generate a new vector.

The aforementioned newly generated vector is shown below. This vector is a type III vector obtained through the overall blanking operation in the A ₁ -B ₁ -C ₁ joint system, and this newly generated vector is marked as [ω] ² +<μ, the first round of overall blanking The operation is complete.

That men didn’t other the liberals who were appointed wasn’t remarked

As shown in Figure 25, the newly generated vector [ω] ² +<μ is taken as the reception vector for the second round of the overall blanking operation. From the vector of [ω] ² + <[mu] the first element on the right side until the start vector [ω] ² + <first element of each element of the vector μ μ left inside contains up who, denoted by order value ; The rest of the elements in the vector [ω] ² +<μ are not marked with order values. Take the third element in the vector [ω] ² +<μ that has been marked with the order value, and only construct a unique space on the right side of the element. After making the empty space, take the preposition vector g[PREP](u)=by+<the press as the insertion vector for the second round of the overall empty insertion operation, and record the insertion vector as ξ. Insert the vector ξ into the space corresponding to the aforementioned third element in the way of overall insertion, and then generate a new vector.

The aforementioned newly generated vector is shown below. This vector is a type III vector obtained through the overall void operation in the A ₁ -B ₁ -C ₁ joint system, and it is also a combined vector. Denote this vector as [[ω] ² \μ] ³ +<ξ. That men didn't bother the liberals who by the press were appointed wasn't remarked

Replace the aforementioned merged vector with the number as follows. After inspection, the number in the reverse order appeared in the vector. Obviously the merged vector is unreasonable, so clear the merged vector.

1 2 5 6 3 8 9 4 7

Perform the first round of the overall insertion operation again, as shown in Figure 26. Take the two vectors ω and μ mentioned above, respectively, as the reception vector and the insertion vector of the first round of the overall blanking operation. Take the right side as the first side, and label the order value of each element in the vector ω one by one from right to left. After the order value is marked, the fourth element in the aforementioned vector ω is taken, and a unique space is constructed only on the right side of the element. Insert the aforementioned vector μ into the space corresponding to the aforementioned fourth element in the way of overall insertion, and then generate a new vector, which is obtained by the overall insertion operation in the A ₁ -B ₁ -C ₁ joint system A type III vector of, mark this newly generated vector as [ω] ⁴ +<μ, and the first round of overall interpolation is completed.

Perform the second round of the overall insertion operation again, as shown in Figure 27. Take the newly generated vector [ω] ⁴ +<μ as the receiving vector for the second round of the overall nulling operation. From the vector of [ω] ⁴ + <a first element on the right side until the start of the vector [mu] [ω] ⁴ + <a first element of each element of the vector μ μ left inside contains up who, denoted by order value ; The rest of the elements in the vector [ω] ⁴ +<μ are not marked with order values. Take the first element in the vector [ω] ⁴ +<μ that has an order value, and only construct a unique space on the right side of the element. After making the empty space, take the preposition vector g[PREP](u)=by+<the press as the insertion vector for the second round of the overall empty insertion operation, and record the insertion vector as ξ. Insert the vector ξ into the space corresponding to the first element mentioned above in the way of overall insertion, and then generate a new vector.

The aforementioned newly generated vector is shown below. This vector is a type III vector obtained through the overall void operation in the A ₁ -B ₁ -C ₁ joint system, and it is also a combined vector. Denote this vector as [[ω] ⁴ \μ] ¹ +<ξ. That men who were appointed didn't bother the liberals wasn't remarked by the press

Replace the aforementioned merged vector with the number as follows. After inspection, there is no serial number in the vector. The merged vector is reasonable, keep the merged vector, and keep the A ₁ -B ₁ -C ₁ joint system, and wait for subsequent operations.

1 2 3 4 5 6 7 8 9

The above-mentioned overall blanking operation corresponds to the blanking scheme: ω→μ→ξ.

As for the subsequent exhaustion of the space corresponding to each element in each receiving vector in each round of the emptying operation involved in the above-mentioned emptying scheme, that is, exhausting every merged vector involved in the above-mentioned emptying scheme, You can imitate the aforementioned operations, not to list them all.

In summary, through the A ₁ -B ₁ -C ₁ joint system, the general syntactic structure of Example 1 is obtained, that is, the A ₁ -B ₁ -C ₁ joint system describes the basic framework of the syntactic structure of Example 1. As shown in Figure 28.

Exhaust all the plug-in solutions corresponding to any A-B-C joint system:

For example: the aforementioned A ₁ -B ₁ -C ₁ joint system contains 3 type II vectors ω, μ, ξ; for the aforementioned 3 type II vectors, follow the permutation formula in combinatorics

Perform calculations to obtain all the blanking schemes corresponding to the A ₁ -B ₁ -C ₁ joint system as follows: ω→μ→ξ (plan 1), μ→ω→ξ (plan 3), ξ→μ→ω (plan 5) ),

ω→ξ→μ (plan 2), μ→ξ→ω (plan 4), ξ→ω→μ (plan 6).

The subsequent exhaustion:

As for the subsequent exhaustion of all ABC joint systems corresponding to each word list (ii), exhaustion of all insertion schemes and all merging vectors corresponding to each ABC joint system, relevant calculations such as multiplication and permutation and combination in combinatorics can be used. The methods are gradually exhausted in accordance with the aforementioned operations, and will not be listed one by one.

The second method of overall insertion:

The second method of overall blanking is to mark every element in the receiving vector with a sequence value in each round of the overall blanking operation, and then you can take any element that has been marked with a sequence value, construct a gap and perform the blanking operation .

In the second overall blanking method, there are no restrictions on the order value of each round of blank insertion and the selection of spaces; in the first overall blanking method, every subsequent round from the second round of overall blanking The overall round insertion is limited to the position of the first element on the second side of the previous round insertion vector contained in the received vector, the order value is marked and the space selected. When all the stitching vectors corresponding to a certain joint system are exhausted, the first overall interpolation method will not produce the same stitching vector; the second overall inserting method may produce the same result, that is, the same stitching vector will be generated, and the result will be the same Combine into one result. The operation process of the second overall plug-in method is shown in Figure 29 and Figure 30.

Optimization of the first overall interpolation method and the second overall interpolation method:

The above process can be further optimized according to S8.6 of the application plan. S8.6 of the application plan is the optimization of the steps from S8.2 to S8.5 of the application plan, that is, the optimization of the first and second overall insertion methods mentioned above.

According to S8.6 of the application plan, after the equivalent substitution operation of S8.1 is performed, each type II vector in the A ₁ -B ₁ -C ₁ joint system is replaced with a corresponding number, as shown below Show:

That men didn’t other the liberals wasn’t remarked: 1 2 5 6 7

who were appointed: 3 4 by the press: 8 9

In the following, only the first round of the overall plug-in operation is taken as an example to illustrate the optimization method of the application scheme S8.6. This optimization method is applicable to all the aforementioned first and second overall inserting methods.

Now, given a blanking scheme corresponding to the A ₁ -B ₁ -C ₁ joint system, suppose that in the first round of the overall blanking operation corresponding to the blanking scheme, the vector (1 2 5 6 7) is the receiving vector, The vector (3 4) is the insertion vector. Take the right side as the first side, and construct a space on the right side of each element inside the receiving vector, as shown below:

1_______ 2_______ 5_______ 6_______ 7_______

Start to filter reasonable slots: because 7>3, the numbered group (3 4) cannot be inserted into the slot corresponding to number 7, and there is no empty operation for this slot; because 6>3, the numbered group (3 4) cannot be inserted If the vacancy is in the vacancy corresponding to 6, the vacancy does not have an insert operation; because 5>3, the numbered group (3 4) cannot be vacant in the vacancy corresponding to the number 5, and the vacancy does not have an insert operation; because 2<3 and 4 <5, the numbered ordered group (3 4) can be inserted into the slot corresponding to number 2, and the slot can be inserted into the slot, as shown in Figure 31.

Steps after the overall blanking operation and checking whether the combined vector is reasonable:

Next, for the norm A system that can generate a reasonable merging vector, that is, a matrix that can generate a reasonable merging vector, use the method of probability combined with syntax rules or dependency analysis to check the syntax rules. Consider constructing a set of syntactic rules, which contains a limited number of syntactic rules. This set of syntactic rules can also be used in the syntactic structure fixes mentioned later. The set of syntactic rules includes but not limited to the following syntactic rules:

① In English, unless the subject clause is surrounded by left and right quotation marks, the subordinating conjunction that of the leading subject clause cannot be omitted; further, unless the subject clause is surrounded by left and right quotation marks, any leading word that leads to the subject clause cannot be omitted. The omission is reflected in the matrix structure: if a certain x _i element in the matrix is served by a certain predicate vector f _j , then the l _j element in the aforementioned f _j cannot be an empty unit, that is, l _j ≠e.

② In English, if the predicate is in the passive voice, on the premise that it does not include special syntactic phenomena, then the predicate cannot have a corresponding second-position object. Reflected in the matrix structure: If one of the elements r _i is the passive matrix, then the aforementioned r _i z _i corresponding to the unit must be empty, i.e., z _i = e.

③In English, on the premise that no special syntactic phenomenon is included, if the predicate is in the passive voice, and the predicate is a unit composed of verbs that can neither accept a double object nor an object combined with an object complement, then the predicate is both There can be no corresponding first-position object and no corresponding second-position object. Reflected in the matrix structure: if a certain r _i element in the matrix is in the passive voice, and the r _i element is a unit composed of a verb that can neither accept a double object nor an object combined with an object complement, then it is the same as the aforementioned r Both y _i and z _i corresponding to _i must be empty units, that is, y _i =e and z _i =e.

④In English, subject and predicate must be consistent in the singular and plural concepts without including special syntactic phenomena; although there are some nouns with the same singular and plural forms in English, they will interfere with the judgment of the aforementioned problems. But these nouns can be summarized and given in advance by querying the dictionary or statistics. Subject and predicate must maintain the same rules in singular and plural, which is easy to handle in matrix structure.

⑤In English, most of the prepositions, such as in, on, at, to, with, for, about, etc., cannot be followed by the object clauses that lead or omit that; a few prepositions, such as except, besides , But, etc., can be followed by that guided or omitted object clauses of that.

⑥ Use the rules of event object verbs and non-event object verbs to check; the event object verbs in this patent application refer to verbs in natural language that can only use events as objects but not people or things as objects; this patent application The non-event object verb in, refers to a verb in natural language that can only use people or things as objects but not events as objects. For example, bother in English is a typical non-event object verb, which can accept people or Things are used as objects, but object clauses that lead or omit that are not allowed; event object verbs and non-event object verbs can be summarized and given in advance by querying the dictionary or statistics; event object verbs and non-event object verbs The concept plays an important role in the computer's natural language syntactic analysis; this patent application also lists event object verbs and non-event object verbs as a syntactic rule, and checks are performed according to this rule.

⑦Some special syntactic phenomena in English, inverted sentences or omitted sentences, etc., are not listed one by one.

Next, take another A ₂ -B ₁ -C ₂₁ joint system that generates a reasonable combined vector, as shown below. When checking the syntactic rules of the joint system, it was found that x=the liberals and r=wasn't remarked violated the requirements of the aforementioned syntactic rule ④. A ₂ -B ₁ -C ₂₁ combined system is discarded.

B ₁ ={g[PREP](u)=by+<the press};

Next, take another A ₃ -B ₁ -C ₃₁ joint system that generates a reasonable combined vector, as shown below. When checking the syntactic rules of the joint system, it was found that x=the liberals and r=wasn't remarked violated the requirements of the aforementioned syntactic rule ④, while r=didn't bother and y=f ₃ violated the aforementioned The requirements of syntactic rules ⑥. The A ₃ -B ₁ -C ₃₁ joint system violates the aforementioned syntax rules in two places, and the A ₃ -B ₁ -C ₃₁ joint system is discarded.

B ₁ ={g[PREP](u)=by+<the press};

Special note: In any ABC joint system corresponding to the aforementioned word list (ii-b), the computer initially divides the structurally ambiguous qualifier unit That and the basic noun unit men into the same language segment, which is processed as That modifies men; but That modifies men is an obvious syntactic error, which can be easily recognized and eliminated by the computer in the subsequent syntactic rule checking, because according to English syntax rules, That as a qualifier cannot modify the plural form of a countable noun, men. As a result, all A-B-C joint systems generated by the word list (ii-b) will be treated as unreasonable A-B-C joint systems and removed.

The aforementioned A ₁ -B ₂ -C ₂ combined system is as follows. The aforementioned A ₁ -B ₂ -C ₂ joint system also generates a reasonable combined vector. Run the remaining noun checking program on the aforementioned A ₁ -B ₂ -C ₂ joint system to check whether the remaining noun the press of the C ₂ system is a reasonable remaining noun. If the remaining noun the press is a reasonable remaining noun, then the A ₁ -B ₂ -C ₂ joint system is retained; if the remaining noun the press is an unreasonable remaining noun, then the A ₁ -B ₂ -C ₂ joint system is discarded.

B ₂ ={g[PREP](u)=by+<e}; C ₂ ={the press}

Use probability combined with syntactic rules or dependency analysis methods to check remaining nouns. For example, in English, appositions can use independent nouns, the independent nominative structure of non-predicate verbs can use independent nouns, and the title of articles with colons often use independent nouns, and so on. If the method of combining probability with syntactic rules is used, then the aforementioned linguistic phenomenon is the syntactic rule corresponding to reasonable residual nouns. On the basis of these syntactic rules, special statistics can also be made for the aforementioned syntactic rules in the corpus, and the corresponding probability can be calculated.

If we use the method of probability and syntactic rules mentioned above, it is easy to check: the remaining noun the press of the C ₂ system is an unreasonable remaining noun. Therefore, the A ₁ -B ₂ -C ₂ joint system is discarded.

After various previous treatments, only the aforementioned A ₁ -B ₁ -C ₁ combined system remained, and all other combined systems were discarded due to their own unreasonable factors.

The A ₁ -B ₁ -C ₁ joint system depicts the basic framework of the syntactic structure of Example 1, as shown in Figure 28. Compared with the word list 1, there is still an impurity ingredient missing at present. In order to obtain the complete syntactic analysis result of Example 1, the basic framework of the syntactic structure obtained above can be combined with the method of combining the probability with the syntactic rules or the method of dependency analysis. Specifically, if the method of combining probability with syntactic rules is adopted, then according to the lexical mark of example sentence 1 given in the word list (i), sorted in descending order of probability, the acquisition is not in conflict with the basic framework of the aforementioned syntactic structure. The most probable computer analysis result. The method of combining probability with syntactic rules includes, but is not limited to: probabilistic context-free grammar and lexicalized probabilistic context-free grammar.

For example: Suppose, according to the lexical mark of example sentence 1 calibrated in the word list (i), using the method of probability combined with syntactic rules, 10000 syntactic analysis results generated by the computer are obtained, and the aforementioned results are sorted in descending order of probability . Among them, the results ranked 1st to 19th all conflict with the basic framework of the syntactic structure described by the aforementioned A ₁ -B ₁ -C ₁ joint system, and the result ranked 20th is incompatible with the aforementioned syntactic structure. The basic framework does not conflict, then the result ranked 20th is the computer analysis result that does not conflict with the basic framework of the aforementioned syntactic structure and has the highest probability, and this result is regarded as the final correct result. In the form of strings commonly used in the field of computer science, the results of several syntactic analysis above are expressed as follows:

1), (ROOT(S(NP(IN That)(NP(NNSmen)(SBAR(WPwho)(S(VBDwere)(VBNappointed))))))(VP(VBDdid)(RBn't )(VP(VBbother)(NP(NP(DTthe)(NNSliberals))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon)(PP(INby) (NP(DT the)(NN press)))))))))(..)))

The probability of the result ranked 1st is: 0.00010738

2),(ROOT(S(IN That)(NP(NNSmen)(SBAR(WPwho)(S(VBDwere)(VBNappointed)))))(VP(VBDdid)(RBn't)(VP (VBbother)(NP(NP(DTthe)(NNS liberals))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon)(PP(INby)(NP( DT the)(NN press))))))))))(..)))

The probability of the second place result is: 0.00010621

3), (ROOT(S(NP(IN That)(NP(NNSmen)(SBAR(WPwho)(S(VBDwere)(VBNappointed))))))(VP(VBDdid)(RBn't )(VP(VBbother)(NP(NP(DTthe)(NNSliberals))(VP(VBDwas)(RBn't) (VP(VBNremarked)(PP(RPupon)(PP(INby) (NP(DT the)(NN press)))))))))(..)))

The probability of the result ranked 3rd is: 0.00010403

20), (ROOT(S(NP(IN That)(NP(NP(NNSmen)(SBAR(WPwho)(S(VBDwere)(VBNappointed)))))(VP(VBDdid)(RBn' t)(VP(VBbother)(NP(DTthe)(NNS liberals))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon)(PP( IN by)(NP(DTthe)(NN press))))))(..)))

The probability of the result ranked 20th is: 0.00010196

In summary, after the aforementioned series of processing, the syntactic analysis result of Example 1 is obtained. The result is a result that can be considered correct in English linguistics. In the form of a string commonly used in the field of computer science, the result is expressed as follows: [See Figure 3]

Note: Figure 3 is a schematic diagram corresponding to the string form, and the same applies to the following.

(ROOT(S(NP(IN That)(NP(NP(NNSmen)(SBAR(WPwho)(S(VBDwere)(VBNappointed)))))(VP(VBDdid)(RBn't)( VP(VBbother)(NP(DTthe)(NNS liberals))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon)(PP(INby) (NP(DT the)(NN press))))))(..)))

In the first half of the specification, the following mathematical model is mentioned, and this mathematical model is referred to as the Q model.

f(c ₁ ,l ₁ ,x ₁ ,r ₁ ,y ₁ ,z ₁ );

g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ );

h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ ).

f, g, h meet the following three conditions:

①l ₂ ＝that;

③g[h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ )].

Explanation: the meaning of f(c ₁ ,l ₁ ,g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ ),r ₁ ,y ₁ ,z ₁ ) is that the predicate vector g is the predicate vector The subject clause of f. The meaning of g[h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ )] is that the predicate vector h is inserted into a certain position of the predicate vector g in a way of inserting the entire space. The meaning of l ₂ =that is that the leading word of the predicate vector g is that. Correspondingly, the meaning of the Q model is: the predicate vector g is the subject clause of the predicate vector f, and the leading word of the predicate vector g is that, and the predicate vector h is inserted into a certain position of the predicate vector g in a way of overall insertion.

Example 1 conforms to the above-mentioned Q model. The verification is as follows. The auxiliary components and the empty unit e are omitted:

f(c ₁ ,l ₁ ,x ₁ ,r ₁ ,y ₁ ,z ₁ )=g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ )+<wasn't+<remarked;

g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ )=That+<men+<didn't+<bother+<the+<liberals;

f(c ₁ ,l ₁ ,g(c ₂ ,l ₂ ,x ₂ ,r ₂ ,y ₂ ,z ₂ ),r ₁ ,y ₁ ,z ₁ )=(That+<men+<didn't+<bother+<the+<liberals)+<wasn't+<remarked;

h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ )=who+<were+<appointed;

g[h(c ₃ ,l ₃ ,x ₃ ,r ₃ ,y ₃ ,z ₃ )]=That+<men+[who+<were+<appointed]+<didn't+<bother+<the+<liberals.

What needs to be pointed out is that from a mathematical point of view, all English sentences that conform to the above Q model will often be used by Berkeley Parser and Stanford Parser (Berkeley Parser) until the filing date of this patent application-March 22, 2019. Stanford Parser) parsed the seriously wrong result!

Example 2: That something you learned is wrong is known to the public.

That in this example sentence creates structural ambiguity. However, due to limited space, only the word list (ii) that uses That as a subordinate related word unit for preprocessing is given, as shown below. The adjective wrong in this example sentence serves as the predicative of the clause and is the main component of the sentence. However, in order to facilitate computer processing, the adjective wrong is temporarily removed in the preprocessing step according to the operation of the application plan. The predicative wrong of the subordinate sentence can be repaired in the subsequent syntactic structure repair link.

According to the scheme of this application, for example sentence 2, the following A-B-C joint system can be generated:

B ₁ ={to+<the public};

Through the above A ₁ -B ₁ -C ₁ joint system, the basic framework of the syntactic structure of example sentence 2 is obtained, as shown in Figure 32.

The complete syntactic analysis result of Example 2 is expressed as a string as follows: [See Figure 5]

(ROOT(S(SBAR(IN That)(S(NP(NNsomething)(SBAR(PRPyou)(VBDlearned)))(VP(VBZis)(JJwrong))))(VP(VBZis)( VP(VBN known)(PP(TO to)(NP(DT the)(NN public)))))(..)))

Special note: Example 2 also conforms to the Q model mentioned above. As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence!

Example 3: That that men were appointed didn't other the liberals wasn't remarked up by the press.

The two thats in this example have structural ambiguities. However, due to limited space, only the word list (ii) that uses 2 that as subordinate related word units for preprocessing is given, as shown below:

According to the scheme of this application, for example sentence 3, an A-B-C joint system can be generated as follows:

B ₁ ={g[PREP](u)=by+<the press};

Through the above A ₁ -B ₁ -C ₁ joint system, the basic framework of the syntactic structure of example sentence 3 is obtained, as shown in Figure 33.

The complete syntactic analysis result of Example 3 is expressed as a string as follows: [See Figure 8]

(ROOT(S(SBAR(IN That)(S(SBAR(IN That)(S(NP(NNSmen))(VP(VBDwere)(VBNappointed)))))(VP(VBDdid)(RBn' t)(VP(VBbother)(NP(DTthe)(NNS liberals))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon)(PP( IN by)(NP(DTthe)(NN press))))))(..)))

Special note: As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence!

Example 4: That that that men were appointed didn't other the liberals wasn't remarked up by the press upset many women.

The three thats in this example all produce structural ambiguity. However, due to limited space, we only give a word list (ii) with 3 that as subordinate related word units for preprocessing, as shown below:

According to the scheme of this application, for example sentence 4, an A-B-C joint system can be generated as follows:

B ₁ ={g[PREP](u)=by+<the press};

Through the above A ₁ -B ₁ -C ₁ joint system, the basic framework of the syntactic structure of example sentence 4 is obtained, as shown in Figure 34.

The complete syntactic analysis result of Example 4 is expressed as a string as follows: [See Figure 9]

(ROOT(S(SBAR(IN That)(S(SBAR(IN That)(S(SBAR(IN That)(S(NP(NNSmen))(VP(VBDwere)(VBNappointed)))))(VP (VBDdid)(RBn't)(VP(VBbother)(NP(DTthe)(NNSliberals))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)( ADVP(RPupon)(PP(INby)(NP(DTthe)(NNpress)))))))(VP(VBDupset)(NP(JJmany)(NNSwomen)))(..) ))

Supplementary explanation for cases 3 and 4: In the first half of the description,

cases

3 and 4 have been mentioned. These two sentences have no grammatical and logical errors, and both contain the nested structure of the subject clause guided by that. Among them, that is all subordinate conjunctions (the lexical label is IN); and Berkeley Parser and Stanford Parser gave incorrect syntactic analysis results for Examples 3 and 4! Especially for a total of 5 subordinating conjunctions that in these two sentences, neither Berkeley Parser nor Stanford Parser gave completely correct results! In addition, for the word list (ii) that records that as a structurally ambiguous qualifier unit (ii) and its corresponding ABC joint system, the computer initially classifies the structurally ambiguous qualifier unit that and the basic noun unit men in the same language In the fragment, but that modifies men is an obvious syntax error, and this error can be easily identified and eliminated by the computer in the subsequent syntactic rule checking. Therefore, the word lists (ii) generated in Examples 3 and 4 that mark that as a structurally ambiguous qualifier unit (ii) will be cleared by the computer.

Example 5: Behaviorists suggest the child who is raised in an environment where they are many stimuli which develop his or her capacity for appropriate responses will experience greater intellectual development.

Due to limited space, only the preprocessed word list (ii) is given, as shown below:

The example sentence has 5 predicate verb units suggest, is raised, there are, develop, and will experience; therefore, this example sentence contains 5 predicate elements, which are recorded as r ₁ , r ₂ , r ₃ , r ₄ , and r ₅ in turn; These 5 predicate elements generate corresponding predicate vectors f ₁ , f ₂ , f ₃ , f ₄ , and f ₅ ; according to the information of the application plan S3, the predicate vectors f ₁ , f ₂ , f ₃ , f ₄ , f ₅ The value of each element of is as follows:

① For f ₁ : {r ₁ }={suggest}; {c ₁ }={e}, {l ₁ }={e}, {x ₁ }={Behaviorists,e}, {y ₁ }={ the child, f ₂ , f ₃ , f ₄ , f ₅ , e}, {z ₁ } = {e}.

②For f ₂ there are: {r ₂ }={is raised}; {c ₂ }={e}, {l ₂ }={who,e}, {x ₂ }={Behaviorists,the child,f ₁ , e}, {y ₂ } = {an environment, f ₃ , f ₄ , f ₅ , e}, {z ₂ } = {e}.

③For f ₃ : {r ₃ }={there are}; {c ₃ }={e}, {l ₃ }={who,where,e}, {x ₃ }={Behaviorists, the child, an environment, f ₁ ,f ₂ ,e}, {y ₃ }={many stimuli,f ₄ ,f ₅ ,e}, {z ₃ }={e}.

④For f ₄ : {r ₄ }={develop}; {c ₄ }={e}, {l ₄ }={who,where,which,e}, {x ₄ }={Behaviorists, the child, an environment,many stimuli,f ₁ ,f ₂ ,f ₃ ,e}, {y ₄ }={capacity,responses,f ₅ ,e}, {z ₄ }={e}.

⑤For f ₅ : {r ₅ }={will experience}; {c ₅ }={e}, {l ₅ }={who,where,which,e}, {x ₅ }={Behaviorists, the child ,an environment,many stimuli,capacity,responses,f ₁ ,f ₂ ,f ₃ ,f ₄ ,e},{y ₅ }={development,e},{z ₅ }={e}.

Special note: In English, the there be sentence pattern is essentially an inverted sentence pattern, and the subject of the there be sentence pattern is the language unit after the be verb. In this patent application, for the convenience of computer processing, all the language units located after the be verb are treated as the language units at the object position. When it comes to the subsequent syntactic structure repair link, special syntactic phenomena including there be sentence patterns and inverted sentence patterns can be handled appropriately.

Denote the entire backbone system corresponding to this example as {A}; denote the cardinality of the set {A} as ∣A∣. Let the total of all possible values of the predicate vector f ₁ be {f ₁ }; let the cardinality of the set {f ₁ } be ∣f ₁ ∣. The same treatment is adopted for other predicate vectors and elements. Then use the multiplication principle in combinatorics:

∣f ₁ ∣=∣c ₁ ∣×∣l ₁ ∣×∣x ₁ ∣×∣r ₁ ∣×∣y ₁ ∣×∣z ₁ ∣=1×1×2×1×6×1=12

∣f ₂ ∣=∣c ₂ ∣×∣l ₂ ∣×∣x ₂ ∣×∣r ₂ ∣×∣y ₂ ∣×∣z ₂ ∣=1×2×4×1×5×1=40

∣f ₃ ∣=∣c ₃ ∣×∣l ₃ ∣×∣x ₃ ∣×∣r ₃ ∣×∣y ₃ ∣×∣z ₃ ∣=1×3×6×1×4×1=72

∣f ₄ ∣=∣c ₄ ∣×∣l ₄ ∣×∣x ₄ ∣×∣r ₄ ∣×∣y ₄ ∣×∣z ₄ ∣=1×4×8×1×4×1=128

∣f ₅ ∣=∣c ₅ ∣×∣l ₅ ∣×∣x ₅ ∣×∣r ₅ ∣×∣y ₅ ∣×∣z ₅ ∣=1×4×11×1×2×1=88

∣A∣=∣f ₁ ∣×∣f ₂ ∣×∣f ₃ ∣×∣f ₄ ∣×∣f ₅ ∣=389283840, generating a total of 389283840 backbone systems.

The above process can be simplified according to claim 5 in the application solution, and the generation and checking of the backbone system can be executed simultaneously, thereby reducing the complexity of calculation. According to the information in S4, this example sentence generates 2 auxiliary vectors, as shown below:

g[PREP,1](u)=in+<(u): PREP=in, u={an environment, f ₃ , f ₄ , f ₅ , e}.

g[PREP, 2](u)=for+<(u): PREP=for, u={responses, f ₅ , e}.

Denote the entire auxiliary system corresponding to this example as {B}; denote the cardinality of the set {B} as ∣B∣. Record the total of all possible values of the auxiliary vector g[PREP,1](u) as {g[PREP,1](u)}; record the base of the set {g[PREP,1](u)} as ∣g[PREP,1](u)∣. The same treatment is applied to the auxiliary vector g[PREP,2](u). Using the multiplication principle in combinatorics: ∣B∣=∣g[PREP,1](u)∣×∣g[PREP,2](u)∣=3×5=15, a total of 15 auxiliary systems are generated.

Take a through standardized basic system checks, referred to as specification A ₁ system; take a Regulatory A ₁ system with the checked specification assistance system, referred to regulate B ₁ system; and the specification A ₁ system and regulate B ₁ system The remaining noun system that matches is recorded as the C ₁ system. An ABC joint system is thus obtained, denoted as A ₁ -B ₁ -C ₁ joint system. As follows:

B ₁ ＝{g[PREP,1](u)=in+<an environment,g[PREP,2](u)=for+<responses}

Taking the right side as the first side, five rounds of overall insertion operation are required. Due to space limitations, the inventor of this patent application presented the five-round plug-in in a simple manner. As shown in Figure 35.

After the above five rounds of overall inserting operations, the combined vector is obtained, as shown below:

Behaviorists suggest the child who is raised in an environment where there are many stimuli which develop capacity for responses will experience development

Replace the above flattened vector with a number, as shown below. After checking, there is no serial number in reverse order in the combined vector. Obviously, the combined vector is reasonable, and the combined vector is retained.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

The above-mentioned five rounds of overall insertion operation is the general syntactic structure of the sentence 5 obtained through the A ₁ -B ₁ -C ₁ joint system, that is, the basic framework of the syntactic structure of the sentence 5.

The above process can be further optimized according to S8.6 of the application plan. Replace the A ₁ -B ₁ -C ₁ joint system with a number, as shown below:

B ₁ ={g[PREP,1](u)=6+<7,g[PREP,2](u)=14+<15};

According to S8.6 of the application scheme, the numbering of the above-mentioned A ₁ -B ₁ -C ₁ combined system is optimized. After optimization, the same result as the aforementioned overall insertion operation was obtained.

According to the basic framework of the syntactic structure provided by the A ₁ -B ₁ -C ₁ joint system, the complete syntactic analysis result of Example 5 is obtained. The result is a result that can be considered correct in English linguistics, expressed as a string as follows: [See Figure 10] (ROOT(S(NP(NNS Behaviorists))(VP(VBP suggest)(SBAR(S( NP(NP(DT the)(NN child))(SBAR(WHNP(WP who))(S(VP(VBZ is)(VP(VBN raised)(PP(IN in)(NP(NP(DT an)( NN environment))(SBAR(WHADVP(WRB where))(S(NP(EX there))(VP(VBP are)(NP(NP(JJ many)(NNS stimuli))(SBAR(WHNP(WP which)) (S(VP(VBP develop)(NP(NP(PRP$his)(CC or)(PRP$her)(NN capacity))(PP(IN for)(NP(JJ appropriate)(NNS responses)))) ))))))))))))))(VP(MD will)(VP(VB experience)(NP(JJR greater)(JJ intellectual)(NN development)))))))(..) ))

Example 6: Believing that what he wants will occur, Tom works hard in the company.

The example sentence has 3 predicate verb units wants, will occur, and works; therefore, this example sentence contains 3 predicate elements, which are sequentially denoted as r ₁ , r ₂ , and r ₃ ; and then for these 3 predicate elements, the corresponding predicate vector f is generated ₁ ,f ₂ ,f ₃ ; This example sentence contains 1 gerund-present participle element. Let the gerund-present participle element correspond to the gerund-present participle vector as g[VBG](u,v); according to the application plan For the information of S3, the value of each element in the predicate vectors f ₁ , f ₂ , and f ₃ is as follows:

① For f ₁ there are: {r ₁ }={wants}; {c ₁ }={e}, {l ₁ }={that,what,e}, {x ₁ }={he,g[VBG]( u,v),e}, {y ₁ }={f ₂ ,f ₃ ,e}, {z ₁ }={e}.

②For f ₂ : {r ₂ }={will occur}; {c ₂ }={e}, {l ₂ }={that,what,e}, {x ₂ }={he,g[VBG] (u,v),f ₁ ,e}, {y ₂ }={Tom,f ₃ ,e}, {z ₂ }={e}.

③For f ₃ : {r ₃ }={works}; {c ₃ }={e}, {l ₃ }={that,what,e}, {x ₃ }={he,Tom,g[VBG ](u,v),f ₁ ,f ₂ ,e}, {y ₃ }={the company,e}, {z ₃ }={e}.

∣f ₁ ∣=∣c ₁ ∣×∣l ₁ ∣×∣x ₁ ∣×∣r ₁ ∣×∣y ₁ ∣×∣z ₁ ∣=1×3×3×1×3×1=27

∣f ₂ ∣=∣c ₂ ∣×∣l ₂ ∣×∣x ₂ ∣×∣r ₂ ∣×∣y ₂ ∣×∣z ₂ ∣=1×3×4×1×3×1=36

∣f ₃ ∣=∣c ₃ ∣×∣l ₃ ∣×∣x ₃ ∣×∣r ₃ ∣×∣y ₃ ∣×∣z ₃ ∣=1×3×6×1×2×1=36

Thus: ∣A∣=∣f ₁ ∣×∣f ₂ ∣×∣f ₃ ∣=27×36×36=34992, generating 34992 backbone systems in total.

According to the information in the application plan S4, this example sentence generates two auxiliary vectors g[VBG](u,v) and g[PREP](u):

g[VBG](u,v)=Believing+<(u)+<e: VBG=Believing, u={he, f ₁ , f ₂ , f ₃ , e}.

g[PREP](u)=in+<(u): PREP=in, u={the company,e}.

Denote the entire auxiliary system corresponding to this example as {B}; denote the cardinality of the set {B} as ∣B∣. Record the total of all possible values of the auxiliary vector g[VBG](u,v) as {g[VBG](u,v)}; record the base of the set {g[VBG](u,v)} as ∣g[VBG](u,v)∣. The same process is applied to the auxiliary vector g[PREP](u). Using the principle of multiplication in combinatorics: ∣B∣=∣g[VBG](u,v)∣×∣g[PREP](u)∣=5×2=10, generating 10 auxiliary systems in total.

Take an inspected standard backbone system and record it as the standard A ₁ system; take an inspected standard auxiliary system that matches the standard A ₁ system and record it as the standard B ₁ system; the remaining noun system that will match the standard B ₁ system , Marked as C ₁ system. An ABC joint system is thus obtained, denoted as A ₁ -B ₁ -C ₁ joint system. As follows:

B ₁ ={g[VBG](u,v)=Believing+<f ₂ +<e,g[PREP](u)=in+<the company}

Take another ABC joint system, as shown below, and mark it as A ₂ -B ₂ -C ₂₂ joint system.

B ₂ ={g[VBG](u,v)=Believing+<he+<e,g[PREP](u)=in+<the company}

Take another ABC joint system, as shown below, and mark it as A ₂ -B ₁ -C ₂₁ joint system.

B ₁ ={g[VBG](u,v)=Believing+<f ₂ +<e,g[PREP](u)=in+<the company};

C ₂₁ ＝{he}

Take the left side as the first side, construct an empty position, and then perform the overall emptying operation. After the overall blanking operation, the A ₂ -B ₂ -C ₂₂ joint system did not generate a reasonable split vector, and the A ₂ -B ₂ -C ₂₂ joint system was cleared during the overall blanking link. Next, for the A ₁ -B ₁ -C ₁ joint system and the A ₂ -B ₁ -C ₂₁ joint system that are retained after the overall insertion operation, the method of probability combined with syntactic rules can be used to check the remaining nouns. After inspection, it is found that the remaining nouns he in the C ₂₁ system are not independent nouns that can be used by appositions, independent nouns that can be used by independent nominative structures that are not non-predicate verbs, independent nouns often used in article titles that are not collocation colons, and so on. Therefore, the remaining noun he in the C ₂₁ system is an unreasonable remaining noun. A ₂ -B ₁ -C ₂₁ has an error in the combined system, so discard it.

After various necessary treatments, in the end only the A ₁ -B ₁ -C ₁ joint system remained, and the other joint systems were discarded due to their own unreasonable factors. The basic framework of the syntactic structure of example sentence 6 corresponding to the A ₁ -B ₁ -C ₁ joint system is shown in Figure 36.

Further, according to the basic framework of the syntactic structure described by the A ₁ -B ₁ -C ₁ joint system, the method of combining probability with syntactic rules is adopted, and the probability is sorted from largest to smallest, so as to obtain the basic framework without conflict with the aforementioned syntactic structure. And the most probable computer analysis result. After the aforementioned series of processing, the complete syntactic analysis result of Example 6 is obtained. The result is a result that can be considered correct in English linguistics, expressed as a string as follows: [See Figure 11]

(ROOT(S(S(VP(VBGBelieving)(SBAR(IN that)(S(SBAR(WHNP(WPwhat))(S(NP(PRPhe))(VP(VBZwants))))(VP( MD will)(VP(VBoccur)))))))(,,)(NP(NNPTom))(VP(VBZworks)(ADVP(RBhard))(PP(IN)(NP(DTthe )(NN company))))(..)))

Example 7: A study of travelers conducted by the website TripAdvisor names Yangshuo as one of the top 10 destinations in the world.

Conducted in this example sentence has structural ambiguity. Due to limited space, only word lists (ii-a) and (ii-b) are given.

Word list (ii-a):

Word list (ii-b):

Based on the word list (ii-a), the A _a -B _a -C _a joint system is as follows:

A _a ＝e e A study names Yangshuo e

Note: The word list (ii-a) contains only one predicate, so the matrix structure of the standard A _a system degenerates into a predicate vector.

B _a ={g[PREP,1](u)=of+<travelers,g[PREP,2](u)=by+<the website,

g[PREP,3](u)=as+<one,g[PREP,4](u)=of+<destinations,

g[PREP,5](u)=in+<the world,g[VBN](u)=conducted+<e}

C _a ={TripAdvisor}

The A _b -B _b -C _b joint system generated according to the word list (ii-b) is as follows:

B _b ={g[PREP,1](u)=of+<travelers,g[PREP,2](u)=by+<the website,

g[PREP,3](u)=as+<one,g[PREP,4](u)=of+<destinations,

g[PREP,5](u)=in+<the world}

C _b ={TripAdvisor}

After the overall blanking operation, we arrived at the syntactic rule inspection link and found that: x ₂ = f ₁ in the vector f ₂ in the A _b -B _b -C _b joint system, that is, the vector f ₁ is the subject clause of f ₂ , and the vector f ₁ l ₁ = e, and f ₁ vector is not enclosed in quotation marks around, which violates the previously mentioned an English syntactic rules. Therefore, the A _b -B _b -C _b joint system has errors and is discarded.

After each step, the A _a -B _a -C _a joint system has no errors and is reserved. Finally, the complete syntactic structure of the example sentence 7 corresponding to the A _a -B _a -C _a joint system is obtained, which is expressed as a string as follows: [see Figure 13]

(ROOT(S(NP(NP(DT A)(NN study))(ADJP(PP(IN of)(NP(NNStravelers)))(VP(VBNconducted)(PP(INby)(NP(NP( DTthe)(NNwebsite))(NP(NNPTripAdvisor)))))))(VP(VBZnames)(NP(NNPYangshuo))(PP(IN as)(NP(NP(CDone))(PP (IN of)(NP(NP(DTthe)(JJtop)(CD10)(NNS destinations))(PP(IN in)(NP(DTthe)(NNworld)))))))(. .)))

As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence! The structural ambiguity between the past participle of this example sentence and the general past tense of the predicate verb (conducted in this example sentence) is a common structural ambiguity.

Example 8: That near all behavior is learned behavior is a basic assumption that has been put forward by the social scientists.

There is structural ambiguity between That and is learned in this example. Due to limited space, only word lists (ii-a) and (ii-b) are given. Word list (ii-a):

Word list (ii-b):

Based on the word list (ii-a), the A _a -B _a -C _a joint system is as follows:

B _a ={g[VBN](u)=learned+<e,g[PREP](u)=by+<scientists};

B _b ={g[PREP](u)=by+<scientists};

The complete syntactic structure of example sentence 8 obtained from the A _a -B _a -C _a joint system and its corresponding reasonable stitching vector:

(ROOT(S(SBAR(IN That)(S(NP(ADJP(RBnearly)(DTall))(NNbehavior))(VP(VBZis)(NP(VBNlearned)(NP(NNbehavior))) )))(VP(VBZis)(NP(NP(DTa)(JJbasic)(NNassumption))(SBAR(WHNP(WPthat))(S(VP(VBZhas)(VP(VBNbeen)( VP(VBN put)(ADVP(RB forward))(PP(INby)(NP(DTthe)(JJ social)(NNS scientists)))))))))(..)))

The syntactic structure can be used to repair this link, distinguish and adjust the primary and secondary status of each vector in the syntactic structure of the A _b -B _b -C _b joint system, so as to obtain A _b -B _b -C _b The complete syntactic structure corresponding to the joint system. The said distinction and adjustment of the primary and secondary status of each vector in the syntactic structure of the A _b -B _b -C _b joint system specifically refers to: which predicate vector serves as the main sentence and which predicate vector Make clauses, and adjust the predicate vector serving as the main clause and the predicate vector serving as the clause, etc.

The complete syntactic structure of example sentence 8 obtained from the A _b -B _b -C _b joint system and its corresponding reasonable combined vector:

(ROOT(S(SBAR(IN That)(S(NP(ADJP(RBnearly)(DTall))(NNbehavior))(VP(VBZis)(VP(VBNlearned)))))(NP(NN behavior))(VP(VBZis)(NP(NP(DTa)(JJbasic)(NNassumption))(SBAR(WHNP(WPthat))(S(VP(VBZhas)(VP(VBNbeen)( VP(VBN put)(ADVP(RB forward))(PP(INby)(NP(DTthe)(JJ social)(NNS scientists)))))))))(..)))

The intuitive form of the complete syntactic structure corresponding to the A _a -B _a -C _a joint system is shown in Figure 37.

The intuitive form of the complete syntactic structure corresponding to the A _b -B _b -C _b joint system is shown in Figure 38.

Then, the method of semantic processing is used to filter out the best syntactic analysis results. The semantic processing methods include, but are not limited to, semantic analysis methods based on λ-calculus, semantic analysis methods based on semantic fields and semantic networks, semantic analysis methods based on knowledge graphs, semantic analysis methods based on semantic graph models, and semantic analysis methods based on semantic graph models. The relationship calculates the probability and selects the semantic analysis method with the largest probability among them, and so on. The semantic processing method generally needs to be based on the sufficient restriction of the syntactic structure on the semantic relationship. The premise that the syntactic structure fully restricts the semantic relationship means that the syntactic structure preliminarily determines the meaning of each word in the sentence and the mutual collocation relationship between the meanings of the words. For example: According to the complete syntactic structure corresponding to the A _a -B _a -C _a joint system, the first That in this example sentence is a subordinate conjunction that leads the subject clause, and the corresponding meaning of the first That is "no meaning"; The complete syntactic structure corresponding to the A _b -B _b -C _b joint system. In this example, the first That is a subordinate conjunction that guides the adverbial clause at the beginning of the sentence, and the corresponding meaning of the first That is "because"; The complete syntactic structure corresponding to the A _a -B _a -C _a joint system. In this example, learned is the past participle acting as an attributive, and the corresponding semantics of learned is "learned"; according to the A _b -B _b -C _b joint system Corresponding to the complete syntactic structure, in this example sentence is and learned jointly act as a predicate, then the corresponding semantics of is learned is "to be learned"; etc. In particular, in order to achieve the aforementioned effects, a syntactic-semantic constraint relational database that meets the aforementioned requirements can be constructed in a targeted manner.

Hypothesis: On the premise that the syntactic structure has sufficient constraints on the semantic relationship, calculate the probability of the semantic relationship corresponding to the aforementioned two complete syntactic structures and select the result with the highest probability. The process is as follows:

The semantic relationship of the A _a -B _a -C _a joint system that is subject to the aforementioned syntactic structure constraints is shown in Figure 39.

The semantic relationship of the A _b -B _b -C _b joint system that is subject to the aforementioned syntactic structure constraints is shown in Figure 40.

Take the aforementioned complete syntactic structure corresponding to the A _a -B _a -C _a joint system with the largest semantic relationship probability as the final result of the syntactic analysis of this example sentence. The result is presented again in string form as follows: [See Figure 15]

Special note: For the result of lexical analysis that marked That as a structurally ambiguous qualifier unit, the computer will initially classify the structurally ambiguous qualifier unit That and the basic noun unit allbehavior in the same language segment, and treat them as That modifier all behavior; and That modifies all behavior is an obvious syntax error, which can be easily identified and eliminated by the computer in the subsequent syntactic rule checking process. Therefore, the word list (ii) that marked That as a structurally ambiguous qualifier unit will be cleared by the computer.

As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence!

Example 9: Jack met the patient the nurse the clinic had hired sent to the doctor.

The sentence in this sentence has structural ambiguity. Due to limited space, only word lists (ii-a) and (ii-b) are given.

Word list (ii-a):

JackJack	metmet	the patientthe patient	the nursethe nurse	the clinictheclinic
基本名词单元Basic noun unit	谓语动词单元Predicate verb unit	基本名词单元Basic noun unit	基本名词单元Basic noun unit	基本名词单元 Basic noun unit
11	22	33	44	55
had hiredhad hired	sentsent	toto	the doctorthe doctor	..
谓语动词单元Predicate verb unit	谓语动词单元Predicate verb unit	介词单元Preposition unit	基本名词单元Basic noun unit		句号period
66	77	88	99	无编号No number

Word list (ii-b):

JackJack	metmet	the patientthe patient	the nursethe nurse	the clinictheclinic
基本名词单元Basic noun unit	谓语动词单元Predicate verb unit	基本名词单元Basic noun unit	基本名词单元Basic noun unit	基本名词单元 Basic noun unit
11	22	33	44	55
had hiredhad hired	sentsent	toto	the doctorthe doctor	..
谓语动词单元Predicate verb unit	过去分词单元Past participle unit	介词单元Preposition unit	基本名词单元Basic noun unit		句号period
66	77	88	99	无编号No number

Based on the word list (ii-a), the A _a -B _a -C _a joint system is as follows:

B _a ={g[PREP](u)=to+<the doctor};

B _b ={g[VBN](u)=sent+<e,g[PREP](u)=to+<the doctor}; C _b ={the nurse}

When checking the remaining nouns of the A _b -B _b -C _b joint system that is retained after the overall insertion operation, the method of probability combined with syntactic rules can be used. After inspection, it is found that the remaining noun the nurse of the C _b system is not an independent noun that can be used by appositions, an independent noun that can be used by independent nominative structures that are not non-predicate verbs, and an independent noun that is often used in article titles that are not collocated with a colon, etc. . Therefore, the remaining noun the nurse in the C _b system is an unreasonable remaining noun. The A _b -B _b -C _b joint system has an error and is discarded.

The overall insertion process of the complete syntactic structure corresponding to the A _a -B _a -C _a joint system is shown in Figure 41.

After each step, the A _a -B _a -C _a joint system has no errors and is reserved. Finally, the complete syntactic structure of example sentence 9 corresponding to the A _a -B _a -C _a joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 17]

(ROOT(S(NP(NNP Jack))(VP(VBDmet)(NP(NP(DTthe)(NNpatient))(SBAR(S(NP(NP(DTthe)(NNnurse))(SBAR( S(NP(DT the)(NNclinic))(VP(VBDhad)(VP(VBNhired))))))(VP(VBDsent)(PP(TOto)(NP(DTthe)(NNdoctor )))))))))(..)))

Example 9 was mentioned in the first half of the description. As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence! The structural ambiguity between the past participle of this example sentence and the general past tense of the predicate verb (sent in this example sentence) is a common structural ambiguity.

Due to limited space, the following example sentences are only briefly explained:

Example 10: Jack met the boy the nurse the doctor the clinic had hired sent to the ward introduced to the patient.

This example sentence can obtain the correct final syntactic analysis result through the following A ₁ -B ₁ -C ₁ joint system.

Example 10 was mentioned in the first half of the description. The computer analysis process of Example 10 is similar to that of Example 9.

B ₁ ={g[PREP,1](u)=to+<the ward,g[PREP,2](u)=to+<the patient};

The overall insertion process of the complete syntactic structure corresponding to the A ₁ -B ₁ -C ₁ joint system is shown in Figure 42.

After each step, the A ₁ -B ₁ -C ₁ joint system has no errors and is reserved. Finally, the complete syntactic structure of example sentence 10 corresponding to the A ₁ -B ₁ -C ₁ joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 19]

(ROOT(S(NP(NNP Jack))(VP(VBDmet)(NP(NP(DTthe)(NNboy))(SBAR(S(NP(NP(DTthe)(NNnurse))(SBAR( S(NP(NP(DT the)(NN doctor))(SBAR(S(NP(DTthe)(NNclinic))(VP(VBDhad)(VP(VBNhired))))))(VP(VBD sent)(PP(TO to)(NP(DTthe)(NNward)))))))(VP(VBD introduced)(PP(TO to)(NP(DTthe)(NNpatient))))) )))(..)))

Example 11: This is the malt the rat the cat the dog worried killed ate.

Example 11 was mentioned in the first half of the description. Example 11 is similar to the computer analysis process of Example 10.

The overall insertion process of the complete syntactic structure corresponding to the A ₁ -B ₁ -C ₁ joint system is shown in Figure 43.

After each step, the A ₁ -B ₁ -C ₁ joint system has no errors and is reserved. Finally, the complete syntactic structure of example sentence 11 corresponding to the A ₁ -B ₁ -C ₁ joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 21]

(ROOT(S(NP(PRP This))(VP(VBZis)(NP(NP(DTthe)(NNmalt))(SBAR(S(NP(NP(DTthe)(NNrat))(SBAR( S(NP(NP(DTthe)(NN cat))(SBAR(S(NP(DTthe)(NNdog))(VP(VBD Worried)))))(VP(VBDkilled)))))( VP(VBD ate)))))))(..)))

Example 12: Part of the reason Charles Dickens loved his own novel was that it was rather closely modeled on his own life.

Example 12 was mentioned in the first half of the description. Another example of "Part of the reason why Charles Dickens loved his own novel was closely modeled on his own life." The computer analysis process and results in Example 12 are similar.

B ₁ ={g[PREP](u)=on+<life};

Finally, the complete syntactic structure of example sentence 11 corresponding to the A ₁ -B ₁ -C ₁ joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 23]

(ROOT(S(NP(NP(NN Part))(PP(IN of)(NP(NP(DTthe)(NN reason))(SBAR(S(NP(NNPCharles)(NNPDickens))(VP( VBDloved)(NP(PRP$his)(JJown)(NNnovel))))))))(VP(VBDwas)(SBAR(IN That)(S(NP(PRPit))(VP(VBD was)(VP(ADVP(RBrather)(RBclosely))(VBNmodeled)(PP(INon)(NP(PRP$his)(JJown)(NNlife))))))))(.. )))

Example 13: He said he wanted to improve the vineyard to allow visitors to enjoy local food and that in this way, he could make more money.

This example sentence can obtain the correct final syntactic analysis result through the following A ₁ -B ₁ -C ₁ joint system. This example sentence contains two juxtaposed object clauses.

B ₁ ={g[To VB,1](u,v)=to improve+<the vineyard+<e,g[To VB,2](u,v)=to allow+<visitors+<e,g[To VB, 3](u,v)=to enjoy+<local food+<e,g[PREP](u)=in+<this way};

Example 14: I will buy the car which my father needs and the bike which my brother wants.

Syntactic structure repair is another link that is carried out at the same time as the syntactic rule check in the proposal of this application. Syntactic structure repair adopts the method of probability combined with syntactic rules or the method of dependency analysis, and the missing complex inverted sentence patterns, missing long-distance verb-object relations, missing long-distance parallel components, missing adjectives as predicative components, and missing prepositions Phrases are used as predicative components, missing infinitive structures are used as complementary components of the object, missing gerund-present participle structures are used as complementary components of the object, missing past participle structures are used as complementary components of the object, and missing prepositional phrases are used Syntactic information such as the complement of the object is re-excavated, and the defects in the syntactic structure obtained before are repaired accordingly. For example: in this example, the car and the bike are juxtaposed as the object of will buy, and the car and the bike are separated by the attributive clause which my father needs. By patching this link through the syntactic structure, the car and the bike can be combined into one object element. For the two attributive clauses which are inserted respectively after the car and the bike, which my father needs and which my brother wants, they are treated as separate insertions of two basic noun units within the same object element. In addition, the and in this example sentence belongs to "coordinate related word units not used to connect sentences".

C ₁ = {the bike} is repaired by the syntactic structure: will buy the car and the bike.

Example 15: Determining where we are in relation to our surroundings remains an essential skill for our survival.

The in relation to in this example sentence has structural ambiguity. On the one hand, it can be understood that in relation to is a complete compound preposition. On the other hand, it can be understood that in relation to is a combination of the prepositional phrase in relation and the preposition to. Constitute a whole. In the following A ₁ -B ₁ -C ₁ joint system, in relation to is treated as a compound preposition. The compound prepositional phrase in relation to our surroundings serves as the predicative of the clause and is the main component of the sentence. However, in order to facilitate computer processing, according to the operation of the application plan, the compound prepositional phrase in relation to our surroundings is not counted in the matrix, and can be followed The in relation to our surroundings is repaired as a clause predicative in the syntactic structure repair link.

B ₁ ={g[VBG](u,v)=Determining+<f ₁ +<e,g[PREP,1](u)=in relation to+<our surroundings,g[PREP,2](u)=for+ <our survival};

Example 16: Tom washed and polished his car, after he gave his brother a present.

Washed and polished in this example sentence is a combined unit of adjacent predicate verbs, washed and polished constitutes a predicate element; given is a verb that can accept double objects, which can be summarized and given in advance by querying a dictionary or statistics.

Example 17: That men the nurse the clinic had hired sent to the ward introduced to the cleaners didn't bother the patients wasn't marked up by the press.

Example 17 was mentioned in the first half of the description. Example 17 conforms to the Q model mentioned above, the verification is omitted.

B ₁ ={g[PREP,1](u)=to+<the ward,g[PREP,2](u)=to+<the cleaners,

g[PREP,3](u)=by+<the press};

The overall insertion process of the complete syntactic structure corresponding to the A ₁ -B ₁ -C ₁ joint system is shown in Figure 44.

After each step, the A ₁ -B ₁ -C ₁ joint system has no errors and is reserved. Finally, the complete syntactic structure of the sentence 17 corresponding to the A ₁ -B ₁ -C ₁ joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 45]

(ROOT(S(SBAR(IN That)(S(NP(NP(NNSmen))(SBAR(S(NP(NP(DTthe)(NNnurse))(SBAR(S(NP(NP(DTthe) (NN doctor))(SBAR(S(NP(DTthe)(NNclinic))(VP(VBDhad)(VP(VBNhired))))))(VP(VBDsent)(PP(TOto)( NP(DT the)(NNward))))))(VP(VBD introduced)(PP(TO)(NP(DTthe)(NNScleaners)))))))(VP(VBDdid)( RB n't)(VP(VBbother)(NP(DTthe)(NNSpatients))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon) (PP(IN by)(NP(DT the)(NN press))))))(..)))

As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence! [See Figure 46]

Example 18: That men the cleaner introduced to the nurses the doctor the clinic had hired sent to the ward didn't bother the patients wasn't marked up by the press.

Example 18 was mentioned in the first half of the description. Example 18 conforms to the Q model mentioned above, the verification is omitted.

B ₁ ={g[PREP,1](u)=to+<the nurses,g[PREP,2](u)=to+<the ward,

g[PREP,3](u)=by+<the press};

The overall insertion process of the complete syntactic structure corresponding to the A ₁ -B ₁ -C ₁ joint system is shown in Figure 47.

After each step, the A ₁ -B ₁ -C ₁ joint system has no errors and is reserved. Finally, the complete syntactic structure of example sentence 18 corresponding to the A ₁ -B ₁ -C ₁ joint system is obtained. In the form of a string, the result is expressed as follows: [See Figure 48]

(ROOT(S(SBAR(IN That)(S(NP(NP(NNSmen))(SBAR(S(NP(DTthe)(NNcleaner))(VP(VBDintroduced)(PP(TOto)(NP (NP(DT the)(NNS nurses))(SBAR(S(NP(NP(DTthe)(NN doctor))(SBAR(S(NP(DTthe)(NNclinic))(VP(VBDhad)( VP(VBN hired)))))))(VP(VBD sent)(PP(TO to)(NP(DTthe)(NNward)))))))))))))(VP(VBDdid)( RB n't)(VP(VBbother)(NP(DTthe)(NNSpatients))))))(VP(VBDwas)(RBn't)(VP(VBNremarked)(ADVP(RPupon) (PP(IN by)(NP(DT the)(NN press))))))(..)))

As of the filing date of this patent application-March 22, 2019, Berkeley Parser and Stanford Parser gave wrong results for this example sentence! [See Figure 49]

Summary of the invention:

The solution of this patent application aims to solve specific technical problems in computer natural language processing, and organically unifies the three aspects of computer-executed lexical analysis, syntactic analysis, and semantic analysis, so that these three aspects are cross-referenced. Restrict and correct each other. In the scheme of this patent application, the inventor established a new set of mathematical models suitable for computer processing to describe sentences. The mathematical model describing the sentence has a clear and accurate structure, strong expressive ability and practicability. The length of each formula contained in the model is limited, conforms to the natural laws of mathematics and computer science, and helps improve computer processing The accuracy of natural language. On this basis, the inventor gave a set of methods for using computers to analyze the syntactic structure of sentences. The method of using a computer to analyze the syntactic structure of a sentence conforms to the laws of nature, has a wide range of application, high accuracy, and a very large amount of calculation. Distributed computing is recommended. In particular, it is pointed out that all sentences that appear in the specification of this patent application can use the scheme of this patent application to obtain correct syntactic analysis results. The scheme of this patent application can be divided into the following 4 calculation areas:

The first calculation area: α area

In the α area, read the sentence data structure to be parsed, and perform preprocessing operations on the sentence data structure to be parsed; read the sentence data structure to be parsed after the aforementioned preprocessing; for the sentence data structure to be parsed without the predicate verb unit Analyze the sentence, use probability combined with syntactic rules or dependency analysis method to analyze the sentence, and take the aforementioned analysis result as the final analysis result of the computer; for the sentence to be parsed with predicate verb unit, generate a list of related words , And generate the predicate vector, auxiliary vector, and remaining noun vector corresponding to the aforementioned word list, and then generate the ABC joint system corresponding to the aforementioned word list.

It should be noted that: for each word list (i), use probability combined with syntactic rules or dependency analysis methods to check out special interrogative sentences, ellipsis sentences, partial inverted sentences, etc., and perform morphological processing on their predicates in order to Follow-up operations.

For example: When did you leave the house?

The form of processing as declarative sentence is: When+<you+<(did)leave+<the house+<(.)

The second calculation area: β area

In the β area, for any A-B-C joint system generated in the α area, the overall insertion operation, the syntax rule check, the syntax structure repair, and the remaining noun check are performed. This calculation area makes full use of natural laws, through screening and inspection, to generate the general syntactic structure of the sentence to be parsed, that is, the basic framework for generating the syntactic structure of the sentence to be parsed.

Furthermore, using the principle of multiplication in combinatorial mathematics, all ABC joint systems corresponding to each word list generated in the α region are exhausted; further, by permuting and combining all the correlation vectors in each ABC joint system, each one is exhausted. All the blanking schemes corresponding to the ABC joint system; further, the calculation of the β area is repeated for each blanking scheme until all the vacancies and all the combined vectors involved in each blanking scheme are exhausted.

For all the links and algorithms of the β area, please refer to Figure 50 of the specification. Among them, the three links of A, B, and C constitute the ABC joint system; D=ψ(A,B,C) is an algorithm for overall insertion and elimination of the inverse order of natural numbers; E={σ(1),σ(2),... …,Σ(m)} is the algorithm for each sub-item required for syntactic rule checking and syntactic structure repair; F=Φ(NP) is the algorithm for checking remaining nouns; G=ε(↑↓) is the aforementioned exhaustive sum The aforementioned algorithm for the β region is repeatedly executed.

Judging whether the remaining nouns are reasonable or not is the technical balance point for controlling the computer syntax analysis process in the proposal of this application. The A-B-C joint system preserved in the β area depicts the general syntactic structure of the sentence to be parsed, and immediately depicts the basic framework of the syntactic structure of the sentence to be parsed.

The third calculation area: γ area

In the γ region, the basic framework of the syntactic structure of the sentence to be parsed described by the several ABC joint systems retained in the β region is used as the standard, and obtained by analyzing the sentence to be parsed using the method of probability combined with syntactic rules or the dependency analysis method Among the sufficient number of complete syntactic structures, find the most suitable complete syntactic structure that meets the aforementioned criteria.

The fourth calculation area: δ area

In the δ region, based on several complete syntactic structures of the sentence to be parsed generated in the γ region, the method of semantic processing is adopted to find the most suitable semantic relationship subject to the aforementioned syntactic structure constraints, and then the semantic relationship corresponds to The foregoing complete syntactic structure is used as the final syntactic analysis result, and the result is output. The semantic processing method generally needs to be based on the sufficient restriction of the syntactic structure on the semantic relationship. The premise that the syntactic structure fully restricts the semantic relationship means that the syntactic structure preliminarily determines the meaning of each word in the sentence and the mutual collocation relationship between the meanings of the words.

Explanation: The Greek lowercase letters from α to β and the English capital letters from A to G involved in the above four calculation areas are sequence marks, which represent the operation sequence of each calculation area, each link, and each algorithm.

The above descriptions are only preferred embodiments of the present invention and are not used to limit the present invention. For those skilled in the art, the present invention can have various modifications and changes. Any modification, equivalent replacement, improvement, etc., made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

A method of natural language syntactic analysis, including:

S1. Read the sentence data structure to be parsed, and perform preprocessing operations on the sentence data structure to be parsed;

S2. For each word list (i), read the sentence data structure to be parsed after the aforementioned preprocessing: if there is a predicate verb unit in the sentence to be parsed, then generate a word list (ii); There is no predicate verb unit in the sentence, then the sentence is analyzed by the method of probability combined with syntactic rules or the dependency analysis method, and the result of the aforementioned analysis is used as the final analysis result of the computer, and then the corresponding word list is cleared (i) And does not generate a word list (ii);

S3. For each predicate element, generate a corresponding predicate vector; the predicate vector includes a parallel guide element, a subordinate guide element, a subject element, a predicate element, a first position object element, and a second position object element;

Wherein, the predicate element is the corresponding predicate verb unit, or the corresponding adjacent predicate verb combination unit; the predicate element number is the corresponding predicate verb unit number, or the corresponding adjacent predicate verb combination unit number ；

Wherein, the possible value of the coordinate introductory element is one of the coordinate related word units used to connect sentences with a number less than the corresponding predicate element number, or an empty unit; the coordinate related word unit that is not used to connect sentences cannot be used as a coordinate introductory The possible values of the element;

Wherein, the possible value of the subordinate introductory element is one of the subordinate related word units whose number is smaller than the corresponding predicate element number, or one of the adjacent and juxtaposed subordinate related word combination units whose number is smaller than the corresponding predicate element number, or the number is smaller than One of the interrogative unit of the corresponding predicate element number, or one of the adjacent interrogative combination units with a number smaller than the corresponding predicate element number, or an empty unit;

Wherein, the possible value of the subject element is one of the basic noun units whose number is less than the corresponding predicate element number, or one of the adjacent and parallel basic noun combination units whose number is less than the corresponding predicate element number, or the number is less than the corresponding One of the infinitive vectors corresponding to the infinitive element of the predicate element number, or a gerund whose number is less than the corresponding predicate element number-the gerund corresponding to the present participle element-one of the present participle vectors, or one of the corresponding predicate element numbers One of the predicate vectors corresponding to the predicate element, or an empty unit;

Wherein, the possible value of the object element in the first position is one of the basic noun units whose number is greater than the number of the corresponding predicate element and less than the number of the first predicate element that appears after the predicate element, or the number is greater than the corresponding predicate element The element number is less than one of the adjacent basic noun combination units of the first predicate element number that appears after the predicate element, or the number is greater than the corresponding predicate element number and less than the first predicate element that appears after the predicate element. One of the infinitive vectors corresponding to the infinitive element of a predicate element number, or a gerund whose number is greater than the number of the corresponding predicate element and less than the number of the first predicate element that appears after the predicate element-the verb corresponding to the present participle element Noun-one of the present participle vectors, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding predicate element, or an empty unit; the predicative component corresponding to the predicate element that meets the aforementioned requirements is also regarded as the first position object element deal with;

Among them, if the corresponding predicate element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding object element in the first position is a basic noun unit or an adjacent basic noun Combination unit, then the possible value of the object element at the second position is one of the basic noun units with a number greater than the number of the corresponding object element at the first position and less than the number of the first predicate element that appears after the predicate element, or One of the adjacent basic noun combination units whose number is greater than the number of the corresponding object element in the first position and less than the number of the first predicate element that appears after the predicate element, or corresponds to the predicate element whose number is greater than the corresponding predicate element One of the predicate vectors of, or an empty unit; if the corresponding predicate element is a unit composed of a verb that can accept a double object or a verb that can be combined with an object complement, and the corresponding object element in the first position is neither a basic noun If the unit is not an adjacent basic noun combination unit, then the value of the object element in the second position is an empty unit; if the corresponding predicate element is a verb that is complemented by neither a double object nor an unacceptable object combined with an object The possible value of the object element in the second position is an empty unit; among them, the verb of the double-object can be accessed or the verb of the complementary object combined with the object complement and the unacceptable double-object Verbs that cannot accept an object combined with an object complement can be summarized and given in advance by querying a dictionary or statistically; define the verbs that can accept double objects or the verbs that can accept an object combined with the object complement and the said both. Verbs that cannot accept double objects and cannot accept an object combined with an object complement will help reduce the complexity of calculations;

S4. For each infinitive element, generate a corresponding infinitive vector; for each gerund-present participle element, generate a corresponding gerund-present participle vector; for each past participle element, generate a corresponding past participle vector; For each preposition element, a corresponding preposition vector is generated; according to the possible values of the infinitive element, the infinitive first-position object element, and the infinitive second-position object element, obtain the infinitive vector corresponding to each infinitive element All possible values of; According to the possible values of the gerund-present participle element, gerund-present participle object element in the first position, gerund-present participle object element in the second position, obtain each gerund-present participle The gerund corresponding to the element-all possible values of the present participle vector; according to the possible values of the past participle element and the past participle object element, all possible values of the past participle vector corresponding to each past participle element are obtained; State the possible values of preposition elements and preposition object elements, and obtain all possible values of the preposition vector corresponding to each preposition element;

Wherein, the infinitive vector includes infinitive elements, infinitive first-position object elements, and infinitive second-position object elements;

The infinitive element is the corresponding infinitive verb unit, or the corresponding adjacent infinitive verb combination unit; the infinitive element number is the corresponding infinitive verb unit number, or the corresponding adjacent infinitive infinitive Verb combination unit number;

The possible value of the object element in the first position of the infinitive is one of the basic noun units whose number is greater than the number of the corresponding infinitive element and less than the number of the first predicate element that appears after the infinitive element, or the number is greater than the corresponding The number of the infinitive element of and is less than one of the adjacent basic noun combination units of the first predicate element number that appears after the infinitive element, or the number is greater than the number of the corresponding infinitive element and less than the number of the infinitive element One of the infinitive vectors corresponding to the infinitive element of the first predicate element number that appears after the element, or one of the infinitive vectors whose number is greater than the corresponding infinitive element number and less than the number of the first predicate element that appears after the infinitive element Noun-the gerund corresponding to the present participle element-one of the present participle vectors, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding infinitive element, or an empty unit; the infinitive element corresponds to the predicative that meets the aforementioned requirements Component, also treated as an object element in the first position of the infinitive;

If the corresponding infinitive element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the object element in the first position of the corresponding infinitive is a basic noun unit or an adjacent basic Noun combination unit, then the possible value of the object element in the second position of the infinitive is a basic number greater than the number of the object element in the first position of the corresponding infinitive and less than the number of the first predicate element that appears after the infinitive element One of the noun units, or one of the adjacent basic noun combination units whose number is greater than the number of the object element in the first position of the corresponding infinitive and less than the number of the first predicate element that appears after the infinitive element, or one of the corresponding basic noun combination units One of the predicate vectors corresponding to the predicate element with the larger number of the infinitive element, or an empty unit; if the corresponding infinitive element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and corresponds to The object element in the first position of the infinitive is neither a basic noun unit nor an adjacent basic noun combination unit, then the value of the object element in the second position of the infinitive is an empty unit; if the corresponding infinitive element is A unit composed of verbs that can neither accept a double object nor a non-acceptable object combined with an object complement, then the value of the object element in the second position of the infinitive is an empty unit; among them, the verb that can accept a double object or can The verbs that receive the object combined with the object complement and the verbs that can neither receive the double object nor the object combined with the object complement can be summarized and given in advance by querying the dictionary or statistics; define the said acceptable double object The verbs or the verbs that can accept the object and the object complement and the verbs that can not accept the double object or the unacceptable object and the object complement can help reduce the complexity of calculation;

Wherein, the gerund-present participle vector includes gerund-present participle element, gerund-present participle first position object element, gerund-present participle second position object element;

The gerund-present participle element is the corresponding gerund-present participle unit, or the corresponding adjacent gerund-present participle combination unit; the gerund-present participle element number is the corresponding gerund-present participle Unit number, or corresponding adjacent parallel gerund-present participle combination unit number;

The possible value of the object element in the first position of the gerund-present participle is a basic number greater than the number of the corresponding gerund-present participle element and less than the number of the first predicate element that appears after the gerund-present participle element One of the noun units, or one of the adjacent basic noun combination units whose numbers are greater than the corresponding gerund-present participle element number and less than the number of the first predicate element that appears after the gerund-present participle element, or One of the infinitive vectors corresponding to the infinitive element whose number is greater than the corresponding gerund-present participle element number and less than the first predicate element number that appears after the gerund-present participle element, or the number is greater than the corresponding gerund -The present participle element number is less than the gerund with the number of the first predicate element that appears after the present participle element-the gerund corresponding to the present participle element-one of the present participle vectors, or more than the corresponding gerund- One of the predicate vectors corresponding to the predicate element with the higher number of the present participle element, or an empty unit; the predicative component corresponding to the gerund-present participle element that meets the aforementioned requirements is also treated as the object element in the first position of the gerund-present participle;

If the corresponding gerund-present participle element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding gerund-present participle first position object element is a basic noun unit or An adjacent basic noun combination unit, then the possible value of the object element in the second position of the gerund-present participle is that the number is greater than the number of the object element in the first position of the corresponding gerund-present participle and is smaller than the object element number in the first position of the gerund -One of the basic noun units of the first predicate element number that appears after the present participle element, or the number is greater than the corresponding gerund-number of the object element in the first position of the present participle and less than the number that appears after the gerund-present participle element One of the adjacent and juxtaposed basic noun combination units of the first predicate element number, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding gerund-present participle element, or an empty unit; if the corresponding gerund- The present participle element is a unit composed of a verb that can accept a double object or a verb that can accept an object combined with an object complement, and the corresponding gerund-the object element in the first position of the present participle is neither a basic noun unit nor an adjacent juxtaposition The basic noun combination unit of, then the value of the object element in the second position of the gerund-present participle is the empty unit; if the corresponding gerund-present participle element is composed of both unacceptable double objects and unacceptable objects combined with object complements The unit of the verb constituted by the verb, then the value of the object element in the second position of the gerund-present participle is the empty unit; wherein the verb that can accept the double object or the verb of the object complement and the Verbs that can neither accept double objects nor accept objects combined with object complements can be summarized and given in advance by querying a dictionary or statistically; define the verbs that can accept double objects or accept objects combined with object complements Verbs and the mentioned verbs that can neither accept double objects nor combine object complements can help reduce the complexity of calculations;

Wherein, the past participle vector includes past participle elements and past participle object elements;

The past participle element is the corresponding past participle unit, or the corresponding adjacent past participle combination unit; the past participle element number is the corresponding past participle unit number, or the corresponding adjacent past participle combination unit number ；

If the corresponding past participle element is a unit consisting of a verb that can accept a double object or a verb that can be combined with an object complement, then the possible value of the past participle object element is that the number is greater than the number of the corresponding past participle element and less than One of the basic noun units of the first predicate element number that appears after the past participle element, or the number greater than the corresponding past participle element number and less than the first predicate element number that appears after the past participle element One of the basic noun combination units that are adjacent to each other, or one of the predicate vectors corresponding to the predicate element with a larger number than the corresponding past participle element, or an empty unit; if the corresponding past participle element is composed of neither a double object nor an object Combining the unit composed of the verb of the object complement, then the value of the object element of the past participle is the empty unit; wherein, the verb that can be accessed by the double object or the verb that can be combined with the object complement and the verb of the object complement. Verbs that cannot accept a double object or an object combined with an object complement can be summarized and given in advance by querying a dictionary or statistics; define the verbs that can accept a double object or a verb that can accept an object combined with an object complement and The described verbs that can neither accept double objects nor accept objects combined with object complements help to reduce the complexity of calculation;

Wherein, the preposition vector includes a preposition element and a preposition object element;

The preposition element is a corresponding preposition unit, or a corresponding adjacent preposition combination unit; the preposition element number is a corresponding preposition unit number, or a corresponding adjacent preposition combination unit number;

The possible value of the preposition object element is the first basic noun unit whose number is greater than the number of the corresponding preposition element and appears after the preposition element, or the number is greater than the number of the corresponding preposition element and appears after the preposition element The first adjacent basic noun combination unit, or the first gerund-present participle vector whose number is greater than the corresponding preposition element number and appears after the preposition element, or the number is greater than the corresponding preposition element number and is The first infinitive vector that appears after the preposition element, or the preposition vector corresponding to the preposition element whose number is greater than the corresponding preposition element number and is adjacent to the number sequence of the preposition element number, or the preposition vector that is greater than the corresponding preposition element number One of the predicate vectors corresponding to the predicate element, or an empty unit;

S5. The infinitive vector, the gerund-present participle vector, the past participle vector and the preposition vector are collectively referred to as auxiliary vectors; for each auxiliary vector in the sentence to be parsed, any possible value corresponding to the auxiliary vector is selected. In this way, a set of possible values corresponding to all auxiliary vectors is obtained; the possible values corresponding to the aforementioned set of all auxiliary vectors are regarded as a set, which is called an auxiliary system;

S6. Given a standard backbone system arbitrarily, collocation with a corresponding auxiliary system; replace every element outside the excluded vector in each auxiliary vector in the aforementioned auxiliary system with the corresponding number; after replacing the number, check The auxiliary system; if the following unreasonable situation occurs in the auxiliary system, then the auxiliary system is removed; if the following unreasonable situation does not occur in the auxiliary system, then the auxiliary system is retained; the remaining auxiliary system The system is called the specification auxiliary system; the predicate vectors mentioned in the following all refer to the predicate vectors in the aforementioned canonical backbone system;

S6.1. If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector or the same preposition vector appears in two different auxiliary vectors, then the auxiliary system is unreasonable, Clear the auxiliary system;

S6.2. If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector appears in an auxiliary vector and in a predicate vector at the same time, then the auxiliary system is unreasonable and the auxiliary system is removed. system;

S6.3. If two numbers in reverse order appear in an auxiliary vector, then the auxiliary system is unreasonable, and the auxiliary system is cleared;

S6.4. Substituting any two auxiliary vectors that have elements between the two into the relationship, all of which are equivalently substituted; if there is a cross-substitution contradiction between the vectors, then the auxiliary system is unreasonable, and the auxiliary system is cleared; if If two numbers in reverse order appear after equal substitution, then the auxiliary system is unreasonable. Clear the auxiliary system;

S6.5. Substituting any auxiliary vector and any predicate vector that have elements between the two elements into the relationship, all of which are equivalently substituted; if there is a contradiction in the substitution between the vectors, then the auxiliary system is unreasonable, and the Auxiliary system; if two numbers in reverse order appear after equal substitution, then the auxiliary system is unreasonable, and the auxiliary system is cleared;

S6.6. After the inspection, restore to the original state before the inspection for use in subsequent operations;

S7. Generate residual noun system and A-B-C joint system;

S7.1. Given a canonical backbone system and a canonical auxiliary system corresponding to the canonical backbone system, the remaining basic noun units and adjacent parallel basic noun combinations that do not enter the aforementioned canonical backbone system and standard auxiliary system The whole unit is regarded as a set, which is called a residual noun system; each element in the residual noun system is called a residual noun element; the number of a residual noun element is the basic corresponding to the residual noun element The number of the noun unit or the basic noun combination unit; for each remaining noun element, a corresponding remaining noun vector is generated; the remaining noun vector includes only the remaining noun elements, that is, the remaining noun vector and the remaining noun elements are in one-to-one correspondence ；

S7.2. A normative backbone system, a normative auxiliary system and a residual noun system corresponding to each other in the manner described in S7.1 constitute an A-B-C joint system;

S8. For any given ABC joint system, perform the overall blanking operation for the ABC joint system; each slot can receive at most one vector in an overall blanking operation, or no vector, that is, no blanking operation ; Before the overall blanking operation, clear the empty unit; in the overall blanking operation, the vector that constructs a space and receives other vectors into the space is recorded as the received vector; the vector that inserts the space of other vectors is recorded as the inserted vector ；

S8.1. In the aforementioned ABC joint system, for each element in each vector that can be replaced by other vectors, all the corresponding vectors are used for equivalent substitution, regardless of whether the corresponding vector is a predicate vector or an auxiliary vector Vector; perform the aforementioned equal substitution until all the other vectors in each vector are replaced; after the aforementioned equal substitution, if a vector is substituted into another vector, then cancel the substitution into the other vector The original position of the vector in the ABC joint system, so that the two vectors after the aforementioned equal substitution operation are completely integrated; through equal substitution, all the original vectors in the ABC joint system are transformed into mutual differences. There is a new vector in which the elements are substituted; taking equal substitution as the limit, the vector in the ABC joint system before the equal substitution is called the I type vector, and the vector in the ABC joint system after the equal substitution It is called a type II vector; obviously, a certain type I vector and a certain type II vector can be the same vector, that is, a vector may not change before and after the equivalent substitution;

S8.2. Perform the first round of the overall blanking operation in the ABC joint system: take any type II vector ω as the receiving vector of the first round of the overall blanking operation; label each of the vectors ω one by one according to a predetermined direction The order value of an element; according to the order value that has been marked, the i-th element in the vector ω can be selected, and a unique space is constructed only on the first side of the element; after the space is created, any one that excludes the aforementioned vector ω The second type of vector μ outside is used as the insertion vector for the first round of the overall blanking operation; in the way of overall blanking, the vector μ is inserted into the space corresponding to the aforementioned i-th element, and then a new vector is generated. The generated vector is denoted as [ω] i +<μ; the vectors obtained through the overall blanking operation in the ABC joint system are collectively referred to as type III vectors; the order value of the overall blanking labeling in each round is limited to this Used in a round of overall plug-in process;

S8.3. Perform the second round of the overall blanking operation in the ABC joint system: take the type III vector [ω] i +<μ as the receiving vector of the second round of the overall blanking operation; according to the predetermined direction, the slave vector Each element from the first element on the first side in [ω] i +<μ to the first element on the second side inside the vector μ contained in the vector [ω] i +<μ is marked with an order value; vector The rest of the elements in [ω] i +<μ are not marked with the order value; according to the marked order value, the j-th element is taken, and only a unique space is constructed on the first side of the element; after the space is created, you can take any A type II vector ξ that has not been used in any previous steps is used as the insertion vector for the second round of the overall blanking operation; the vector ξ is inserted into the space corresponding to the j-th element in the overall blanking manner, and then a new , The newly generated vector is marked as [[ω] i \μ] j +<ξ; or

Take the type III vector [ω] i +<μ as the receiving vector for the second round of the overall blanking operation; label each element in the vector [ω] i +<μ according to the predetermined direction; Sequence value, any take the kth element in the vector [ω] i +<μ, and only construct a unique vacancy on the first side of the element; after creating a vacancy, take any second II that has not been used in any previous steps The class vector ξ is used as the insertion vector for the second round of the overall blanking operation; the vector ξ is inserted into the space corresponding to the k-th element in the overall blanking method, and then a new vector is generated, and the newly generated vector is recorded as ( [ω] i +<μ) k +<ξ; According to this method, the overall interpolation operation is performed. If the same result appears after the execution of S8.4, then the same result will be merged into one result, that is, the same merged vector Merge into a flat vector;

S8.4. In the aforementioned ABC joint system, the overall insertion operation given in S8.3 is repeatedly executed in the following way: take the newly generated vector obtained from the previous round of overall insertion operation as a new round of overall Insert the received vector of the null operation, and any type II vector that has not been used in any previous steps is used as the insertion vector of the new round of the overall null operation; repeat the overall insert operation until all the II types After all the vectors are inserted into the space, it is recorded as the exhaustion of all the insertion vectors, and a type III vector is obtained while all the vectors are inserted. The type III vector obtained while inserting the exhaustion into the vector is recorded as the combined vector; S8.3 Contains 2 types of overall blanking operation methods. For the selection of the overall blanking operation method in S8.3, the previous and subsequent steps should be consistent; arrange the type II vectors used in each round of the overall blanking operation in order, Until all the insertion vectors are exhausted, a blanking scheme corresponding to the ABC joint system is formed; the operations from S8.2 to S8.4 are repeated to exhaust every round of blanking operations involved in the blanking scheme Receiving the space corresponding to each element in the vector, that is, each combined vector involved in the exhaustive insertion scheme;

S8.5. Check the result generated by S8.4: replace with a number; if two numbers in reverse order appear in a combined vector, then the combined vector is unreasonable, clear the combined vector; if it does not appear in a combined vector If the number is reversed, the combined vector is reasonable, and the combined vector is retained;

S8.6. After converting all the type I vectors in the aforementioned ABC joint system into type II vectors, first replace each type II vector in the ABC joint system with corresponding numbers, and then execute the aforementioned The overall blanking operation; according to any given blanking scheme corresponding to the ABC joint system, in each round of the overall blanking operation, a blank is constructed on the first side of each element in the receiving vector, and then Start to filter reasonable gaps; compare the greater or less than relationship between the first number on the left or right side inserted into the vector and the adjacent number on the left or right corresponding to the gap to be filtered, and only select the number sequence to avoid occurrence Inversely, the space that is greater than or less than the relationship is regarded as a reasonable space, and the empty space is inserted, and the remaining space is regarded as an unreasonable space, and no space is inserted; if there is no reasonable space in the receiving vector, then the above-mentioned empty insertion scheme is unreasonable , End the blanking scheme, and replace other blanking schemes; using this method for optimization, the obtained combined vector can be directly recorded as a reasonable combined vector, without the need to reverse the numbering order;

S8.7. Use the principle of multiplication in combinatorics to exhaust all ABC joint systems corresponding to each word list (ii); further, by permuting and combining all type II vectors in each ABC joint system, exhaustive All the blanking schemes corresponding to each ABC joint system; further, the operations from S8.2 to S8.6 are repeated for each blanking scheme until all the stitching vectors corresponding to each blanking scheme are exhausted;

S8.8. Syntactic rule check: Use the syntactic rules of natural language, and use the method of probability combined with syntactic rules or dependency analysis method to check each reasonable combination vector and its corresponding ABC joint system; the aforementioned use Syntactic rules inspection should include the use of event object verbs and non-event object verbs; the event object verbs refer to verbs in natural language that can only use events as objects but not people or things as objects; The non-event object verbs refer to verbs in natural language that can only take people or things as objects, but not events; event object verbs and non-event object verbs can be summarized in advance by querying a dictionary or statistics Give

S8.9. While executing S8.8, repair the syntactic structure; the said syntactic structure repair uses the method of probability combined with syntactic rules or the method of dependency analysis to re-excavate the missing syntactic information, and repair the previous Defects in the obtained syntactic structure; this link can also be repaired through the syntactic structure, distinguishing and adjusting the primary and secondary status of each vector in the syntactic structure of the reserved ABC joint system;

S8.10. Residual noun check: use probability combined with syntactic rules or dependency analysis method to find reasonable residual nouns and unreasonable residual nouns, and discard the A-B-C joint system containing unreasonable residual nouns;

S9. Take the basic framework of the syntactic structure of the sentence to be parsed described by the several ABC joint systems retained by S8 as the standard, and use the method of probability combined with syntactic rules or the dependency analysis method to analyze the sentence to be parsed to obtain sufficient numbers Among the complete syntactic structures of, find the most suitable complete syntactic structure that meets the aforementioned criteria;

S10. Based on several complete syntactic structures generated by S9, using semantic processing methods to find the most suitable semantic relationship subject to the aforementioned syntactic structure constraints, and then take the aforementioned complete syntactic structure corresponding to the semantic relationship as the final Syntactic analysis results.
The method according to claim 1, wherein the preprocessing operation comprises:

S1.1. For the part of speech of each word in the sentence to be parsed, automatic computer analysis and labeling are performed to generate the result of lexical analysis;

S1.2. For natural language elements such as predicate verbs, basic noun phrases, basic adjective phrases, and basic adverb phrases in the sentence to be parsed, automatic computer analysis and labeling; for adjacent noun phrases and adjacent parallel noun phrases Natural language elements such as adjective phrases and adjacent adverb phrases are automatically analyzed and labeled by computer;

S1.3. Combine various adjacent part-of-speech units, and record the merged adjacent part-of-speech units as a corresponding part-of-speech unit;

S1.4. For the language information in the sentences to be parsed as described in S1.2 and S1.3, open a list of words and write them as word list (i); word list (i) includes words and word correspondences The attributes of the words, the position information of the words in the sentence, punctuation marks and their position in the sentence;

S1.5. For the various possible results of lexical analysis, use combinatorial mathematics related methods to generate multiple different word lists (i) to accommodate multiple structural ambiguities; for the multiple different words generated above List (i) is distinguished by different numbers; in the preprocessing operation, the restrictions on the lexical analysis results are relaxed, and multiple different lexical analysis results caused by structural ambiguities are passed through multiple different word lists ( i) Keep it and leave it to the subsequent syntactic analysis link and semantic processing link for identification and screening, that is, through the subsequent syntactic analysis link and semantic processing link, the various lexical analysis results are restricted, thereby increasing the final selection of the correct The possibility of lexical analysis results;

S1.6. For each word list (i), use probability combined with syntactic rules or dependency analysis methods to check out special sentence patterns such as interrogative sentences, omission sentences, and inverted sentences, and perform corresponding morphological processing of their predicates , In order to deal with the subsequent steps;

S1.7. For each word list (i), remove adverb units, adjective units, adjacent adverb units, adjacent adjective units, interjection units, simple parentheses in non-sentence forms, and particle units , Adjacent juxtaposed particle units, adjacent juxtaposed qualifier units without structural ambiguity, mixed modifier units, impurity components in sentences waiting to be resolved; commas on both sides of non-sentence simple parentheses units waiting to be resolved are removed Contains minor punctuation marks.
The method according to claim 1, wherein the step S2 comprises:

S2.1. For each word list (i), read the sentence data structure that has been preprocessed to be parsed, and the sentence data structure that has been preprocessed includes the following information:

(1) Coordinate related word units used to connect sentences;

(2) The coordinate related word unit not used to connect sentences; the function of the coordinate related word unit not used to connect sentences is to connect various coordinate components within the sentence;

(3) Predicate verb unit, subordinate related word unit, basic noun unit, infinitive verb unit, gerund-present participle unit, past participle unit, preposition unit, adjacent predicate verb combination unit, adjacent parallel subordinate related words Combination unit, adjacent parallel basic noun combination unit, adjacent parallel infinitive verb combination unit, adjacent parallel gerund-present participle combination unit, adjacent parallel past participle combination unit, adjacent parallel preposition combination unit ；

(4) Interrogative unit, adjacent interrogative combination unit, and structurally ambiguous qualifier unit;

(5), including the parenthesis component of the predicate verb unit;

(6), the main punctuation marks;

S2.2. Generate a word list (ii) for the sentence data structure in the aforementioned S2.1; the word list (ii) includes the aforementioned words, the attributes corresponding to the aforementioned words, and the comparison of the aforementioned words according to the natural language sequence The numbers and main punctuation marks are marked in descending order of numbers.
The method according to claim 1, wherein the step S3 comprises:

S3.1. Obtain all the predicate vectors corresponding to each predicate element according to the possible values of the predicate element, the parallel guide element, the subordinate guide element, the subject element, the first position object element, and the second position object element Possible values; the predicate vector includes a parallel guide element, a subordinate guide element, a subject element, a predicate element, a first-position object element, and a second-position object element;

S3.2. For each predicate vector in the sentence to be parsed, choose any possible value corresponding to the predicate vector to obtain a set of possible values corresponding to the entire predicate vector; correspond to the aforementioned set of all predicate vectors The possible values of is arranged in a fixed order to form a matrix of n rows and 6 columns; the aforementioned matrix of n rows and 6 columns is called a backbone system;

S3.3. Replace every element outside of each predicate vector in any given backbone system with a corresponding number; after replacing the number, check the backbone system; if in the backbone system If the following unreasonable conditions occur, then the backbone system should be cleared; if the following unreasonable conditions do not occur in the backbone system, then the backbone system should be retained; the remaining backbone system is called the standardized backbone system:

S3.3.1. Check the aforementioned backbone system: compare the word list (ii), if there is a parallel related word unit or subordinate related word unit or adjacent parallel subordinate related word combination unit for connecting sentences that does not enter the main system, then the main The system is unreasonable, clear the backbone system;

S3.3.2. Check the aforementioned backbone system: If the same number or the same predicate vector or the same infinitive vector or the same gerund-present participle vector appears in two different predicate vectors, then the backbone system is unreasonable To clear the backbone system;

S3.3.3. Check the aforementioned backbone system: if there are two numbers in reverse order in a predicate vector, then the backbone system is unreasonable, and the backbone system is cleared;

S3.3.4. Check the aforementioned backbone system: replace any two predicate vectors with elements in the relationship between them, all of which are replaced by equal amounts; if there is a cross contradiction between the substitutions between the vectors, then the backbone system is unreasonable. Clear the backbone system; if two numbers in reverse order appear after equal substitutions, then the backbone system is unreasonable, and the backbone system is cleared;

S3.3.5. After the inspection, return to the original state before the inspection for use in subsequent operations.
The method according to claim 4, wherein in the process of executing S3.2, the inspection program of S3.3 is executed synchronously to prevent the generation of an unreasonable backbone system.