Embodiment
Specific implementation process of the present invention is as follows:
Step 1:
(1a) semantic matching relationship between 1-definition vocabulary is defined:
Definition 1: in lexical semantic knowledge base, any two notional word W
xand W
ybetween the inherent semantic relation that has, be called semantic matching relationship; With function match (W
x, W
y) representing its level of intimate, the value of function is exactly semantic matches value; Semantic matching relationship and concrete statement have nothing to do; If W
xwith W
ybetween there is no semantic matching relationship, then match (W is set
x, W
y)=MAX, MAX is a large constant.
(1b) 2 are defined: any notional word W in statement
i(removing predicate head) all semanteme is modified in another one notional word W
gi, claim W
giw
isemantic modification target.
(1c) 3 are defined: in specific grammatical analysis option A
iwhen, suppose that V is predicate head, S is the actor of V, and O is the recipient of V, Wi be a notional word in statement and! (W
i∈ S, V, O}), W
giw
isemanteme modify target, with function match (W
i, W
gi) represent its semantic matches value, so, the semantic matches value Value of whole statement
aican represent with formula (1):
The number (not comprising S, V, O) of target modified in the semanteme of S and O to be V, n be notional word, K
sVOand K
wifor weights coefficient. it should be noted that value less expression semantic matches degree is larger.
(1d) the best grammatical analysis axiom of axiom 1-is supposed: suppose that a statement has m kind grammatical analysis scheme, meet the grammatical analysis option A of semantic logic most
isatisfy condition: A
i=argmin (Value
ai), namely semantic matches is worth minimum grammatical analysis scheme is best grammatical analysis scheme.
Step 2:
(2a) define 4-key concept: build set before semantic base, without the need to carry out semantical definition and at Sentence analysis and reasoning time to be carried out the limited assemble of symbol of special semantic interpretation process by system, key concept quantity is few, uses C
wB={ W
b1, W
b2... W
bkrepresent, give tacit consent to each key concept W
biinherit in root concept.
(2b) define 5-lattice: the special key concept set representing the Deep Semantics relation of correlation circumstance between action genus and other things, use C
vC={ C
1, C
2... C
mrepresent; Such as " reason " is the lattice representing action reason.
(2c) define 6-mode: the special key concept set representing the executing state of action genus, use C
vM={ M
1, M
2... M
nrepresent; Such as " completing " is the mode that an expression action has been finished.
(2d) define the semantic relation that 7-is basic: build set before semantic base, unmodifiable and at Sentence analysis time need to carry out the semantic relation of special semantic interpretation process, comprise following semantic relation:
R
c(V, C, W) lattice semantic relation: in order to represent that action genus V and concept W has the Deep Semantics relation that lattice are C, wherein C ∈ C
vC;
R
m(V, M) modal semantic relation: in order to represent that the executing state of action genus V is M, wherein C ∈ C
vM;
R
aP(W, P) attribute semantic relation: in order to represent that concept P is the semantic relation of an attribute of noun genus W;
R
aS(W, S) state semantic relation: in order to represent that concept S is the semantic relation of a state of noun genus W;
R
d(W) semantic relation is inherited: can only use once in the semantical definition formula of concept, such as W
1inherit in W
2;
R
p(W
1, W
2) finite aggregate of part of semantic relation is described: concept W is described
2semanteme be W
1the part of semanteme;
R
w(W
1, W
2) finite aggregate of whole relation is described: concept W is described
1semanteme comprises W
2;
R
vS(V, W) represents that the actor of action V is the semantic relation of W, R
vO(V, W) represents that the recipient of action V is the semantic relation of W.
(2e) define 8-and expand semantic relation: building the semantic relation set newly defined in semantic base process, quantity is not limit. when Sentence analysis, all expansion semantic relations have unified disposal route, do not carry out special process. use R
r(W
1, W
2) represent, illustrate that the concept W be defined is W
1and W
2between a kind of semantic relation; In non-a defined formula, use R
r(W, W
1, W
2) represent W
1and W
2between there is the expansion semantic relation of W by name.;
(2f) based on basic description logic, to its concept definition regularly 1-7 limit and convert, become Concept Semantic define method herein.
Rule 1-concept definition rule:
1) if W
b1, W
b2key concept, R
1, R
2semantic relation, then new symbol
R
2(W
b2) be concept;
2) if W
b1, W
b2concept, R
1, R
2semantic relation, then new symbol
R
2(W
b2) be concept;
Do not have in rule 1 concept also, concept hands over, the service regeulations of measure word, their processing mode is shown in regular 2-2; Rule 2-3 is in order to the definition mode of concept is converted into one group of semantic relation, and requires that noun meets single principle inherited.
Rule 2-concept processing rule also: if W
1, W
2concept, W
1, W
2most recent co mmon ancestor concept be W
p, when new ideas W has W=W
1uW
2semantic time, due to single inheritance rules, W can be defined as W=R
d(W
p) I (R
p(W
p, W
1) U R
p(W
p, W
2)); Such as, parents=R
d(people) I (R
p(people, father) U R
p(people, mother)).
The processing rule that rule 3-concept is handed over: if W
1, W
2concept, when new ideas W has W=W
1i W
2semantic time, due to single inheritance rules, W can be defined as W=R
d(W
1) I R
w(W
1, W
2) or W=R
d(W
2) I R
w(W
2, W
1).
Rule 2 Sum fanction 3 only by concept also, concept hand over converted a kind of representation, only need specify that the certain semantic of these two kinds of representations is explained, this is done to solve the flexible means of one of many succession issues and bonding inheritance principles, act on the interface be similar in java, with the speed of the retrieval and coupling of accelerating concept.
In natural language, noun all can semantically directly or indirectly inherited in key concept, in order to the more clear semanteme representing noun accurately, adopts single principle inherited when requiring that semantic nouns represents, for having the concept inheriting semanteme, adopt regular 2 Sum fanction 3 to process more; Transitive verb semantically to represent noun to another noun do the action applied; Intransitive verb then represents the one change of noun self; Adjective is at the state semantically all represented between noun or noun or attribute; Adverbial word semantically all to represent the implementation status of action or mode and correlation circumstance or lattice; Therefore, the semanteme of all kinds of vocabulary in natural language can be represented by the satisfied mode of regular 4.
Rule 4-concept classification definition rule: in natural language, concept is classified by character and is expressed as noun, verb, adjective, adverbial word; Suppose the definition representing concept W with Def (W), Num (R, W) should meet following rule for the occurrence number of semantic relation R in definition, every class vocabulary definitions:
The list of noun is inherited: satisfy condition
Concept W;
Verb: satisfy condition
Adjective: satisfy condition
Adverbial word: satisfy condition
The processing rule of rule 5-measure word: measure word
generality quantifier and existential quantifier
, do not obtain special treating, by as " number of times " one of lattice semantic relation value represents the number of times of action, the value as " quantity " attribute semantic relation represents the number of noun; Because this does not affect the syntactic structure analysis of natural language, make a concrete analysis of according to the value of " number of times " lattice of action and noun " quantity " attribute when reasoning, detailed disposal route slightly.
Rule 6-example arranges rule: when defined notion W, if the concept W in definition
ioccur m time, and n that occurs for this m time referring to semantic { S
1, S
2... S
n, then it is available that { W, W#1...W#n-1}, n that distinguishes W is semantic, and W#i may be interpreted as example when reasoning.
Rule 7-polysemant disposes rule: have a lot of polysemant in natural language, if polysemant W has n semantic { S
1, S
2... S
n, then for each concrete semantical definition concept, { W@1, W@2...W@n} distinguishes and represents this n different semanteme to define n concept altogether.
(2g) suppose axiom 2: inherit semantic relation and have unidirectional delivery, lower floor's concept is inherited Upper Concept and is had semantic relation.
Theorem 1. is according to inheritance R
d, all nouns form one tree.
According to regular 1-7 and definition 4-8, the semanteme of vocabulary in natural language can be defined; Suppose to represent semantic relation with a directed line segment, according to the definition Sum fanction of theorem 1 and vocabulary, the semanteme of vocabulary W can be set by noun in one group of directed line segment represent.
Step 3:
(3a) semantic matching relationship between noun-noun is defined:
Definition 9-conjunctive word collects: the set of all vocabulary comprised in nominal definition formula, uses C
rWrepresent; The conjunctive word of such as, noun W in accompanying drawing Fig. 1 collects
C
RW={W,W
P,W
r1,W
r2,W
r3,W
r4,W
r5,W
v,W
vc}
In analysis hereafter with ∝ for representing succession semantic relation, W
x∝ W
yrepresent W
xinherit in W
y, and specify W ∝ W.
(1) basic semantic matching relationship
Definition 10-direct semantics matching relationship: if vocabulary W
x, W
ymeet following condition, use symbol W
xw
yrepresent:
Condition: suppose W
yconjunctive word to collect be C
wY, then
Work as W
xw
ytime, match (W
x, W
y)=K
t* d (W
x, W
z);
K
tfor matching relationship coefficient, according to the type of mated relation R, be set to different constants, have 1≤K
t≤ 3;
Such as: { W in accompanying drawing Fig. 1
dr1, W
dvc, W
dr2, W
d2, W
dr3, W
dr4, W
dr5in each vocabulary with W, there is direct semantics matching relationship.
Definition 11-inherits semantic matching relationship: if vocabulary W
x, W
ymeet following condition, use symbol W
x//W
yrepresent:
Condition:
Work as W
x//W
ytime, match (W
x, W
y)=match (W
x, W
z)+d (W
y, W
z);
Such as: { W in accompanying drawing Fig. 1
dr1, W
dvc, W
dr2, W
d2, W
dr3, W
dr4, W
dr5and W
d1, W
d2there is succession semantic matching relationship;
Definition semantic distance function d (W
x, W
y): represent two vocabulary W with inheritance
x, W
ybetween succession number of times.
(2) semantic matching relationship is comprised
The explicit semantic relation of inclusion of definition 12-: if vocabulary W
x, W
ymeet following condition, use symbol W
x⊙ W
yrepresent;
Condition: there is concept W
z, satisfy condition
Work as W
x⊙ W
ytime, there is match (W
x, W
y)=K
p* (d (W
x, W
z), K
pfor comprising matching relationship coefficient.
Definition 13-implicit semantic relation of inclusion: if vocabulary W
x, W
ymeet following condition, use symbol W
xzero W
yrepresent;
Condition: there is concept W
z, satisfy condition
Work as W
xzero W
ytime, match (W
x, W
y)=K
p* (d (W
z, W
y)).
Definition 14-comprises semantic matching relationship: if vocabulary W
x, W
ymeet following condition, use symbol W
x◎ W
yrepresent:
Condition:
Work as W
x◎ W
ytime, match (W
x, W
y)=
min{match(W
X,W
Z)+match(W
Z,W
Y),match(W
X,W
Y)}
Theorem 3: when vocabulary WX, WY meet WX ◎ WY, WY has the semantic relation of WX;
(3b) semantic matching relationship between noun-verb
The semantic matching relationship of noun-verb can be divided into two classes:
1) SVO semantic matching relationship: the recipient of actor or action may be done in noun
2) lattice semantic matching relationship: noun and verb have lattice semantic matching relationship
Suppose that verb be gerund of executing in the definition of V, V is S
0, be O by gerund
0; Due to define time by S
0be set to the top noun may implementing V, O
0be set to the top noun bearing this action, so only have same S
0or O
0there is the noun S of certain relation and noun O just likely to perform an action V, namely form the semantic matches of SVO; SVO semantic matches has 6 kinds of situations, and it is worth available Value
sVOrepresent, computing formula is as follows:
Value
SVO=match(S,S
0)+match(O,O
0)
The conventional SVO semantic matching relationship of definition 15-: satisfy condition (S ∝ S
0) ∩ (O ∝ O
0).
Definition 16-heavy duty SVO semantic matching relationship: satisfy condition:
For noun S and O and verb V, do not meet in the definition of V SVO coupling time, and the defined declaration of S, O they meet SVO coupling;
Example: ring=R
d(ornaments) ∩ R
vS(wearing, people) ∩ R
vO(wearing, ornaments) ∩ R
c(wearing, position, hand), owing to containing R in " ring "
vS(wearing, people), so { people, wears, ring } forms heavily loaded SVO semantic matching relationship.
Definition 17-comprises SVO semantic matching relationship: satisfy condition ((S ◎ S
0) ∩ (O ∝ O
0)) ∪ ((S ∝ S
0) ∩ (O ◎ O
0));
Example: class=R
d(set) ∩ R
w(set, student), because " student " can " eat " " meal ", " student " is the part of " class ", so { class, eats, meal } forms overall SVO semantic matching relationship.
The similar SVO semantic matching relationship of definition 18-: satisfy condition ((S ∽ S
0) ∩ (O ∝ O
0)) ∪ ((S ∝ S
0) ∩ (O ∽ O
0)).
Definition 19-likens SVO semantic matching relationship: under the following conditions, may there is metaphor in conjecture statement:
Condition 1: conventional SVO semantic matching relationship, heavily loaded SVO semantic matching relationship can be met without any noun in whole statement, comprise SVO semantic matching relationship, similar SVO semantic matching relationship;
Condition 2: there is noun S or O in statement, meets! (S ∝ S
0) ∩ (O ∝ O
0), guess and liken S to S
0;
Or condition 3: there is noun S or O in statement, meet (S ∝ S
0) ∩! (O ∝ O
0), guess and liken O to O
0;
For metaphor SVO semantic matching relationship, Value
sVO=K
f* (match (S, W
p)+match (O, W
p))
K
ffor weights coefficient, W
ps and S
0most recent co mmon ancestor.
Definition 20-lattice semantic matching relationship: for noun W and verb V, meet
Match (W, V)=K
c* d (W, W
c), K
cfor weights coefficient.
(3c) semantic matching relationship of noun-adjective
For adjective W
vAwith noun W
n, meet
Match (W
vA, W
n)=K
a* d (W
n, W), K
a=1, be weights coefficient;
(3d) semantic matching relationship arranged side by side
Semantic matching relationship arranged side by side only for the judgement of parallel construction in statement, to determine the scope of conjunction.
Definition 21-semantic similitude: because nominal definition have employed single method inherited, two noun W
x, W
yalthough there is no inheritance in definition, at semantically W
xbut may be W
yone, the concept be equivalent in description logic contains, and uses symbol W
x∽ W
yrepresent; Tableau algorithm in description logic can be improved, to judge Concept Semantic similarity relation.
Definition 22-noun semantic matching relationship arranged side by side: for two noun W
x, W
yavailable match (W
x, W
y)=K
t* (d (W
x, W
e)+d (W
y, W
e)) calculate a numerical value, as heuristic information, W
ew
x, W
ynearest common ancestor's node; When meeting W
x∽ W
ytime, also may be coordination.
Definition 23-verb semantic matching relationship arranged side by side: for two verb V
x, V
yavailable match (W
x, W
y)=K
t* (d (S
x0, S
y0)+d (O
x0, O
y0)) calculate a numerical value, as heuristic information, { S
x0, S
y0, O
x0, O
y0w
x, W
yactor in definition and by dynamic person.
(3e) semantic matching relationship between other class vocabulary
The semantic matching relationship of adverbial word: about adverbs modify adjective and adverbial word, also has very complicated situation, wouldn't discuss herein, suppose adverbial word can semantic matches in verb, adjective and adverbial word, specify match (W
1, W
2the semantic matching relationship of)=0. measure word: if lexicon should preserve the incidence relation of measure word and noun. measure word W can modification noun W
n, then match (W, W is specified
n)=0; Otherwise match (W, W
n)=MAX; The semantic matching relationship of pronoun: refer to relation according to pronoun, replaces to corresponding noun and processes by pronoun, such as " I " by " people " process.
(3f) syntactic match relation
Pay special attention to: various semantic matching relationship are above inherent, have nothing to do with concrete statement. in concrete statement, the vocabulary of some type possible is modified mutually, but semantic relation not inherent between vocabulary itself, just may there is a kind of grammatical phenomenon of semantic modified relationship in this statement, mainly comprise following two kinds of situations:
(1) modified relationship between uncommon part of speech: between verb-verb; Between adverbial word-noun; Between adjective-verb etc.; Such as " like swimming " " to be frank " etc.; These all belong to syntactic match relation, do not have inherent semantic matching relationship, just in statement, have phraseological modified relationship between vocabulary itself; In Sentence analysis process, its semantic matches is worth available match (W
x, W
y)=MAX/K
gcalculate, K
gtype weights (generally K
g< 1.5).
(2) the flexible conversion of parts of speech, such as adjective often can be applied flexibly as adverbial word, and this situation is not considered herein.
Step 4:
(4a) three levels and the syntax thereof of semantic structure are defined
Sentence analysis to be carried out according to semantic model herein, the statement abstract representation method of applicable semantic model must be had; Any statement is all formed through iteration by statement relatively simple for structure, and phrase is seen as an ingredient in statement; In order to meet the semantic analysis needs of semantic model, the semantic structure of statement can be divided into three levels according to the complexity of semantic structure and feature: simple sentence, special simple sentence, complex sentence.
Definition 24-simple sentence: only have a verb or adjective to make the statement C of predicate
s, available grammar G
1carry out abstractdesription;
By the thought design grammar G of case grammar
1, mentality of designing: suppose that V is predicate, S is the actor of V; O is the recipient of V, A
bit is preposition attribute; A
ait is postpositive attributive; P
dbe the adverbial modifier or complement, be equivalent to one group of lattice in case grammar; P
cit is the lattice content of; N is noun; N
pfor noun phrase;
Grammar G
1in the mentality of designing of key rule as follows:
1) C
s→ P
da
bsA
ap
dvP
da
boA
ap
d, the appearance order of SVO has 10 kinds, and accompanying drawing Fig. 2 is one wherein;
2) S → n|SA
aa
bs, actor done in multiple vocabulary, as the S in Fig. 2;
3)P
D→P
C|P
DP
C
S, O, A
b, A
a, P
cin the service regeulations of the vocabulary such as preposition, conjunction, auxiliary word, number, measure word can write out easily.
Definition 25-special simple sentence: there is multiple verb or adjective, but semantically not comprise the statement of subordinate clause, available grammar G
2carry out abstractdesription;
Grammar G
2mentality of designing: guarantee can not produce on the basis of subordinate clause, to grammar G
1the few rule of middle interpolation can generating grammar G
2, mainly contain following 2 kinds of situations:
1) situation of predicate made in multiple verb or adjective
2) S, O, A made in verb or adjective
b, A
a, P
csituation
Grammar G
2key be verb phrase V
vnoun phrase N directly can not be followed in front and back
p, namely can not there is N
p+ V
vor V
v+ N
p.
Definition 26-complex sentence: in grammar G
2the regular N of middle interpolation
p→ C
s, form grammar G
3; Because regular N
p→ C
sillustrate that a simple sentence or special simple sentence can make any composition in a complex sentence, achieve simple sentence recurrence, therefore grammar G
3complex sentence can be described.
(4b) thinking of best grammatical analysis scheme is obtained
(1) lexical ambiguity digestion procedure
Suppose W
1w
2... W
klexical semantic number be respectively n
1, n
2... n
k, carry out fully intermeshing for each semanteme, result is { L
1, L
2... L
m, then M=n
1* n
2* ... * n
k, suppose one of them
w
mn-th meaning of a word, then L
ic
sa sequence of words without lexical ambiguity; Exhaustive each { L in parsing process
1, L
2... L
manalysis result, select best L
ijust lexical ambiguity can be cleared up.
(2) analytical mathematics
According to axiom 1, obtain all grammatical analysis schemes, for each grammatical analysis option A
i, calculate A according to formula 1
icorresponding semantic matches value, and select best grammatical analysis scheme.
The simple clause of definition 27-: meet grammar G in statement
1or G
2substring be simple clause;
Suppose that target axiom modified in axiom 4-semanteme: supposing that target modified in the semanteme of notional word W is W
gi, then for the simple clause C meeting semantic logic in statement
s, meet (W ∈ C
s) → (W
gi∈ C
s); For the attribute A of next-door neighbour S
b, meet (W ∈ A
b) → (W
gi∈ (A
b∪ S)), similar process can be done in the attribute of next-door neighbour O; For the adverbial modifier or complement P
d, meet (W ∈ P
d) → (W
gi∈ (P
d∪ V)).
According to semantic feature of modifying target, all grammatical analysis schemes can be divided into 2 layers:
1) simple clause's level grammatical analysis scheme;
2) the grammatical analysis scheme of simple clause inside.
(4c) best grammatical analysis scheme is obtained
(1) Rule of judgment of simple clause can be summed up
For statement C
s, carry out grammar G
1, G
2, G
3cYK Algorithm Analysis, the substring s (i, j) meeting table 1 conditional is the simple clause that can sum up;
Table 1 can sum up the Rule of judgment of simple clause
(2) bottom-up simple clause sums up method
Available bottom-up simple clause sums up method and asks for best subordinate clause level grammatical analysis scheme, sees algorithm 4:
The simple clause of algorithm 1-sums up method:
1) for statement C
s, according to the Rule of judgment of table 1, find out the substring set { s that can sum up corresponding to simple clause
1, s
2... s
m;
2) for each clause s
i, calculate (algorithm 2) simple clause s
ibest semantic matches value, by s
ibe summed up as N
p, N is set
pend semantic;
3) C is made
sequal to sum up result, by the simple clause s in recursive procedure
ithe summation of best semantic matches value, carry out the recurrence of step 1-3;
4) there is best complete simple clause s corresponding to sentence semantic matches value
iscope and end order be best grammatical analysis scheme.
The best semantic matches value calculating simple clause is the key of algorithm, and concrete grammar is shown in algorithm 2;
In the algorithm, have employed the method for exhaustion when simple clause selects, theoretic best grammatical analysis scheme can be obtained; But the calculated amount of this method is comparatively large, not easily realizes; When clause's quantity can be returned too much, can only from can the good simple clause of several semantic matches degree be selected to carry out recursive search, to ask for suboptimum grammatical analysis scheme apodosis.
(3) semanteme is summed up
In algorithm 1, by simple clause C
sbe summed up as N
pafter, N
pdo not have semanteme, the semantic matches cannot carrying out next step calculates, and the mode of solution is as follows:
1) regulation carrys out N by summing up
pcan be matched with any vocabulary W, semantic matches value is:
Match (N
p, W) and=MAX/K
c(generally there is K
c> 1)
2) if N
pmake S or O of new goal clause, then can by N
psemanteme be set to former C
sin S or O.
(4d) the best semantic matches value of simple clause is obtained
Calculate the best semantic matches value of simple clause, according to axiom 1 and axiom 4, there is multiple grammatical analysis scheme simple clause inside, all grammatical analysis schemes must be obtained, for often kind of grammatical analysis scheme, the semanteme of its notional word is modified target and is determined, just can calculate the semantic matches value under this grammatical analysis scheme according to formula 1, the grammatical analysis scheme with minimum semantic matches value is exactly required analysis result;
The grammatical analysis scheme of simple clause inside can be divided into 3 layers: 1) SVO combination level; 2) A
a, P
d, A
blevel; 3) A
a, P
d, A
binner grammatical analysis scheme; Wherein best grammatical analysis scheme is selected by algorithm 2;
The best semantic matches value of the simple clause of algorithm 2-:
1) if simple clause is special simple sentence, all methods being summed up as simple sentence are found
2) for often kind of resolution principle, special simple sentence is summed up as simple sentence
3) for this simple sentence, all possible SVO combined method is found out
4) for often kind of SVO combined method, by C
sbe segmented into { L
1, L
2..L
n; If S or O is phrase, then carry out algorithm 3
5) each segmentation L
iinside A can be comprised at most
a, P
d, A
bthree partial contents, find out L
iin all A
a, P
d, A
bdivision methods
6) for often kind of A
a, P
d, A
bdivision methods, the means combined by syntax and semantics the matching analysis, are determined that target modified in the semanteme of each notional word, make A
a, P
d, A
bsemantic matches value minimum
7) ask for the semantic matches value of full sentence, select the minimum corresponding analytic process of semantic matches value as the grammatical analysis scheme of the best
Suppose
for simple clause C
scarry out grammar G
1the operation result of CYK algorithm, expression can generate the grammar symbol collection of substring s (i, j).
(1) the grammatical analysis scheme of SVO combination level
In simple sentence, suppose noun W
1and W
2meet SVO with verb V to mate, then { W
1, V, W
2a SVO combination, but S or O may be a phrase, when there is { W in sentence
1, V, W
3and { W
2, V, W
3sVO coupling, and V, W
3not at W
1and W
2centre, and W
1and W
2middle substring s (m, n) meets:
time, at W
1+ s (m, n)+W
3the phrase formed is S, in like manner can obtain longer S or O.
Algorithm 3-S or O segmentation:
1) obtain phrase S or O, find out in S all nouns meeting SVO coupling, be assumed to be { n
1, n
2..n
m}
2) according to { n
1, n
2..n
mphrase S is divided into m-1 section, according to regular S → n|SA
aa
bs is known each not for empty segmentation may comprise A
aa
b
(2) A
a, P
d, A
bthe division methods of level
Suppose segmentation L
isubstring be s (m, n), then meet
p, q is grammatical A
a, P
d, A
bdivision methods, segmentation result is: A
a=s (m, p), P
d=s (p, q) A
b=s (q, n);
(3) A
a, P
d, A
binner best grammatical analysis scheme
Theorem 2:A
a, P
d, A
binner best grammatical analysis scheme works as A
a, P
d, A
bthe grammatical analysis scheme that interior each notional word is corresponding under having best semantic modification target conditions;
Definition 28-simple noun phrase: do not comprise verb and adjectival noun phrase is exactly simple noun phrase;
Theorem 3: the attribute A of simple sentence
aor A
bbest grammatical analysis many equivalents in the best grammatical analysis scheme of a simple noun phrase.
Prove: only comprise a verb due to simple sentence or predicate V made in adjective, therefore A
a, A
bin do not comprise verb and adjective, according to axiom 4, A
a, A
bgrammatical analysis many equivalents in simple noun phrase N
pgrammatical analysis scheme, N
p∈ { (A
b+ S), (A
a+ S), (A
b+ O), (A
a+ O) };
Simple noun phrase N
pdifferent grammatical analysis schemes only by conjunction, preposition, auxiliary word, measure word impact; The key of grammatical analysis is selected conjunction, preposition, auxiliary word, the scope of measure word and their end order;
The determination of A, scope: in conjunction, preposition, auxiliary word, measure word, suppose w
bfor preposition type, w
mfor preposition type, then its scope can be summed up as two kinds of forms 1) ..N
bn..N
b1... w
m..N
a1..N
am..; 2) w
b..N
bn..N
b1... w
m..N
a1..N
am...; Wherein { N
b1, N
b2... N
bnthe noun of first half in scope, { N
a1, N
a2... N
amit is the noun of latter half in scope;
Custom modified in backward semanteme according to Chinese, can at { N
a1, N
a2... N
amin find out grammatical and N
b1there is the noun N of best semantic matches value
ajas boundary after scope; The scope prezone of form 1 can be determined by similar method;
(2) determination of end order: conjunction, preposition, auxiliary word, measure word and scope thereof should be summed up by certain grammar rule, the available method of exhaustion obtains their the best end order; Generally, after simple sentence statement to carry out repeatedly segmentation, A
a, A
bin comprise conjunction, preposition, auxiliary word, measure word number n be generally less than 4, there is calculating feasibility.
Definition 29 (noun sequences): do not exist conjunction, preposition, auxiliary word, measure word simple noun phrase be noun sequence.
After conjunction, preposition, auxiliary word, measure word are all summed up, simple noun phrase has just been known as a noun sequence, and conjunction, preposition, auxiliary word, scope of quantifier inside may also exist one or two noun sequence in addition; In noun sequence, only noun affects semantic modified relationship, and custom modified in the backward semanteme according to Chinese, supposes that noun sequence is L
n=W
1w
2... W
m; Then determine L by semanteme
nin arbitrarily noun to modify target concrete grammar (approximation method) as follows:
Target modified in the best semanteme of algorithm 4-noun sequence:
Set C is set
wfor sky, for L
nin each noun W
iif, match (W
i, W
m) < MAX, by W
ibe added to C
w, do to operate as follows:
1) C is supposed
welement be W by order successively
1-W
2-...-W
n, be then done as follows: by L
nbe divided into n+1 section, the semanteme modification target arranging them is W
m, and recurrence is carried out to each section;
2) C is worked as
win when only having a noun, carry out step 3) and step 4);
3) forward direction modified relationship is set: for any segmentation, if there is W
xw
x+1... W
y-1w
y, satisfy condition: 1. any W
x+1... W
ybetween noun and W
yafter the semantic matches value of noun be MAX; 2. match (W
y, W
x) < MAX; Then W is set
ysemanteme modify target be W
x; Then W is set
x+1... W
y-1between the modification target W of noun
y;
4) if L
nin also have noun W
ydo not modify target, then arranging its modification target is W
y+1.
P
danalytical approach be similar to A
a, A
b, key carries out boundary line delimitation according to preposition, the content in preposition scope is also converted into a simple noun phrase; Due to comparatively loaded down with trivial details, do not discuss in detail here.
(4) disposal route of special simple sentence
Sum up all Non-Definite Verb/adjectives, statement is converted into simple sentence, select best end scheme, disposal route is as follows:
1) statement is carried out to the CYK algorithm of grammar G 2, find the verb or Adjective Phrases that likely do predicate, may kinds of schemes be had.
2) for each scheme, sum up remaining verb (or adjective), choose semantic matches and be worth minimum analytical plan.
Also need when summing up Non-Definite Verb or adjective to arrange to sum up semanteme, such as " beautiful necklace ", summing up semanteme is " necklace ", does not discuss in detail here.
Step 5:
According to the grammatical analysis result with best semantic matches value, simple sentence can be converted into a knowledge point, each simple clause of complex sentence is converted into knowledge point, and whole complex sentence is converted into one group of knowledge point.
Example: " moulding is that the green bronze ornaments of insect are welcome by Brazilian girl very much to statement." knowledge five-tuple, in table 2:
The more educated example of table 2 statement
After statement being converted into the knowledge point of depositing with structural data form, just can carry out various Intelligent Information Processing to these knowledge datas.
Above-described embodiment is only for the invention example is clearly described, and the restriction not to the invention embodiment.For those of ordinary skill in the field, can also make other changes in different forms on the basis of the above description.Here exhaustive without the need to also giving all embodiments.And thus the apparent change of amplifying out or variation be still among the protection domain of the invention claim.