CN1588537A

CN1588537A - Method for semantic analyzer bead on grammar model

Info

Publication number: CN1588537A
Application number: CNA2004100667933A
Authority: CN
Inventors: 朱杰; 熊英
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2004-09-29
Filing date: 2004-09-29
Publication date: 2005-03-02
Anticipated expiration: 2024-09-29
Also published as: CN1310171C

Abstract

The invention is a method of establishing semantic analyzer based on syntactic model in the field of intelligent information processing, using high-layer semantic information of a telephone dial system to establish a syntactic model, and applying the syntactic model to semantic analysis, automatically splitting phonetic stream, organically combining phonetic-Chinese character conversion with voice analysis, including two aspects of syntactic model establishment and semantic analysis. It advances a method of using high-layer semantic information in a syntactic model to split the phonetic stream. It is a semantic analyzer able to eliminate ambiguity and split sentences. It can perfectly analyze the semantic information of the sentences in or out of the grammatical rule.

Description

Foundation is based on the method for the semantic analyzer of syntactic model

Technical field

The present invention is a kind of method of setting up semantic analyzer that relates to the intelligent information processing technology field, is specifically related to the method for a kind of foundation based on the semantic analyzer of syntactic model.

Background technology

Typical conversational system is made up of modules such as speech recognition, natural language understanding, dialogue management, natural language generation, phonetic syntheses.The research of natural language understanding module is of long duration, and studies main flow at present for to realize with rule-based language understanding method, promptly carries out syntax-semantic parsing according to the existing syntax, and its basis is a Formal Language Theory.Common natural language understanding module mainly uses context-free grammar CFG (Context Free Grammar) to describe and the analyzing and processing written word.Spoken its characteristics of having compared with written word: the sentence formula is simpler, and syntactic structure is more random, often with multiple conversational language phenomenon (repeat, revise, refer to, omission etc.).And Chinese characters spoken language is compared with Oral English Practice, and the sentence formula is more flexible, and word order is more random.Traditional CFG syntax are difficult to represent effectively the many phenomenons in Chinese characters spoken language ground.And the spoken dialogue system that relates to voice also comprises noise, ambiguous, pet phrase, eats the spoken voice of sound, change of tune or the like now, so spoken dialog is the difficult point that natural language understanding technology is realized, but also is the key point of application system applicability.In the spoken language, people's language is very random, can omit, corrigendum, flashback or the like, and these spoken phenomenons are traditional only insoluble based on the understanding system institute of lexical analysis, can solve well and introduce based on crucial semantic technology.

Because above all factors, fully the speech recognition device based on the CFG of syntax rule is fragile, find by literature search, Hacioglu, people such as K are at " Acoustics, Speech and Signal Processing, 2001 IEEE " Volume:1,2001 Page (s): " the Dialog-context dependent language modeling combining n-grams andstochastic context-free grammars Acoustics; Speech; and SignalProcessing; 2001.Proceedings (" IEEE acoustics; voice and signal Processing "; " in conjunction with the language model based on conversational system of the N unit syntax and context-free grammar ") proposes speech recognition device and will utilize bi-gram (bigram) language model and context-free grammar (CFG) simultaneously in the literary composition that delivers on the 537-540vol.1.Because can not get semantic information from the N unit syntax (N-gram) language model, a kind of solution is to utilize language model to choose the highest optimal path of score.But for the application system of specific area (as: weather, flight, lodging, traffic, tourism, air ticket, train ticket are ordered or the like), such method is not optimum, because do not make full use of the high-layer semantic information in these fields.

Summary of the invention

The objective of the invention is at the deficiencies in the prior art, be primarily aimed at the phone automatic switching system of phonetic dialing, propose a kind of method of setting up based on the semantic analyzer of syntactic model, make that the Chinese-character phonetic letter after the speech recognition is changed, and guarantee that conversion method reaches optimum.

The present invention is achieved through the following technical solutions, the present invention has made full use of the high-layer semantic information of dialing system, set up syntactic model, and this syntactic model is applied to semantic analysis, automatic segmentation phonetic stream, Chinese-character phonetic letter conversion and semantic analysis are combined, comprise two aspects of foundation, semantic analysis algorithm of syntactic model:

(1) described syntactic model, be a notion transfer network that has weight, representing the transfer between notion and notion, whole grammer is made up of syntax rule in layer, the high-layer semantic information of having represented the dialing system has constituted the semantic concept transfer network BSCTN of bigram.Transfer between notion is stipulated by the syntax rule in the syntactic model.Each notion in the syntactic model is called " grammar concept ", and each grammar concept is corresponding to grammatical attribute in each layer.Make up the grammer of getting up like this, expression is flexible, and clear concept is realized simple.

(2) described semantic analysis algorithm mainly is three row's fork rules that are applied in " dial system ":

Row's fork rule one: according to syntactic model BSCTN, use loose syntax rule (loose grammar) G0, whole sentence is analyzed, get rid of the sentence that does not meet syntax rule.

The syntax rule of " loose " is defined as: allow to connect the speech that exceeds dictionary and exceed syntax rule after each grammar concept, the grammatical attribute that defines these speech is-1, and can be transferred in other all grammar concept in the sub-grammer layer and go.

S^{*} = \underset{S_{i} &Element; G 0}{Y} S_{i}

S _iFor carry out the possible path of i bar of cutting, S according to loose syntax rule G0 ^*Be cutting route optimum in all cutting route.

Using loose syntax rule is in order to analyze the sentence that exceeds syntax rule neatly.With ' 0 ', ' 1 ' the life and death state of representing every paths, ' 0 ' expression ' extremely ', ' 1 ' represents ' life '.After the row's of application fork rule one,, illustrate that this sentence does not meet syntax rule if the state of all cutting route is ' 0 ' all.Then the state in all paths is changed into ' 1 ' by ' 0 ', continue application rule two, three and arrange fork.

For example: ask the qing[modal particle]-dial bo[to dial]-dial bo[to dial]-five wu[numerals]-four si[numerals]

Do not transfer to ' dialling ' owing to do not allow notion ' to dial ' in the syntax rule, so according to syntax rule G0, the state of all cutting route all is ' 0 ', as seen this sentence does not meet syntax rule.For the row's of utilization fork rule, extract relevant semantic information, the state of all cutting route is changed into ' 1 ', the rule row fork below using again.

Row's fork rule two: longest match principle.For ' 1 ', the order of the state in all the other paths is for ' 0 ' the order of the state of the cutting route that contains minimum grammar concept (minimum pinyin word).By the calculation syntax mark, choosing the minimum path of grammer mark is optimal path.

S^{*} = \underset{s}{\arg \min} score (s) = \underset{s}{\arg \min} score ({ph}_{1} K {ph}_{n}) = \underset{s}{\arg \min} Σ_{i = 1}^{n} score ({ph}_{i})

S is the path that cutting obtains, and score (s) is used for obtaining the grammer mark in this path, score (ph _i) obtain the grammer mark of present node.If the grammatical attribute of pinyin word is-1, then the grammer mark is 10, otherwise the grammer mark is 1.10 and 1 has differed an order of magnitude, is enough to the quality in path is distinguished.

Experiment showed, that application rule two can get rid of a large amount of ambiguity paths fast.

Row's fork rule three: grammatical sentence has obtained optimum explanation, but if contain the speech that exceeds dictionary or exceed syntax rule in the statement, then may also exist many cutting route after handling through above-mentioned row's fork, further analyze.

Strict syntax rule G1 is defined as: grammatical attribute does not allow it to transfer to other notion for-1 notion is left out.

Analyze each state and be ' 1 ' cutting route, skip grammatical attribute and be-1 speech, save the pinyin word (speech of continuous identical grammatical attribute is only got) of the same syntax of figs attribute of repetition.Syntax rule G1 calculation syntax mark according to strictness.Choosing the minimum path of score then is optimal path.

S^{*} = \underset{G 1}{\arg \min} score (s) = \underset{G 1}{\arg \min} Σ_{i = 1}^{n} score ({ph}_{i} | {ph}_{i - 1}, K {ph}_{1}, {ph}_{0})

S is the path that cutting obtains, and score (s) obtains the grammer mark in this path, score (ph _i| ph _I-1, K, ph ₁, ph ₀) obtain the grammer mark that in the past phase of history node is transferred to present node.If this shifts grammaticality G1, then the grammer mark is 1, otherwise the grammer mark is 10.10 and 1 has differed an order of magnitude, is enough to the quality in path is distinguished.

Below the inventive method is further described, step is as follows:

1,, sets up " grammar concept " of dial system according to the syntactic features of dialing system;

2,, set up the syntax rule of the high-layer semantic information that comprises dial system by " grammar concept ".

3, according to syntax rule and " grammar concept ", set up the stratificational grammar model.

(1) ground floor is total grammer layer (grammar-all), and the system that controlling is in big transfer aspect semantic.

(2) second layer is subject method (grammar), is controlling the transfer between the notion in total grammer layer.

(3) the 3rd layers is sub-grammer (sub-grammar), has stipulated the formation of the grammar concept that defines in subject method layer.

(4) the 4th layers is speech layer (phrases), and corresponding to the set of the concrete speech of notion in the sub-grammer layer, the speech in the identity set has identical semantic information.

(5) layer 5 is word (character) layer, is representing how word forms speech.

(6) layer 6 is syllable (syllable) layer, and the syllabary that each band is transferred is shown consonant, vowel structure (initial-final), and these recognition units (initials, finals) are called phoneme (phonemes).

4, set up the classificating word dictionary.

5, search the classificating word dictionary, carry out the complete trails coupling, obtain all possible cutting route of phonetic stream, the information that each node in the path comprises is pinyin word and corresponding grammatical attribute thereof.

6, write down " life and death " situation of every paths with a variable, ' 1 ' is ' life ', and ' 0 ' is ' extremely '.

7, judge whether to exist many cutting route, then forward step 8 to if exist, otherwise forward step 9 to.

8, use three divergent rules of row successively and get rid of the cutting route that violates grammar, up to only staying an optimal path;

9,, search syntactic category speech dictionary and just can successfully convert pinyin word to Chinese word according to last cutting result.

In the 8th step, need to prove: when the row's of application fork rule is got rid of the cutting route that violates grammar, because grammer is a hierarchical model, so judging that current node is whether during grammaticality, consider several historical nodes of front of present node, searching route judges whether current sub-grammer layer is covered, if cover and then get back to upper strata subject method layer, seek follow-up subject method layer notion; Otherwise the node of current sub-grammer layer moves one backward, after the set of all descendant nodes that obtain historical path, judges whether present node belongs to the node in the set.

The present invention proposes a kind of method of utilizing the high-layer semantic information in the syntactic model to come cutting phonetic stream, this is a kind of semantic analyzer that can get rid of the ambiguity partition statement.This analyzer can both analyze semantic information well to sentence in the syntax rule and the sentence that exceeds syntax rule.Utilize the syntactic model that contains high-layer semantic information proposed by the invention organically to combine Chinese-character phonetic letter conversion and semantic analysis efficiently, and the syntactic structure of this layering realizes simple, the operational efficiency height, it is strong to arrange divergent ability, can be used for simple man-machine interactive system.

Description of drawings

Below in conjunction with drawings and Examples the present invention is described in further detail:

Six layers of structural drawing in Fig. 1 syntactic model of the present invention

Among the figure, solid arrow direction (from left to right) has been represented the path direction when prediction algorithm launches in the grammer path in the speech recognition, the path direction when on behalf of the grammer path, the dotted arrow direction recall.

The searching route example of Fig. 2 semantic analysis algorithm of the present invention

Among the figure, sub-grammer node layer s1 is gone in search from subject method node layer g1 toward lower floor.Transfer to s2 again to s3 from node s1.After node s3 covers, will get back to the upper strata, g1 transfers to g2 as can be known, gets back to sub-grammer layer and obtains node s4, so forward s4 to from s3.

The effect synoptic diagram of Fig. 3 semantic analyzer of the present invention.

Embodiment

Below in conjunction with the explanation of " dialing " statement example in " dial system ", understand technical scheme of the present invention better.

The special-shaped word of unisonance of considering Chinese is a lot, and well-behaved parser is that unit flows cutting to phonetic with the speech, and the phonetic of speech is called " pinyin word ".

According to statistics, the unisonance allograph of Chinese is a lot, the individual character pinyin word is average corresponding 12 Chinese characters.But two word pinyin word are on average corresponding to 1.46 Chinese words; The speech that three words or three words are above, phonetic and Chinese word almost are one to one.As seen it is more much smaller than single unisonance allograph possibility occurring the unisonance homophone to occur, is a kind of efficient ways so utilize speech for unit changes the phonetic circulation into Chinese sentence.

Embodiment

1,, sets up " grammar concept " of dial system according to the syntactic features of dialing system.For example:, set up " grammar concept " of " dial system " according to the syntactic features of " dial system ".As [beginning], [modal particle], [dialling], [name], [place name], [phone], [numeric string] etc.

2,, set up the syntax rule of the high-layer semantic information that comprises the dialing system by " grammar concept ".For example: [beginning]-[modal particle] | [dialling], [dialling]-[name] | [place name] | [numeric string] ... ..

3, according to syntax rule and " grammar concept ", set up the stratificational grammar model.For example: the syntactic model of " dial system " of our exploitation has mainly utilized wherein three layers of the stratificational grammar model introduced previously.

Subject method layer: beginning-modal particle | dial, dial-name | numeral | phone

Sub-grammer layer: dial=dial name=surname+appellation | surname+name | surname+name+name

The speech layer: dial=[dial, change, switching is transferred to, and connects, and beats ... ]

4, set up the classificating word dictionary.As [# dial (dial, change, connect, switching ...) # modal particle (please, trouble, excuse me ...) the # numeric string (0,1,2,3...) # surname (Zhao, money, grandson, Lee ...) # appellation (sir, Ms, Miss, teacher ...) ... ]

8, the divergent rule of the row of application is got rid of irrational cutting route successively, up to only staying an optimal path.

For example phonetic flows: jie tong ba ba.

Connecting jie-tong[dials]-eight ba[numerals]-eight ba[numerals].

Meeting jie[dials]-logical tong[-1]-father ba-ba[appellation].

Connecting jie-tong[dials]-father ba-ba[appellation].

The row's of application fork rule one, cutting route iii. is not excluded owing to meeting G0. and path i. and ii. have been saved, and can obtain optimal path according to rule two is i..

According to above step, " dialing " statement in " dial system " is tested.7 subject method layer notions are arranged, 10 sub-grammer layer notions, 200 of entries in this model.Test set 1 is 200 of grammatical phonetic test statements, and test set 2 is 60 test statements that do not meet syntax rule.

Experiment 1: when segmentation rules 1 adopts the syntax rule G1 of loose syntax rule G0 and strictness respectively, respectively the cutting route number is added up.

Table 1: cutting route is added up after adopting different syntax rules

Test set	Grammer in row's fork rule 1	The complete trails coupling	After the application rule 1	After the application rule 2
Test set	Grammer in row's fork rule 1	The complete trails coupling	After the application rule 1	After the application rule 2	Test set 1	G0	4656	327	200
G1	4656	228	200			G0	4656	327	200
G1	4656	228	200	Test set 2		G0	471	246	168
G1	471	426	218			G0	471	246	168

By table 1 as seen, for grammatical test set 1, employing G1 is better than G0 work; But, adopt G0 better than G1 for the test set 2 that exceeds grammer.Taken all factors into consideration above-mentioned two kinds of situations, this method has adopted grammer G0 in row's fork rule 1.

Experiment 2: added up and arranged the effect of divergent regular 1-2-3 for row's fork, the data in the table 2 are number of path.

Table 2: row's fork effect of rule

Test set	The complete trails coupling	After the application rule 1	After the application rule 2	After the application rule 3
Test set	The complete trails coupling	After the application rule 1	After the application rule 2	After the application rule 3	Test set 1	4656	327	200	-
Test set 2	471	246	168	66	Test set 1	4656	327	200	-

1 of test set has been used the divergent regular 1-2 of row, and test set 2 has been used regular 1-2-3.As seen row's fork effect of rule 1 and rule 2 is very big.The good results are evident for the row of test set 2 fork for rule 3.

Experiment 3: after statistics is handled through cutting and row's fork, the speech under phonetic level and the Chinese level and the semantic tagger accuracy of sentence.

Table 3: the semantic tagger accuracy of speech and sentence

Test set	The phonetic level		The Chinese level
	The phonetic level		The Chinese level		Pinyin word	The phonetic sentence	Chinese word	Middle sentence
	Test set 1	100	100	97.5	Pinyin word	The phonetic sentence	Chinese word	Middle sentence	95
Test set 2	Test set 1	100	100	97.5	95	90	92	85	95

By table 3 as seen, this semantic analyzer is fine to grammatical sentence work, for the sentence that does not meet syntax rule, stronger analytic function is arranged also.

Claims

1, a kind of method of setting up based on the semantic analyzer of syntactic model, it is characterized in that, utilize the high-layer semantic information of dialing system, set up syntactic model, and this syntactic model is applied to semantic analysis, automatic segmentation phonetic stream combines Chinese-character phonetic letter conversion and semantic analysis, comprises two aspects of foundation, semantic analysis algorithm of syntactic model:

(1) described syntactic model, it is a notion transfer network that has weight, representing the transfer between notion and notion, whole grammer is made up of syntax rule in layer, the high-layer semantic information of having represented the dialing system has constituted the semantic concept transfer network BSCTN of binary, and the transfer between notion is stipulated by the syntax rule in the syntactic model, each notion in the syntactic model is called " grammar concept ", and each grammar concept is corresponding to grammatical attribute in each layer;

Row's fork rule one: according to syntactic model BSCTN, use loose syntax rule G0, whole sentence is analyzed, get rid of the sentence that violates grammar;

Row's fork rule two: longest match principle, for ' 1 ', the order of the state in all the other paths is for ' 0 ' the order of the state of the cutting route that contains minimum grammar concept, and by the calculation syntax mark, choosing the minimum path of grammer mark is optimal path;

Row's fork rule three: grammatical sentence has obtained optimum explanation, but if contain the speech that exceeds dictionary or exceed syntax rule in the statement, then may also exist many cutting route after handling through above-mentioned row's fork, analyze by hand and judge.

2, foundation according to claim 1 is based on the method for the semantic analyzer of syntactic model, it is characterized in that, described loose syntax rule, be defined as: allow to connect the speech that exceeds dictionary and exceed syntax rule after each grammar concept, the grammatical attribute that defines these speech is-1, and can transfer in other all grammar concept in the sub-grammer layer and go.

3, foundation according to claim 1 is characterized in that based on the method for the semantic analyzer of syntactic model, below by step it is done further to limit:

(1), sets up " grammar concept " of dial system according to the syntactic features of dialing system;

(2) set up the syntax rule of the high-layer semantic information comprise the dialing system by " grammar concept ";

(3) according to syntax rule and " grammar concept ", set up the stratificational grammar model;

(4) set up the classificating word dictionary;

(5) search the classificating word dictionary, carry out the complete trails coupling, obtain all possible cutting route of phonetic stream, the information that each node in the path comprises is pinyin word and corresponding grammatical attribute thereof;

(6) write down " life and death " situation of every paths with a variable, ' 1 ' is ' life ', and ' 0 ' is ' extremely ';

(7) judge whether to exist many cutting route, then change step (8), otherwise change step (9) if exist;

(8) use three divergent rules of row successively and get rid of the cutting route that violates grammar, up to only staying an optimal path;

(9) according to last cutting result, search syntactic category speech dictionary, pinyin word is converted to Chinese word.

4, foundation according to claim 3 is characterized in that based on the method for the semantic analyzer of syntactic model, in the step (3), sets up the stratificational grammar model, and is specific as follows:

(1) ground floor is total grammer layer, and the system that controlling is in big transfer aspect semantic;

(2) second layer is the subject method, is controlling the transfer between the notion in total grammer layer;

(3) the 3rd layers is sub-grammer, has stipulated the formation of the grammar concept that defines in subject method layer;

(4) the 4th layers is the speech layer, and corresponding to the set of the concrete speech of notion in the sub-grammer layer, the speech in the identity set has identical semantic information;

(5) layer 5 is the word layer, is representing how word forms speech;

(6) layer 6 is a syllablic tier, and the syllabary that each band is transferred is shown the consonant, vowel structure, and these recognition units are called phoneme.

5, foundation according to claim 3 is based on the method for the semantic analyzer of syntactic model, it is characterized in that, in the described step (8), when the row's of application fork rule is got rid of the cutting route that violates grammar, because grammer is a hierarchical model, so judging that current node is whether during grammaticality, consider several historical nodes of front of present node, searching route, judge whether current sub-grammer layer is covered, if cover and then get back to upper strata subject method layer, seek follow-up subject method layer notion; Otherwise the node of current sub-grammer layer moves one backward, after the set of all descendant nodes that obtain historical path, judges whether present node belongs to the node in the set.