CN1588537A - Method for semantic analyzer bead on grammar model - Google Patents

Method for semantic analyzer bead on grammar model Download PDF

Info

Publication number
CN1588537A
CN1588537A CNA2004100667933A CN200410066793A CN1588537A CN 1588537 A CN1588537 A CN 1588537A CN A2004100667933 A CNA2004100667933 A CN A2004100667933A CN 200410066793 A CN200410066793 A CN 200410066793A CN 1588537 A CN1588537 A CN 1588537A
Authority
CN
China
Prior art keywords
layer
semantic
grammar
grammer
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2004100667933A
Other languages
Chinese (zh)
Other versions
CN1310171C (en
Inventor
朱杰
熊英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CNB2004100667933A priority Critical patent/CN1310171C/en
Publication of CN1588537A publication Critical patent/CN1588537A/en
Application granted granted Critical
Publication of CN1310171C publication Critical patent/CN1310171C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention is a method of establishing semantic analyzer based on syntactic model in the field of intelligent information processing, using high-layer semantic information of a telephone dial system to establish a syntactic model, and applying the syntactic model to semantic analysis, automatically splitting phonetic stream, organically combining phonetic-Chinese character conversion with voice analysis, including two aspects of syntactic model establishment and semantic analysis. It advances a method of using high-layer semantic information in a syntactic model to split the phonetic stream. It is a semantic analyzer able to eliminate ambiguity and split sentences. It can perfectly analyze the semantic information of the sentences in or out of the grammatical rule.

Description

Foundation is based on the method for the semantic analyzer of syntactic model
Technical field
The present invention is a kind of method of setting up semantic analyzer that relates to the intelligent information processing technology field, is specifically related to the method for a kind of foundation based on the semantic analyzer of syntactic model.
Background technology
Typical conversational system is made up of modules such as speech recognition, natural language understanding, dialogue management, natural language generation, phonetic syntheses.The research of natural language understanding module is of long duration, and studies main flow at present for to realize with rule-based language understanding method, promptly carries out syntax-semantic parsing according to the existing syntax, and its basis is a Formal Language Theory.Common natural language understanding module mainly uses context-free grammar CFG (Context Free Grammar) to describe and the analyzing and processing written word.Spoken its characteristics of having compared with written word: the sentence formula is simpler, and syntactic structure is more random, often with multiple conversational language phenomenon (repeat, revise, refer to, omission etc.).And Chinese characters spoken language is compared with Oral English Practice, and the sentence formula is more flexible, and word order is more random.Traditional CFG syntax are difficult to represent effectively the many phenomenons in Chinese characters spoken language ground.And the spoken dialogue system that relates to voice also comprises noise, ambiguous, pet phrase, eats the spoken voice of sound, change of tune or the like now, so spoken dialog is the difficult point that natural language understanding technology is realized, but also is the key point of application system applicability.In the spoken language, people's language is very random, can omit, corrigendum, flashback or the like, and these spoken phenomenons are traditional only insoluble based on the understanding system institute of lexical analysis, can solve well and introduce based on crucial semantic technology.
Because above all factors, fully the speech recognition device based on the CFG of syntax rule is fragile, find by literature search, Hacioglu, people such as K are at " Acoustics, Speech and Signal Processing, 2001 IEEE " Volume:1,2001 Page (s): " the Dialog-context dependent language modeling combining n-grams andstochastic context-free grammars Acoustics; Speech; and SignalProcessing; 2001.Proceedings (" IEEE acoustics; voice and signal Processing "; " in conjunction with the language model based on conversational system of the N unit syntax and context-free grammar ") proposes speech recognition device and will utilize bi-gram (bigram) language model and context-free grammar (CFG) simultaneously in the literary composition that delivers on the 537-540vol.1.Because can not get semantic information from the N unit syntax (N-gram) language model, a kind of solution is to utilize language model to choose the highest optimal path of score.But for the application system of specific area (as: weather, flight, lodging, traffic, tourism, air ticket, train ticket are ordered or the like), such method is not optimum, because do not make full use of the high-layer semantic information in these fields.
Summary of the invention
The objective of the invention is at the deficiencies in the prior art, be primarily aimed at the phone automatic switching system of phonetic dialing, propose a kind of method of setting up based on the semantic analyzer of syntactic model, make that the Chinese-character phonetic letter after the speech recognition is changed, and guarantee that conversion method reaches optimum.
The present invention is achieved through the following technical solutions, the present invention has made full use of the high-layer semantic information of dialing system, set up syntactic model, and this syntactic model is applied to semantic analysis, automatic segmentation phonetic stream, Chinese-character phonetic letter conversion and semantic analysis are combined, comprise two aspects of foundation, semantic analysis algorithm of syntactic model:
(1) described syntactic model, be a notion transfer network that has weight, representing the transfer between notion and notion, whole grammer is made up of syntax rule in layer, the high-layer semantic information of having represented the dialing system has constituted the semantic concept transfer network BSCTN of bigram.Transfer between notion is stipulated by the syntax rule in the syntactic model.Each notion in the syntactic model is called " grammar concept ", and each grammar concept is corresponding to grammatical attribute in each layer.Make up the grammer of getting up like this, expression is flexible, and clear concept is realized simple.
(2) described semantic analysis algorithm mainly is three row's fork rules that are applied in " dial system ":
Row's fork rule one: according to syntactic model BSCTN, use loose syntax rule (loose grammar) G0, whole sentence is analyzed, get rid of the sentence that does not meet syntax rule.
The syntax rule of " loose " is defined as: allow to connect the speech that exceeds dictionary and exceed syntax rule after each grammar concept, the grammatical attribute that defines these speech is-1, and can be transferred in other all grammar concept in the sub-grammer layer and go.
S * = Y S i ∈ G 0 S i
S iFor carry out the possible path of i bar of cutting, S according to loose syntax rule G0 *Be cutting route optimum in all cutting route.
Using loose syntax rule is in order to analyze the sentence that exceeds syntax rule neatly.With ' 0 ', ' 1 ' the life and death state of representing every paths, ' 0 ' expression ' extremely ', ' 1 ' represents ' life '.After the row's of application fork rule one,, illustrate that this sentence does not meet syntax rule if the state of all cutting route is ' 0 ' all.Then the state in all paths is changed into ' 1 ' by ' 0 ', continue application rule two, three and arrange fork.
For example: ask the qing[modal particle]-dial bo[to dial]-dial bo[to dial]-five wu[numerals]-four si[numerals]
Do not transfer to ' dialling ' owing to do not allow notion ' to dial ' in the syntax rule, so according to syntax rule G0, the state of all cutting route all is ' 0 ', as seen this sentence does not meet syntax rule.For the row's of utilization fork rule, extract relevant semantic information, the state of all cutting route is changed into ' 1 ', the rule row fork below using again.
Row's fork rule two: longest match principle.For ' 1 ', the order of the state in all the other paths is for ' 0 ' the order of the state of the cutting route that contains minimum grammar concept (minimum pinyin word).By the calculation syntax mark, choosing the minimum path of grammer mark is optimal path.
S * = arg min s score ( s ) = arg min s score ( ph 1 K ph n ) = arg min s Σ i = 1 n score ( ph i )
S is the path that cutting obtains, and score (s) is used for obtaining the grammer mark in this path, score (ph i) obtain the grammer mark of present node.If the grammatical attribute of pinyin word is-1, then the grammer mark is 10, otherwise the grammer mark is 1.10 and 1 has differed an order of magnitude, is enough to the quality in path is distinguished.
Experiment showed, that application rule two can get rid of a large amount of ambiguity paths fast.
Row's fork rule three: grammatical sentence has obtained optimum explanation, but if contain the speech that exceeds dictionary or exceed syntax rule in the statement, then may also exist many cutting route after handling through above-mentioned row's fork, further analyze.
Strict syntax rule G1 is defined as: grammatical attribute does not allow it to transfer to other notion for-1 notion is left out.
Analyze each state and be ' 1 ' cutting route, skip grammatical attribute and be-1 speech, save the pinyin word (speech of continuous identical grammatical attribute is only got) of the same syntax of figs attribute of repetition.Syntax rule G1 calculation syntax mark according to strictness.Choosing the minimum path of score then is optimal path.
S * = arg min G 1 score ( s ) = arg min G 1 Σ i = 1 n score ( ph i | ph i - 1 , K ph 1 , ph 0 )
S is the path that cutting obtains, and score (s) obtains the grammer mark in this path, score (ph i| ph I-1, K, ph 1, ph 0) obtain the grammer mark that in the past phase of history node is transferred to present node.If this shifts grammaticality G1, then the grammer mark is 1, otherwise the grammer mark is 10.10 and 1 has differed an order of magnitude, is enough to the quality in path is distinguished.
Below the inventive method is further described, step is as follows:
1,, sets up " grammar concept " of dial system according to the syntactic features of dialing system;
2,, set up the syntax rule of the high-layer semantic information that comprises dial system by " grammar concept ".
3, according to syntax rule and " grammar concept ", set up the stratificational grammar model.
(1) ground floor is total grammer layer (grammar-all), and the system that controlling is in big transfer aspect semantic.
(2) second layer is subject method (grammar), is controlling the transfer between the notion in total grammer layer.
(3) the 3rd layers is sub-grammer (sub-grammar), has stipulated the formation of the grammar concept that defines in subject method layer.
(4) the 4th layers is speech layer (phrases), and corresponding to the set of the concrete speech of notion in the sub-grammer layer, the speech in the identity set has identical semantic information.
(5) layer 5 is word (character) layer, is representing how word forms speech.
(6) layer 6 is syllable (syllable) layer, and the syllabary that each band is transferred is shown consonant, vowel structure (initial-final), and these recognition units (initials, finals) are called phoneme (phonemes).
4, set up the classificating word dictionary.
5, search the classificating word dictionary, carry out the complete trails coupling, obtain all possible cutting route of phonetic stream, the information that each node in the path comprises is pinyin word and corresponding grammatical attribute thereof.
6, write down " life and death " situation of every paths with a variable, ' 1 ' is ' life ', and ' 0 ' is ' extremely '.
7, judge whether to exist many cutting route, then forward step 8 to if exist, otherwise forward step 9 to.
8, use three divergent rules of row successively and get rid of the cutting route that violates grammar, up to only staying an optimal path;
9,, search syntactic category speech dictionary and just can successfully convert pinyin word to Chinese word according to last cutting result.
In the 8th step, need to prove: when the row's of application fork rule is got rid of the cutting route that violates grammar, because grammer is a hierarchical model, so judging that current node is whether during grammaticality, consider several historical nodes of front of present node, searching route judges whether current sub-grammer layer is covered, if cover and then get back to upper strata subject method layer, seek follow-up subject method layer notion; Otherwise the node of current sub-grammer layer moves one backward, after the set of all descendant nodes that obtain historical path, judges whether present node belongs to the node in the set.
The present invention proposes a kind of method of utilizing the high-layer semantic information in the syntactic model to come cutting phonetic stream, this is a kind of semantic analyzer that can get rid of the ambiguity partition statement.This analyzer can both analyze semantic information well to sentence in the syntax rule and the sentence that exceeds syntax rule.Utilize the syntactic model that contains high-layer semantic information proposed by the invention organically to combine Chinese-character phonetic letter conversion and semantic analysis efficiently, and the syntactic structure of this layering realizes simple, the operational efficiency height, it is strong to arrange divergent ability, can be used for simple man-machine interactive system.
Description of drawings
Below in conjunction with drawings and Examples the present invention is described in further detail:
Six layers of structural drawing in Fig. 1 syntactic model of the present invention
Among the figure, solid arrow direction (from left to right) has been represented the path direction when prediction algorithm launches in the grammer path in the speech recognition, the path direction when on behalf of the grammer path, the dotted arrow direction recall.
The searching route example of Fig. 2 semantic analysis algorithm of the present invention
Among the figure, sub-grammer node layer s1 is gone in search from subject method node layer g1 toward lower floor.Transfer to s2 again to s3 from node s1.After node s3 covers, will get back to the upper strata, g1 transfers to g2 as can be known, gets back to sub-grammer layer and obtains node s4, so forward s4 to from s3.
The effect synoptic diagram of Fig. 3 semantic analyzer of the present invention.
Embodiment
Below in conjunction with the explanation of " dialing " statement example in " dial system ", understand technical scheme of the present invention better.
The special-shaped word of unisonance of considering Chinese is a lot, and well-behaved parser is that unit flows cutting to phonetic with the speech, and the phonetic of speech is called " pinyin word ".
According to statistics, the unisonance allograph of Chinese is a lot, the individual character pinyin word is average corresponding 12 Chinese characters.But two word pinyin word are on average corresponding to 1.46 Chinese words; The speech that three words or three words are above, phonetic and Chinese word almost are one to one.As seen it is more much smaller than single unisonance allograph possibility occurring the unisonance homophone to occur, is a kind of efficient ways so utilize speech for unit changes the phonetic circulation into Chinese sentence.
Embodiment
1,, sets up " grammar concept " of dial system according to the syntactic features of dialing system.For example:, set up " grammar concept " of " dial system " according to the syntactic features of " dial system ".As [beginning], [modal particle], [dialling], [name], [place name], [phone], [numeric string] etc.
2,, set up the syntax rule of the high-layer semantic information that comprises the dialing system by " grammar concept ".For example: [beginning]-[modal particle] | [dialling], [dialling]-[name] | [place name] | [numeric string] ... ..
3, according to syntax rule and " grammar concept ", set up the stratificational grammar model.For example: the syntactic model of " dial system " of our exploitation has mainly utilized wherein three layers of the stratificational grammar model introduced previously.
Subject method layer: beginning-modal particle | dial, dial-name | numeral | phone
Sub-grammer layer: dial=dial name=surname+appellation | surname+name | surname+name+name
The speech layer: dial=[dial, change, switching is transferred to, and connects, and beats ... ]
4, set up the classificating word dictionary.As [# dial (dial, change, connect, switching ...) # modal particle (please, trouble, excuse me ...) the # numeric string (0,1,2,3...) # surname (Zhao, money, grandson, Lee ...) # appellation (sir, Ms, Miss, teacher ...) ... ]
5, search the classificating word dictionary, carry out the complete trails coupling, obtain all possible cutting route of phonetic stream, the information that each node in the path comprises is pinyin word and corresponding grammatical attribute thereof.
6, write down " life and death " situation of every paths with a variable, ' 1 ' is ' life ', and ' 0 ' is ' extremely '.
7, judge whether to exist many cutting route, then forward step 8 to if exist, otherwise forward step 9 to.
8, the divergent rule of the row of application is got rid of irrational cutting route successively, up to only staying an optimal path.
9,, search syntactic category speech dictionary and just can successfully convert pinyin word to Chinese word according to last cutting result.
For example phonetic flows: jie tong ba ba.
Connecting jie-tong[dials]-eight ba[numerals]-eight ba[numerals].
Meeting jie[dials]-logical tong[-1]-father ba-ba[appellation].
Connecting jie-tong[dials]-father ba-ba[appellation].
The row's of application fork rule one, cutting route iii. is not excluded owing to meeting G0. and path i. and ii. have been saved, and can obtain optimal path according to rule two is i..
According to above step, " dialing " statement in " dial system " is tested.7 subject method layer notions are arranged, 10 sub-grammer layer notions, 200 of entries in this model.Test set 1 is 200 of grammatical phonetic test statements, and test set 2 is 60 test statements that do not meet syntax rule.
Experiment 1: when segmentation rules 1 adopts the syntax rule G1 of loose syntax rule G0 and strictness respectively, respectively the cutting route number is added up.
Table 1: cutting route is added up after adopting different syntax rules
Test set Grammer in row's fork rule 1 The complete trails coupling After the application rule 1 After the application rule 2
Test set 1 G0 4656 327 200
G1 4656 228 200
Test set 2 G0 471 246 168
G1 471 426 218
By table 1 as seen, for grammatical test set 1, employing G1 is better than G0 work; But, adopt G0 better than G1 for the test set 2 that exceeds grammer.Taken all factors into consideration above-mentioned two kinds of situations, this method has adopted grammer G0 in row's fork rule 1.
Experiment 2: added up and arranged the effect of divergent regular 1-2-3 for row's fork, the data in the table 2 are number of path.
Table 2: row's fork effect of rule
Test set The complete trails coupling After the application rule 1 After the application rule 2 After the application rule 3
Test set 1 4656 327 200 -
Test set 2 471 246 168 66
1 of test set has been used the divergent regular 1-2 of row, and test set 2 has been used regular 1-2-3.As seen row's fork effect of rule 1 and rule 2 is very big.The good results are evident for the row of test set 2 fork for rule 3.
Experiment 3: after statistics is handled through cutting and row's fork, the speech under phonetic level and the Chinese level and the semantic tagger accuracy of sentence.
Table 3: the semantic tagger accuracy of speech and sentence
Test set The phonetic level The Chinese level
Pinyin word The phonetic sentence Chinese word Middle sentence
Test set 1 100 100 97.5 95
Test set 2 95 90 92 85
By table 3 as seen, this semantic analyzer is fine to grammatical sentence work, for the sentence that does not meet syntax rule, stronger analytic function is arranged also.

Claims (5)

1, a kind of method of setting up based on the semantic analyzer of syntactic model, it is characterized in that, utilize the high-layer semantic information of dialing system, set up syntactic model, and this syntactic model is applied to semantic analysis, automatic segmentation phonetic stream combines Chinese-character phonetic letter conversion and semantic analysis, comprises two aspects of foundation, semantic analysis algorithm of syntactic model:
(1) described syntactic model, it is a notion transfer network that has weight, representing the transfer between notion and notion, whole grammer is made up of syntax rule in layer, the high-layer semantic information of having represented the dialing system has constituted the semantic concept transfer network BSCTN of binary, and the transfer between notion is stipulated by the syntax rule in the syntactic model, each notion in the syntactic model is called " grammar concept ", and each grammar concept is corresponding to grammatical attribute in each layer;
(2) described semantic analysis algorithm mainly is three row's fork rules that are applied in " dial system ":
Row's fork rule one: according to syntactic model BSCTN, use loose syntax rule G0, whole sentence is analyzed, get rid of the sentence that violates grammar;
Row's fork rule two: longest match principle, for ' 1 ', the order of the state in all the other paths is for ' 0 ' the order of the state of the cutting route that contains minimum grammar concept, and by the calculation syntax mark, choosing the minimum path of grammer mark is optimal path;
Row's fork rule three: grammatical sentence has obtained optimum explanation, but if contain the speech that exceeds dictionary or exceed syntax rule in the statement, then may also exist many cutting route after handling through above-mentioned row's fork, analyze by hand and judge.
2, foundation according to claim 1 is based on the method for the semantic analyzer of syntactic model, it is characterized in that, described loose syntax rule, be defined as: allow to connect the speech that exceeds dictionary and exceed syntax rule after each grammar concept, the grammatical attribute that defines these speech is-1, and can transfer in other all grammar concept in the sub-grammer layer and go.
3, foundation according to claim 1 is characterized in that based on the method for the semantic analyzer of syntactic model, below by step it is done further to limit:
(1), sets up " grammar concept " of dial system according to the syntactic features of dialing system;
(2) set up the syntax rule of the high-layer semantic information comprise the dialing system by " grammar concept ";
(3) according to syntax rule and " grammar concept ", set up the stratificational grammar model;
(4) set up the classificating word dictionary;
(5) search the classificating word dictionary, carry out the complete trails coupling, obtain all possible cutting route of phonetic stream, the information that each node in the path comprises is pinyin word and corresponding grammatical attribute thereof;
(6) write down " life and death " situation of every paths with a variable, ' 1 ' is ' life ', and ' 0 ' is ' extremely ';
(7) judge whether to exist many cutting route, then change step (8), otherwise change step (9) if exist;
(8) use three divergent rules of row successively and get rid of the cutting route that violates grammar, up to only staying an optimal path;
(9) according to last cutting result, search syntactic category speech dictionary, pinyin word is converted to Chinese word.
4, foundation according to claim 3 is characterized in that based on the method for the semantic analyzer of syntactic model, in the step (3), sets up the stratificational grammar model, and is specific as follows:
(1) ground floor is total grammer layer, and the system that controlling is in big transfer aspect semantic;
(2) second layer is the subject method, is controlling the transfer between the notion in total grammer layer;
(3) the 3rd layers is sub-grammer, has stipulated the formation of the grammar concept that defines in subject method layer;
(4) the 4th layers is the speech layer, and corresponding to the set of the concrete speech of notion in the sub-grammer layer, the speech in the identity set has identical semantic information;
(5) layer 5 is the word layer, is representing how word forms speech;
(6) layer 6 is a syllablic tier, and the syllabary that each band is transferred is shown the consonant, vowel structure, and these recognition units are called phoneme.
5, foundation according to claim 3 is based on the method for the semantic analyzer of syntactic model, it is characterized in that, in the described step (8), when the row's of application fork rule is got rid of the cutting route that violates grammar, because grammer is a hierarchical model, so judging that current node is whether during grammaticality, consider several historical nodes of front of present node, searching route, judge whether current sub-grammer layer is covered, if cover and then get back to upper strata subject method layer, seek follow-up subject method layer notion; Otherwise the node of current sub-grammer layer moves one backward, after the set of all descendant nodes that obtain historical path, judges whether present node belongs to the node in the set.
CNB2004100667933A 2004-09-29 2004-09-29 Method for semantic analyzer bead on grammar model Expired - Fee Related CN1310171C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100667933A CN1310171C (en) 2004-09-29 2004-09-29 Method for semantic analyzer bead on grammar model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100667933A CN1310171C (en) 2004-09-29 2004-09-29 Method for semantic analyzer bead on grammar model

Publications (2)

Publication Number Publication Date
CN1588537A true CN1588537A (en) 2005-03-02
CN1310171C CN1310171C (en) 2007-04-11

Family

ID=34604095

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100667933A Expired - Fee Related CN1310171C (en) 2004-09-29 2004-09-29 Method for semantic analyzer bead on grammar model

Country Status (1)

Country Link
CN (1) CN1310171C (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100349161C (en) * 2005-07-29 2007-11-14 中国科学院声学研究所 Semantic analysis method for resolution of verb different meanings structure in sentence
CN101477798B (en) * 2009-02-17 2011-01-05 北京邮电大学 Method for analyzing and extracting audio data of set scene
CN101609671B (en) * 2009-07-21 2011-09-07 北京邮电大学 Method and device for continuous speech recognition result evaluation
CN103325370A (en) * 2013-07-01 2013-09-25 百度在线网络技术(北京)有限公司 Voice identification method and voice identification system
CN104537036A (en) * 2014-12-23 2015-04-22 华为软件技术有限公司 Language feature analyzing method and device
CN109949799A (en) * 2019-03-12 2019-06-28 广东小天才科技有限公司 A kind of semanteme analytic method and system
CN112562673A (en) * 2020-12-29 2021-03-26 苏州思必驰信息科技有限公司 Voice recognition method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5966686A (en) * 1996-06-28 1999-10-12 Microsoft Corporation Method and system for computing semantic logical forms from syntax trees
CN1141660C (en) * 1998-05-18 2004-03-10 中国科学院声学研究所 Natural language statement analyzing method simulating brain's language sensing process
US7249019B2 (en) * 2002-08-06 2007-07-24 Sri International Method and apparatus for providing an integrated speech recognition and natural language understanding for a dialog system

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100349161C (en) * 2005-07-29 2007-11-14 中国科学院声学研究所 Semantic analysis method for resolution of verb different meanings structure in sentence
CN101477798B (en) * 2009-02-17 2011-01-05 北京邮电大学 Method for analyzing and extracting audio data of set scene
CN101609671B (en) * 2009-07-21 2011-09-07 北京邮电大学 Method and device for continuous speech recognition result evaluation
CN103325370A (en) * 2013-07-01 2013-09-25 百度在线网络技术(北京)有限公司 Voice identification method and voice identification system
CN103325370B (en) * 2013-07-01 2015-11-25 百度在线网络技术(北京)有限公司 Audio recognition method and speech recognition system
CN104537036A (en) * 2014-12-23 2015-04-22 华为软件技术有限公司 Language feature analyzing method and device
CN104537036B (en) * 2014-12-23 2018-11-13 华为软件技术有限公司 A kind of method and device of metalanguage feature
CN109949799A (en) * 2019-03-12 2019-06-28 广东小天才科技有限公司 A kind of semanteme analytic method and system
CN109949799B (en) * 2019-03-12 2021-02-19 广东小天才科技有限公司 Semantic parsing method and system
CN112562673A (en) * 2020-12-29 2021-03-26 苏州思必驰信息科技有限公司 Voice recognition method and device

Also Published As

Publication number Publication date
CN1310171C (en) 2007-04-11

Similar Documents

Publication Publication Date Title
CN1120436C (en) Speech recognition method and system for identifying isolated non-relative Chinese character
Bod et al. Data-oriented parsing
CN1303582C (en) Automatic speech sound classifying method
CN1140870A (en) Speech recognizing method and apparatus and speech translating system
CN1667699A (en) Generating large units of graphonemes with mutual information criterion for letter to sound conversion
CN101493812B (en) Tone-character conversion method
CN106294460B (en) A kind of Chinese speech keyword retrieval method based on word and word Hybrid language model
CN104317846A (en) Semantic analysis and marking method and system
CN100392658C (en) Body-bused subject type network reptile system configuration method
CN1310171C (en) Method for semantic analyzer bead on grammar model
CN1224954C (en) Speech recognition device comprising language model having unchangeable and changeable syntactic block
CN1831937A (en) Method and device for voice identification and language comprehension analysing
CN1851706A (en) Body learning based intelligent subject-type network reptile system configuration method
CN1801321A (en) System and method for text-to-speech
CN1901041A (en) Voice dictionary forming method and voice identifying system and its method
CN1177312C (en) Unified recognizing method for multi-speed working pattern
Tsarfaty Integrated morphological and syntactic disambiguation for modern hebrew
CN1238834C (en) Method of grammar describing and identification analyse of colloquial identification understanding
CN1928854A (en) Syntax analysis method and device for layering Chinese long sentences based on punctuation treatment
CN1290371A (en) Segmentation of Chinese text into words
JP3009636B2 (en) Spoken language analyzer
CN1315109C (en) Method for natural voice recognition based on a generative transformation/phrase structure grammar
CN1666253A (en) System and method for mandarin chinese speech recogniton using an optimized phone set
CN1271550C (en) Sentence boundary identification method in spoken language dialogue
CN107342080B (en) Conference site synchronous shorthand system and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070411

Termination date: 20091029