CN1512396A - Analytic method of open type natural language template - Google Patents

Analytic method of open type natural language template Download PDF

Info

Publication number
CN1512396A
CN1512396A CNA021592411A CN02159241A CN1512396A CN 1512396 A CN1512396 A CN 1512396A CN A021592411 A CNA021592411 A CN A021592411A CN 02159241 A CN02159241 A CN 02159241A CN 1512396 A CN1512396 A CN 1512396A
Authority
CN
China
Prior art keywords
semantic
natural language
language template
groove
chunk
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA021592411A
Other languages
Chinese (zh)
Inventor
孙久文
任文捷
刘武
诸光
孙文彦
王楠
高建忠
王江
申江涛
王建新
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CNA021592411A priority Critical patent/CN1512396A/en
Publication of CN1512396A publication Critical patent/CN1512396A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The analysis process of opened natural language template includes the following steps: extracting semantic channel information from the natural language template according to preset opened natural language template compiling format; and replacing all the semantic channels in each semantic block layer one layer by one layer with specific information values the system provides to create natural language. The said analysis process of opened natural language template may be used in analyzing natural language template created in the opened natural language template creating process. No matter how complicated and flexible the grammar may be, the language template can be converted to corresponding humanized natural language intelligently without needing any coding of the user.

Description

A kind of analytic method of open natural language template
Technical field
The present invention relates to treatment technology, particularly a kind of analytic method of open natural language template to natural language.
Background technology
At present, along with popularizing of computing machine, people are on the increase the demand of intelligent man-machine interaction system, machine translation system and other customizing messages disposal systems, system related information is handled and in addition intelligent and presenting of hommization becomes one by the field of extensive concern, and present as the intellectuality of one of the most basic information interchange means of mankind natural language, then be a focus in this field.
In the existing intelligent man-machine interaction system, man-to-man often mode.General situation is to realize the various logic of languages of appointing with the program language of computing machine, this needs good fully by appointment logic of language to come the coding language, require simultaneously in the man-machine interaction process, must import natural language by good by appointment logic of language, this method implementation procedure is loaded down with trivial details and can't adapt to flexible and changeable mutual guiding demand.
In the existing machine translation system, can only reconfigure the vocabulary after translating according to the grammer described in the program, semanteme mostly, and form the final objective language.
Though there is minority system in forming the target language process, also to introduce certain language template notion, provided fixing language template, be that basic mode is resolved to fill a vacancy, form target language.The analytic method of fixedly language template like this has certain effect for the fixedly language of simple specific transactions or technical translator.But, present analytic method can only be resolved at the fixedly language template of some specific professional or specific translation speciality, can not resolve different language template, this just need carry out program development respectively to each language template, thereby causes the wasting of resources of great amount of manpower and material resources.
A kind of open natural language template establishment method has appearred at present, this method can be created and modification natural language template according to business demand is autonomous by the user, its basic process is that the information variable that will extract from the natural language original shape is as semantic groove, with natural language original shape combination according to the different business information extraction, the semantic groove and the natural language original shape that may repeat are combined into semantic chunk, or different semantic chunks are carried out modes such as nested combination create the natural language template with the natural language original shape.Since this method have use simple, flexibly, the naturalness advantages of higher, have broad application prospects, yet the fixing analytic method of language template at present can't be realized the parsing of the open language template created by this method.
Summary of the invention
In view of this, the object of the present invention is to provide a kind of analytic method of open natural language template, can resolve the natural language template that adopts open natural language template establishment method to create.
For achieving the above object, technical scheme of the present invention specifically is achieved in that
A kind of analytic method of open natural language template, this method may further comprise the steps:
1) according to predefined open natural language template compiling form, extracts the semantic groove information in the natural language template;
2) successively each the semantic groove in every layer of semantic chunk is replaced with the specifying information value that system provides, all replaced, generate natural language up to all semantic grooves of current natural language template.
Wherein, this method step 2) can replace by carry out semantic groove from internal layer to outer field order, also can carry out semantic groove and replace by the order from the skin to the internal layer.
This method step 1) may further include: add up the element of open natural language template, and preserve each semantic groove information.
This method step 2) may further include: resolve counting for this semantic chunk is provided with circulation, and read piece cycle index in the semantic chunk of current parsing, relevant with the piece cycle index semantic groove with the every replacement of specifying information value in the system once in this semantic chunk, circulation is resolved counting and is added one, resolve counting up to circulation and equate step 2 with the piece cycle index) can further include: the semantic groove that has nothing to do with the piece cycle index in to the semantic chunk of current parsing, keep this semanteme groove under one deck resolve.Step 2) can further include: the semantic chunk that reads is replaced with the specifying information value in the system with the semantic groove in the semantic chunk of layer.
This method step 2) may further include: replace with the specifying information value of system being arranged in the outer semantic groove of semantic chunk in the open natural language template.
This method can further be resolved the natural language that the back generates with the language template of user's input, exports to the user with the form of tabulation.
As seen from the above technical solutions, the analytic method of this open natural language template of the present invention, can resolve the natural language template that adopts open natural language template establishment method to create, no matter how complicated its grammer is, flexible, and not needing the user to carry out any coding can both intelligentizedly be converted into corresponding with it hommization natural language with language template.
Description of drawings
Fig. 1 is the analytic method process flow diagram of open natural language template of the present invention;
Fig. 2 is to the detailed process process flow diagram of basic semantic chunk dissection process among Fig. 1.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, below in conjunction with embodiment and accompanying drawing, the present invention is described in more detail.
Fig. 1 is the analytic method process flow diagram of open natural language template of the present invention, and as shown in Figure 1, the process of the analytic method of open natural language template of the present invention is:
Step 101, at first, add up the element of open natural language template, the information that comprises semantic groove, semantic chunk and various control flumes, the semantic groove that can directly replace with corresponding information value in the system be can be described as the static semantic groove, and the semantic groove in semantic chunk inside can be described as the dynamic semantics groove.
Compiling form and rule detection are carried out to open natural language template in step 102~105, judge whether it is wrong, if wrong demonstration error message stops resolving.If there is not mistake, then extract the semantic groove in the natural language template, extract the information such as nest relation between title, position and each semantic chunk of semantic groove and preserve.
The innermost layer semantic chunk in the current outermost layer semantic chunk semantic chunk is searched and read to step 106 for nested semantic chunk,, and present embodiment is by resolving to outer field order from internal layer, also can resolving by the order from the skin to the internal layer.
Step 107, according to the nest relation in each semantic chunk, to the semantic chunk that reads, just basic semantic chunk carries out dissection process.Here, there is not nested semantic chunk to be called as basic semantic chunk.The method of resolving is exactly that the dynamic semantics groove in the basic semantic chunk is replaced with the corresponding dynamic value of information in the system.
Step 108~109 judge whether and current semantic chunk other semantic chunks with layer, if having then repeated execution of steps 107, finish the processing that execution in step 109 finishes with layer semantic chunk up to all resolving with layer semantic chunk; Otherwise directly execution in step 109 finishes the processing of layer semantic chunk together.
Step 110, whether judgement constitutes new semantic chunk after resolving, if then return step 107; Otherwise execution in step 111.
Step 111 judges in this natural language template whether have the semantic chunk of parsing or not, if having then return execution in step 106, if no longer include untreated semantic chunk, promptly all semantic chunks are all resolved and finished, and then execution in step 112.
Step 112 is replaced with the specifying information value of system being arranged in all outer static semantic grooves of semantic chunk in this natural language template, generates the hommization natural language, finishes the parsing of natural language template.
Fig. 2 be among Fig. 1 to the detailed process process flow diagram of basic semantic chunk dissection process, as shown in Figure 2, the process that basic semantic chunk is handled is:
Step 201 is at first resolved counting LoopCount for current semantic chunk is provided with circulation.
Step 202 reads the piece cycle index in the semantic chunk of current parsing.
Step 203~204, whether the decision block cycle index is effective, promptly the legitimacy of the cycle index of semantic chunk appointment is tested, two aspects of its main test, the one, whether the semantic groove that is complementary with cycle index is arranged in the semantic chunk, the 2nd, whether judge cycle index greater than 1, the semantic groove less than 1 is directly replaced with corresponding natural language original shape in the system.For example: semantic groove cycle index is 0, can " not have " to replace with the natural language original shape.If above-mentioned 2 do not have and to satisfy, system demonstrates the position and the reason of wrong semantic chunk, and stops resolving.Otherwise piece cycle index information is effective, and then execution in step 205.
Step 205 reads first semantic groove in the semantic chunk.
Step 206~208 judge whether this semanteme groove is relevant with cycle index, if uncorrelated, then keep semantic groove original shape, enter step 209; Otherwise execution in step 207 is resolved counting LoopCount according to circulation, replaces with the specifying information value in the system, enters step 209.
Step 209~211 judge whether to also have untreated semantic groove, if having then read new semantic groove, return step 206, up to there not being new semantic groove; Otherwise execution in step 211, circulation are resolved counting LoopCount and are added one.
Step 212 judges whether circulation parsing counting LoopCount equates with the piece cycle index that if do not wait, then redirect execution in step 205 is up to equating.Otherwise basic semantic chunk is resolved and is finished.
As seen from Figure 2, before carrying out basic semantic chunk parsing, at first need to be provided with the piece circulation numeration " LoopCount " of current semantic chunk acquiescence, need determine the current replacement information variate-value that should in system, choose according to the mark " LoopCount " of current circulation parsing number of times for the dynamic semantics groove relevant with cycle index; For with the circulation incoherent dynamic semantics groove then can in current semantic chunk resolving, be left intact, keeps original information, to last layer or on which floor resolve.
It can also be seen that by Fig. 2,, except above-mentioned replacement method, in resolving, also follow a kind of cyclic process semantic groove carrying out basic semantic chunk when resolving.The every circulation of semantic chunk resolved once all to be needed semantic grooves all in the semantic chunk is handled once according to above-mentioned replacement method, until circulation resolve number of times mark " LoopCount " cycle index corresponding with current semantic chunk when equal till.
The most basic semantic chunk of the above-mentioned replacement method of integrated use, round-robin method resolve to only contain the natural language literal and with the current semantic chunk text message that uncorrelated semantic groove forms that circulates, even only contain the text message of natural language.
If this semantic chunk is semantic chunk unique in the language template then only to be needed to carry out simple substitution to all remaining semantic grooves and just formed final hommization natural language information, otherwise it will be resolved according to semantic chunk analytic method shown in Figure 1 as the part in the newly-generated semantic chunk.
By above-mentioned step as seen, no matter to complexity, comprise many groups nested combinations, still do not contain semantic chunk or only contain the open natural language template of the simple semantic chunk of an individual layer, can both adopt above-mentioned analytic method to resolve, and finally generate the natural language of hommization.Its basic skills is a unit with the semantic chunk exactly, respectively semantic chunk is resolved, and resolves the outer semantic groove of semantic chunk at last; Also can resolve the outer semantic groove of semantic chunk earlier, resolve semantic chunk again.For the parsing of nested semantic chunk, according to order from the inside to the outside, successively the semantic chunk that is in internal layer is resolved, for then carrying out successively, resolve the outermost layer semantic chunk at last with reference to the appearance order with layer semantic chunk.Certainly, also can resolve by skin to internal layer.All may form a new semantic chunk after every layer of semantic chunk resolved, resolving and other semantic chunks of the new semantic chunk that forms are in full accord.
According to the above-mentioned longitudinal and transverse analytic method that interweaves, the language template of complexity has been realized parsing layer by layer from inside to outside, and all linked with one another, whole resolving is organically constituted an integral body, until forming the final natural language of forming by text message fully.
Below in conjunction with the application example of open natural language template in voice-mail system, the detailed process that open natural language template is resolved describes targetedly.The voice-mail system here is meant the system that the interactive voice by phone and system carries out the mail read and write.
Realize interactive voice, need before voice-mail system comes into operation, in system, set up the natural language template, when voice-mail system uses, the natural language template is resolved the generation natural language play to the user by phone and system.The natural language template being resolved the generation natural language play to the user, is that voice-mail system is realized an important step by phone and system's dialogue.
For example, voice-mail system need play to the user with the situation of mail in the subscriber mailbox, the natural language template that it utilized can for: " you have [AllMailNum] envelope mail now; { [UserMailNum] seal from [UserName]; { [] sealed and to be themed as [title], is an envelope [importencevalue] mail.[@UserMailNum]},[@UserNum]}”。The definition of each semantic groove is referring to table one.
Semantic slot name Definition
[AllMailNum] Mail sum in the mailbox
[@UserNum] The different numbers of posting a letter
[UserName] The name of posting a letter
[UserMailNum]、[@UserMailNum] The mail number that each addresser sends
[@loopcount] The cycle count of current block
[title] The mail matter topics of every envelope mail correspondence
[importencevalue] Important attribute (relevant) with every envelope mail
Important attribute (relevant) with the addresser
Table one
Wherein [] is semantic groove, and { } is semantic chunk.[@loopcount] be cycle count groove, [@UserMailNum] and [@UserNum] be the cycle index groove.
This is a complex language template that contains nested semantic chunk, can be referring to Fig. 1, Fig. 2 to its process of resolving.
At first, add up the template key element of this language template, wherein [with], with all be paired appearance, meet the redaction rule of open natural language template.
Then, wherein all semantic grooves are extracted, the information such as nest relation in the title of extraction tank, position, each semantic chunk are also preserved.
Then, search and read current innermost layer semantic chunk, in this template be [@loopcount] envelope themes as [title], is an envelope [importencevalue] mail.[@UserMailNum] } this semantic chunk.Semantic chunk hereto can have following two kinds of methods to resolve according to the actual conditions of nest relation in the semantic chunk of above-mentioned steps extraction tank and user mail.
First kind: relevant if user's wherein semantic groove when creating this open natural language template is set to every envelope mail, resolve to directly then that " first envelope themes as meeting, is an envelope important email.Second envelope themes as tourism, is an envelope important email.First envelope themes as test, is the inessential mail of an envelope ".
Second kind: if the user create should be open during the natural language template wherein semantic groove be set to relevantly with the addresser, then can be divided into two and go on foot parsing, resolve to earlier that " first envelope themes as meeting, is an envelope [importencevalue] mail.Second envelope themes as tourism, is an envelope [importencevalue] mail ".First envelope themes as test, is an envelope [importencevalue] mail.At this moment, [importencevalue] is uncorrelated semantic groove with the piece cycle index.And then resolve to that " first envelope themes as meeting, is the private important email of an envelope.Second envelope themes as tourism, is the private important email of an envelope.First envelope themes as test, is the common office mail of an envelope ".
This semantic chunk does not have other semantic chunks with layer, so next outer semantic chunk is resolved, promptly { [UserMailNum] seals from [UserName], [] } this one deck resolves, analytic method is the same, and analysis result can be " 2 envelopes are from Zhang San, and 1 envelope is from Li Si ".
At this moment, the semantic chunk of this language template is just resolved and is finished, system finds [AllMailNum] the semantic groove that also has in " you have [AllMailNum] envelope mail now " and does not also resolve, replace [AllMailNum] semantic groove with the sum of this user mail in the system this moment, and whole like this language template is resolved and finished.This template can resolve to according to being provided with finally of different user that " you have 3 envelope mails, and 2 envelopes are from Zhang San, and first envelope themes as meeting, are envelope important email.Second envelope themes as tourism, is an envelope important email.1 envelope themes as test from Li Si, is the inessential mail of an envelope ".Or " you have 3 envelope mails, and 2 envelopes are from Zhang San, and first envelope themes as meeting, are the private important email of an envelope.Second envelope themes as tourism, is the private important email of an envelope.1 envelope is from Li Si, and first envelope themes as test, is the common office mail of an envelope ".
By the above embodiments as seen, the analytic method of this open natural language template of the present invention, can resolve the natural language template that adopts open natural language template establishment method to create, no matter how complicated its grammer is, flexible, and not needing the user to carry out any coding can both intelligentizedly be converted into corresponding with it hommization natural language with language template.

Claims (9)

1, a kind of analytic method of open natural language template is characterized in that, this method may further comprise the steps:
1) according to predefined open natural language template compiling form, extracts the semantic groove information in the natural language template;
2) successively each the semantic groove in every layer of semantic chunk is replaced with the specifying information value that system provides, all replaced, generate natural language up to all semantic grooves of current natural language template.
2, analytic method as claimed in claim 1 is characterized in that, this method step 2) replace by carry out semantic groove from internal layer to outer field order.
3, analytic method as claimed in claim 1 is characterized in that, this method step 2) carry out semantic groove replacement by the order from the skin to the internal layer.
4, analytic method as claimed in claim 1 is characterized in that, this method step 1) further comprise: add up the element of open natural language template, and preserve the information of each semantic groove.
5, analytic method as claimed in claim 1, it is characterized in that, this method step 2) further comprises: resolve counting for this semantic chunk is provided with circulation, and read piece cycle index in the semantic chunk of current parsing, relevant with the piece cycle index semantic groove with the every replacement of specifying information value in the system once in this semantic chunk, circulation is resolved counting and is added one, resolves counting up to circulation and equates with the piece cycle index.
6, analytic method as claimed in claim 5 is characterized in that, this method step 2) further comprise: to the semantic groove that has nothing to do with the piece cycle index in the semantic chunk of current parsing, keep semantic groove and resolve to following one deck.
7, analytic method as claimed in claim 1 is characterized in that, this method step 2) further comprise: the semantic chunk that reads is replaced with the specifying information value in the system with the semantic groove in the semantic chunk of layer.
8, analytic method as claimed in claim 1 is characterized in that, this method step 2) further comprise: replace with the specifying information value of system being arranged in the outer semantic groove of semantic chunk in the open natural language template.
9, analytic method as claimed in claim 1 is characterized in that, this method further comprises: the language template of user's input is resolved the natural language that the back generates, export to the user with the form of tabulation.
CNA021592411A 2002-12-27 2002-12-27 Analytic method of open type natural language template Pending CN1512396A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA021592411A CN1512396A (en) 2002-12-27 2002-12-27 Analytic method of open type natural language template

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA021592411A CN1512396A (en) 2002-12-27 2002-12-27 Analytic method of open type natural language template

Publications (1)

Publication Number Publication Date
CN1512396A true CN1512396A (en) 2004-07-14

Family

ID=34237381

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA021592411A Pending CN1512396A (en) 2002-12-27 2002-12-27 Analytic method of open type natural language template

Country Status (1)

Country Link
CN (1) CN1512396A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100399329C (en) * 2005-01-19 2008-07-02 结信网络技术服务(上海)有限公司 Intelligent movable guiding engine systems
CN105138575A (en) * 2015-07-29 2015-12-09 百度在线网络技术(北京)有限公司 Analysis method and device of voice text string
CN107450725A (en) * 2017-07-31 2017-12-08 科大讯飞股份有限公司 Man-machine interaction application platform, method and storage medium
CN110232189A (en) * 2019-06-11 2019-09-13 上海证大喜马拉雅网络科技有限公司 Semantic analytic method, device, equipment and storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100399329C (en) * 2005-01-19 2008-07-02 结信网络技术服务(上海)有限公司 Intelligent movable guiding engine systems
CN105138575A (en) * 2015-07-29 2015-12-09 百度在线网络技术(北京)有限公司 Analysis method and device of voice text string
CN105138575B (en) * 2015-07-29 2017-09-05 百度在线网络技术(北京)有限公司 The analysis method and device of speech text string
CN107450725A (en) * 2017-07-31 2017-12-08 科大讯飞股份有限公司 Man-machine interaction application platform, method and storage medium
CN107450725B (en) * 2017-07-31 2020-09-11 科大讯飞股份有限公司 Man-machine interaction application platform, method and storage medium
CN110232189A (en) * 2019-06-11 2019-09-13 上海证大喜马拉雅网络科技有限公司 Semantic analytic method, device, equipment and storage medium
CN110232189B (en) * 2019-06-11 2023-06-02 上海喜马拉雅科技有限公司 Semantic analysis method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
Partee Noun phrase interpretation and type-shifting principles
CN1159661C (en) System for Chinese tokenization and named entity recognition
CN111061862B (en) Method for generating abstract based on attention mechanism
CN107463553A (en) For the text semantic extraction, expression and modeling method and system of elementary mathematics topic
CN1340804A (en) Automatic new term fetch method and system
CN101114298A (en) Method for gaining oral vocabulary entry, device and input method system thereof
CN1945691A (en) Method for improving template sentence synthetic effect in voice synthetic system
CN108874791B (en) Semantic analysis and Chinese-English sequencing method and system based on minimum semantic block
CN101539910A (en) A sentence taking method for computer aided translation and system thereof
Zarisheva et al. Dialog act annotation for twitter conversations
Linderman Computer content analysis and manual coding techniques: A comparative analysis
CN106227575A (en) Method for generating and analyzing text file
CN1512396A (en) Analytic method of open type natural language template
CN1776673A (en) Method for converting PDF file to XML file
CN1773453A (en) System constituting method based on data definition
CN1512395A (en) Establishing method for open type natural language
CN1270363A (en) Database management method
CN101034394A (en) System and method for enhancing translation efficiency
CN102929700B (en) Method for importing word test library to interactive teaching platform
CN115292347A (en) Active SQL algorithm performance checking device and method based on rules
CN114419645A (en) Contract intelligent analysis method based on AI
CN1220971C (en) Organizing and identifying method for natural language
CN1226692C (en) Machine translation system based on semanteme and its method
CN106846444A (en) A kind of animation system and the method for making animation
CN1417707A (en) Natural language semantic information united-coding method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication