CN101446942A - Semantic character labeling method of natural language sentence - Google Patents
Semantic character labeling method of natural language sentence Download PDFInfo
- Publication number
- CN101446942A CN101446942A CNA2008102436058A CN200810243605A CN101446942A CN 101446942 A CN101446942 A CN 101446942A CN A2008102436058 A CNA2008102436058 A CN A2008102436058A CN 200810243605 A CN200810243605 A CN 200810243605A CN 101446942 A CN101446942 A CN 101446942A
- Authority
- CN
- China
- Prior art keywords
- model
- semantic
- syntactic
- predicate
- mark
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Machine Translation (AREA)
Abstract
The invention discloses a semantic character labeling method of a natural language sentence, which is characterized in that Chinese syntax analysis and semantic character label are simultaneously realized by adopting a combined learning model. The invention can simultaneously output the syntax analysis result of one sentence and gives the semantic role labeling result of a predicative by using a combined model. Because semantic information is increased in a syntax analysis model in the combined learning model, a model trained is particularly suitable for a semantic role labeling task. Therefore, the semantic role label output by the model has high performance. Meanwhile, the performance between the result output by a single syntax analysis model and the syntax analysis result output by the combined model is not large. Particularly, the syntax analysis performance can also be improved by adding semantic information.
Description
Technical field
The present invention relates to a kind of method of the semantic analysis to natural language, relate in particular to the method that a kind of semantic role to natural language sentences is analyzed and marked, belong to the natural language processing field in the computational linguistics.
Background technology
Semantic analysis is a key issue of natural language processing.As one of present hot research problem, (Semantic Role Labeling is a kind of of Shallow Semantic Parsing (ShallowSemantic Parsing) SRL) to semantic character labeling, its essence is the semantic analysis of carrying out shallow-layer in sentence level.So-called semantic character labeling is exactly for given sentence, and each predicate in the distich marks out the corresponding semantic component in the sentence, and makes corresponding semantic marker, as agent, word denoting the receiver of an action, instrument or adjunct etc.Can be applied to SRL question answering system, information extraction, text snippet, text such as contain at the field, are with a wide range of applications.
Semantic character labeling based on machine learning can be divided into four-stage usually: a) pre-service filters out the syntactic constituent that can not become semantic role usually; B) semantic component identification, identifying which mark unit is the semantic role of a certain target predicate; C) semantic role classification is for carrying out the classification of semantic role in the unit of discerning semantic component; D) aftertreatment is carried out global optimization to the semantic role of mark, determines rational role's combination.Wherein, identification and sorting phase generally use local the derivation.Deriving and be meant the semantic label of each composition in the independent decision sentence in so-called part, and does not rely on the mark of other compositions, trains the model that obtains to be called partial model like this.Correspondingly, the overall situation is derived and is generally occurred in post-processing stages.The so-called overall situation is derived and is meant on the basis of deriving in the part, considers the dependence between each composition label, by integrated relevant hard restriction of world model and soft-constraint condition, thereby obtains rational semantic role combination.Usually, reasonably integrated partial model and world model can improve the performance and the robustness of system greatly.
The learning method of partial model can be divided into two classes usually: based on the method for proper vector with based on the method for kernel function.From at present, based on the method for proper vector, obtained bigger success, speed and performance all are better than the method based on kernel function greatly.
Need artificial definition to have the feature templates of discrimination in a large number based on the method for proper vector, according to this template each example is converted into then that proper vector is learnt or predicted operation.Mainly concentrate at present the research of feature engineering and machine learning model.But the local message owing to only reflected unit to be marked based on the method for proper vector can not reflect global information and structuring syntactic information well, so people's exploration is carried out semantic character labeling based on the method for kernel function.Basic thought based on the method for kernel function is that the inseparable problem of low dimensional linear is mapped to higher dimensional space, makes it to become the linear separability problem.Common this mapping can reach by calculating kernel function implicit expression, thus reduction time and space complexity.Kernel function can well incorporate learning algorithms such as support vector machine, perceptron, thereby has caused people's extensive interest.
The natural language processing task comprises part-of-speech tagging, syntactic analysis, semantic analysis, information extraction etc., normally carries out according to the order of sequence, and a promptly back task is carried out on the basis of last task, and for example the semantic role analysis usually will be based on the result of syntactic analysis.Syntactic analysis (Syntactic Parsing) is the basic problem and the gordian technique of natural language processing.Its task is according to given grammer, derives the syntactic structure of sentence automatically, i.e. relation between the sentence unit that sentence comprised and these sentence unit.The purpose of syntactic analysis mainly contains two: one is " pedigree " structure of determining that sentence is comprised; Another is the relation of determining between the composition of sentence.Usually, import a sentence, promptly the linear precedence between the word is exported a nonlinear data structure, as phrase structure tree (as syntax tree) or directed acyclic graph (as dependence figure) etc.
Therefore, in the prior art, always at first carry out syntactic analysis, obtain syntax tree, on the basis of syntax tree, carry out semantic analysis then, comprise the semantic role mark that carries out sentence.Such way can be brought a series of problems, one, the preceding paragraph task is when carrying out, can't consider the real needs of back task and can not satisfy the demands that syntactic analysis is towards follow-up a plurality of tasks usually, it is more common therefore to export the result, and semantic character labeling is except that these general informations of needs, also may need some information specific, as the probabilistic information of syntax tree or its subtree etc., these information syntactic analysis systems are shortcoming relatively usually; Its two, the performance of consequent task is subjected to the restriction of front mission performance, syntactic analysis result's quality can directly have influence on the performance of semantic character labeling.Existing studies show that, the semantic character labeling that on manual syntax tree of English and automatic syntax tree, carries out, its performance (F1 value) differs 10 percentage points, and based on the semantic character labeling that carries out on manual syntax tree of Chinese and the automatic syntax tree, its F1 value differs even reaches 30 percentage points.
Summary of the invention
The present invention seeks to the purpose of this invention is to provide a kind of semantic character labeling method of effective sentence, unite the derivation model by foundation, reduce automatic syntactic analysis result to the semantic character labeling Effect on Performance, thereby solve problem based on the semantic character labeling method poor-performing of automatic syntactic analysis.
For achieving the above object, the technical solution used in the present invention is: a kind of semantic character labeling method of natural language sentences, adopt the combination learning model, and realize Chinese syntactic analysis and semantic character labeling simultaneously, comprise the steps:
(1) generative semantics character labeling model:
Generate the training file: from tagged corpus,, generate required training file according to the feature extraction feature in the following table;
The position | The path | Centre word and part of speech thereof |
Predicate | The subclass framework | The syntactic constituent type |
Syntactic constituent head-word and tail speech | Syntactic constituent left side sibling type | The predicate syntactic frame |
The verb classification | The previous speech of syntactic constituent | Syntactic constituent father node type |
Compressed path | Whether syntactic constituent has right sibling | Syntactic constituent left side sibling centre word type |
The path of revising | Whether there is root node in the path |
Model generates: utilize the training file to the training of maximum entropy classifiers model, obtain the semantic character labeling model file; This model can obtain through behind enough mark language material training studies, uses this model can identify the semantic role of given predicate in the sentence effectively.
(2) generate the combination learning model:
In existing syntactic analysis model, though can obtain preferable performance based on the syntactic analysis model of vocabulary PCFG, it is low excessively to carry out efficient, and time complexity is 0 (n
5); In contrast to this, based on the syntactic analysis model of historical information, formulate decision-making according to mode from left to right, only need travel through once, the method is owing to only need once to travel through from left to right, so it is higher to carry out efficient, but shortcoming is only to use the chunk information of current location front when making decision, and father node generates early than its child node usually, under the situation that child node does not all generate, also unreliable according to the prediction that residing contextual information is made, certainly will cause performance decrease like this.For example, based on basic phrase recognition result shown in Figure 1, obtain correct syntactic analysis result, then preceding four decision-makings must be { Start S and No, Start VP and No}, being chunk NP (I) generates father node NP and is chunk VBD (saw) generation father node VP, and this moment, newly-generated father's node NP and other son's nodes of VP were not known without exception, i.e. only under the situation that its first son generates, producing of father's node, and this often is difficult to accomplish this point under reality.On the contrary, under the established situation of all son's nodes, predict that its father node then is more prone to relatively with reliable.
Based on this, the present invention has used a kind of level and syntax analytic model based on historical information, and this model is the advantage of above two kinds of models comprehensively, obtains preferable performance with less time complexity.Its basic thought is: in every layer of processing procedure, preferentially identify the chunk of easy identification, so just can provide abundanter contextual information to carry out complicated chunk identification; The chunk of not merged chunk and new identification generation constitutes the input of step processing down jointly, repeats this process until identifying root node.Its process is the recursive procedure that a level is handled, and termination condition is when certain layer of processing, and all chunks are merged into a new chunk, i.e. syntax tree root node.
Complete syntactic analysis process can be divided into: part-of-speech tagging, basic chunk identification and syntactic analysis.The effect of syntactic analysis model is on basic chunk base of recognition, predicts next step the decision-making of each chunk, obtains correct syntactic analysis result.The feature masterplate that generation model is commonly used has: centre word of n tree and part of speech, current syntactic constituent mark and decision-making mark, the contextual feature of current composition etc. (comprise 1 yuan, 2 yuan, 3 yuan etc. information).
On level and syntax analytic model basis, incorporate semantic character labeling information, can obtain combination learning model of the present invention, realize the combination learning of syntactic analysis and semantic character labeling.The conjunctive model that proposes is based on following observation: role's composition of predicate w is its sibling normally, or the sibling of its ancestors' node, and in fact this principle also is widely used in the beta pruning strategy of semantic character labeling.Be made up of two parts based on the conjunctive model that this principle proposes: syntax tree makes up part and character labeling part, when syntax tree structure part generates ancestors' node of a new target verb, just call the semantic character labeling model, judge and mark other son's nodes and the relation of the semantic role between the target verb of ancestors' node, increase other semantic relevant information simultaneously, make it to influence the structure of syntax tree.Because the structure of syntax tree is in the variation all the time, and the major influence factors of this variation is a semantic information, and the input of semantic character labeling model is exactly the syntax tree of current structure, and therefore the semantic role of output is also in continuous adjustment, and syntactic analysis and character labeling are interactional.Specifically describe as follows:
Given predicate w when ancestors' node n ode of a newly-generated w, calls the semantic character labeling model, and the child node of judging node concerns with the semantic role of predicate w.Fig. 2 has provided the syntactic analysis and the semantic character labeling combination learning example schematic of an english sentence.To the intermediate result among Fig. 2 .a 1, the target predicate is VBD (closed), after identifying its father node VP, shown in Fig. 2 .b, call the semantic character labeling model, judge that its sibling PP (at 2569.26) and the semantic role of verb VBD (closed) concern, shown in Fig. 2 .c; And then, after the VP node merges to node S, call the semantic character labeling model, judge the semantic role relation of sibling NP (The Dow Jones industrials) with the predicate VBD (closed) of VP.
Calling the semantic character labeling model, when the semantic role between judgement current node and the predicate concerned, if the role L1 that obtains is a non-NULL, promptly current node was certain semantic role of predicate, then the probable value with this role is increased in system's probability, as shown in Equation (1):
prob(T)=prob(T)*prob(L1) (1)
T*=argmax(Prob(T)) (2)
Wherein, prob (T) is the probability of current generation tree T syntactic analysis, and prob (L1) is the probability that current node is noted as L1.The target of syntactic analysis model then is to seek optimum syntax tree T*, and T* satisfies formula (2) here.
Except this information of semantic role, in the syntactic analysis model, also increase other semantic relevant informations, with the effect of further raising semantic information.
Thus, the method that generates the combination learning model is,
Extract corpus: from treebank, extract the syntactic analysis corpus, comprise semantic feature in the training incident of syntactic analysis;
Generate the training file: on syntactic analysis model common feature basis, add semantic feature, generate the training file;
Described syntactic analysis model common feature is represented by following table:
The centre word of n tree, composition mark and decision-making mark |
The centre word part of speech of n tree, composition mark and decision-making mark |
Composition mark of n tree and decision-making mark |
The contextual feature of n tree (1 yuan, 2 yuan, 3 yuan, 4 yuan etc.) |
Described semantic feature is represented by following table:
Predicate | Current predicate verb itself |
The predicate classification | Verb classification under the predicate |
The path | Current syntactic constituent is to the path of predicate |
The predicate role | Call the predicate semantic role that the semantic character labeling Model Identification goes out |
The semantic character labeling model that adopts step (1) that obtains of semantic feature is realized;
Model generates: utilize the training file to the training of maximum entropy classifiers model, obtain the combination learning model file;
(3) part of speech mark: call the part of speech mark module, given sentence is carried out part-of-speech tagging, the part of speech mark series result of N kind optimum before keeping;
(4) basic phrase identification: call basic phrase identification module,, carry out basic phrase identification, the basic phrase recognition result of N kind optimum before keeping at last respectively to the N kind part of speech mark result of step (3) output;
(5) syntactic analysis: call the combination learning model, with the basic phrase recognition result of N kind of step (4) output as input, syntactic analysis result and semantic character labeling result that output is optimum;
Wherein, N is 10~20 integer, and the N value is excessive, will keep too many useless intermediate result in the resolving, increases system overhead; And the N value is too small, then may lose some correct intermediate results.
Because the technique scheme utilization, the present invention compared with prior art has following advantage:
The present invention can export the syntactic analysis result of a sentence and the semantic character labeling result of given predicate simultaneously by the use of conjunctive model.In the combination learning model, owing to increased semantic information in the syntactic analysis model, the model that makes training come out is more suitable in the semantic character labeling task, and therefore the semantic character labeling performance of model output is higher thus.The syntactic analysis result of conjunctive model output simultaneously compares with the result of single syntactic analysis model, and both performances do not have very big difference, even because the adding of semantic information can also improve the performance of syntactic analysis.
Description of drawings
Fig. 1 is the basic phrase recognition result of sentence " I saw the man with the book ".
Fig. 2 is syntactic analysis and semantic character labeling combination learning example schematic.Wherein sentence is: TheDow Jones industrials closed at 2569.26.
Fig. 3 is the output result of the conjunctive model of Chinese sentence " Sino-U.S. signs an agreement in Shanghai " among the embodiment.
Embodiment
Below in conjunction with drawings and Examples the present invention is further described:
Embodiment: will be classification problem to the mark Task Switching of semantic role, and adopt maximum entropy classifiers to train, and obtain the semantic character labeling model.To the sentence structure analysis task, be divided into part of speech mark subtask, basic phrase recognin task and level and syntax analytic subtask, part of speech mark and basic phrase recognin task adopt the mature modules in the existing syntactic analysis software to finish; When syntactic analysis, call the semantic character labeling model, obtain semantic role information, with basic phrase recognition result and semantic information as input, syntactic analysis result and semantic character labeling result that output is optimum.
The generation of semantic character labeling model:
Generate the training file: from tagged corpus,, generate required training file according to the feature extraction feature in the table 1;
Model generates: adopt maximum entropy model, to the training of training file, obtain the semantic character labeling model;
Table 1
The position | The path | Centre word and part of speech thereof |
Predicate | The subclass framework | The syntactic constituent type |
Syntactic constituent head-word and tail speech | Syntactic constituent left side sibling type | The predicate syntactic frame |
The verb classification | The previous speech of syntactic constituent | Syntactic constituent father node type |
Compressed path | Whether syntactic constituent has right sibling | Syntactic constituent left side sibling centre word type |
The path of revising | Whether there is root node in the path |
The generation of combination learning model:
Extract corpus: from treebank, extract the level and syntax analytic corpus;
Generate the training file: adopt maximum entropy model, add semantic feature (table 2), generate the required tag file of training according to syntactic analysis model common feature (table 3)
Model generates: adopt maximum entropy model, to the training of training file, obtain the combination learning model;
Table 2
Predicate | Current predicate verb itself |
The predicate classification | Verb classification under the predicate |
The path | Current syntactic constituent is to the path of predicate |
The predicate role | Call the predicate semantic role that the semantic character labeling Model Identification goes out |
Table 3
The centre word of n tree, composition mark and decision-making mark |
The centre word part of speech of n tree, composition mark and decision-making mark |
Composition mark of n tree and decision-making mark |
The contextual feature of n tree |
To sentence to be analyzed, carry out successively:
Part of speech mark: call the part of speech mark module, given sentence is carried out part-of-speech tagging, the part of speech mark series result of N kind optimum before keeping;
Basic phrase identification: call basic phrase identification module, carry out basic phrase identification, the basic phrase recognition result of N kind optimum before keeping at last;
Syntactic analysis: call the combination learning model, as input, export optimum syntactic analysis result and semantic character labeling result with the basic phrase recognition result of 3.2 N kind.
Fig. 3 is the output result of the conjunctive model of Chinese sentence " Sino-U.S. signs an agreement in Shanghai ".The node of wherein drawing a circle need call the semantic character labeling model when generating, judge the child node of this node and the semantic relation between the predicate node (signing).Explanation on every limit among the figure is the concrete steps of syntactic analysis.Be described below:
(1) it is the beginning of IP composition to basic phrase NP (Sino-U.S.) mark, promptly is labeled as " S_IP "
(2) judge whether this phrase finishes,, be designated as " NO " as for not
(3) to basic phrase P () be labeled as the beginning (" S_PP ") of PP composition
(4) phrase does not finish identification, is designated as " NO "
(5) basic phrase NP (Shanghai) is labeled as the continuation (" J_PP ") of PP composition
(6) phrase end of identification is designated as " YES ", forms syntactic constituent PP;
(7) composition PP is labeled as the beginning (" S_VP ") of VP composition
(8) phrase does not finish identification, is designated as " NO "
(9) basic phrase W (signing) is labeled as the beginning (" S_VP ") of VP composition
(10) phrase does not finish identification, is designated as " NO "
(11) basic phrase NP (agreement) is labeled as the continuation (" J_VP ") of VP composition
(12) phrase end of identification is designated as " YES ", forms syntactic constituent VP; Because VP is the father node of predicate verb W (signing), therefore call the semantic character labeling model, judge another child's node NP (agreement) of this node and the relation between the predicate node W (signing), drawing NP (agreement) is the A1 role of W, revises the probability of current syntactic analysis.
(13) composition VP is labeled as the continuation (" J_VP ") of another VP composition
(14) phrase end of identification is designated as " YES ", forms syntactic constituent VP; Because VP is ancestors' node of predicate verb W (signing), therefore call the semantic character labeling model, judge another child's node PP of this node and the relation between the predicate node W, drawing PP is the AM-LOC role of W, revises the probability of current syntactic analysis.
(15) composition VP is labeled as the continuation (" J_IP ") of IP composition
(16) phrase end of identification is designated as " YES ", forms syntactic constituent IP; Because IP is ancestors' node of predicate verb W (signing), therefore call the semantic character labeling model, judge another child's node NP (Sino-U.S.) of this node and the relation between the predicate node W, drawing NP is the A0 role of W, revises the probability of current syntactic analysis.
Claims (1)
1. the semantic character labeling method of natural language sentences is characterized in that: adopt the combination learning model, realize Chinese syntactic analysis and semantic character labeling simultaneously, comprise the steps:
(1) generative semantics character labeling model:
Generate the training file: from tagged corpus,, generate required training file according to the feature extraction feature in the following table;
Model generates: utilize the training file to the training of maximum entropy classifiers model, obtain the semantic character labeling model file;
(2) generate the combination learning model:
Extract corpus: from treebank, extract the syntactic analysis corpus, comprise semantic feature in the training incident of syntactic analysis;
Generate the training file: on syntactic analysis model common feature basis, add semantic feature, generate the training file;
Described syntactic analysis model common feature is represented by following table:
Described semantic feature is represented by following table:
The semantic character labeling model that adopts step (1) that obtains of semantic feature is realized;
Model generates: utilize the training file to the training of maximum entropy classifiers model, obtain the combination learning model file;
(3) part of speech mark: call the part of speech mark module, given sentence is carried out part-of-speech tagging, the part of speech mark series result of N kind optimum before keeping;
(4) basic phrase identification: call basic phrase identification module,, carry out basic phrase identification, the basic phrase recognition result of N kind optimum before keeping at last respectively to the N kind part of speech mark result of step (3) output;
(5) syntactic analysis: call the combination learning model, with the basic phrase recognition result of N kind of step (4) output as input, syntactic analysis result and semantic character labeling result that output is optimum;
Wherein, N is 10~20 integer.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2008102436058A CN101446942A (en) | 2008-12-10 | 2008-12-10 | Semantic character labeling method of natural language sentence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2008102436058A CN101446942A (en) | 2008-12-10 | 2008-12-10 | Semantic character labeling method of natural language sentence |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101446942A true CN101446942A (en) | 2009-06-03 |
Family
ID=40742626
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2008102436058A Pending CN101446942A (en) | 2008-12-10 | 2008-12-10 | Semantic character labeling method of natural language sentence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101446942A (en) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101908042A (en) * | 2010-08-09 | 2010-12-08 | 中国科学院自动化研究所 | Tagging method of bilingual combination semantic role |
CN101937430A (en) * | 2010-09-03 | 2011-01-05 | 清华大学 | Method for extracting event sentence pattern from Chinese sentence |
CN102375808A (en) * | 2010-08-20 | 2012-03-14 | 山西大学 | Method and device for frame disambiguation during labeling by Chinese frame net |
CN102662931A (en) * | 2012-04-13 | 2012-09-12 | 厦门大学 | Semantic role labeling method based on synergetic neural network |
CN102799577A (en) * | 2012-08-17 | 2012-11-28 | 苏州大学 | Extraction method of semantic relation between Chinese entities |
CN103150405A (en) * | 2013-03-29 | 2013-06-12 | 苏州大学 | Classification model modeling method, Chinese cross-textual reference resolution method and system |
CN103176963A (en) * | 2013-03-08 | 2013-06-26 | 北京理工大学 | Chinese sentence meaning structure model automatic labeling method based on CRF ++ |
CN106021225A (en) * | 2016-05-12 | 2016-10-12 | 大连理工大学 | Chinese maximal noun phrase (MNP) identification method based on Chinese simple noun phrases (SNPs) |
CN106776576A (en) * | 2016-12-29 | 2017-05-31 | 竹间智能科技(上海)有限公司 | Subordinate sentence and semantic role labeling method, system based on CoNLL forms |
CN107526724A (en) * | 2017-08-22 | 2017-12-29 | 北京百度网讯科技有限公司 | For marking the method and device of language material |
CN107818082A (en) * | 2017-09-25 | 2018-03-20 | 沈阳航空航天大学 | With reference to the semantic role recognition methods of phrase structure tree |
CN108604228A (en) * | 2016-02-09 | 2018-09-28 | 国际商业机器公司 | System and method for the language feature generation that multilayer word indicates |
CN108647254A (en) * | 2018-04-23 | 2018-10-12 | 苏州大学 | Automatic treebank method for transformation and system based on pattern insertion |
CN108959240A (en) * | 2017-05-26 | 2018-12-07 | 上海醇聚信息科技有限公司 | A kind of proprietary ontology automatic creation system and method |
CN109710914A (en) * | 2017-10-26 | 2019-05-03 | 饶竹一 | Semantic training system and its method based on business model |
CN110634336A (en) * | 2019-08-22 | 2019-12-31 | 北京达佳互联信息技术有限公司 | Method and device for generating audio electronic book |
CN110781685A (en) * | 2019-10-18 | 2020-02-11 | 四川长虹电器股份有限公司 | Method for automatically marking correctness of semantic analysis result based on user feedback |
US11520982B2 (en) | 2019-09-27 | 2022-12-06 | Sap Se | Generating corpus for training and validating machine learning model for natural language processing |
WO2023236240A1 (en) * | 2022-06-09 | 2023-12-14 | 深圳计算科学研究院 | Data screening method and apparatus based on reinforcement learning |
-
2008
- 2008-12-10 CN CNA2008102436058A patent/CN101446942A/en active Pending
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012019478A1 (en) * | 2010-08-09 | 2012-02-16 | 中国科学院自动化研究所 | Method for labeling semantic role of bilingual parallel sentence pair |
CN101908042A (en) * | 2010-08-09 | 2010-12-08 | 中国科学院自动化研究所 | Tagging method of bilingual combination semantic role |
US9411802B2 (en) | 2010-08-09 | 2016-08-09 | Institute Of Automation, Chinese Academy Of Sciences | Method for labeling semantic role of bilingual parallel sentence pair |
CN101908042B (en) * | 2010-08-09 | 2016-04-13 | 中国科学院自动化研究所 | A kind of mask method of bilingual combination semantic role |
CN102375808A (en) * | 2010-08-20 | 2012-03-14 | 山西大学 | Method and device for frame disambiguation during labeling by Chinese frame net |
CN102375808B (en) * | 2010-08-20 | 2016-03-23 | 山西大学 | A kind of frame disambiguation during labeling by Chinese frame net method and device |
CN101937430B (en) * | 2010-09-03 | 2015-04-15 | 清华大学 | Method for extracting event sentence pattern from Chinese sentence |
CN101937430A (en) * | 2010-09-03 | 2011-01-05 | 清华大学 | Method for extracting event sentence pattern from Chinese sentence |
CN102662931A (en) * | 2012-04-13 | 2012-09-12 | 厦门大学 | Semantic role labeling method based on synergetic neural network |
CN102662931B (en) * | 2012-04-13 | 2015-03-25 | 厦门大学 | Semantic role labeling method based on synergetic neural network |
CN102799577B (en) * | 2012-08-17 | 2016-08-03 | 苏州大学 | A kind of Chinese inter-entity semantic relation extraction method |
CN102799577A (en) * | 2012-08-17 | 2012-11-28 | 苏州大学 | Extraction method of semantic relation between Chinese entities |
CN103176963A (en) * | 2013-03-08 | 2013-06-26 | 北京理工大学 | Chinese sentence meaning structure model automatic labeling method based on CRF ++ |
CN103150405B (en) * | 2013-03-29 | 2014-12-10 | 苏州大学 | Classification model modeling method, Chinese cross-textual reference resolution method and system |
CN103150405A (en) * | 2013-03-29 | 2013-06-12 | 苏州大学 | Classification model modeling method, Chinese cross-textual reference resolution method and system |
CN108604228A (en) * | 2016-02-09 | 2018-09-28 | 国际商业机器公司 | System and method for the language feature generation that multilayer word indicates |
CN108604228B (en) * | 2016-02-09 | 2022-12-02 | 国际商业机器公司 | System and method for linguistic feature generation for multi-layered word representations |
CN106021225A (en) * | 2016-05-12 | 2016-10-12 | 大连理工大学 | Chinese maximal noun phrase (MNP) identification method based on Chinese simple noun phrases (SNPs) |
CN106021225B (en) * | 2016-05-12 | 2018-12-21 | 大连理工大学 | A kind of Chinese Maximal noun phrase recognition methods based on the simple noun phrase of Chinese |
CN106776576A (en) * | 2016-12-29 | 2017-05-31 | 竹间智能科技(上海)有限公司 | Subordinate sentence and semantic role labeling method, system based on CoNLL forms |
CN108959240A (en) * | 2017-05-26 | 2018-12-07 | 上海醇聚信息科技有限公司 | A kind of proprietary ontology automatic creation system and method |
CN107526724A (en) * | 2017-08-22 | 2017-12-29 | 北京百度网讯科技有限公司 | For marking the method and device of language material |
CN107818082A (en) * | 2017-09-25 | 2018-03-20 | 沈阳航空航天大学 | With reference to the semantic role recognition methods of phrase structure tree |
CN107818082B (en) * | 2017-09-25 | 2020-12-04 | 沈阳航空航天大学 | Semantic role recognition method combined with phrase structure tree |
CN109710914A (en) * | 2017-10-26 | 2019-05-03 | 饶竹一 | Semantic training system and its method based on business model |
CN108647254A (en) * | 2018-04-23 | 2018-10-12 | 苏州大学 | Automatic treebank method for transformation and system based on pattern insertion |
CN108647254B (en) * | 2018-04-23 | 2021-06-22 | 苏州大学 | Automatic tree library conversion method and system based on pattern embedding |
CN110634336A (en) * | 2019-08-22 | 2019-12-31 | 北京达佳互联信息技术有限公司 | Method and device for generating audio electronic book |
US11520982B2 (en) | 2019-09-27 | 2022-12-06 | Sap Se | Generating corpus for training and validating machine learning model for natural language processing |
CN110781685A (en) * | 2019-10-18 | 2020-02-11 | 四川长虹电器股份有限公司 | Method for automatically marking correctness of semantic analysis result based on user feedback |
CN110781685B (en) * | 2019-10-18 | 2022-08-19 | 四川长虹电器股份有限公司 | Method for automatically marking correctness of semantic analysis result based on user feedback |
WO2023236240A1 (en) * | 2022-06-09 | 2023-12-14 | 深圳计算科学研究院 | Data screening method and apparatus based on reinforcement learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101446942A (en) | Semantic character labeling method of natural language sentence | |
CN109146610B (en) | Intelligent insurance recommendation method and device and intelligent insurance robot equipment | |
CN107315737B (en) | Semantic logic processing method and system | |
Honnibal et al. | Joint incremental disfluency detection and dependency parsing | |
US11501082B2 (en) | Sentence generation method, sentence generation apparatus, and smart device | |
US8082151B2 (en) | System and method of generating responses to text-based messages | |
CN105718586A (en) | Word division method and device | |
CN109145276A (en) | A kind of text correction method after speech-to-text based on phonetic | |
CN110197279B (en) | Transformation model training method, device, equipment and storage medium | |
CN105138507A (en) | Pattern self-learning based Chinese open relationship extraction method | |
CN105608218A (en) | Intelligent question answering knowledge base establishment method, establishment device and establishment system | |
WO2008107305A2 (en) | Search-based word segmentation method and device for language without word boundary tag | |
CN111159385A (en) | Template-free universal intelligent question-answering method based on dynamic knowledge graph | |
CN112151015B (en) | Keyword detection method, keyword detection device, electronic equipment and storage medium | |
CN110853625B (en) | Speech recognition model word segmentation training method and system, mobile terminal and storage medium | |
CN110188359B (en) | Text entity extraction method | |
CN111832299A (en) | Chinese word segmentation system | |
CN110377695B (en) | Public opinion theme data clustering method and device and storage medium | |
CN110263154A (en) | A kind of network public-opinion emotion situation quantization method, system and storage medium | |
CN110232127A (en) | File classification method and device | |
CN107256212A (en) | Chinese search word intelligence cutting method | |
CN104516874A (en) | Method and system for parsing dependency of noun phrases | |
CN108763202A (en) | Method, apparatus, equipment and the readable storage medium storing program for executing of the sensitive text of identification | |
CN115064154A (en) | Method and device for generating mixed language voice recognition model | |
CN110717316B (en) | Topic segmentation method and device for subtitle dialog flow |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20090603 |