CN113792542A - Intention understanding method fusing syntactic analysis and semantic role pruning - Google Patents

Intention understanding method fusing syntactic analysis and semantic role pruning Download PDF

Info

Publication number
CN113792542A
CN113792542A CN202111184182.9A CN202111184182A CN113792542A CN 113792542 A CN113792542 A CN 113792542A CN 202111184182 A CN202111184182 A CN 202111184182A CN 113792542 A CN113792542 A CN 113792542A
Authority
CN
China
Prior art keywords
pruning
semantic
sentence
words
semantic role
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111184182.9A
Other languages
Chinese (zh)
Inventor
张丹
董晓飞
张学强
曹峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing New Generation Artificial Intelligence Research Institute Co ltd
China Academy of Information and Communications Technology CAICT
Original Assignee
Nanjing New Generation Artificial Intelligence Research Institute Co ltd
China Academy of Information and Communications Technology CAICT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing New Generation Artificial Intelligence Research Institute Co ltd, China Academy of Information and Communications Technology CAICT filed Critical Nanjing New Generation Artificial Intelligence Research Institute Co ltd
Priority to CN202111184182.9A priority Critical patent/CN113792542A/en
Publication of CN113792542A publication Critical patent/CN113792542A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

The invention relates to the technical field of semantic analysis in natural language processing, and discloses an intention understanding method integrating syntactic analysis and semantic role pruning, which comprises a preprocessing module, a semantic matching module and a pruning module, wherein the intention understanding method comprises the steps of jointly coding and modeling a text obtained by sentence component pruning and semantic role pruning and an original input text, the sentence component pruning reserves main sentence components such as a subject, a predicate and an object of a sentence, the semantic role pruning reserves core arguments in the sentence including events, subjects, ranges, action starts, action ends and other verb-related argument roles, and the pruning has the function of eliminating other parts which are not sentence main components or core arguments and highlighting main semantics of the sentence. The invention has the advantages that the semantics are strengthened aiming at the input sentences with too long sentences, too many description words or multiple intentions of the user; the main words expressing sentence semantics through dependency syntax analysis and semantic role labeling enhance semantic information in a form of joint input coding.

Description

Intention understanding method fusing syntactic analysis and semantic role pruning
Technical Field
The invention relates to the technical field of semantic analysis in natural language processing, in particular to an intention understanding method integrating syntactic analysis and semantic role pruning.
Background
The intention understanding refers to accurately understanding the intention of a user from the perspective of semantics based on a user input text, at present, a common method for intention understanding is to match the user input with a standard question in a knowledge base through a semantic similarity model so as to determine the user semantics, but when the user input has too long sentences, too many description words or too many intentions of the user, the semantic similarity model is directly used for matching the user input text with the standard question in the knowledge base, so that the corresponding standard question is difficult to find in the knowledge base, and the intention of the user is difficult to determine, dependency syntax analysis and semantic role labeling are key bottom-layer technologies in natural language processing, the basic task of dependency syntax analysis is to identify the dependency relationship between words in the sentences, determine the syntax structure of the sentences, and the semantic label role mainly studies the relationship between each component in the sentences and predicates, and describing the relationship between the semantic characters by the semantic characters, carrying out syntactic structure analysis on a text input by a user through dependency syntactic analysis, determining the main structure of a sentence, marking the semantic characters to obtain the argument in the sentence, determining the argument roles of the affairs, the affairs and the like in the sentence, pruning through the syntactic relationship, removing redundant descriptors in the sentence, judging whether a plurality of intentions exist, inputting the result into a semantic similarity model, enhancing semantic information and further improving the accuracy of intention understanding.
Disclosure of Invention
Technical problem to be solved
Aiming at the defects of the prior art, the invention provides an intention understanding method integrating syntactic analysis and semantic role pruning, which has the advantages of strengthening semantics aiming at input sentences with too long sentences, too many description words or multiple intentions of users and strengthening semantic information in a form of joint input coding by using main words expressing the semantics of the sentences through dependency syntactic analysis and semantic role labeling, and solves the problem of difficult processing aiming at the sentences with too long sentences, too many description words or the multiple intentions of the users.
(II) technical scheme
In order to achieve the purpose, the invention provides the following technical scheme: an intention understanding method fusing syntactic analysis and semantic role pruning comprises a preprocessing module, a semantic matching module and a pruning module, wherein the pruning module comprises a pruning algorithm, and sentence component pruning comprises the following steps:
step 1: determining a central word;
step 2: determining a subject;
and step 3: determining an object;
semantic role pruning comprises the following steps:
step 1: determining sentence predicates and marking;
step 2: determining core arguments, reserving the core arguments in the sentence, including the roles of the actors relevant to the construction, the subject, the scope, the action start, the action end and other verbs, and marking;
and step 3: and pruning other arguments which are not directly related to the predicates, such as time, place, purpose, degree and the like.
Preferably, the intention understanding system fusing syntactic analysis and semantic role pruning mainly comprises a preprocessing module, a pruning module, a semantic matching module and a result output module.
Preferably, the determining the core word obtains a root node of the dependency syntax tree as an initial core word, determines whether there is a word which is dependent on the core word and has an interlocking relationship, and if so, adds the word into the core word list.
Preferably, the subject is determined so as to obtain, starting from the initial headword, a word that depends on the initial headword and on which the dependency relationship is the primary predicate relationship, as a main word.
Preferably, starting from the initial core word, obtaining a word depending on the initial core word and the dependency relationship being the motile-guest relationship, taking the word as the guest word, judging whether a word depending on the guest word and the dependency relationship being the parallel relationship exists, and if so, adding the word into the guest word list.
Preferably, the semantic matching module inputs the sentence components, semantic roles and original sentences obtained after pruning into the semantic matching model together.
Compared with the prior art, the invention provides the intention understanding method integrating the syntactic analysis and the semantic role pruning, and the intention understanding method has the following beneficial effects:
1. when the method is used, a user inputs an information text, the information text is processed and output in preprocessing, the participle in the input sentence can be processed in three links of part of speech tagging, dependency grammar analysis and semantic role tagging, sentence components are analyzed by dependency syntax, a semantic role is obtained after semantic role tagging, wherein the sentence components and the semantic role are pruning parts, and the sentence processed by a pruning algorithm is subjected to semantic matching to finally output the intention of the user.
2. The invention relates to an intention understanding method integrating syntactic analysis and semantic role pruning, which comprises the steps of carrying out dependency syntactic analysis and semantic role labeling on a user input text, pruning an input sentence according to a syntactic structure and a semantic role, respectively obtaining sentence components and semantic roles for expressing sentence semantics, inputting the sentence components and the semantic roles into a semantic similarity model together with an original sentence for intention matching, and enhancing the semantic information of the sentence through the sentence components and the semantic roles so as to improve the accuracy of intention understanding, and has the advantages of strengthening the semantics aiming at the input sentence with too long sentence, too many description words or multiple intentions of a user; the main words expressing sentence semantics through dependency syntax analysis and semantic role labeling enhance semantic information in a form of joint input coding.
Drawings
FIG. 1 is a diagram of the system framework method for understanding the intent of merging syntactic analysis and semantic role pruning according to the present invention
Schematic representation.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, an intention understanding method for merging syntactic analysis and semantic role pruning includes a preprocessing module, a semantic matching module, and a pruning module, where the pruning module includes a pruning algorithm, and the sentence component pruning includes the following steps:
step 1: determining a central word, acquiring a root node of a dependency syntax tree as an initial central word, judging whether a word which depends on the central word and is in a linkage relation exists or not, and if so, adding the word into a central word list;
step 2: determining a subject, starting from the initial central word, acquiring words which depend on the initial central word and have a dependency relationship as a main-predicate relationship, taking the words as main words, judging whether words which depend on the main words and have a dependency relationship of a parallel relationship exist, and if the words exist, adding the words into a main word list;
and step 3: determining an object, starting from the initial central word, acquiring words which depend on the initial central word and depend on the relation of the initial central word and are in the action-guest relation, taking the words as guest words, judging whether words which depend on the guest words and have the parallel relation of the dependence relations exist or not, and if the words exist, adding the words into an object word list;
semantic role pruning comprises the following steps:
step 1: determining sentence predicates and marking;
step 2: determining core arguments, reserving the core arguments in the sentence, including the roles of the actors relevant to the construction, the subject, the scope, the action start, the action end and other verbs, and marking;
and step 3: and pruning other arguments which are not directly related to the predicates, such as time, place, purpose, degree and the like.
Semantic matching: the sentence components, semantic roles and the original sentences obtained after pruning are input into the semantic matching model together
And (4) outputting a result: and obtaining the user intention according to the standard question intention matched by the semantic matching result.
The working principle is as follows: when the intention understanding method integrating syntactic analysis and semantic role pruning is used, a user inputs an information text, the information text is processed and output in preprocessing, participles in the input sentences are processed in three links of part-of-speech tagging, dependency syntactic analysis and semantic role tagging, sentence components are analyzed by dependency syntactic, semantic role tagging is obtained after semantic role tagging processing, wherein the sentence components and the semantic roles are pruning parts, sentences processed by a pruning algorithm are subjected to semantic matching, and finally the intention of the user is output. The semantic information of the sentence is enhanced through the sentence components and the semantic roles so as to improve the accuracy of intention understanding, and the method has the advantages that the semantics is enhanced aiming at the input sentence with too long sentence, too many description words or multiple intentions of the user; the main words expressing sentence semantics through dependency syntax analysis and semantic role labeling enhance semantic information in a form of joint input coding.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (6)

1. An intention understanding method fusing syntactic analysis and semantic role pruning is characterized by comprising the following steps of: the system comprises a preprocessing module, a semantic matching module, semantic role pruning, sentence component pruning, a result output module and a pruning module, wherein the sentence component pruning comprises the following steps:
step 1: determining a central word;
step 2: determining a subject;
and step 3: determining an object;
the semantic role pruning comprises the following steps:
step 1: determining sentence predicates and marking;
step 2: determining core arguments, reserving the core arguments in the sentence, including the roles of the actors relevant to the construction, the subject, the scope, the action start, the action end and other verbs, and marking;
and step 3: and pruning other arguments which are not directly related to the predicates, such as time, place, purpose, degree and the like.
2. The method for understanding intent with fused syntactic analysis and semantic role pruning according to claim 1, wherein: the intention understanding system fusing syntactic analysis and semantic role pruning mainly comprises a preprocessing module, a pruning module, a semantic matching module and a result output module.
3. The method for understanding intent with fused syntactic analysis and semantic role pruning according to claim 1, wherein: and determining the central word to obtain a root node of the dependency syntax tree as an initial central word, judging whether a word which is dependent on the central word and is in an interlocking relationship exists or not, and if so, adding the word into the central word list.
4. The method for understanding intent with fused syntactic analysis and semantic role pruning according to claim 1, wherein: and the determined subject starts from the initial central word, and obtains words which depend on the initial central word and depend on the relationship as the main-meaning relationship, and the words are used as main words.
5. The method for understanding intent with fused syntactic analysis and semantic role pruning according to claim 1, wherein: and starting from the initial central word, obtaining words which depend on the initial central word and have the dependency relationship of the action guest relationship, using the words as guest words, judging whether the words which depend on the guest words and have the dependency relationship of the parallel relationship exist or not, and if the words exist, adding the words into the guest word list.
6. The method for understanding intent with fused syntactic analysis and semantic role pruning according to claim 1, wherein: and the semantic matching module inputs the sentence components, the semantic roles and the original sentences obtained after pruning into the semantic matching model together.
CN202111184182.9A 2021-10-12 2021-10-12 Intention understanding method fusing syntactic analysis and semantic role pruning Pending CN113792542A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111184182.9A CN113792542A (en) 2021-10-12 2021-10-12 Intention understanding method fusing syntactic analysis and semantic role pruning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111184182.9A CN113792542A (en) 2021-10-12 2021-10-12 Intention understanding method fusing syntactic analysis and semantic role pruning

Publications (1)

Publication Number Publication Date
CN113792542A true CN113792542A (en) 2021-12-14

Family

ID=79184929

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111184182.9A Pending CN113792542A (en) 2021-10-12 2021-10-12 Intention understanding method fusing syntactic analysis and semantic role pruning

Country Status (1)

Country Link
CN (1) CN113792542A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114662469A (en) * 2022-02-25 2022-06-24 北京百度网讯科技有限公司 Emotion analysis method and device, electronic equipment and storage medium
CN115577090A (en) * 2022-12-07 2023-01-06 北京云迹科技股份有限公司 Idiom understanding-based voice conversation method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090204605A1 (en) * 2008-02-07 2009-08-13 Nec Laboratories America, Inc. Semantic Search Via Role Labeling
CN101510221A (en) * 2009-02-17 2009-08-19 北京大学 Enquiry statement analytical method and system for information retrieval
CN104298658A (en) * 2014-10-29 2015-01-21 百度在线网络技术(北京)有限公司 Method and device for acquiring search result
KR101646159B1 (en) * 2015-04-23 2016-08-05 한림대학교 산학협력단 The method and apparatus for analyzing sentence based on semantic role labeling
CN108460015A (en) * 2018-02-08 2018-08-28 合肥工业大学 Text emotion grouped data enhances analysis method
CN111723564A (en) * 2020-05-27 2020-09-29 西安交通大学 Event extraction and processing method for case-following electronic file

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090204605A1 (en) * 2008-02-07 2009-08-13 Nec Laboratories America, Inc. Semantic Search Via Role Labeling
CN101510221A (en) * 2009-02-17 2009-08-19 北京大学 Enquiry statement analytical method and system for information retrieval
CN104298658A (en) * 2014-10-29 2015-01-21 百度在线网络技术(北京)有限公司 Method and device for acquiring search result
KR101646159B1 (en) * 2015-04-23 2016-08-05 한림대학교 산학협력단 The method and apparatus for analyzing sentence based on semantic role labeling
CN108460015A (en) * 2018-02-08 2018-08-28 合肥工业大学 Text emotion grouped data enhances analysis method
CN111723564A (en) * 2020-05-27 2020-09-29 西安交通大学 Event extraction and processing method for case-following electronic file

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114662469A (en) * 2022-02-25 2022-06-24 北京百度网讯科技有限公司 Emotion analysis method and device, electronic equipment and storage medium
CN115577090A (en) * 2022-12-07 2023-01-06 北京云迹科技股份有限公司 Idiom understanding-based voice conversation method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
US10706236B1 (en) Applied artificial intelligence technology for using natural language processing and concept expression templates to train a natural language generation system
Pust et al. Parsing English into abstract meaning representation using syntax-based machine translation
Cetto et al. Graphene: Semantically-linked propositions in open information extraction
WO2017198031A1 (en) Semantic parsing method and apparatus
KR102080081B1 (en) Automatic review program of poisonous clauses and omission clauses in international construction contracts
WO2017163346A1 (en) Text analysis system and program
CN112541337B (en) Document template automatic generation method and system based on recurrent neural network language model
AU2019265874B2 (en) Systems and methods for document deviation detection
CN113792542A (en) Intention understanding method fusing syntactic analysis and semantic role pruning
CN111382571A (en) Information extraction method, system, server and storage medium
CN112733547A (en) Chinese question semantic understanding method by utilizing semantic dependency analysis
Sibarani et al. A study of parsing process on natural language processing in bahasa Indonesia
Argamon-Engelson et al. A memory-based approach to learning shallow natural language patterns
Wax Automated grammar engineering for verbal morphology
Zamaraeva et al. Handling cross-cutting properties in automatic inference of lexical classes: A case study of Chintang
CN111178080A (en) Named entity identification method and system based on structured information
CN113609838A (en) Document information extraction and mapping method and system
CN111091009B (en) Document association auditing method based on semantic analysis
CN114417008A (en) Construction engineering field-oriented knowledge graph construction method and system
CN115374258A (en) Knowledge base query method and system combining semantic understanding with question template
Ferraro et al. NLP Techniques for Normative Mining.
Tianwen et al. Evaluate the chinese version of machine translation based on perplexity analysis
Korobkin et al. Extraction of structural elements of inventions from Russian-language patents
JP4033093B2 (en) Natural language processing system, natural language processing method, and computer program
Baracho et al. Sentiment analysis in social networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination