CN111858840A - Method and device for intent recognition based on concept map - Google Patents

Method and device for intent recognition based on concept map Download PDF

Info

Publication number
CN111858840A
CN111858840A CN201910329359.6A CN201910329359A CN111858840A CN 111858840 A CN111858840 A CN 111858840A CN 201910329359 A CN201910329359 A CN 201910329359A CN 111858840 A CN111858840 A CN 111858840A
Authority
CN
China
Prior art keywords
intention
question
user
concept
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910329359.6A
Other languages
Chinese (zh)
Inventor
魏誉荧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201910329359.6A priority Critical patent/CN111858840A/en
Publication of CN111858840A publication Critical patent/CN111858840A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Artificial Intelligence (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

本发明属于意图识别领域,公开了一种基于概念图的意图识别方法及装置,其方法包括:采集用户的语音信息;对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。本发明将各种随意性的问法生成问法概念图,使得根据语音信息中的问法词在生成的问法概念图中获取用户意图时,更容易更准确获取用户意图,且在问法概念图中,各种问法层级清楚,使得搜索识别速度更快。

Figure 201910329359

The present invention belongs to the field of intention recognition, and discloses an intention recognition method and device based on a concept map, the method comprising: collecting user voice information; performing grammatical analysis on the voice information, extracting question words and keywords in the voice information; matching the question words with nodes in a question concept map to obtain matching nodes, wherein the question concept map is pre-constructed; obtaining the user's intention according to the intention concepts corresponding to the matching nodes and the keywords. The present invention generates a question concept map from various random questions, so that when obtaining the user's intention in the generated question concept map according to the question words in the voice information, it is easier and more accurate to obtain the user's intention, and in the question concept map, the various question levels are clear, making the search and recognition speed faster.

Figure 201910329359

Description

一种基于概念图的意图识别方法及装置Method and device for intent recognition based on concept map

技术领域technical field

本发明属于语义识别技术领域,特别涉及一种基于概念图的意图识别方法及装置。The invention belongs to the technical field of semantic recognition, and in particular relates to a method and device for intention recognition based on a concept map.

背景技术Background technique

随着智能终端及网络技术的迅速发展,人们越来越习惯地使用智能终端完成各种需求。自然语言作为人类表达自己思想最方便、最自然的方式,已经逐渐成为智能服务领域最主流的人机交互方式。With the rapid development of smart terminals and network technologies, people are more and more accustomed to using smart terminals to fulfill various needs. As the most convenient and natural way for humans to express their thoughts, natural language has gradually become the most mainstream human-computer interaction method in the field of intelligent services.

在人机交互场景中,意图识别是必不可少的一个环节,其主要通过分析用户输入的语音,了解用户的意图,并转换成机器能够理解的结构化数据格式,然后做出相应的反馈,因此,在人机交互场景中,准确识别用户意图是做出正确应答的基础。而对于低年级的学生而言,由于处于语言学习的阶段,在语言表达过程中可能出现语言表述不全、随意性等情况。因此,在学生使用的语音电子产品中,若使用常规的意图识别方法,则容易出现意图识别不准确的情况,从而导致智能终端无法做出正确的应答,影响用户的使用体验。In the human-computer interaction scenario, intent recognition is an indispensable link. It mainly analyzes the voice input by the user, understands the user's intent, converts it into a structured data format that the machine can understand, and then gives corresponding feedback. Therefore, in human-computer interaction scenarios, accurate identification of user intent is the basis for making correct responses. For students in lower grades, because they are in the stage of language learning, there may be situations such as incomplete language expression and randomness in the process of language expression. Therefore, in the voice electronic products used by students, if the conventional intention recognition method is used, the intention recognition is likely to be inaccurate, so that the intelligent terminal cannot make a correct response, which affects the user experience.

发明内容SUMMARY OF THE INVENTION

本发明的目的是提供一种基于概念图的意图识别方法及装置,将各种随意性的问法生成问法概念图,以便更容易更准确地获取用户意图。The purpose of the present invention is to provide a method and device for recognizing intent based on a concept map, which generates a concept map from various random questions, so as to obtain user intent more easily and more accurately.

本发明提供的技术方案如下:The technical scheme provided by the present invention is as follows:

一方面,提供一种基于概念图的意图识别方法,包括:In one aspect, a method for recognizing intent based on a concept map is provided, including:

采集用户的语音信息;Collect user's voice information;

对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;Syntax analysis is carried out to the voice information, and the question words and keywords in the voice information are extracted;

将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;Matching the question word with the node in the question concept graph to obtain a matching node, wherein the question concept graph is pre-built;

根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。According to the intent concept corresponding to the matched node and the keyword, the user's intent is acquired.

进一步优选地,所述问法概念图的构建方法为:Further preferably, the method for constructing the concept map of the question method is:

收集大量用户语料;Collect a large number of user corpora;

提取所述语料中的问法词;extract the question words in the corpus;

获取所述语料中意图相同的问法词的意图概念;Obtain the intent concept of the question word with the same intent in the corpus;

建立所述问法词与所述意图概念之间的映射关系;establishing a mapping relationship between the question word and the intent concept;

根据所述问法词与所述意图概念之间的映射关系,构建问法概念图。According to the mapping relationship between the question word and the intent concept, a question concept map is constructed.

进一步优选地,所述根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图具体包括:Further preferably, acquiring the user's intent according to the intent concept corresponding to the matched node and the keyword specifically includes:

从预设的词库中搜索与所述关键词匹配的目标词语;Search the target word matching the keyword from the preset thesaurus;

根据所述目标词语的语义,确定所述关键词的语义;According to the semantics of the target word, determine the semantics of the keyword;

根据所述匹配的节点对应的意图概念以及所述关键词的语义,获取所述用户的意图。The user's intent is acquired according to the intent concept corresponding to the matched node and the semantics of the keyword.

进一步优选地,对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词之后还包括:Further preferably, the voice information is grammatically analyzed, and after extracting the question words and keywords in the voice information, it also includes:

当所述关键词中含有特定关键词时,接收所述用户的触摸信号;When the keyword contains a specific keyword, receiving a touch signal from the user;

根据所述触摸信号,获取所述用户选择区域的图像信息;obtaining image information of the user-selected area according to the touch signal;

所述根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图具体包括:The acquiring the user's intent according to the intent concept corresponding to the matched node and the keyword specifically includes:

根据所述匹配的节点对应的意图概念、所述关键词以及所述图像信息,获取所述用户的意图。The user's intent is acquired according to the intent concept corresponding to the matched node, the keyword, and the image information.

进一步优选地,所述根据所述匹配的节点对应的意图概念、所述关键词以及所述图像信息,获取所述用户的意图具体包括:Further preferably, acquiring the user's intent according to the intent concept corresponding to the matched node, the keyword, and the image information specifically includes:

识别所述图像信息中的文字信息;Identify text information in the image information;

将所述文字信息替换所述关键词中的特定关键词;replacing specific keywords in the keywords with the text information;

根据所述匹配的节点对应的意图概念、所述关键词和所述文字信息,获取所述用户的意图。The user's intent is acquired according to the intent concept corresponding to the matched node, the keyword, and the text information.

另一方面,还提供一种基于概念图的意图识别装置,包括:On the other hand, a device for recognizing intent based on a concept map is also provided, comprising:

采集模块,用于采集用户的语音信息;The acquisition module is used to collect the user's voice information;

提取模块,用于对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;Extraction module, for carrying out grammatical analysis to described speech information, extracts the question word and key word in described speech information;

匹配模块,用于将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;a matching module, configured to match the question word with the node in the question concept graph to obtain the matched node, wherein the question concept graph is pre-built;

意图获取模块,用于根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。An intent acquisition module, configured to acquire the user's intent according to the intent concept corresponding to the matched node and the keyword.

进一步优选地,还包括构建模块;Further preferably, it also includes building blocks;

所述构建模块包括:The building blocks include:

语料收集单元,用于收集大量用户语料;The corpus collection unit is used to collect a large number of user corpora;

词语提取单元,用于提取所述语料中的问法词;a word extraction unit, used to extract the question words in the corpus;

概念获取单元,用于获取所述语料中意图相同的问法词的意图概念;a concept acquisition unit, used to acquire the intent concept of the question word with the same intent in the corpus;

关系建立单元,用于建立所述问法词与所述意图概念之间的映射关系;a relationship establishing unit, configured to establish a mapping relationship between the question word and the intent concept;

构建单元,用于根据所述问法词与所述意图概念之间的映射关系,构建问法概念图。The construction unit is used for constructing a question concept map according to the mapping relationship between the question word and the intention concept.

进一步优选地,所述意图获取模块包括:Further preferably, the intent acquisition module includes:

词语搜索单元,用于从预设的词库中搜索与所述关键词匹配的目标词语;A word search unit, used to search for target words matching the keyword from a preset thesaurus;

语义确定单元,用于根据所述目标词语的语义,确定所述关键词的语义;a semantic determination unit, configured to determine the semantics of the keyword according to the semantics of the target word;

意图获取单元,用于根据所述匹配的节点对应的意图概念以及所述关键词的语义,获取所述用户的意图。An intent acquisition unit, configured to acquire the user's intent according to the intent concept corresponding to the matched node and the semantics of the keyword.

进一步优选地,还包括:Further preferably, it also includes:

信号接收模块,用于当所述关键词中含有特定关键词时,接收所述用户的触摸信号;a signal receiving module for receiving a touch signal of the user when the keyword contains a specific keyword;

图像获取模块,用于根据所述触摸信号,获取所述用户选择区域的图像信息;an image acquisition module, configured to acquire image information of the user-selected area according to the touch signal;

所述意图获取模块,还用于根据所述匹配的节点对应的意图概念、所述关键词以及所述图像信息,获取所述用户的意图。The intent acquisition module is further configured to acquire the user's intent according to the intent concept corresponding to the matched node, the keyword and the image information.

进一步优选地,所述意图获取模块包括:Further preferably, the intent acquisition module includes:

识别单元,用于识别所述图像信息中的文字信息;an identification unit for identifying text information in the image information;

替换单元,用于将所述文字信息替换所述关键词中的特定关键词;A replacement unit, used to replace the specific keywords in the keywords with the text information;

意图获取单元,用于根据所述匹配的节点对应的意图概念、所述关键词和所述文字信息,获取所述用户的意图。An intent acquisition unit, configured to acquire the user's intent according to the intent concept corresponding to the matched node, the keyword, and the text information.

与现有技术相比,本发明提供的一种基于概念图的意图识别方法及装置的有益效果为:本发明将各种随意性的问法生成问法概念图,使得根据语音信息中的问法词在生成的问法概念图中获取用户意图时,更容易更准确获取用户意图,且在问法概念图中,各种问法层级清楚,使得搜索识别速度更快。Compared with the prior art, the present invention provides a method and device for recognizing intent based on a concept map, which has the following beneficial effects: It is easier and more accurate for French words to obtain user intentions in the generated question concept map, and in the question concept map, the levels of various questions are clear, which makes the search and recognition faster.

附图说明Description of drawings

下面将以明确易懂的方式,结合附图说明优选实施方式,对一种基于概念图的意图识别方法及装置的上述特性、技术特征、优点及其实现方式予以进一步说明。The preferred embodiments will be described below in a clear and easy-to-understand manner with reference to the accompanying drawings, and further description will be given of the above-mentioned characteristics, technical features, advantages and implementations of a method and device for identifying intent based on a concept map.

图1是本发明一种基于概念图的意图识别方法的第一实施例的流程示意图;FIG. 1 is a schematic flowchart of a first embodiment of a method for recognizing intent based on a concept map of the present invention;

图2是本发明一种基于概念图的意图识别方法的第二实施例的流程示意图;2 is a schematic flowchart of a second embodiment of a conceptual map-based intent identification method of the present invention;

图3是本发明一种基于概念图的意图识别方法的第三实施例的流程示意图;3 is a schematic flowchart of a third embodiment of a conceptual map-based intent identification method of the present invention;

图4是本发明一种基于概念图的意图识别方法的第四实施例的流程示意图;4 is a schematic flowchart of a fourth embodiment of a conceptual map-based intent recognition method of the present invention;

图5是本发明一种基于概念图的意图识别方法的第五实施例的流程示意图;5 is a schematic flowchart of a fifth embodiment of a conceptual map-based intent recognition method of the present invention;

图6是本发明一种基于概念图的意图识别装置的一个实施例的结构示意框图;FIG. 6 is a schematic structural block diagram of an embodiment of an intention recognition device based on a conceptual diagram of the present invention;

图7是本发明一种基于概念图的意图识别装置的另一个实施例的结构示意框图。FIG. 7 is a schematic structural block diagram of another embodiment of an intention recognition device based on a conceptual diagram of the present invention.

附图标号说明Explanation of reference numerals

100、采集模块;200、提取模块;300、匹配模块;400、意图获取模块;410、词语搜索单元;420、语义确定单元;430、意图获取单元;440、识别单元;450、替换单元;500、构建模块;510、语料收集单元;520、词语提取单元;530、概念获取单元;540、关系建立单元;550、构建单元;600、信号接收模块;700、图像获取模块。100, acquisition module; 200, extraction module; 300, matching module; 400, intent acquisition module; 410, word search unit; 420, semantic determination unit; 430, intent acquisition unit; 440, identification unit; 450, replacement unit; 500 510, corpus collection unit; 520, word extraction unit; 530, concept acquisition unit; 540, relationship establishment unit; 550, construction unit; 600, signal receiving module; 700, image acquisition module.

具体实施方式Detailed ways

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对照附图说明本发明的具体实施方式。显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图,并获得其他的实施方式。In order to more clearly describe the embodiments of the present invention or the technical solutions in the prior art, the specific embodiments of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative efforts, and obtain other implementations.

为使图面简洁,各图中只示意性地表示出了与本发明相关的部分,它们并不代表其作为产品的实际结构。另外,以使图面简洁便于理解,在有些图中具有相同结构或功能的部件,仅示意性地绘示了其中的一个,或仅标出了其中的一个。在本文中,“一个”不仅表示“仅此一个”,也可以表示“多于一个”的情形。In order to keep the drawings concise, the drawings only schematically show the parts related to the present invention, and they do not represent its actual structure as a product. In addition, in order to make the drawings concise and easy to understand, in some drawings, only one of the components having the same structure or function is schematically shown, or only one of them is marked. As used herein, "one" not only means "only one", but also "more than one".

本发明中,通过概念图来进行意图识别,可提高意图识别的准确率;本发明的基于概念图的意图识别方法可以应用于智能终端设备;例如:家教机、儿童平板等,以下实施例中为方便理解,都以家教机作为主语解释,但本领域的技术人员均应明白该基于概念图的意图识别方法也可应用于其他智能终端设备,只要能实现相应功能即可。In the present invention, the intent recognition is carried out through the concept map, which can improve the accuracy of the intent recognition; the intent recognition method based on the concept map of the present invention can be applied to intelligent terminal equipment; For the convenience of understanding, all explanations take the tutoring machine as the subject, but those skilled in the art should understand that the intent recognition method based on the concept map can also be applied to other intelligent terminal devices, as long as the corresponding functions can be realized.

根据本发明提供的第一实施例,如图1所示,一种基于概念图的意图识别方法,包括:According to the first embodiment provided by the present invention, as shown in FIG. 1 , a method for recognizing intent based on a concept map includes:

S100采集用户的语音信息;S100 collects user's voice information;

具体地,可通过麦克风或其它语音采集装置来采集用户的语音。麦克风和其它语音采集装置可以是家教机内置的,也可以是外接的设备。该语音信息可以是用户实时输入的语音。Specifically, the user's voice may be collected through a microphone or other voice collection devices. Microphones and other voice collection devices may be built-in devices in the tutoring machine, or may be external devices. The voice information may be voice input by the user in real time.

S200对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;S200 performs grammatical analysis on the voice information, and extracts question words and keywords in the voice information;

具体地,获取到语音信息后,可先将该语音信息转换为文本信息,然后对转换的文本信息进行分词及语法分析,以从文本信息中提取出问法词和关键词。将语音信息转换为文本,可采用现有的各种技术手段来实现,本发明不限于使用某种转换方法,在此不再进行详细说明。Specifically, after acquiring the voice information, the voice information may be converted into text information first, and then word segmentation and grammatical analysis are performed on the converted text information to extract question words and keywords from the text information. Converting the voice information into text can be achieved by using various existing technical means. The present invention is not limited to using a certain conversion method, and will not be described in detail here.

此处的问法词是指语音交互过程中表示各种问法的词,如“怎么办”、“怎么解”、“我不懂”等。此处的关键词是指语音信息中具有实际意义的名词,即具有对应实体的名词;如“语文视频在哪里”中的“语文视频”即为关键词,“连除问题怎么办”中的“连除问题”即为关键词,“兢兢业业”中的“兢兢业业”即为关键词。The question word here refers to the word expressing various questions in the process of voice interaction, such as "how to do", "how to solve", "I don't understand" and so on. The keywords here refer to the nouns with actual meaning in the voice information, that is, the nouns with the corresponding entities; for example, "Chinese video" in "Where is the Chinese video" is the keyword, and "What to do with the connection and division problem?" "Consistently remove the problem" is the key word, and "conscientiously and conscientiously" in "conscientiously and conscientiously" is the key word.

S300将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;S300 matches the question word with the nodes in the question concept graph to obtain a matching node, wherein the question concept graph is pre-built;

具体地,概念图是一种用节点代表概念,连线表示概念间关系的图示法。概念图一般由“节点”、“链接”和“有关文字标注”组成。问法概念图是指将各种不同的问法,根据问法间的关系生成的概念图。在问法概念图中,根节点下包含多个子节点,子节点为根节点的下位概念,子节点下又可以包含多个子子节点,子子节点为子节点的下位概念。Specifically, a concept map is a graphical method that uses nodes to represent concepts and lines to represent relationships between concepts. Concept maps are generally composed of "nodes", "links" and "related text annotations". The concept map of questions refers to the concept map generated by various questions according to the relationship between the questions. In the concept diagram of the question method, the root node contains multiple sub-nodes, and the sub-nodes are the subordinate concepts of the root node. The sub-nodes can contain multiple sub-sub-nodes, and the sub-sub-nodes are sub-concepts of the sub-nodes.

例如:问法概念图的根节点下包含多个问法概念,如表示“讲解”的问法,表示“寻找”的问法,表示“调节”的问法等。For example, the root node of the questioning concept graph contains multiple questioning concepts, such as the questioning method for "explaining", the questioning method for "searching", and the questioning method for "adjustment".

其中,不同的问法概念下又包含多个问法,如表示“讲解”的问法包括“不懂”、“如何理解”、“怎么办”、“是什么”等。表示“寻找”的问法包括“在哪里”、“在何处”、“什么地方”等。Among them, different questioning concepts include multiple questioning methods, for example, the questioning methods expressing "explaining" include "do not understand", "how to understand", "how to do", "what is" and so on. Questions that express "find" include "where", "where", "where" and so on.

将从语言信息中提取出的问法词与问法概念图中的节点进行匹配,即在问法概念图中搜索与问法词匹配的节点。Match the question words extracted from the language information with the nodes in the question concept graph, that is, search for the nodes matching the question words in the question concept graph.

S400根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。S400 acquires the user's intent according to the intent concept corresponding to the matched node and the keyword.

具体地,获取到匹配的节点后,即可根据匹配的节点得到对应的意图概念,然后根据对应的意图概念并结合关键词,来获取用户的意图。Specifically, after obtaining the matching node, the corresponding intent concept can be obtained according to the matching node, and then the user's intent can be obtained according to the corresponding intent concept and combining keywords.

示例性地,语音信息为“一元一次方程是什么”,从语音信息中提取出的问法词为“是什么”,提取出的关键词是“一元一次方程”,问法词在问法概念图中匹配的节点为“是什么”,而在问法概念图中节点“是什么”的意图概念为“讲解”,结合关键词“一元一次方程”和“讲解”即可得到用户的意图为搜索一元一次方程相关的内容,如搜索一元一次方程的讲解、一元一次方程的练习题等。Exemplarily, the speech information is "what is a linear equation in one variable", the question word extracted from the speech information is "what is", the extracted keyword is "a linear equation in one variable", and the question word is in the concept of question The matching node in the graph is "what is", and the intent concept of the node "what is" in the question concept graph is "explain". Combining the keywords "one-dimensional linear equation" and "explain", the user's intent can be obtained as Search for content related to linear equations in one variable, such as searching for explanations of linear equations in one variable, practice questions for linear equations in one variable, etc.

本实施例中,将各种随意性的问法生成问法概念图,使得根据语音信息中的问法词在生成的问法概念图中获取用户意图时,更容易更准确获取用户意图,且在问法概念图中,各种问法层级清楚,使得搜索识别速度更快。In this embodiment, various random question methods are used to generate a question method concept map, so that when the user intent is obtained in the generated question method concept map according to the question method words in the voice information, it is easier and more accurate to obtain the user intent, and In the question method concept map, the various question method levels are clear, which makes the search and identification faster.

根据本发明提供的第二实施例,如图2所示,一种基于概念图的意图识别方法,本实施例在实施例一的基础上,所述问法概念图的构建方法为:According to the second embodiment provided by the present invention, as shown in FIG. 2 , a method for recognizing intent based on a concept map, this embodiment is based on the first embodiment, and the method for constructing the concept map is as follows:

S010收集大量用户语料;S010 collects a large number of user corpora;

具体地,在用户使用家教机的过程中,收集用户与家教机进行语音交互时产生的各种语料。Specifically, in the process of the user using the tutoring machine, various corpora generated when the user performs voice interaction with the tutoring machine are collected.

S020提取所述语料中的问法词;S020 extracts the question words in the corpus;

具体地,收集到大量语料后,可先将语料转换为文本信息,然后对转换的文本信息进行分词及语法分析,再从转换得到的文本信息中提取出与问法相关的问法词。Specifically, after a large amount of corpus is collected, the corpus can be converted into text information first, and then the converted text information can be segmented and grammatically analyzed, and then the question words related to the question method can be extracted from the converted text information.

例如,语料为“连除问题怎么办”、“图形的运动我不会”、“兢兢业业我不懂”、“语义视频在哪里”、“帮我打开语文练习题”、“音量太小了”、“音量太大了”、“字体太小了”、“我要预习语文”、“打开历史课本”等。从语料中提取出的问法词为“怎么办”、“我不会”、“我不懂”、“在哪里”、“打开”、“太小了”、“太大了”、“预习”等。For example, the corpus is "what to do with the problem of connecting and removing", "I don't know how to move graphics", "I don't know how to work hard", "where is the semantic video", "help me open the language practice questions", "the volume is too low" , "The volume is too loud", "The font is too small", "I want to preview the language", "Open the history textbook" and so on. The question words extracted from the corpus are "what to do", "I don't know", "I don't understand", "where", "open", "too small", "too big", "preview" "Wait.

S030获取所述语料中意图相同的问法词的意图概念;S030 obtains the intent concept of the question word with the same intent in the corpus;

具体地,从语料中提取出各种问法词后,对问法词进行分类,将意图相同的问法词分为一类,再获取意图相同的问法词的意图概念。Specifically, after extracting various question words from the corpus, classify the question words, classify the question words with the same intention into one category, and then obtain the intent concept of the question words with the same intention.

例如,问法词“怎么办”、“我不会”、“我不懂”都是表示对某个内容不理解、不会解答、不明白等,需要家教机搜索并展示对应的答案、题目讲解或讲义等;因此,可将问法词“怎么办”、“我不会”、“我不懂”等归为一类。For example, the French words "what to do", "I don't know" and "I don't understand" all indicate that you don't understand a certain content, don't know how to answer it, don't understand, etc. You need a tutor to search and display the corresponding answers and questions. Lectures or handouts, etc.; therefore, questions such as "what to do", "I don't know", "I don't understand", etc. can be grouped together.

问法词“在哪里”、“打开”、“预习”都是表示需要寻找某个内容,需要家教机打开对应的内容,因此,可将问法词“在哪里”、“打开”、“预习”等归为一类。The French question words "where", "open", and "preview" all indicate that a certain content needs to be found, and the corresponding content needs to be opened by the tutor computer. Therefore, the French question words "where", "open", "preview" can be used " etc. are grouped together.

问法词“太小了”、“太大了”都是表示需要调节某个对象,如调节音量大小、调节字体大小、调节视频显示大小等;因此,可将问法词“太小了”、“太大了”归为一类。The French words "too small" and "too big" all indicate that an object needs to be adjusted, such as adjusting the volume size, adjusting the font size, adjusting the video display size, etc.; therefore, the French word "too small" can be , "too big" are grouped together.

将得到的所有问法词进行归类后,则根据问法词表示的意图,确定分类后的每类问法词的意图概念;如可将问法词“怎么办”、“我不会”、“我不懂”的意图概念调节为“讲解”,将问法词“在哪里”、“打开”、“预习”的意图概念调节为“寻找对象”,将问法词太小了”、“太大了”的意图概念调节为“调节”。After classifying all the question words obtained, the intention concept of each type of question word after classification is determined according to the intention expressed by the question word; , the intention concept of "I don't understand" is adjusted to "explain", the intention concept of the question words "where", "open" and "preview" are adjusted to "find the object", and the question word is too small", The intent concept of "too big" is conditioned to 'conditioned'.

S040建立所述问法词与所述意图概念之间的映射关系;S040 establishes a mapping relationship between the question word and the intent concept;

具体地,将问法词进行分类且确定每类问法词的意图概念后,建立问法词与意图概念之间的映射关系,即建立意图概念与问法词之间的对应关系。Specifically, after classifying the question words and determining the intent concept of each type of question word, the mapping relationship between the question word and the intent concept is established, that is, the corresponding relationship between the intent concept and the question word is established.

S050根据所述问法词与所述意图概念之间的映射关系,构建问法概念图。S050 constructs a question concept map according to the mapping relationship between the question word and the intent concept.

具体地,根据建立的问法词与意图概念之间的对应关系,即可构建问法概念图。Specifically, according to the established correspondence between the question word and the intent concept, the question concept map can be constructed.

本实施例中,通过收集用户在使用智能终端设备时的语料,即可收集大量随意性、口语化的问法,然后将该随意性的问法进行分类整理生成问法概念图,使得在通过问法概念图进行意图识别时,识别率较高。In this embodiment, by collecting the corpus of the user when using the intelligent terminal device, a large number of random and colloquial questions can be collected, and then the random questions can be classified and sorted to generate a concept map of the questions, so that the When the concept map is asked for intent recognition, the recognition rate is higher.

根据本发明提供的第三实施例,如图3所示,一种基于概念图的意图识别方法,包括:According to a third embodiment provided by the present invention, as shown in FIG. 3 , a method for recognizing intent based on a concept map includes:

S100采集用户的语音信息;S100 collects user's voice information;

S200对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;S200 performs grammatical analysis on the voice information, and extracts question words and keywords in the voice information;

S300将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;S300 matches the question word with the nodes in the question concept graph to obtain a matching node, wherein the question concept graph is pre-built;

S410从预设的词库中搜索与所述关键词匹配的目标词语;S410 searches for a target word matching the keyword from a preset thesaurus;

具体地,在获取关键词意图时,可先建立词库,词库中的关键词为名词性词语,词库中的关键词可通过爬虫技术从网络上爬取,并获取词库中的词语的语义。然后将从语音信息中提取出的关键词与词库中的词库进行匹配,找到匹配的目标词语。Specifically, when acquiring the intent of keywords, a thesaurus can be established first. The keywords in the thesaurus are noun words, and the keywords in the thesaurus can be crawled from the network through the crawler technology, and the words in the thesaurus can be obtained. semantics. Then, the keywords extracted from the speech information are matched with the thesaurus in the thesaurus to find the matching target words.

S420根据所述目标词语的语义,确定所述关键词的语义;S420 determines the semantics of the keyword according to the semantics of the target word;

具体地,查找到目标词语后,根据目标词语对应的语义,确定语音信息中的关键词的语义,即确定语音信息中关键词的语义,并转换成机器能够理解的结构化数据格式。Specifically, after finding the target word, determine the semantics of the keywords in the speech information according to the semantics corresponding to the target words, that is, determine the semantics of the keywords in the speech information, and convert them into a structured data format that can be understood by machines.

S430根据所述匹配的节点对应的意图概念以及所述关键词的语义,获取所述用户的意图。S430 acquires the user's intent according to the intent concept corresponding to the matched node and the semantics of the keyword.

具体地,得到关键词的语义,以及意图概念后,即可根据意图概念和关键词的语义,获取用户的意图。Specifically, after obtaining the semantics of the keywords and the intent concept, the user's intent can be acquired according to the intent concept and the semantics of the keywords.

例如,语音信息为“连除问题怎么办”,从中提取出的关键词为“连除问题”,问法词为“怎么办”,根据问法概念图,得到问法词“怎么办”匹配的节点对应的问法概念为“讲解”,关键词“连除问题”的语义为“连除问题”,根据问法概念“讲解”和关键词“连除问题”,即可得到用户的意图为“讲解连除问题”。For example, if the voice information is "how to do with the question of consecutive division", the keyword extracted from it is "question of consecutive division", and the question word is "how to do". According to the concept map of the question, the matching word "how to do" is obtained. The corresponding questioning concept of the node is “Explanation”, and the semantics of the keyword “Concatenating and Excluding Questions” is “Concatenating and Excluding Questions”. According to the questioning concept “Explanation” and the keyword “Concatenating and Excluding Questions”, the user’s intention can be obtained. For "explain the problem of concatenation and division".

本实施例中,通过建立词库,并通过关键词匹配的方法来获取关键词的语义,相比于其他语义识别方法,更方便快捷。In this embodiment, the semantics of the keywords are acquired by establishing a thesaurus and using the keyword matching method, which is more convenient and quicker than other semantic recognition methods.

根据本发明提供的第四实施例,如图4所示,一种基于概念图的意图识别方法,包括:According to a fourth embodiment provided by the present invention, as shown in FIG. 4 , a method for recognizing intent based on a concept map includes:

S100采集用户的语音信息;S100 collects user's voice information;

S200对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;S200 performs grammatical analysis on the voice information, and extracts question words and keywords in the voice information;

S210当所述关键词中含有特定关键词时,接收所述用户的触摸信号;S210, when the keyword contains a specific keyword, receive a touch signal from the user;

具体地,特定关键词是指“这个字”、“这道题”、“这句话”、“这个”、“这”等指代词。当语音信息中包含上述特定关键词时,则接收用户在触摸屏上的触摸信号。触摸信号可以是用户在触摸屏上的连续滑动触摸信号,如连续触摸形成的一条直线,或连续触摸形成的圆形框、椭圆框,不规则图形框等。Specifically, the specific keywords refer to pronouns such as "this word", "this question", "this sentence", "this", "this" and the like. When the above-mentioned specific keyword is included in the voice information, the user's touch signal on the touch screen is received. The touch signal may be a continuous sliding touch signal by the user on the touch screen, such as a straight line formed by continuous touch, or a circular frame, oval frame, irregular graphic frame, etc. formed by continuous touch.

S220根据所述触摸信号,获取所述用户选择区域的图像信息;S220, acquiring image information of the user-selected area according to the touch signal;

具体地,接收到触摸信号后,根据触摸信号,即可获取用户选择区域的图像信息,如连续触摸形成一条直线后,以该直线为对角线构建一个矩形框,截取该矩形框内的图像,该矩形框内的图像即为用户选择区域的图像信息。Specifically, after receiving the touch signal, the image information of the area selected by the user can be obtained according to the touch signal. For example, after continuous touches form a straight line, a rectangular frame is constructed with the straight line as the diagonal, and the image in the rectangular frame is intercepted. , the image in the rectangular frame is the image information of the area selected by the user.

若连续触摸形成一个圆形框或椭圆框等,则截取形成的框内的图像,该框内的图像即为用户选择区域的图像信息。If the continuous touch forms a circular frame or an oval frame, etc., the image in the formed frame is intercepted, and the image in the frame is the image information of the area selected by the user.

此处,将触摸信号限定为连续触摸信号,可防止出现失误操作,减少错误率的发生,如可防止用户因失误操作而在触摸屏上形成两个点触摸,并获取该两个点构成的框内的图像。Here, the touch signal is limited to a continuous touch signal, which can prevent erroneous operations and reduce the error rate. For example, it can prevent the user from forming two touch points on the touch screen due to erroneous operations, and obtain the frame formed by the two points. image inside.

当语音信息中包含特定关键词时,说明用户输入的语音信息是缺失的,如“这道题怎么解”,家教机并不知道这道题是什么,因此,就算将该语音信息进行语义解析后,也无法获取用户的准确意图,所以,在判断出语音信息中包含特定关键词时,需要进一步获取用户手指指向或框选区域的图像信息,以获取某道题的信息。When the voice information contains specific keywords, it means that the voice information input by the user is missing, such as "how to solve this problem", the tutor does not know what the problem is, so even if the voice information is semantically parsed Afterwards, the exact intention of the user cannot be obtained. Therefore, when it is determined that the voice information contains a specific keyword, it is necessary to further obtain the image information of the area pointed by the user's finger or framed to obtain the information of a certain question.

S300将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;S300 matches the question word with the nodes in the question concept graph to obtain a matching node, wherein the question concept graph is pre-built;

S440根据所述匹配的节点对应的意图概念、所述关键词以及所述图像信息,获取所述用户的意图。S440 acquires the user's intent according to the intent concept corresponding to the matched node, the keyword, and the image information.

具体地,根据获取到的意图概念、关键词的语义,结合获取的图像信息,三者融合,便可获得用户的真实意图,从而给予相应的反馈。Specifically, according to the acquired intent concept and the semantics of the keywords, combined with the acquired image information, the three are integrated to obtain the user's true intent, so as to give corresponding feedback.

根据本发明提供的第五实施例,如图5所示,一种基于概念图的意图识别方法,包括:According to a fifth embodiment provided by the present invention, as shown in FIG. 5 , a method for recognizing intent based on a concept map includes:

S100采集用户的语音信息;S100 collects user's voice information;

S200对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;S200 performs grammatical analysis on the voice information, and extracts question words and keywords in the voice information;

S210当所述关键词中含有特定关键词时,接收所述用户的触摸信号;S210, when the keyword contains a specific keyword, receive a touch signal from the user;

S220根据所述触摸信号,获取所述用户选择区域的图像信息;S220, acquiring image information of the user-selected area according to the touch signal;

S300将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;S300 matches the question word with the nodes in the question concept graph to obtain a matching node, wherein the question concept graph is pre-built;

S441识别所述图像信息中的文字信息;S441 identifies text information in the image information;

具体地,得到图像信息后,对该图像进行处理识别,获得该图像信息中的文字信息,该文字信息即为某道题的信息或某个字的信息。Specifically, after the image information is obtained, the image is processed and identified to obtain text information in the image information, where the text information is information about a certain question or information about a certain word.

S442将所述文字信息替换所述关键词中的特定关键词;S442 replaces specific keywords in the keywords with the text information;

S443根据所述匹配的节点对应的意图概念、所述关键词和所述文字信息,获取所述用户的意图。S443 acquires the user's intent according to the intent concept corresponding to the matched node, the keyword, and the text information.

具体地,将获取的文字信息替换特定关键词,即将这道题替换为获取的文字信息。然后根据匹配的节点对应的意图概念、剩余关键词的语义和获取的文字信息,获取用户的真实意图。Specifically, the acquired text information is replaced with a specific keyword, that is, the question is replaced with the acquired text information. Then, according to the intent concept corresponding to the matched node, the semantics of the remaining keywords, and the acquired text information, the real intent of the user is obtained.

例如,用户在家教机上看课外读物时,看到“兢兢业业”时,“兢兢业业”不认识,想寻求帮助,于是,用户会说“这几个字读什么”,家教机采集到该语音信息,并判断出该语音信息中含有特定关键词时,会根据触摸信号获取用户选择区域的图像信息,如获取“兢兢业业”的图像,通过对该图像进行文字识别,得到“兢兢业业”这一文字信息,然后结合问法词“读什么”,即可获取用户的真实语义为“兢兢业业读什么”。如此,对于用户不认识的字词或不会解答的题,便可通过该家教机得到帮助,以辅助用户的学习。For example, when a user reads extracurricular readings on the home teaching machine, when he sees "conscientious and conscientious", he does not recognize "conscientious and conscientious" and wants to ask for help, so the user will say "what do these words read", the home teaching machine collects the voice information, When it is judged that the voice information contains a specific keyword, the image information of the area selected by the user will be obtained according to the touch signal. Combined with the question word "what to read", the real semantics of the user can be obtained as "what to read conscientiously". In this way, words that the user does not know or questions that cannot be answered can be helped through the tutoring machine to assist the user's learning.

根据本发明提供的第六实施例,如图6所示,一种基于概念图的意图识别装置,包括:According to the sixth embodiment provided by the present invention, as shown in FIG. 6 , a device for recognizing intent based on a concept map includes:

采集模块100,用于采集用户的语音信息;The collection module 100 is used to collect the voice information of the user;

具体地,可通过麦克风或其它语音采集装置来采集用户的语音。麦克风和其它语音采集装置可以是家教机内置的,也可以是外接的设备。该语音信息可以是用户实时输入的语音。Specifically, the user's voice may be collected through a microphone or other voice collection devices. Microphones and other voice collection devices may be built-in devices in the tutoring machine, or may be external devices. The voice information may be voice input by the user in real time.

提取模块200,用于对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;Extraction module 200, for performing grammatical analysis on the voice information, and extracting question words and keywords in the voice information;

具体地,获取到语音信息后,可先将该语音信息转换为文本信息,然后对转换的文本信息进行分词及语法分析,以从文本信息中提取出问法词和关键词。将语音信息转换为文本,可采用现有的各种技术手段来实现,本发明不限于使用某种转换方法,在此不再进行详细说明。Specifically, after acquiring the voice information, the voice information may be converted into text information first, and then word segmentation and grammatical analysis are performed on the converted text information to extract question words and keywords from the text information. Converting the voice information into text can be achieved by using various existing technical means. The present invention is not limited to using a certain conversion method, and will not be described in detail here.

此处的问法词是指语音交互过程中表示各种问法的词,如“怎么办”、“怎么解”、“我不懂”等。此处的关键词是指语音信息中具有实际意义的名词,即具有对应实体的名词;如“语文视频在哪里”中的“语文视频”即为关键词,“连除问题怎么办”中的“连除问题”即为关键词,“兢兢业业”中的“兢兢业业”即为关键词。The question word here refers to the word expressing various questions in the process of voice interaction, such as "how to do", "how to solve", "I don't understand" and so on. The keywords here refer to the nouns with actual meaning in the voice information, that is, the nouns with the corresponding entities; for example, "Chinese video" in "Where is the Chinese video" is the keyword, and "What to do with the connection and division problem?" "Consistently remove the problem" is the key word, and "conscientiously and conscientiously" in "conscientiously and conscientiously" is the key word.

匹配模块300,用于将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;The matching module 300 is configured to match the question word with the node in the question concept graph to obtain the matched node, wherein the question concept graph is pre-built;

具体地,概念图是一种用节点代表概念,连线表示概念间关系的图示法。概念图一般由“节点”、“链接”和“有关文字标注”组成。问法概念图是指将各种不同的问法,根据问法间的关系生成的概念图。在问法概念图中,根节点下包含多个子节点,子节点为根节点的下位概念,子节点下又可以包含多个子子节点,子子节点为子节点的下位概念。Specifically, a concept map is a graphical method that uses nodes to represent concepts and lines to represent relationships between concepts. Concept maps are generally composed of "nodes", "links" and "related text annotations". The concept map of questions refers to the concept map generated by various questions according to the relationship between the questions. In the concept diagram of the question method, the root node contains multiple sub-nodes, and the sub-nodes are the subordinate concepts of the root node. The sub-nodes can contain multiple sub-sub-nodes, and the sub-sub-nodes are sub-concepts of the sub-nodes.

例如:问法概念图的根节点下包含多个问法概念,如表示“讲解”的问法,表示“寻找”的问法,表示“调节”的问法等。For example, the root node of the questioning concept graph contains multiple questioning concepts, such as the questioning method for "explaining", the questioning method for "searching", and the questioning method for "adjustment".

其中,不同的问法概念下又包含多个问法,如表示“讲解”的问法包括“不懂”、“如何理解”、“怎么办”、“是什么”等。表示“寻找”的问法包括“在哪里”、“在何处”、“什么地方”等。Among them, different questioning concepts include multiple questioning methods, for example, the questioning methods expressing "explaining" include "do not understand", "how to understand", "how to do", "what is" and so on. Questions that express "find" include "where", "where", "where" and so on.

将从语言信息中提取出的问法词与问法概念图中的节点进行匹配,即在问法概念图中搜索与问法词匹配的节点。Match the question words extracted from the language information with the nodes in the question concept graph, that is, search for the nodes matching the question words in the question concept graph.

意图获取模块400,用于根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。The intent acquisition module 400 is configured to acquire the user's intent according to the intent concept corresponding to the matched node and the keyword.

具体地,获取到匹配的节点后,即可根据匹配的节点得到对应的意图概念,然后根据对应的意图概念并结合关键词,来获取用户的意图。Specifically, after obtaining the matching node, the corresponding intent concept can be obtained according to the matching node, and then the user's intent can be obtained according to the corresponding intent concept and combining keywords.

示例性地,语音信息为“一元一次方程是什么”,从语音信息中提取出的问法词为“是什么”,提取出的关键词是“一元一次方程”,问法词在问法概念图中匹配的节点为“是什么”,而在问法概念图中节点“是什么”的意图概念为“讲解”,结合关键词“一元一次方程”和“讲解”即可得到用户的意图为搜索一元一次方程相关的内容,如搜索一元一次方程的讲解、一元一次方程的练习题等。Exemplarily, the speech information is "what is a linear equation in one variable", the question word extracted from the speech information is "what is", the extracted keyword is "a linear equation in one variable", and the question word is in the concept of question The matching node in the graph is "what is", and the intent concept of the node "what is" in the question concept graph is "explain". Combining the keywords "one-dimensional linear equation" and "explain", the user's intent can be obtained as Search for content related to linear equations in one variable, such as searching for explanations of linear equations in one variable, practice questions for linear equations in one variable, etc.

本实施例中,将各种随意性的问法生成问法概念图,使得根据语音信息中的问法词在生成的问法概念图中获取用户意图时,更容易更准确获取用户意图,且在问法概念图中,各种问法层级清楚,使得搜索识别速度更快。In this embodiment, various random question methods are used to generate a question method concept map, so that when the user intent is obtained in the generated question method concept map according to the question method words in the voice information, it is easier and more accurate to obtain the user intent, and In the question method concept map, the various question method levels are clear, which makes the search and identification faster.

优选地,还包括构建模块500;Preferably, a building module 500 is also included;

构建模块500包括:Building blocks 500 include:

语料收集单元510,用于收集大量用户语料;a corpus collection unit 510, used to collect a large number of user corpora;

具体地,在用户使用家教机的过程中,收集用户与家教机进行语音交互时产生的各种语料。Specifically, in the process of the user using the tutoring machine, various corpora generated when the user performs voice interaction with the tutoring machine are collected.

词语提取单元520,用于提取所述语料中的问法词;A word extraction unit 520, used for extracting question words in the corpus;

具体地,收集到大量语料后,可先将语料转换为文本信息,然后对转换的文本信息进行分词及语法分析,再从转换得到的文本信息中提取出与问法相关的问法词。Specifically, after a large amount of corpus is collected, the corpus can be converted into text information first, and then the converted text information can be segmented and grammatically analyzed, and then the question words related to the question method can be extracted from the converted text information.

例如,语料为“连除问题怎么办”、“图形的运动我不会”、“兢兢业业我不懂”、“语义视频在哪里”、“帮我打开语文练习题”、“音量太小了”、“音量太大了”、“字体太小了”、“我要预习语文”、“打开历史课本”等。从语料中提取出的问法词为“怎么办”、“我不会”、“我不懂”、“在哪里”、“打开”、“太小了”、“太大了”、“预习”等。For example, the corpus is "what to do with the problem of connecting and removing", "I don't know how to move graphics", "I don't know how to work hard", "where is the semantic video", "help me open the language practice questions", "the volume is too low" , "The volume is too loud", "The font is too small", "I want to preview the language", "Open the history textbook" and so on. The question words extracted from the corpus are "what to do", "I don't know", "I don't understand", "where", "open", "too small", "too big", "preview" "Wait.

概念获取单元530,用于获取所述语料中意图相同的问法词的意图概念;a concept acquiring unit 530, configured to acquire the intent concept of the question word with the same intent in the corpus;

具体地,从语料中提取出各种问法词后,对问法词进行分类,将意图相同的问法词分为一类,再获取意图相同的问法词的意图概念。Specifically, after extracting various question words from the corpus, classify the question words, classify the question words with the same intention into one category, and then obtain the intent concept of the question words with the same intention.

例如,问法词“怎么办”、“我不会”、“我不懂”都是表示对某个内容不理解、不会解答、不明白等,需要家教机搜索并展示对应的答案、题目讲解或讲义等;因此,可将问法词“怎么办”、“我不会”、“我不懂”等归为一类。For example, the French words "what to do", "I don't know" and "I don't understand" all indicate that you don't understand a certain content, don't know how to answer it, don't understand, etc. You need a tutor to search and display the corresponding answers and questions. Lectures or handouts, etc.; therefore, questions such as "what to do", "I don't know", "I don't understand", etc. can be grouped together.

问法词“在哪里”、“打开”、“预习”都是表示需要寻找某个内容,需要家教机打开对应的内容,因此,可将问法词“在哪里”、“打开”、“预习”等归为一类。The French question words "where", "open", and "preview" all indicate that a certain content needs to be found, and the corresponding content needs to be opened by the tutor computer. Therefore, the French question words "where", "open", "preview" can be used " etc. are grouped together.

问法词“太小了”、“太大了”都是表示需要调节某个对象,如调节音量大小、调节字体大小、调节视频显示大小等;因此,可将问法词“太小了”、“太大了”归为一类。The French words "too small" and "too big" all indicate that an object needs to be adjusted, such as adjusting the volume size, adjusting the font size, adjusting the video display size, etc.; therefore, the French word "too small" can be , "too big" are grouped together.

将得到的所有问法词进行归类后,则根据问法词表示的意图,确定分类后的每类问法词的意图概念;如可将问法词“怎么办”、“我不会”、“我不懂”的意图概念调节为“讲解”,将问法词“在哪里”、“打开”、“预习”的意图概念调节为“寻找对象”,将问法词太小了”、“太大了”的意图概念调节为“调节”。After classifying all the question words obtained, the intention concept of each type of question word after classification is determined according to the intention expressed by the question word; , the intention concept of "I don't understand" is adjusted to "explain", the intention concept of the question words "where", "open" and "preview" are adjusted to "find the object", and the question word is too small", The intent concept of "too big" is conditioned to 'conditioned'.

关系建立单元540,用于建立所述问法词与所述意图概念之间的映射关系;a relationship establishing unit 540, configured to establish a mapping relationship between the question word and the intent concept;

具体地,将问法词进行分类且确定每类问法词的意图概念后,建立问法词与意图概念之间的映射关系,即建立意图概念与问法词之间的对应关系。Specifically, after classifying the question words and determining the intent concept of each type of question word, the mapping relationship between the question word and the intent concept is established, that is, the corresponding relationship between the intent concept and the question word is established.

构建单元550,用于根据所述问法词与所述意图概念之间的映射关系,构建问法概念图。The constructing unit 550 is configured to construct a question concept map according to the mapping relationship between the question word and the intent concept.

具体地,根据建立的问法词与意图概念之间的对应关系,即可构建问法概念图。Specifically, according to the established correspondence between the question word and the intent concept, the question concept map can be constructed.

本实施例中,通过收集用户在使用智能终端设备时的语料,即可收集大量随意性、口语化的问法,然后将该随意性的问法进行分类整理生成问法概念图,使得在通过问法概念图进行意图识别时,识别率较高。In this embodiment, by collecting the corpus of the user when using the intelligent terminal device, a large number of random and colloquial questions can be collected, and then the random questions can be classified and sorted to generate a concept map of the questions, so that the When the concept map is asked for intent recognition, the recognition rate is higher.

优选地,意图获取模块400包括:Preferably, the intent acquisition module 400 includes:

词语搜索单元410,用于从预设的词库中搜索与所述关键词匹配的目标词语;A word search unit 410, configured to search for a target word matching the keyword from a preset thesaurus;

具体地,在获取关键词意图时,可先建立词库,词库中的关键词为名词性词语,词库中的关键词可通过爬虫技术从网络上爬取,并获取词库中的词语的语义。然后将从语音信息中提取出的关键词与词库中的词库进行匹配,找到匹配的目标词语。Specifically, when acquiring the intent of keywords, a thesaurus can be established first. The keywords in the thesaurus are noun words, and the keywords in the thesaurus can be crawled from the network through the crawler technology, and the words in the thesaurus can be obtained. semantics. Then, the keywords extracted from the speech information are matched with the thesaurus in the thesaurus to find the matching target words.

语义确定单元420,用于根据所述目标词语的语义,确定所述关键词的语义;a semantic determination unit 420, configured to determine the semantics of the keyword according to the semantics of the target word;

具体地,查找到目标词语后,根据目标词语对应的语义,确定语音信息中的关键词的语义,即确定语音信息中关键词的语义,并转换成机器能够理解的结构化数据格式。Specifically, after finding the target word, determine the semantics of the keywords in the speech information according to the semantics corresponding to the target words, that is, determine the semantics of the keywords in the speech information, and convert them into a structured data format that can be understood by machines.

意图获取单元430,用于根据所述匹配的节点对应的意图概念以及所述关键词的语义,获取所述用户的意图。The intent acquiring unit 430 is configured to acquire the user's intent according to the intent concept corresponding to the matched node and the semantics of the keyword.

具体地,得到关键词的语义,以及意图概念后,即可根据意图概念和关键词的语义,获取用户的意图。Specifically, after obtaining the semantics of the keywords and the intent concept, the user's intent can be acquired according to the intent concept and the semantics of the keywords.

例如,语音信息为“连除问题怎么办”,从中提取出的关键词为“连除问题”,问法词为“怎么办”,根据问法概念图,得到问法词“怎么办”匹配的节点对应的问法概念为“讲解”,关键词“连除问题”的语义为“连除问题”,根据问法概念“讲解”和关键词“连除问题”,即可得到用户的意图为“讲解连除问题”。For example, if the voice information is "How to do the problem of continuous division", the keyword extracted from it is "How to do the problem of continuous division", and the question word is "How to do". According to the concept map of the question method, the matching word "How to do" is obtained. The corresponding questioning concept of the node is "explain", and the semantics of the keyword "connection and division question" is "connection and division question". According to the questioning concept "explanation" and the keyword "connection and division question", the user's intention can be obtained. For "explain the problem of concatenation and division".

本实施例中,通过建立词库,并通过关键词匹配的方法来获取关键词的语义,相比于其他语义识别方法,更方便快捷。In this embodiment, the semantics of the keywords are acquired by establishing a thesaurus and using the keyword matching method, which is more convenient and quicker than other semantic recognition methods.

根据本发明提供的第七实施例,如图7所示,一种基于概念图的意图识别装置,包括:According to a seventh embodiment provided by the present invention, as shown in FIG. 7 , a device for recognizing intent based on a concept map includes:

采集模块100,用于采集用户的语音信息;The collection module 100 is used to collect the voice information of the user;

提取模块200,用于对所述语音信息进行语法分析,提取所述语音信息中的问法词和关键词;Extraction module 200, for performing grammatical analysis on the voice information, and extracting question words and keywords in the voice information;

匹配模块300,用于将所述问法词与问法概念图中的节点进行匹配,得到匹配的节点,其中,所述问法概念图为预先构建的;The matching module 300 is configured to match the question word with the node in the question concept graph to obtain the matched node, wherein the question concept graph is pre-built;

意图获取模块400,用于根据所述匹配的节点对应的意图概念以及所述关键词,获取所述用户的意图。The intent acquisition module 400 is configured to acquire the user's intent according to the intent concept corresponding to the matched node and the keyword.

优选地,还包括:Preferably, it also includes:

信号接收模块600,用于当所述关键词中含有特定关键词时,接收所述用户的触摸信号;a signal receiving module 600, configured to receive a touch signal of the user when the keyword contains a specific keyword;

具体地,特定关键词是指“这个字”、“这道题”、“这句话”、“这个”、“这”等指代词。当语音信息中包含上述特定关键词时,则接收用户在触摸屏上的触摸信号。触摸信号可以是用户在触摸屏上的连续滑动触摸信号,如连续触摸形成的一条直线,或连续触摸形成的圆形框、椭圆框,不规则图形框等。Specifically, the specific keywords refer to pronouns such as "this word", "this question", "this sentence", "this", "this" and the like. When the above-mentioned specific keyword is included in the voice information, the user's touch signal on the touch screen is received. The touch signal may be a continuous sliding touch signal by the user on the touch screen, such as a straight line formed by continuous touch, or a circular frame, oval frame, irregular graphic frame, etc. formed by continuous touch.

图像获取模块700,用于根据所述触摸信号,获取所述用户选择区域的图像信息;An image acquisition module 700, configured to acquire image information of the user-selected area according to the touch signal;

具体地,接收到触摸信号后,根据触摸信号,即可获取用户选择区域的图像信息,如连续触摸形成一条直线后,以该直线为对角线构建一个矩形框,截取该矩形框内的图像,该矩形框内的图像即为用户选择区域的图像信息。Specifically, after receiving the touch signal, the image information of the area selected by the user can be obtained according to the touch signal. For example, after continuous touches form a straight line, a rectangular frame is constructed with the straight line as the diagonal, and the image in the rectangular frame is intercepted. , the image in the rectangular frame is the image information of the area selected by the user.

若连续触摸形成一个圆形框或椭圆框等,则截取形成的框内的图像,该框内的图像即为用户选择区域的图像信息。If the continuous touch forms a circular frame or an oval frame, etc., the image in the formed frame is intercepted, and the image in the frame is the image information of the area selected by the user.

此处,将触摸信号限定为连续触摸信号,可防止出现失误操作,减少错误率的发生,如可防止用户因失误操作而在触摸屏上形成两个点触摸,并获取该两个点构成的框内的图像。Here, the touch signal is limited to a continuous touch signal, which can prevent erroneous operations and reduce the error rate. For example, it can prevent the user from forming two touch points on the touch screen due to erroneous operations, and obtain the frame formed by the two points. image inside.

当语音信息中包含特定关键词时,说明用户输入的语音信息是缺失的,如“这道题怎么解”,家教机并不知道这道题是什么,因此,就算将该语音信息进行语义解析后,也无法获取用户的准确意图,所以,在判断出语音信息中包含特定关键词时,需要进一步获取用户手指指向或框选区域的图像信息,以获取某道题的信息。When the voice information contains specific keywords, it means that the voice information input by the user is missing, such as "how to solve this problem", the tutor does not know what the problem is, so even if the voice information is semantically parsed Afterwards, the exact intention of the user cannot be obtained. Therefore, when it is determined that the voice information contains a specific keyword, it is necessary to further obtain the image information of the area pointed by the user's finger or framed to obtain the information of a certain question.

意图获取模块400,还用于根据所述匹配的节点对应的意图概念、所述关键词以及所述图像信息,获取所述用户的意图。The intent acquisition module 400 is further configured to acquire the user's intent according to the intent concept corresponding to the matched node, the keyword, and the image information.

具体地,根据获取到的意图概念、关键词的语义,结合获取的图像信息,三者融合,便可获得用户的真实意图,从而给予相应的反馈。Specifically, according to the acquired intent concept and the semantics of the keywords, combined with the acquired image information, the three are integrated to obtain the user's true intent, so as to give corresponding feedback.

优选地,意图获取模块400包括:Preferably, the intent acquisition module 400 includes:

识别单元440,用于识别所述图像信息中的文字信息;an identification unit 440, configured to identify text information in the image information;

替换单元450,用于将所述文字信息替换所述关键词中的特定关键词;A replacement unit 450, configured to replace specific keywords in the keywords with the text information;

意图获取单元430,用于根据所述匹配的节点对应的意图概念、所述关键词和所述文字信息,获取所述用户的意图。The intent acquiring unit 430 is configured to acquire the user's intent according to the intent concept corresponding to the matched node, the keyword and the text information.

具体地,得到图像信息后,对该图像进行处理识别,获得该图像信息中的文字信息,该文字信息即为某道题的信息或某个字的信息。Specifically, after the image information is obtained, the image is processed and identified to obtain text information in the image information, where the text information is information about a certain question or information about a certain word.

将获取的文字信息替换特定关键词,即将这道题替换为获取的文字信息。然后根据匹配的节点对应的意图概念、剩余关键词的语义和获取的文字信息,获取用户的真实意图。Replace the acquired text information with specific keywords, that is, replace this question with the acquired text information. Then, according to the intent concept corresponding to the matched node, the semantics of the remaining keywords, and the acquired text information, the real intent of the user is obtained.

例如,用户在家教机上看课外读物时,看到“兢兢业业”时,“兢兢业业”不认识,想寻求帮助,于是,用户会说“这几个字读什么”,家教机采集到该语音信息,并判断出该语音信息中含有特定关键词时,会根据触摸信号获取用户选择区域的图像信息,如获取“兢兢业业”的图像,通过对该图像进行文字识别,得到“兢兢业业”这一文字信息,然后结合问法词“读什么”,即可获取用户的真实语义为“兢兢业业读什么”。如此,对于用户不认识的字词或不会解答的题,便可通过该家教机得到帮助,以辅助用户的学习。For example, when a user reads extracurricular readings on the home teaching machine, when he sees "conscientious and conscientious", he does not recognize "conscientious and conscientious" and wants to ask for help, so the user will say "what do these words read", the home teaching machine collects the voice information, When it is judged that the voice information contains a specific keyword, the image information of the area selected by the user will be obtained according to the touch signal. Combined with the question word "what to read", the real semantics of the user can be obtained as "what to read conscientiously". In this way, words that the user does not know or questions that cannot be answered can be helped through the tutoring machine to assist the user's learning.

应当说明的是,上述实施例均可根据需要自由组合。以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。It should be noted that the above embodiments can be freely combined as required. The above are only the preferred embodiments of the present invention. It should be pointed out that for those skilled in the art, without departing from the principles of the present invention, several improvements and modifications can be made. It should be regarded as the protection scope of the present invention.

Claims (10)

1. An intention recognition method based on a concept graph is characterized by comprising the following steps:
Collecting voice information of a user;
carrying out grammar analysis on the voice information, and extracting question words and key words in the voice information;
matching the question words with nodes in a question concept graph to obtain matched nodes, wherein the question concept graph is constructed in advance;
and acquiring the intention of the user according to the intention concept corresponding to the matched node and the keyword.
2. The method for identifying an intention based on a concept graph according to claim 1, wherein the method for constructing the concept graph of the question method comprises the following steps:
collecting a large amount of user corpora;
extracting the question words in the corpus;
acquiring the intention concepts of the question words with the same intention in the corpus;
establishing a mapping relation between the question words and the intention concepts;
and constructing a question concept graph according to the mapping relation between the question words and the intention concepts.
3. The method as claimed in claim 1, wherein the obtaining the user's intention according to the intention concept corresponding to the matched node and the keyword specifically comprises:
searching a target word matched with the keyword from a preset word bank;
Determining the semantics of the keywords according to the semantics of the target words;
and acquiring the intention of the user according to the intention concept corresponding to the matched node and the semantics of the keyword.
4. The method for recognizing the intention based on the concept graph according to any one of claims 1-3, wherein after parsing the voice information and extracting the question words and the keywords in the voice information, the method further comprises:
when the keywords contain specific keywords, receiving a touch signal of the user;
acquiring image information of the user selection area according to the touch signal;
the obtaining the intention of the user according to the intention concept corresponding to the matched node and the keyword specifically includes:
and acquiring the intention of the user according to the intention concept corresponding to the matched node, the keyword and the image information.
5. The method as claimed in claim 4, wherein the obtaining the user's intention according to the intention concept corresponding to the matched node, the keyword and the image information specifically comprises:
Identifying character information in the image information;
replacing the text information with a specific keyword in the keywords;
and acquiring the intention of the user according to the intention concept corresponding to the matched node, the keyword and the text information.
6. An intention recognition apparatus based on a concept graph, comprising:
the acquisition module is used for acquiring voice information of a user;
the extraction module is used for carrying out grammar analysis on the voice information and extracting the question words and the key words in the voice information;
the matching module is used for matching the question words with nodes in a question concept graph to obtain matched nodes, wherein the question concept graph is constructed in advance;
and the intention acquisition module is used for acquiring the intention of the user according to the intention concept corresponding to the matched node and the keyword.
7. The concept graph-based intention recognition apparatus according to claim 6, further comprising a construction module;
the building module comprises:
the corpus collection unit is used for collecting a large amount of user corpuses;
the word extraction unit is used for extracting the question words in the corpus;
The concept acquisition unit is used for acquiring the intention concepts of the same-intention question words in the corpus;
the relationship establishing unit is used for establishing a mapping relationship between the question and legal terms and the intention concepts;
and the construction unit is used for constructing a question and law concept graph according to the mapping relation between the question and law words and the intention concepts.
8. The concept graph-based intention recognition device according to claim 6, wherein the intention acquisition module comprises:
the word searching unit is used for searching a target word matched with the keyword from a preset word bank;
the semantic determining unit is used for determining the semantics of the keywords according to the semantics of the target words;
and the intention acquisition unit is used for acquiring the intention of the user according to the intention concept corresponding to the matched node and the semantics of the keyword.
9. The concept graph-based intention recognition apparatus according to any one of claims 6 to 8, further comprising:
the signal receiving module is used for receiving a touch signal of the user when the keyword contains a specific keyword;
the image acquisition module is used for acquiring the image information of the user selection area according to the touch signal;
The intention acquisition module is further used for acquiring the intention of the user according to the intention concept corresponding to the matched node, the keyword and the image information.
10. The concept graph-based intention recognition device according to claim 9, wherein the intention acquisition module comprises:
the identification unit is used for identifying character information in the image information;
the replacing unit is used for replacing the text information with a specific keyword in the keywords;
and the intention acquisition unit is used for acquiring the intention of the user according to the intention concept corresponding to the matched node, the keyword and the character information.
CN201910329359.6A 2019-04-23 2019-04-23 Method and device for intent recognition based on concept map Pending CN111858840A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910329359.6A CN111858840A (en) 2019-04-23 2019-04-23 Method and device for intent recognition based on concept map

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910329359.6A CN111858840A (en) 2019-04-23 2019-04-23 Method and device for intent recognition based on concept map

Publications (1)

Publication Number Publication Date
CN111858840A true CN111858840A (en) 2020-10-30

Family

ID=72952187

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910329359.6A Pending CN111858840A (en) 2019-04-23 2019-04-23 Method and device for intent recognition based on concept map

Country Status (1)

Country Link
CN (1) CN111858840A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112463920A (en) * 2020-11-25 2021-03-09 联想(北京)有限公司 Information response method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110093479A1 (en) * 2009-10-19 2011-04-21 Vexigo, Ltd. System and method for use of semantic understanding in storage, searching and providing of data or other content information
CN108228820A (en) * 2017-12-30 2018-06-29 厦门太迪智能科技有限公司 User's query intention understanding method, system and terminal
CN109582825A (en) * 2018-12-07 2019-04-05 百度在线网络技术(北京)有限公司 Method and apparatus for generating information

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110093479A1 (en) * 2009-10-19 2011-04-21 Vexigo, Ltd. System and method for use of semantic understanding in storage, searching and providing of data or other content information
CN108228820A (en) * 2017-12-30 2018-06-29 厦门太迪智能科技有限公司 User's query intention understanding method, system and terminal
CN109582825A (en) * 2018-12-07 2019-04-05 百度在线网络技术(北京)有限公司 Method and apparatus for generating information

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112463920A (en) * 2020-11-25 2021-03-09 联想(北京)有限公司 Information response method and device

Similar Documents

Publication Publication Date Title
US11379548B2 (en) Analyzing concepts over time
CN106776711B (en) Chinese medical knowledge map construction method based on deep learning
US8457947B2 (en) Hybrid translation apparatus and method thereof
CN111475623A (en) Case information semantic retrieval method and device based on knowledge graph
KR102491172B1 (en) Natural language question-answering system and learning method
US9015168B2 (en) Device and method for generating opinion pairs having sentiment orientation based impact relations
CN107301163B (en) Formula-containing text semantic parsing method and device
CN103678684A (en) Chinese word segmentation method based on navigation information retrieval
KR20150096295A (en) System and method for buinding q&as database, and search system and method using the same
CN115858758A (en) Intelligent customer service knowledge graph system with multiple unstructured data identification
CN106649778A (en) Interactive method and device based on deep questions and answers
CN109949799A (en) Semantic parsing method and system
CN116244412A (en) Multi-intention recognition method and device
CN110287405A (en) The method, apparatus and storage medium of sentiment analysis
CN115687572A (en) Data information retrieval method, device, equipment and storage medium
WO2021120979A1 (en) Method and apparatus for generating patent summary information, and electronic device and medium
KR20220074576A (en) A method and an apparatus for extracting new words based on deep learning to generate marketing knowledge graphs
CN109346108A (en) Operation checking method and system
CN111858840A (en) Method and device for intent recognition based on concept map
KR20190115721A (en) Apparatus, method and computer program for processing inquiry
CN110008314B (en) Intention analysis method and device
Kovatchev et al. Decomposing and comparing meaning relations: Paraphrasing, textual entailment, contradiction, and specificity
CN115293127A (en) Contract document information comparison method, device and system
CN114942981A (en) Question-answer query method and device, electronic equipment and computer readable storage medium
KR20220074572A (en) A method and an apparatus for extracting new words based on deep learning to generate marketing knowledge graphs

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination