CN110162637A - Information Atlas construction method, device and equipment - Google Patents

Information Atlas construction method, device and equipment Download PDF

Info

Publication number
CN110162637A
CN110162637A CN201910114989.1A CN201910114989A CN110162637A CN 110162637 A CN110162637 A CN 110162637A CN 201910114989 A CN201910114989 A CN 201910114989A CN 110162637 A CN110162637 A CN 110162637A
Authority
CN
China
Prior art keywords
information
node
attribute values
subgraph
map element
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910114989.1A
Other languages
Chinese (zh)
Other versions
CN110162637B (en
Inventor
谢润泉
赵创钿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201910114989.1A priority Critical patent/CN110162637B/en
Publication of CN110162637A publication Critical patent/CN110162637A/en
Application granted granted Critical
Publication of CN110162637B publication Critical patent/CN110162637B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

A kind of Information Atlas construction method, device and equipment are disclosed, the Information Atlas construction method includes: to extract map element corresponding with the input information from multiclass subgraph spectrum according to input information;And the extracted map element of split is to generate Information Atlas corresponding with the input information;Wherein, at least part subgraph in the multiclass subgraph spectrum composes information node having the same, and extracting map element corresponding with the input information from multiclass subgraph spectrum according to input information includes: to determine the node attribute values of target information node according to input information;And map element corresponding with identified node attribute values is extracted from every class subgraph spectrum., can be effectively based on input information by map element corresponding with input information in extraction and split multiclass subgraph spectrum, the subgraph that be associated with multiple and different classifications is composed, and the information of multiple classifications associated with input information is returned.

Description

Information Atlas construction method, device and equipment
Technical field
This disclosure relates to map construction field, relate more specifically to a kind of Information Atlas construction method, Information Atlas building Device and Information Atlas construct equipment.
Background technique
As artificial intelligence is in civilian and commercial kitchen area extensive use, map construction is in business big data, intelligence Play the role of becoming more and more important Deng during, therefore map construction, especially Information Atlas building are also faced with higher It is required that.The building for the general information map that user searches in recommendation business is concentrated in Information Atlas building at present and is based on artificial Arrange obtained knowledge base, for example, the people cube of Microsoft, Baidu it is intimate, have become function in intelligent answer, machine translation etc. In scene.
However, currently constructing obtained Information Atlas is the other Information Atlas of unitary class, and phase between each Information Atlas It is mutually independent, unified association is not formed between the Information Atlas of multiple classifications.When inputting ad hoc inquiry information, return the result The other information of mostly single or unitary class, and it is unable to get all information relevant to ad hoc inquiry information.
Therefore, it is necessary to one kind based on input information realization Information Atlas building, especially realization Financial Information map When building, the Information Atlas structure of multiple classification informations associated with input information can be returned to effectively based on input information Construction method.
Summary of the invention
In view of the above problems, present disclose provides a kind of Information Atlas construction method, Information Atlas construction device and information Map construction equipment.The Information Atlas construction method provided using the disclosure can be based on input information realization Information Atlas structure On the basis of building, effectively based on input information, it is associated with the subgraph spectrum of multiple and different classifications, is returned associated with input information The information of multiple classifications.
According to the one side of the disclosure, a kind of Information Atlas construction method is proposed, comprising: according to input information from multiclass Map element corresponding with the input information is extracted in subgraph spectrum;And the extracted map element of split is to generate and the input The corresponding Information Atlas of information;Wherein, at least part subgraph in the multiclass subgraph spectrum composes information node having the same, And extracting map element corresponding with the input information from multiclass subgraph spectrum according to input information includes: to be believed according to input Breath, determines the node attribute values of target information node;And it is extracted and identified node attribute values pair from every class subgraph spectrum The map element answered.
In some embodiments, every class subgraph spectrum has at least two information nodes, and every class subgraph spectrum includes multiple Map element;For each information node, each map element has the node attribute values of the information node.
In some embodiments, the extracted map element of split is to generate Information Atlas packet corresponding with the input information Include: using each identified node attribute values as linking point, link from the multiple subgraph spectrum in extract with the nodal community It is worth corresponding map element.
In some embodiments, the multiclass subgraph spectrum includes concept subgraph spectrum, company's subgraph spectrum, stock subgraph spectrum, throws Provide at least two classes during subgraph spectrum, product subgraph spectrum, event subgraph are composed, industry subgraph is composed.
In some embodiments, the target information node comprises at least one of the following: Business Name, organization, people Name, film name, name of product, stock code, stock name, concept name.
In some embodiments, the multiclass subgraph spectrum includes at least event subgraph and composes, and the input information is thing Part information, wherein the node attribute values that target information node is determined according to input information include: the event letter for being inputted Breath determines the node attribute values of target information node corresponding with the event information based on event subgraph spectrum.
In some embodiments, map member corresponding with the input information is extracted from multiclass subgraph spectrum according to input information It further include constructing the multiclass subgraph to compose before element, wherein every a kind of subgraph spectrum includes: to be arranged at least in building multiclass subgraph spectrum Two information nodes, and the corresponding relationship between setting information node;The node category of the information node is extracted from external data Property value;Extracted node attribute values are associated, to form map element;Obtained map element is denoised, is obtained Subgraph spectrum.
In some embodiments, at least two information node includes Business Name information node, and for gained To the denoising of map element include: each node attribute values for Business Name information node, based on the public affairs pre-established Simple full name corresponding relationship is taken charge of, the multiple node attribute values for meeting company's letter full name corresponding relationship are associated.
In some embodiments, at least two information node includes name information node, and for obtained The denoising of map element includes: that will be divided with multiple attribute values of the associated Business Name information node of same name information node It cuts, and the node attribute values of name information node is marked using the node attribute values of each Business Name information node Know, to eliminate name node ambiguity.
According to another aspect of the present disclosure, a kind of Information Atlas construction device is provided, comprising: map element extraction mould Block is configured as extracting map element corresponding with the input information from multiclass subgraph spectrum according to input information;Information Atlas Generation module is configured as the extracted map element of split to generate Information Atlas corresponding with the input information.
In some embodiments, every class subgraph spectrum has at least two information nodes, and every class subgraph spectrum includes multiple Map element;For each information node, each map element has the node attribute values of the information node;Wherein, map member Plain extraction module includes: target information node determining module, is configured as determining the section of target information node according to input information Point attribute value;And map element respective modules, it is configured as extracting and identified node attribute values from every class subgraph spectrum Corresponding map element.
In some embodiments, Information Atlas generation module includes: map element link module, is configured as with each institute Determining node attribute values are linking point, link the map corresponding with the node attribute values extracted from the multiple subgraph spectrum Element.
According to another aspect of the present disclosure, a kind of Information Atlas building equipment is provided, wherein the equipment includes processing Device and memory, the memory include one group of instruction, and one group of instruction makes the information when being executed by the processor Map construction equipment executes operation, and the operation includes: to be extracted and the input information from multiclass subgraph spectrum according to input information Corresponding map element;And the extracted map element of split is to generate Information Atlas corresponding with the input information.Wherein, At least part subgraph in the multiclass subgraph spectrum composes information node having the same, and sub from multiclass according to input information It includes: to determine the node of target information node according to input information that map element corresponding with the input information is extracted in map Attribute value;And map element corresponding with identified node attribute values is extracted from every class subgraph spectrum.
In some embodiments, every class subgraph spectrum has at least two information nodes, and every class subgraph spectrum includes multiple Map element;For each information node, each map element has the node attribute values of the information node.It is mentioned using the disclosure The Information Atlas construction method of confession, can be on the basis of based on input information realization Information Atlas building, effectively based on defeated Enter information, is associated with the subgraph spectrum of multiple and different classifications, returns to the information of multiple classifications associated with input information.
Detailed description of the invention
It, below will be to required use in embodiment description in order to illustrate more clearly of the technical solution of the embodiment of the present disclosure Attached drawing be briefly described, it should be apparent that, the accompanying drawings in the following description is only some embodiments of the present disclosure, for this For the those of ordinary skill of field, without making creative work, it can also be obtained according to these attached drawings other Attached drawing.The following drawings is not drawn by actual size equal proportion scaling deliberately, it is preferred that emphasis is shows the purport of the disclosure.
Fig. 1 shows the illustrative flow chart of the Information Atlas construction method 100 according to the embodiment of the present disclosure;
Fig. 2 shows the partial schematic diagrams that 200 are composed according to company's subgraph of the embodiment of the present disclosure;
Fig. 3 shows extracting and the input information from multiclass subgraph spectrum according to input information according to the embodiment of the present disclosure The flow chart of the illustrative methods 300 of corresponding map element;
Fig. 4 shows extracting and the input information from multiclass subgraph spectrum according to input information according to the embodiment of the present disclosure The schematic diagram of corresponding map element 400;
Fig. 5 is shown to be extracted from the multiple subgraph spectrum using each identified node attribute values as linking point link The schematic diagram of map element 500 corresponding with the node attribute values;
Fig. 6, which is shown, determines that target corresponding with the event information is believed based on event subgraph spectrum according to the embodiment of the present disclosure Cease the schematic diagram of the node attribute values 600 of node;
Fig. 7 shows the exemplary process diagram of the method 700 of the building subgraph spectrum according to the embodiment of the present disclosure;
The exemplary process diagram for the method 800 that the company name that Fig. 8 shows the embodiment of the present disclosure disambiguates;
The exemplary process diagram for the method 900 that the name that Fig. 9 shows the embodiment of the present disclosure disambiguates;
Figure 10 shows the illustrative block diagram of the Information Atlas construction device 110 according to the embodiment of the present disclosure;
Figure 11 shows the illustrative block diagram that equipment 950 is constructed according to the Information Atlas of the embodiment of the present disclosure.
Specific embodiment
The technical solution in the embodiment of the present disclosure is clearly and completely described below in conjunction with attached drawing, it is clear that Ground, described embodiment are only the section Example of the disclosure, instead of all the embodiments.Implemented based on the disclosure Example, every other embodiment obtained by those of ordinary skill in the art without making creative efforts also belong to The range of disclosure protection.
As shown in the application and claims, unless context clearly prompts exceptional situation, " one ", "one", " one The words such as kind " and/or "the" not refer in particular to odd number, may also comprise plural number.It is, in general, that term " includes " only prompts to wrap with "comprising" Include clearly identify the step of and element, and these steps and element do not constitute one it is exclusive enumerate, method or apparatus The step of may also including other or element.
Although the application is made that various references to the certain module in system according to an embodiment of the present application, however, Any amount of disparate modules can be used and be operated on user terminal and/or server.The module is only illustrative , and disparate modules can be used in the different aspect of the system and method.
Flow chart used herein is used to illustrate operation performed by system according to an embodiment of the present application.It should Understand, before or operation below not necessarily accurately carry out in sequence.On the contrary, as needed, it can be according to inverted order Or various steps are handled simultaneously.It is also possible to during other operations are added to these, or it is a certain from the removal of these processes Step or number step operation.
Fig. 1 shows the exemplary process diagram of the Information Atlas construction method 100 according to the embodiment of the present disclosure.
Firstly, in step s101, extracting figure corresponding with the input information from multiclass subgraph spectrum according to input information Spectral element.After extracting multiple map elements corresponding with input information, further, in step s 102, split is extracted Map element to generate Information Atlas corresponding with the input information.
The input information can be the query information that user directly inputs, or be also possible to computer system in response to User inputs information or controls information and the information voluntarily inquired.The embodiment of the present disclosure not source by input information and its input The limitation of mode.For example, can be the query information that user inputs in Webpage search column, or it be also possible to the inquiry of user The information that information generates after computer pre-processes.
The multiclass subgraph spectrum refers to is under the jurisdiction of different classes of multiple subgraphs spectrum respectively.Wherein, the multiclass subgraph spectrum In at least part subgraph compose information node having the same.In some embodiments, the multiclass subgraph spectrum may include general Subgraph spectrum, company's subgraph spectrum, stock subgraph spectrum, investment subgraph is read to compose, in product subgraph spectrum, event subgraph spectrum, industry subgraph spectrum At least two classes.
According to the embodiment of the present disclosure, every one kind subgraph music score can such as have at least two information nodes, and every a kind of son Map includes multiple map elements, and for each information node, each map element has the node attribute values of the information node.
In some embodiments, in each map element, for the same information node, there can be one or more The node attribute values of the information node.Node attribute values can be particular content, or be also possible to " default " or " sky ".This Shen It is not limited by the number of the node attribute values of information node and particular content in map element please.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.By taking the map element of company's subgraph spectrum as an example, wherein may include nodal community It is worth " Baidu " and node attribute values " iqiyi.com ", and " iqiyi.com " be the subsidiary of " Baidu ", based on the corresponding relationship, then " hundred Degree " is first nodes attribute value, and " iqiyi.com " is two-level node attribute value.The application is not by the node attribute values in map element Different stage and its mutual corresponding relationship limitation.
Fig. 2 shows the partial schematic diagrams that 200 are composed according to company's subgraph of the embodiment of the present disclosure.
It is composed referring to company's subgraph shown in Fig. 2, above- mentioned information node and map element is more specifically described. The said firm's subgraph spectrum includes three information nodes, respectively Business Name information node KC, name information node KHAnd industry name Claim information node KS, further, for example including map element E in the said firm's subgraph spectrumC1And map element EC2, wherein map is first Plain EC1It further comprise the node attribute values of multiple information nodes.Referring further to Figure 2, map element EC1Including Business Name The node attribute values " iqiyi.com " of information node, " Baidu ", the node attribute values " Li Yanhong " of name information node, " Lu Qi ", The node attribute values " internet " of film name information node.Map element EC2Nodal community including Business Name information node It is worth " big boundary ", the node attribute values " Wang Tao " of name information node, the node attribute values " unmanned plane " of film name information node. Further, based on the corresponding relationship between node attribute values, under Business Name information node, " Baidu " is first nodes category Property value, " iqiyi.com " be two-level node attribute value.
Fig. 3 shows extracting and the input information from multiclass subgraph spectrum according to input information according to the embodiment of the present disclosure The flow chart of the illustrative methods 300 of corresponding map element.
Firstly, according to input information, determining the node attribute values of target information node in step S301.
According to the embodiment of the present disclosure, target information node may be, for example, multiple information nodes from multiclass subgraph spectrum, or Person can be multiple information nodes from certain a kind of subgraph spectrum, or an information node being also possible in certain a kind of map. The embodiment of the present disclosure is not limited by the type and number of identified target information node.Such as it can be by target information node only It is set as Business Name, or target information node can be set to Business Name, name, product Business Name three.
In some embodiments, the target information node comprises at least one of the following: Business Name, organization, people Name, film name, name of product, stock code, stock name, concept name.
According to the embodiment of the present disclosure, the node attribute values of the target information node can be determined based on preset strategy, such as When all or part of the content for inputting information is the node attribute values of target information node, then target information can be directly obtained The attribute value of node;When the content in input information does not include the node attribute values of target information node or only includes part Target information node node attribute values when, input information can be pre-processed, will treated data as mesh Mark the node attribute values of information node.The embodiment of the present disclosure is not limited by the mode for the node attribute values for determining destination node. Such as setting name is target information node, then node category of " Ma Yun " that can be inputted user directly as target information node Property value;" Baidu takes Ali by the hand and opens up shared bicycle market " that can also be inputted to user, was waited based on Entity recognition, Relation extraction Journey, extraction obtain node attribute values " Baidu ", " Ali " of Business Name information node, the node category of name of product information node Property value " shared bicycle ", and corresponding name information is further obtained based on " Baidu ", " Ali " in company's subgraph spectrum The node attribute values " Li Yanhong " of node, " Ma Yun ".
In addition, for the same target information node, the section of the target information node based on determined by input information Point attribute value can be one or more, and the embodiment of the present disclosure is not by of the node attribute values of identified target information node Several limitations.
According to input information, after the node attribute values for determining target information node, further, in step s 302, Map element corresponding with identified node attribute values is extracted from every class subgraph spectrum.
In some embodiments, map corresponding with the identified node attribute values member extracted from every class subgraph spectrum Element can be multiple, such as when there are name duplication of name, for the node attribute values " Ma Yun " of target information node, in company's subgraph There may be multiple corresponding map elements in spectrum, it is under the jurisdiction of the nodal community with different Business Names respectively Under value.Under some embodiments, in certain class subgraphs spectrum, it will be unable to extract the node attribute values pair with target information node The map element answered, such as when the node attribute values of target information node are " quick worker App ", it may be simultaneously in stock subgraph spectrum It does not include the map element of the node attribute values corresponding to the target information node.The embodiment of the present disclosure is not by extracted map The number of element and its limitation in source.
Fig. 4 shows extracting and the input information from multiclass subgraph spectrum according to input information according to the embodiment of the present disclosure The schematic diagram of corresponding map element.
Referring to Fig. 4, the above process can be described more specifically.Such as preset target information node is name and company name Claim, then when the input information of user is " Lu Qi ", is composed by company's subgraph, search map element corresponding to " Lu Qi ", obtain To map element EC1;Further, it can get map element EC1In corresponding to node attribute values " Lu Qi " Business Name believe Cease node KCNode attribute values.It, can be by Business Name information node K according to the embodiment of the present disclosureCFirst nodes attribute value As its node attribute values, the node attribute values of the Business Name information node corresponding to the input information " Lu Qi " can be obtained " Baidu ".
Obtain the node attribute values " Lu Qi " of name information node and the node attribute values " hundred of Business Name information node After degree ", further, extracted and identified node attribute values pair in each subgraph spectrum of the multiple subgraphs spectrum constructed The map element answered.Such as there is currently company's subgraph spectrum, product subgraph spectrum and stock subgraph spectrum, then it can be believed based on Business Name The node attribute values " Baidu " of node and the node attribute values " Lu Qi " of name information node are ceased, corresponding company's is extracted Map element EC1, product subgraph spectral element EP1, stock subgraph spectral element EG1
After extraction obtains multiple map elements, in step s 102, the extracted map element of split is defeated with this to generate Enter the corresponding Information Atlas of information can include: using each identified node attribute values as linking point, link from the multiple son The map element corresponding with the node attribute values extracted in map.
Fig. 5 is shown to be extracted from the multiple subgraph spectrum using each identified node attribute values as linking point link The schematic diagram of map element corresponding with the node attribute values.
Referring to Fig. 4 and Fig. 5, the map split process can be more specifically described.Based on process shown in Fig. 4, mentioning It, further, as shown in Figure 5, can be by the node attribute values of Business Name information node after obtaining multiple map elements The node attribute values " Lu Qi " of " Baidu " and name information node are used as linking point, split company subgraph spectral element EC1, product Map element EP1, stock subgraph spectral element EG1, generating has Business Name information node, name information node, stock information section Point, product information node, film name information node Information Atlas.
By extracting the node attribute values of target information node, being effectively associated with multiple and different classes based on input information Map element in other subgraph spectrum allows to generate the Information Atlas of the target information node with multiple classifications, thus The information of multiple classifications associated with information is inputted can be returned.
In some embodiments, the multiclass subgraph spectrum includes at least event subgraph and composes, and the input information is thing Part information, wherein the node attribute values that target information node is determined according to input information include: the event letter for being inputted Breath determines the node attribute values of target information node corresponding with the event information based on event subgraph spectrum.
For example, the input information is specific event title or the description for a certain event, such as it can be only Abbreviation including dependent event, or also may include the relevant personage of event, company, theme or concept name.The disclosure is not It is limited by the particular content of the event information inputted.It can for example input " Wei Zexi event ", or can also input " Ma Huateng takes Yang Zhenyu by the hand and initiates Science Explorations prize ".
It for example, can have at least two category information nodes in the event subgraph spectrum, such as can be entity class information section Point and theme class information node.For example including event title, Business Name, name, stock etc., theme in entity class information node Category information node is for example including subject name, concept name and semantic label etc..
According to the embodiment of the present disclosure, event subgraph spectrum includes multiple map elements, for each information node, each map Element has the node attribute values of the information node.
In some embodiments, in the map element of event subgraph spectrum, for the same information node, can have one The node attribute values of a or multiple information nodes.Wherein, which can be particular content, or be also possible to " default " or " sky ".The application is not by the number of the node attribute values of information node in the map element of event subgraph spectrum and specifically The limitation of content.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.Such as the map element of event subgraph spectrum may include corresponding to subject The node attribute values " mobile communication " and node attribute values " 5G communication " for claiming information node, wherein " 5G communication " is " mobile communication " Sub-topics concept, be based on the corresponding relationship, then " mobile communication " be first nodes attribute value, " 5G communication " be two-level node category Property value.The application is not limited by the different stage of the node attribute values in map element and its mutual corresponding relationship.
Fig. 6 shows the node category that target information node corresponding with the event information is determined based on event subgraph spectrum The schematic diagram of property value.
Referring to Fig. 6, it is specifically described and target information section corresponding with the event information is determined based on event subgraph spectrum The process of point.For the event information of " the Baidu's discussion bar Wei Zexi event " of user's input, as target information node, then Can be in event subgraph spectrum, lookup and map element corresponding to the event " Baidu's discussion bar Wei Zexi event " obtain map Element EV1;Further, dependent event " Baidu's termination and the conjunction of Putian hospital, system for obtaining the event can be composed by event subgraph Make ", and obtaining its relevant company as " Baidu is online ", relevant theme is " medical tangle ".It is based on the event as a result, Subgraph spectrum has determined the node attribute values of relevant to the event Business Name information node and subject information node.
In the node attribute values and subject information node that Business Name information node relevant to the incoming event has been determined Node attribute values after, can continue the node attribute values and/or subject information of identified Business Name information node Node attribute values of the node attribute values of node as target information node further search for corresponding figure in company's subgraph spectrum Mark element.
In some embodiments, map member corresponding with the input information is extracted from multiclass subgraph spectrum according to input information It further include the process for constructing the multiclass subgraph spectrum before element.
Fig. 7 shows the exemplary process diagram of the method 700 of the building subgraph spectrum according to the embodiment of the present disclosure.
According to the embodiment of the present disclosure, construct every a kind of subgraph spectrum in multiclass subgraph spectrum include: firstly, in step s 701, The default corresponding relationship that at least two information nodes are set, and are arranged between the information node.Further, in step S702, The node attribute values of the information node are extracted from external data.After the extraction for completing node attribute values, in step S703, Extracted node attribute values are associated, to form map element.Further, in step S704, for obtained The denoising of map element obtains subgraph spectrum.
According to the embodiment of the present disclosure, in map element, for the same information node, there can be one or more be somebody's turn to do The node attribute values of information node.Node attribute values can be particular content, or be also possible to " default " or " sky ".The application It is not limited by the number of the node attribute values of information node and particular content in map element.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.By taking the map element of company's subgraph spectrum as an example, wherein may include nodal community It is worth " Baidu " and node attribute values " iqiyi.com ", wherein " iqiyi.com " is the subsidiary of " Baidu ", is based on the corresponding relationship, then " Baidu " is first nodes attribute value, and " iqiyi.com " is two-level node attribute value.The application is not by the node category in map element The limitation of the different stage and its mutual corresponding relationship of property value.
According to the embodiment of the present disclosure, for the denoising of obtained map element for example can by calculate node different degree come It realizes, by calculating different degree filtering noise node attribute values, saves the memory space of map, accelerate inquiry velocity;Another party Face, when fuzzy query map, while when hitting multiple both candidate nodes, the preferential node for selecting importance high;Or it can lead to Source number and each source itself the confidence level COMPREHENSIVE CALCULATING node that calculate node confidence level is crossed to realize, for example, by using node Confidence level;Or can also be realized using calculated relationship confidence level, i.e., using the source number of relationship, source itself Confidence level, relationship both sides node belief, the confidence level of egress and the didactic calculated relationship of ingress co-occurrence number.This public affairs It opens and is not limited by map element denoising mode.
In the following, illustrating the process of building subgraph spectrum for constructing company's subgraph spectrum.Firstly, in building company When map, as shown in step S701, such as Business Name, name, industry can choose as presupposed information node, and further Its preset relation is set are as follows: for the attribute value of each company's information node, the respective attributes value of name information node is to supply In the individual of the said firm, the respective attributes value of trade information node is the industry where the said firm for duty.Further, for company Each different attribute value in name information node, can also be arranged the subordinate relation that it includes main company and subsidiary.
Further, in step S702, the node attribute values of the information node are extracted from external data.For example, It can be by naming Entity recognition process to extract the nodal community of the information nodes such as name, Business Name from non-structured text Value.
Further, in step S703, it can be based on Relation extraction process, obtained between extracted node attribute values Relationship, the process can for example be obtained by the structuring list data on website, or can also be based on Entity recognition, according to Predefined relationship classification, classifies for the relationship between node attribute values.Further, by extracted multiple nodes The node attribute values for meeting corresponding relationship in attribute value are associated to form map element.Such as by the node of company's information node Attribute value " Baidu ", with the personal name information node respective nodes attribute value " Li Yanhong " for taking service in the said firm, the said firm The node respective attributes value " internet " of the trade information node of place industry is interrelated, obtains a map element.
Wherein the associated step of above-mentioned node attribute values, which can for example be used, connects it to realize by line segment, or It can place it in corresponding list, the disclosure is not limited by the interrelated used form of node attribute values.Into one Step ground can also be indicated mutual between node attribute values when being realized the association of node attribute values using line segment on line segment Relationship.
After generating multiple map elements, in step S704, obtained map element is denoised, obtains subgraph spectrum. The denoising process for example can calculate node different degree, node belief, relationship confidence level first, its three is added later Weight average, to screen out the noisy unstable data of tool.
In some embodiments, the denoising of map element further includes that the node attribute values of information node are disambiguated, into one Step may include that company name disambiguates and name disambiguates.
Company name is disambiguated, by each node attribute values for Business Name information node, based on building in advance Vertical company's letter full name corresponding relationship, the multiple node attribute values for meeting company's letter full name corresponding relationship are associated.
The exemplary process diagram for the method 800 that the company name that Fig. 8 shows the embodiment of the present disclosure disambiguates.
Referring to Fig. 8, firstly, in step S801, for each nodal community in subgraph Pu Zhong company information node Value extracts its company's full name and its corresponding multiple companies referred to as from external data.Further, in step S802, Referred to as based on acquired company's full name and corresponding multiple companies, the full name and abbreviation corresponding relationship of each company are obtained, it is raw Cheng company letter full name subgraph spectrum;Finally, in step S803, it is corresponding based on simple full name obtained in company's letter full name subgraph spectrum The multiple node attribute values for meeting wherein full name and abbreviation corresponding relationship are associated by relationship.
In some embodiments, after company's letter full name subgraph spectrum is obtained in step S802, step S803 can not also be used It realizes that Business Name is entirely referred to as associated, but is composed based on the said firm's letter full name subgraph, by each company's full name therein and right The input terminal for the multiple abbreviations input neural network algorithm answered, generates company's letter full name relational model by neural network algorithm. Thereafter, the full name that designated company is inputted by the input terminal in company's letter full name relational model, can be in company's letter full name relationship mould The output end of type obtains its multiple corresponding abbreviation, also obtains the simple full name relationship of designated company accordingly.Further, based on finger Determine the simple full name relationship of company, the multiple node attribute values for meeting wherein full name and abbreviation corresponding relationship can be associated.
Name is disambiguated, by by multiple attributes with the associated Business Name information node of same name information node Value is split, and using the node attribute values of each Business Name information node to the node attribute values of name information node It is identified, so as to eliminate name node ambiguity.
The exemplary process diagram for the method that the name that Fig. 9 shows the embodiment of the present disclosure disambiguates.
Referring to Fig. 9, firstly, in step S901, for the node attribute values of the name information node in map element, with Point divides the map element centered on the node attribute values, obtains multiple map daughter elements independent of each other.Further It, for each daughter element in the multiple map daughter element independent of each other, extracts and believes with the name in step S902 in ground Cease the multiple Business Name information nodes and its node attribute values that the node attribute values of node are directly linked.Finally, in step In S903, for each Business Name information node, using the node attribute values of the said firm's information node come to name information section The node attribute values of point are identified.
Wherein, in the point centered on the node attribute values of name information node, when being divided to map element, such as can Thus to obtain multiple map independent of each other by the attribute value for removing the name information node in current map element Element.
In some embodiments, it is extracted in each map daughter element in multiple map daughter elements independent of each other The multiple Business Name information nodes and its node attribute values being directly linked with the node attribute values of the name information node can wrap It includes: when there are multiple Business Name information sections being directly linked with the name information node attribute value in a map daughter element When the node attribute values of point, the node attribute values of multiple Business Name information nodes and the node of the name information node can be calculated The relationship confidence level of attribute value;And extract the nodal community wherein with the Business Name information node of maximum relationship confidence level Value.
By the way that name is carried out corresponding association with Business Name, the method to identify name can avoid passing through name letter When ceasing node as target information node, due to the case where there are personage's duplications of name, and lead to wrong close occur when subgraph spectrum association Connection, or introduce the subgraph spectral element of excessive noise and redundancy.
Figure 10 shows the exemplary block diagram of the Information Atlas construction device 110 according to the embodiment of the present disclosure.
Information Atlas construction device 110 as shown in Figure 10 includes: that map element extraction module 120 and Information Atlas generate Module 130.
Wherein, the map element extraction module 120 be configured as according to input information from multiclass subgraph spectrum in extract with The corresponding map element of the input information.
The Information Atlas generation module 130 is configured as the extracted map element of split to generate and the input information Corresponding Information Atlas.
The input information can be the query information that user directly inputs, or be also possible to computer system in response to User inputs information or controls information and the information voluntarily inquired.The embodiment of the present disclosure not source by input information and its input The limitation of mode.For example, can be the query information that user inputs in Webpage search column, or it be also possible to the inquiry of user The information that information generates after computer pre-processes.
The multiclass subgraph spectrum refers to is under the jurisdiction of different classes of multiple subgraphs spectrum respectively.Wherein, the multiclass subgraph spectrum In at least part subgraph compose information node having the same.In some embodiments, the multiclass subgraph spectrum may include general Subgraph spectrum, company's subgraph spectrum, stock subgraph spectrum, investment subgraph is read to compose, in product subgraph spectrum, event subgraph spectrum, industry subgraph spectrum At least two classes.
In some embodiments, in map element, for the same information node, there can be one or more letters Cease the node attribute values of node.Node attribute values can be particular content, or be also possible to " default " or " sky ".The application is not It is limited by the number of the node attribute values of information node and particular content in map element.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.By taking the map element of company's subgraph spectrum as an example, wherein may include nodal community It is worth " Baidu " and node attribute values " iqiyi.com ", wherein " iqiyi.com " is the subsidiary of " Baidu ", is based on the corresponding relationship, then " Baidu " is first nodes attribute value, and " iqiyi.com " is two-level node attribute value.The application is not by the node category in map element The limitation of the different stage and its mutual corresponding relationship of property value.
Fig. 2 shows the partial schematic diagrams that 200 are composed according to company's subgraph of the embodiment of the present disclosure.
It is composed referring to company's subgraph shown in Fig. 2, above- mentioned information node and map element is more specifically described. The said firm's subgraph spectrum includes three information nodes, respectively Business Name information node KC, name information node KHAnd industry name Claim information node KS.Further, for example including map element E in the said firm's subgraph spectrumC1And map element EC2, wherein map is first Plain EC1It further comprise the node attribute values of multiple information nodes.Referring further to Figure 2, map element EC1Including Business Name The node attribute values " iqiyi.com " of information node, " Baidu ", the node attribute values " Li Yanhong " of name information node, " Lu Qi ", The node attribute values " internet " of film name information node.Map element EC2Nodal community including Business Name information node It is worth " big boundary ", the node attribute values " Wang Tao " of name information node, the node attribute values " unmanned plane " of film name information node. Further, based on the corresponding relationship between node attribute values, node attribute values " Baidu " are first nodes attribute value, and " love is odd Skill " is two-level node attribute value.
Wherein, in map element extraction module 120, process as shown in Figure 3 can be executed, according to input information from more Map element corresponding with the input information is extracted in class subgraph spectrum.It is further can include: target information node determining module 121 and map element respective modules 122.
The target information node determining module 121 is configured as executing the operation as shown in step S301 in Fig. 3, according to Information is inputted, determines the node attribute values of target information node.
According to the embodiment of the present disclosure, target information node may be, for example, multiple information nodes from multiclass subgraph spectrum, or Person can be multiple information nodes from certain a kind of subgraph spectrum, or an information node being also possible in certain a kind of map. The embodiment of the present disclosure is not limited by the type and number of identified target information node.Such as it can be by target information node only It is set as Business Name, or target information node can be set to Business Name, name, product Business Name three.
In some embodiments, the target information node comprises at least one of the following: Business Name, organization, people Name, film name, name of product, stock code, stock name, concept name.
Wherein, the node attribute values of the target information node can be determined based on preset strategy, such as when input information When all or part of the content is the node attribute values of target information node, then the attribute of target information node can be directly obtained Value;When input information in content do not include target information node node attribute values or only include part target information The node attribute values of node can pre-process input information, using treated data as target information node Node attribute values.The embodiment of the present disclosure is not limited by the mode for the node attribute values for determining destination node.Such as setting name For target information node, then node attribute values of " Ma Yun " that user can be inputted directly as target information node;It can also be right " Baidu takes Ali by the hand and opens up shared bicycle market " of user's input, is obtained based on processes, extractions such as Entity recognition, Relation extractions The node attribute values of the node attribute values " Baidu " of Business Name information node, " Ali ", name of product information node are " shared single Vehicle ", and the node category of corresponding name information node is further obtained based on " Baidu ", " Ali " in company's subgraph spectrum Property value " Li Yanhong ", " Ma Yun ".
In addition, for the same target information node, the section of the target information node based on determined by input information Point attribute value can be one or more, and the embodiment of the present disclosure is not by of the node attribute values of identified target information node Several limitations.
The map element respective modules 122 are configured as executing the operation as shown in step S302 in Fig. 3, from every class Map element corresponding with identified node attribute values is extracted in map.
In some embodiments, map corresponding with the identified node attribute values member extracted from every class subgraph spectrum Element can be multiple, such as when there are name duplication of name, for the node attribute values " Ma Yun " of target information node, in company's subgraph There may be multiple corresponding map elements in spectrum, it is under the jurisdiction of the nodal community with different Business Names respectively Under value.Under some embodiments, in certain class subgraphs spectrum, it will be unable to extract the node attribute values pair with target information node The map element answered, such as when the node attribute values of target information node are " quick worker App ", it may be simultaneously in stock subgraph spectrum It does not include the map element of the node attribute values corresponding to the target information node.The embodiment of the present disclosure is not by extracted map The number of element and its limitation in source.
In some embodiments, further, the Information Atlas generation module 130 may include map element link module 131, in the map element link module 131, process as shown in Figure 5 can be executed, with each identified nodal community Value is linking point, links the map element corresponding with the node attribute values extracted from the multiple subgraph spectrum.
Referring to Fig. 4 and Fig. 5, the map split process can be more specifically described.Based on process shown in Fig. 4, mentioning It, further, as shown in Figure 5, can be by the node attribute values of Business Name information node after obtaining multiple map elements The node attribute values " Lu Qi " of " Baidu " and name information node are used as linking point, split company subgraph spectral element EC1, product Map element EP1, stock subgraph spectral element EG1, generating has Business Name information node, name information node, stock information section The Information Atlas of point, name of product information node.
In some embodiments, the multiclass subgraph spectrum includes at least event subgraph and composes, and the input information is thing Part information.Wherein, target information node determining module 121 may include event information respective modules 121', corresponding in event information In module 121', it is corresponding with the event information that determination can be composed based on the event subgraph for the event information inputted The node attribute values of target information node.
For example, the input information is specific event title or the description for a certain event, such as it can be only Abbreviation including dependent event, or also may include the relevant personage of event, company, theme or concept name.The disclosure is not It is limited by the particular content of the event information inputted.It can for example input " Wei Zexi event ", or can also input " Ma Huateng takes Yang Zhenyu by the hand and initiates Science Explorations prize ".
It for example, can have at least two category information nodes in the event subgraph spectrum, such as can be entity class information section Point and theme class information node.For example including event title, Business Name, name, stock etc., theme in entity class information node Category information node is for example including subject name, concept name and semantic label etc..
According to the embodiment of the present disclosure, event subgraph spectrum includes multiple map elements, for each information node, each map Element has the node attribute values of the information node.
In some embodiments, in the map element of event subgraph spectrum, for the same information node, can have one The node attribute values of a or multiple information nodes.Wherein, which can be particular content, or be also possible to " default " or " sky ".The application is not by the number of the node attribute values of information node in the map element of event subgraph spectrum and specifically The limitation of content.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.Such as the map element of event subgraph spectrum may include corresponding to subject The node attribute values " mobile communication " and node attribute values " 5G communication " for claiming information node, wherein " 5G communication " is " mobile communication " Sub-topics concept, be based on the corresponding relationship, then " mobile communication " be first nodes attribute value, " 5G communication " be two-level node category Property value.The application is not limited by the different stage of the node attribute values in map element and its mutual corresponding relationship.
Figure 11 shows the illustrative block diagram that equipment 950 is constructed according to the Information Atlas of the embodiment of the present disclosure.
Information Atlas building equipment 950 as shown in figure 11 can be implemented as one or more dedicated or general computers System module or component, such as PC, laptop, tablet computer, mobile phone, personal digital assistant (personal Digital assistance, PDA) and any intelligent and portable equipment.Wherein, Information Atlas building equipment 950 may include to A few processor 960 and memory 970.
Wherein, at least one described processor is for executing program instructions.The memory 970 is set in Information Atlas building In standby 950 can program storage unit in different forms and data storage element exist, such as hard disk, read-only memory (ROM), random access memory (RAM), it can be used to during storage processor processing and/or execution information map construction Possible program instruction performed by the various data files and processor used.Although being not shown, hum pattern Spectrum building equipment 950 can also include an input output assembly, support Information Atlas building equipment 950 and other assemblies (such as Image capture device 980) between input/output data stream.Information Atlas construct equipment 950 can also by communication port from Network sends and receives information and data.
In some embodiments, one group of instruction that the memory 970 is stored by the processor 960 execute when, The Information Atlas building equipment 950 is set to execute operation, the operation includes: to be extracted from multiclass subgraph spectrum according to input information Map element corresponding with the input information;And the extracted map element of split is to generate letter corresponding with the input information Cease map.
The input information can be the query information that user directly inputs, or be also possible to computer system in response to User inputs information or controls information and the information voluntarily inquired.The embodiment of the present disclosure not source by input information and its input The limitation of mode.For example, can be the query information that user inputs in Webpage search column, or it be also possible to the inquiry of user The information that information generates after computer pre-processes.
The multiclass subgraph spectrum refers to is under the jurisdiction of different classes of multiple subgraphs spectrum respectively.Wherein, the multiclass subgraph spectrum In at least part subgraph compose information node having the same.In some embodiments, the multiclass subgraph spectrum may include general Subgraph spectrum, company's subgraph spectrum, stock subgraph spectrum, investment subgraph is read to compose, in product subgraph spectrum, event subgraph spectrum, industry subgraph spectrum At least two classes.
According to the embodiment of the present disclosure, every one kind subgraph music score can such as have at least two information nodes, and every a kind of son Map includes multiple map elements, and for each information node, each map element has the node attribute values of the information node.
In some embodiments, in map element, for the same information node, there can be one or more letters Cease the node attribute values of node.Node attribute values can be particular content, or be also possible to " default " or " sky ".The application is not It is limited by the number of the node attribute values of information node and particular content in map element.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.By taking the map element of company's subgraph spectrum as an example, wherein may include nodal community It is worth " Baidu " and node attribute values " iqiyi.com ", wherein " iqiyi.com " is the subsidiary of " Baidu ", is based on the corresponding relationship, then " Baidu " is first nodes attribute value, and " iqiyi.com " is two-level node attribute value.The application is not by the node category in map element The limitation of the different stage and its mutual corresponding relationship of property value.
In some embodiments, map member corresponding with the input information is extracted from multiclass subgraph spectrum according to input information Element includes: to determine the node attribute values of target information node according to input information;And it is extracted from every class subgraph spectrum with institute really The corresponding map element of fixed node attribute values.
According to the embodiment of the present disclosure, target information node may be, for example, multiple information nodes from multiclass subgraph spectrum, or Person can be multiple information nodes from certain a kind of subgraph spectrum, or an information node being also possible in certain a kind of map. The embodiment of the present disclosure is not limited by the type and number of identified target information node.Such as it can be by target information node only It is set as Business Name, or target information node can be set to Business Name, name, product Business Name three.
In some embodiments, the target information node comprises at least one of the following: Business Name, organization, people Name, film name, name of product, stock code, stock name, concept name.
In addition, for the same target information node, the section of the target information node based on determined by input information Point attribute value can be one or more, and the embodiment of the present disclosure is not by of the node attribute values of identified target information node Several limitations.
In some embodiments, map corresponding with the identified node attribute values member extracted from every class subgraph spectrum Element can be multiple, such as when there are name duplication of name, for the node attribute values " Ma Yun " of target information node, in company's subgraph There may be multiple corresponding map elements in spectrum, it is under the jurisdiction of the nodal community with different Business Names respectively Under value.Under some embodiments, in certain class subgraphs spectrum, it will be unable to extract the node attribute values pair with target information node The map element answered, such as when the node attribute values of target information node are " quick worker App ", it may be simultaneously in stock subgraph spectrum It does not include the map element of the node attribute values corresponding to the target information node.The embodiment of the present disclosure is not by extracted map The number of element and its limitation in source.
In some embodiments, the multiclass subgraph spectrum includes at least event subgraph and composes, and the input information is thing Part information, wherein the node attribute values that target information node is determined according to input information include: the event letter for being inputted Breath determines the node attribute values of target information node corresponding with the event information based on event subgraph spectrum.
For example, the input information is specific event title or the description for a certain event, such as it can be only Abbreviation including dependent event, or also may include the relevant personage of event, company, theme or concept name.The disclosure is not It is limited by the particular content of the event information inputted.It can for example input " Wei Zexi event ", or can also input " Ma Huateng takes Yang Zhenyu by the hand and initiates Science Explorations prize ".
It for example, can have at least two category information nodes in the event subgraph spectrum, such as can be entity class information section Point and theme class information node.For example including event title, Business Name, name, stock etc., theme in entity class information node Category information node is for example including subject name, concept name and semantic label etc..
According to the embodiment of the present disclosure, event subgraph spectrum includes multiple map elements, for each information node, each map Element has the node attribute values of the information node.
In some embodiments, in the map element of event subgraph spectrum, for the same information node, can have one The node attribute values of a or multiple information nodes.Wherein, which can be particular content, or be also possible to " default " or " sky ".The application is not by the number of the node attribute values of information node in the map element of event subgraph spectrum and specifically The limitation of content.
In some embodiments, for the node attribute values can further progress partition of the level, such as can be classified as First nodes attribute value and two-level node attribute value.Such as the map element of event subgraph spectrum may include corresponding to subject The node attribute values " mobile communication " and node attribute values " 5G communication " for claiming information node, wherein " 5G communication " is " mobile communication " Sub-topics concept, be based on the corresponding relationship, then " mobile communication " be first nodes attribute value, " 5G communication " be two-level node category Property value.The application is not limited by the different stage of the node attribute values in map element and its mutual corresponding relationship.
In some embodiments, Information Atlas building equipment 950 can receive outside Information Atlas building equipment 950 The input information that the input equipment in portion is inputted executes above-described Information Atlas construction method, reality based on the input information The function of existing above-described Information Atlas construction device.
In some embodiments, map member corresponding with the input information is extracted from multiclass subgraph spectrum according to input information It further include the process for constructing the multiclass subgraph spectrum before element.
Although processor 960, memory 970 are rendered as individual module in Figure 10, those skilled in the art can be managed Solution, above equipment module may be implemented as individual hardware device, can also be integrated into one or more hardware devices.Only It can be realized the principle of disclosure description, the specific implementation of different hardware devices should not be used as limitation disclosure protection The factor of range.
The application has used particular words to describe embodiments herein.Such as " first/second embodiment ", " one implements Example ", and/or " some embodiments " mean a certain feature relevant at least one embodiment of the application, structure or feature.Cause This, it should be highlighted that and it is noted that " embodiment " or " an implementation referred to twice or repeatedly in this specification in different location Example " or " alternate embodiment " are not necessarily meant to refer to the same embodiment.In addition, in one or more embodiments of the application Certain features, structure or feature can carry out combination appropriate.
In addition, it will be understood by those skilled in the art that the various aspects of the application can be by several with patentability Type or situation are illustrated and described, the combination or right including any new and useful process, machine, product or substance Their any new and useful improvement.Correspondingly, the various aspects of the application can completely by hardware execute, can be complete It is executed, can also be executed by combination of hardware by software (including firmware, resident software, microcode etc.).Hardware above is soft Part is referred to alternatively as " data block ", " module ", " engine ", " unit ", " component " or " system ".In addition, the various aspects of the application The computer product being located in one or more computer-readable mediums may be shown as, which includes computer-readable program Coding.
Unless otherwise defined, all terms (including technical and scientific term) used herein have leads with belonging to the present invention The identical meanings that the those of ordinary skill in domain is commonly understood by.It is also understood that those of definition term such as in usual dictionary The meaning consistent with their meanings in the context of the relevant technologies should be interpreted as having, without application idealization or The meaning of extremely formalization explains, unless being clearly defined herein.
The above is the description of the invention, and is not considered as limitation ot it.Notwithstanding of the invention several Exemplary embodiment, but those skilled in the art will readily appreciate that, before without departing substantially from teaching and advantage of the invention Many modifications can be carried out to exemplary embodiment by putting.Therefore, all such modifications are intended to be included in claims institute In the scope of the invention of restriction.It should be appreciated that being the description of the invention above, and it should not be considered limited to disclosed spy Determine embodiment, and the model in the appended claims is intended to encompass to the modification of the disclosed embodiments and other embodiments In enclosing.The present invention is limited by claims and its equivalent.

Claims (14)

1. a kind of Information Atlas construction method, comprising:
Map element corresponding with the input information is extracted from multiclass subgraph spectrum according to input information;And
The extracted map element of split is to generate Information Atlas corresponding with the input information;
Wherein, at least part subgraph in the multiclass subgraph spectrum composes information node having the same, and is believed according to input Breath extracts map element corresponding with the input information from multiclass subgraph spectrum
According to input information, the node attribute values of target information node are determined;And
Map element corresponding with identified node attribute values is extracted from every class subgraph spectrum.
2. Information Atlas construction method as described in claim 1, wherein every class subgraph spectrum has at least two information nodes, And every class subgraph spectrum includes multiple map elements;For each information node, each map element has the information node Node attribute values.
3. Information Atlas construction method as claimed in claim 2, wherein the extracted map element of split is defeated with this to generate Entering the corresponding Information Atlas of information includes:
Using each identified node attribute values as linking point, link from the multiple subgraph spectrum in extract with the nodal community It is worth corresponding map element.
4. Information Atlas construction method as described in claim 1, wherein the multiclass subgraph spectrum includes concept subgraph spectrum, public affairs Take charge of at least two classes during subgraph is composed, stock subgraph is composed, investment subgraph spectrum, product subgraph spectrum, event subgraph are composed, industry subgraph is composed.
5. Information Atlas construction method as claimed in claim 2, wherein the target information node includes following at least one Kind: Business Name, organization, name, film name, name of product, stock code, stock name, concept name.
6. Information Atlas construction method as claimed in claim 2, wherein the multiclass subgraph spectrum includes at least event subgraph Spectrum, and the input information is event information,
Wherein, determine that the node attribute values of target information node include: according to input information
For the event information inputted, target information node corresponding with the event information is determined based on event subgraph spectrum Node attribute values.
7. Information Atlas construction method as described in claim 1, wherein according to input information from multiclass subgraph spectrum in extract with It further include constructing the multiclass subgraph spectrum before the corresponding map element of the input information, wherein every in building multiclass subgraph spectrum A kind of subgraph is composed
At least two information nodes, and the corresponding relationship between setting information node are set;
The node attribute values of the information node are extracted from external data;
Extracted node attribute values are associated, to form map element;
Obtained map element is denoised, subgraph spectrum is obtained.
8. map construction method as claimed in claim 7, wherein at least two information node includes Business Name information Node, and include: for the denoising of obtained map element
For each node attribute values of Business Name information node, based on the company's letter full name corresponding relationship pre-established, The multiple node attribute values for meeting company's letter full name corresponding relationship are associated.
9. map construction method as claimed in claim 7, wherein at least two information node includes name information section Point, and include: for the denoising of obtained map element
It will be split with multiple attribute values of the associated Business Name information node of same name information node, and using often The node attribute values of a Business Name information node are identified the node attribute values of name information node, to eliminate name Node ambiguity.
10. a kind of Information Atlas construction device, comprising:
Map element extraction module is configured as being extracted from multiclass subgraph spectrum according to input information corresponding with the input information Map element;
Information Atlas generation module is configured as the extracted map element of split to generate information corresponding with the input information Map.
11. Information Atlas construction device as claimed in claim 10, wherein every class subgraph spectrum has at least two information sections Point, and every class subgraph spectrum includes multiple map elements;For each information node, each map element has the information node Node attribute values;
Wherein, map element extraction module includes:
Target information node determining module is configured as determining the node attribute values of target information node according to input information;With And
Map element respective modules are configured as extracting map corresponding with identified node attribute values from every class subgraph spectrum Element.
12. Information Atlas construction device as claimed in claim 10, wherein Information Atlas generation module includes:
Map element link module is configured as linking using each identified node attribute values as linking point from the multiple The map element corresponding with the node attribute values extracted in subgraph spectrum.
13. a kind of Information Atlas constructs equipment, wherein the equipment includes processor and memory, the memory includes one group Instruction, one group of instruction make the Information Atlas building equipment execute operation, the operation when being executed by the processor Include:
Map element corresponding with the input information is extracted from multiclass subgraph spectrum according to input information;And
The extracted map element of split is to generate Information Atlas corresponding with the input information;
Wherein, at least part subgraph in the multiclass subgraph spectrum composes information node having the same, and is believed according to input Breath extracts map element corresponding with the input information from multiclass subgraph spectrum
According to input information, the node attribute values of target information node are determined;And
Map element corresponding with identified node attribute values is extracted from every class subgraph spectrum.
14. Information Atlas as claimed in claim 13 constructs equipment, wherein every class subgraph spectrum has at least two information sections Point, and every class subgraph spectrum includes multiple map elements;For each information node, each map element has the information node Node attribute values.
CN201910114989.1A 2019-02-14 2019-02-14 Information map construction method, device and equipment Active CN110162637B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910114989.1A CN110162637B (en) 2019-02-14 2019-02-14 Information map construction method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910114989.1A CN110162637B (en) 2019-02-14 2019-02-14 Information map construction method, device and equipment

Publications (2)

Publication Number Publication Date
CN110162637A true CN110162637A (en) 2019-08-23
CN110162637B CN110162637B (en) 2023-06-20

Family

ID=67644878

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910114989.1A Active CN110162637B (en) 2019-02-14 2019-02-14 Information map construction method, device and equipment

Country Status (1)

Country Link
CN (1) CN110162637B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110781317A (en) * 2019-10-29 2020-02-11 北京明略软件系统有限公司 Method and device for constructing event map and electronic equipment
CN111930906A (en) * 2020-07-29 2020-11-13 北京北大软件工程股份有限公司 Knowledge graph question-answering method and device based on semantic block
CN112612899A (en) * 2020-11-24 2021-04-06 中国传媒大学 Knowledge graph construction method and device, storage medium and electronic equipment
CN113708980A (en) * 2021-10-27 2021-11-26 中国光大银行股份有限公司 Topological graph generation method and device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9026524B1 (en) * 2013-01-10 2015-05-05 Relationship Science LLC Completing queries using transitive closures on a social graph
CN106777331A (en) * 2017-01-11 2017-05-31 北京航空航天大学 Knowledge mapping generation method and device
US20180060326A1 (en) * 2016-08-26 2018-03-01 Facebook, Inc. Classifying Search Queries on Online Social Networks
CN108509420A (en) * 2018-03-29 2018-09-07 赵维平 Gu spectrum and ancient culture knowledge mapping natural language processing method
CN108763555A (en) * 2018-06-01 2018-11-06 北京奇虎科技有限公司 Representation data acquisition methods and device based on demand word
CN109033223A (en) * 2018-06-29 2018-12-18 北京百度网讯科技有限公司 For method, apparatus, equipment and computer readable storage medium across type session
CN109145122A (en) * 2018-08-02 2019-01-04 北京仿真中心 A kind of product know-how map construction and querying method and system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9026524B1 (en) * 2013-01-10 2015-05-05 Relationship Science LLC Completing queries using transitive closures on a social graph
US20180060326A1 (en) * 2016-08-26 2018-03-01 Facebook, Inc. Classifying Search Queries on Online Social Networks
CN106777331A (en) * 2017-01-11 2017-05-31 北京航空航天大学 Knowledge mapping generation method and device
CN108509420A (en) * 2018-03-29 2018-09-07 赵维平 Gu spectrum and ancient culture knowledge mapping natural language processing method
CN108763555A (en) * 2018-06-01 2018-11-06 北京奇虎科技有限公司 Representation data acquisition methods and device based on demand word
CN109033223A (en) * 2018-06-29 2018-12-18 北京百度网讯科技有限公司 For method, apparatus, equipment and computer readable storage medium across type session
CN109145122A (en) * 2018-08-02 2019-01-04 北京仿真中心 A kind of product know-how map construction and querying method and system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110781317A (en) * 2019-10-29 2020-02-11 北京明略软件系统有限公司 Method and device for constructing event map and electronic equipment
CN110781317B (en) * 2019-10-29 2022-03-01 北京明略软件系统有限公司 Method and device for constructing event map and electronic equipment
CN111930906A (en) * 2020-07-29 2020-11-13 北京北大软件工程股份有限公司 Knowledge graph question-answering method and device based on semantic block
CN112612899A (en) * 2020-11-24 2021-04-06 中国传媒大学 Knowledge graph construction method and device, storage medium and electronic equipment
CN113708980A (en) * 2021-10-27 2021-11-26 中国光大银行股份有限公司 Topological graph generation method and device
CN113708980B (en) * 2021-10-27 2022-04-19 中国光大银行股份有限公司 Topological graph generation method and device

Also Published As

Publication number Publication date
CN110162637B (en) 2023-06-20

Similar Documents

Publication Publication Date Title
US11327978B2 (en) Content authoring
CN112131366B (en) Method, device and storage medium for training text classification model and text classification
US10437868B2 (en) Providing images for search queries
CN110209897B (en) Intelligent dialogue method, device, storage medium and equipment
Feagin et al. Rethinking racial formation theory: A systemic racism critique
CN110162637A (en) Information Atlas construction method, device and equipment
US20170161619A1 (en) Concept-Based Navigation
US20170262783A1 (en) Team Formation
US20210406473A1 (en) System and method for building chatbot providing intelligent conversational service
CN113535974B (en) Diagnostic recommendation method and related device, electronic equipment and storage medium
CN111597314A (en) Reasoning question-answering method, device and equipment
US11650979B2 (en) Assigning a new entigen to a word group
CN108287875B (en) Character co-occurrence relation determining method, expert recommending method, device and equipment
CN111259154B (en) Data processing method and device, computer equipment and storage medium
CN111783903B (en) Text processing method, text model processing method and device and computer equipment
JP2022006173A (en) Knowledge pre-training model training method, device and electronic equipment
CN112287085B (en) Semantic matching method, system, equipment and storage medium
CN113761220A (en) Information acquisition method, device, equipment and storage medium
JP2023002690A (en) Semantics recognition method, apparatus, electronic device, and storage medium
CN113407850A (en) Method and device for determining and acquiring virtual image and electronic equipment
CN113656587A (en) Text classification method and device, electronic equipment and storage medium
US20200210643A1 (en) Generating a query response utilizing a knowledge database
EP4327197A1 (en) Task execution based on real-world text detection for assistant systems
US20200175064A1 (en) Image processing utilizing an entigen construct
CN114611529B (en) Intention recognition method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant