CN108829858B - Data query method and device and computer readable storage medium - Google Patents

Data query method and device and computer readable storage medium Download PDF

Info

Publication number
CN108829858B
CN108829858B CN201810647001.3A CN201810647001A CN108829858B CN 108829858 B CN108829858 B CN 108829858B CN 201810647001 A CN201810647001 A CN 201810647001A CN 108829858 B CN108829858 B CN 108829858B
Authority
CN
China
Prior art keywords
data
query
user
entity
entities
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810647001.3A
Other languages
Chinese (zh)
Other versions
CN108829858A (en
Inventor
黄正元
龚杰
孙俊
李伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JD Digital Technology Holdings Co Ltd
Jingdong Technology Holding Co Ltd
Original Assignee
JD Digital Technology Holdings Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JD Digital Technology Holdings Co Ltd filed Critical JD Digital Technology Holdings Co Ltd
Priority to CN201810647001.3A priority Critical patent/CN108829858B/en
Publication of CN108829858A publication Critical patent/CN108829858A/en
Application granted granted Critical
Publication of CN108829858B publication Critical patent/CN108829858B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The disclosure provides a data query method, a data query device and a computer readable storage medium, and relates to the technical field of data processing. The data query method comprises the following steps: receiving query information input by a user; carrying out natural language processing on query information input by a user, and analyzing to obtain query keywords; determining a query entity corresponding to a query keyword in a knowledge graph; the data chains in the knowledge-graph associated with the query entity are returned to the user. According to the method and the device, the relevant data chain in the knowledge graph is returned to the user after the user inputs the query information, and the upstream and downstream incidence relation of the relevant data information can be fully displayed for the user, so that the range of data analysis of the user is expanded, and the more comprehensive data analysis of the user is facilitated.

Description

Data query method and device and computer readable storage medium
Technical Field
The present disclosure relates to the field of data processing technologies, and in particular, to a data query method and apparatus, and a computer-readable storage medium.
Background
Today, as the internet rapidly develops, information is abundantly present in unstructured textual data, large numbers of semi-structured forms and web pages, and structured data of production systems. The financial industry is much more sensitive to various news events than other industries and therefore relies heavily on the accuracy, comprehensiveness, and timeliness of news events. More and more data and information can be collected to the financial industry user in time, and the relationship of thousands of threads can be analyzed from the data and the information, so that the working efficiency of the user can be greatly improved, the user is assisted to make the most accurate judgment, and the maximum return value is obtained.
Financial industry users often need to go through several steps before making a final investment or decision. First, manual data collection and processing are performed, data are collected from various news sources (such as search engines, news portals, government announcements, forums, and the like), so that a large amount of news information is obtained, and data which are invalid for users are filtered out according to personal experiences of financial users. And then, analyzing and summarizing classified data, wherein after information is acquired, a user needs to extract key points of each information event, summarize and classify the key points, and form a document, a report or store the document, the report or the document into a structured data system (such as an Excel or mysql database). Because the amount of data is very large, the result of data storage may lack some information originally provided by the data for search convenience and query efficiency. And finally, making a decision, wherein after the data is classified, the user can carry out data analysis, judge the influence of the occurrence of the news event on the investment subject and the influence range, and then make a final decision according to the data analysis result.
Disclosure of Invention
One technical problem that this disclosure solves is how to achieve a more comprehensive data analysis.
According to an aspect of an embodiment of the present disclosure, there is provided a data query method including: receiving query information input by a user; carrying out natural language processing on query information input by a user, and analyzing to obtain query keywords; determining a query entity corresponding to a query keyword in a knowledge graph; the data chains in the knowledge-graph associated with the query entity are returned to the user.
In some embodiments, determining a query entity corresponding to the query keyword in the knowledge-graph comprises: determining an initial query entity corresponding to a query keyword in a knowledge graph; returning to the user the data chains in the knowledge-graph associated with the query entity includes: and returning a data chain which takes the initial query entity as an initial node in the knowledge graph to the user.
In some embodiments, determining a query entity corresponding to the query keyword in the knowledge-graph comprises: determining a starting query entity and an ending query entity corresponding to a query keyword in a knowledge graph; returning to the user the data chains in the knowledge-graph associated with the query entity includes: and returning a data chain which takes the initial query entity as an initial node and takes the end query entity as an end node in the knowledge graph to the user.
In some embodiments, the data chains associated with the query entity in the knowledge-graph are returned to the user in descending order of the number of relationships contained in the data chains.
In some embodiments, the data query method further comprises: collecting database data; carrying out natural language processing on database data, and extracting entities and relations among the entities; and generating a knowledge graph according to the entities and the relationship among the entities.
In some embodiments, natural language processing the database data to extract entities and relationships between entities includes: extracting data keywords from database data by using a natural language processing model; taking the data key words with the word frequency-reverse document frequency higher than a first threshold value as entities; relationships between entities are extracted from database data using a natural language processing model.
In some embodiments, collecting database data comprises: configuring a data source address list, a starting page number, an ending page number and acquisition time; according to the acquisition time, news data determined by a data source address list, a starting page number and an ending page number are automatically extracted; and analyzing to obtain the title and text data in the news data, and storing the title and text data in the database.
According to another aspect of the embodiments of the present disclosure, there is provided a data query apparatus including: the information receiving module is configured to receive query information input by a user; the keyword analysis module is configured to perform natural language processing on query information input by a user and analyze the query information to obtain a query keyword; an entity determination module configured to determine a query entity corresponding to the query keyword in the knowledge graph; a data return module configured to return to the user a data chain in the knowledge-graph associated with the query entity.
In some embodiments, the entity determination module is configured to: determining an initial query entity corresponding to a query keyword in a knowledge graph; the data return module is configured to: and returning a data chain which takes the initial query entity as an initial node in the knowledge graph to the user.
In some embodiments, the entity determination module is configured to: determining a starting query entity and an ending query entity corresponding to a query keyword in a knowledge graph; the data return module is configured to: and returning a data chain which takes the initial query entity as an initial node and takes the end query entity as an end node in the knowledge graph to the user.
In some embodiments, the data return module is configured to: and returning the data chains associated with the query entity in the knowledge graph to the user according to the sequence of the relationship numbers contained in the data chains from large to small.
In some embodiments, the data querying device further comprises a knowledge-graph generation module configured to: collecting database data; carrying out natural language processing on database data, and extracting entities and relations among the entities; and generating a knowledge graph according to the entities and the relationship among the entities.
In some embodiments, the knowledge-graph generation module is configured to: extracting data keywords from database data by using a natural language processing model; taking the data key words with the word frequency-reverse document frequency higher than a first threshold value as entities; relationships between entities are extracted from database data using a natural language processing model.
In some embodiments, the knowledge-graph generation module is configured to: configuring a data source address list, a starting page number, an ending page number and acquisition time; according to the acquisition time, news data determined by a data source address list, a starting page number and an ending page number are automatically extracted; and analyzing to obtain the title and text data in the news data, and storing the title and text data in the database.
According to still another aspect of the embodiments of the present disclosure, there is provided a data query apparatus including: a memory; and a processor coupled to the memory, the processor configured to execute the aforementioned data query method based on instructions stored in the memory.
According to still another aspect of the embodiments of the present disclosure, there is provided a computer-readable storage medium, wherein the computer-readable storage medium stores computer instructions, and the computer instructions, when executed by a processor, implement the aforementioned data query method.
According to the method and the device, the relevant data chain in the knowledge graph is returned to the user after the user inputs the query information, and the upstream and downstream incidence relation of the relevant data information can be fully displayed for the user, so that the range of data analysis of the user is expanded, and the more comprehensive data analysis of the user is facilitated.
Other features of the present disclosure and advantages thereof will become apparent from the following detailed description of exemplary embodiments thereof, which proceeds with reference to the accompanying drawings.
Drawings
In order to more clearly illustrate the embodiments of the present disclosure or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, it is obvious that the drawings in the following description are only some embodiments of the present disclosure, and for those skilled in the art, other drawings can be obtained according to the drawings without inventive exercise.
FIG. 1 shows a flow diagram of one embodiment of a process for generating a knowledge-graph.
FIG. 2 shows a schematic diagram of a knowledge-graph.
FIG. 3 shows a flow diagram of one embodiment of the disclosed data query method.
FIG. 4 shows a complete workflow diagram for data query federated data collection.
Fig. 5 shows a schematic flow chart of a data query device according to an embodiment of the present disclosure.
Fig. 6 shows a schematic structural diagram of a data query device according to another embodiment of the present disclosure.
Detailed Description
The technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the drawings in the embodiments of the present disclosure, and it is obvious that the described embodiments are only a part of the embodiments of the present disclosure, and not all of the embodiments. The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application, or uses. All other embodiments, which can be derived by a person skilled in the art from the embodiments disclosed herein without making any creative effort, shall fall within the protection scope of the present disclosure.
The inventor researches and discovers that in the related art, in order to enable an end user to concentrate on event analysis, data collection personnel and the end user are not the same person or the same team generally, so that the result is that the two parties have different opinions on key points of collected information, important data or certain details are certainly lost, the final data analysis is influenced, and the risk degree of investment of the user is improved. The data analysis and summarization results are solidified into the document or the database for the convenience of the user to view, however, the document usually has a fixed template, and the database has a fixed field, so that the data processing process cannot dynamically analyze the precursor consequence of the event according to the attribute of the current information, and the data is incomplete. In addition, as data is incrementally stored, historical data may gradually accumulate, resulting in a slow data query process.
The data query method of the present disclosure is described in detail below in steps.
Generating a knowledge graph
One embodiment of a process for generating a knowledge-graph is first described in conjunction with FIG. 1.
FIG. 1 shows a flow diagram of one embodiment of a process for generating a knowledge-graph. As shown in fig. 1, the process of generating the knowledge-graph in the present embodiment includes steps S102 to S106.
In step S102, database data is collected.
In the related technology, the required information is collected from a plurality of news sources completely by manpower, so that a large amount of manpower is needed to complete complex and heavy labor every day, time waste is caused, and when a data source is newly added, certain manpower is inevitably needed to collect new data source information.
In order to reduce the difficulty degree of data acquisition, the embodiment realizes automatic acquisition of news data so as to save labor and time cost. First, a data source address list, a start page number, an end page number, and a collection time are configured. Then, according to the acquisition time, news data determined by the data source address list, the starting page number and the ending page number are automatically extracted. And finally, analyzing to obtain the title and text data in the news data, and storing the title and text data in a database for the subsequent natural language processing. If the endpoint focuses on the knowledge map generation of financial events, news data related to finance and finance can be collected.
In step S104, natural language processing is performed on the database data, and entities and relationships between the entities are extracted. In particular, a natural language processing model may be utilized to extract data keywords from database data. For example for news data: "Monday (11 days 6), Zhang III has reached Singapore, i.e., will have a historical meeting with Korean Liqu. It is known that the single meeting of zhang with lien four will be 9 am on local time 12 days: 15, act. After the session is over, Zhang III will hold the reporter and leave Singapore and return to the United states about 8 a.m. The 'deep V' market is presented in the gold day of the spot shipment, and the thousand-three major relations are broken through. However, the rare pre-negotiation in the united states before the 'individual meeting' and the upward income rate of national debt in the 10-year period in the united states lead to the dangerous emotion beginning to fall back and the gold price wandering around the $ 1300 gateway ". In this embodiment, the natural language processing model performs semantic segmentation on the text in the news data, and segments the text into terms such as zhang san, singapore, korean, and the like; then, labeling each word according to corresponding part of speech, such as: zhang san (name of person), singapore (place name); finally, the required annotated entities, including names of people, places, numbers, etc., are retrieved.
Optionally, the natural language processing model may use a word frequency-reverse document frequency algorithm, and take the data keyword whose word frequency-reverse document frequency is higher than the first threshold as an entity. When the word frequency-reverse document frequency algorithm is specifically applied, the calculation formula of the word frequency-reverse document frequency is as follows:
X=Y*lg{M/(N+1)}
wherein, X represents the word frequency of a word-the reverse document frequency, Y represents the word frequency of a word, namely the total number of occurrences of the word in the article, M represents the total number of documents in the corpus storing all news data, and N represents the number of documents containing the word in the corpus. The idea of the word frequency-inverse document frequency algorithm is that a word or phrase is considered to have a good degree of discrimination if it occurs with high frequency in one article and rarely in other articles. And taking the data key word with a high word frequency-reverse document frequency value as an entity, which is beneficial to generating a knowledge graph representing news key points.
Relationships between entities can be further extracted from database data using a natural language processing model. For example, the database data records the text information "influence of 618 on the stock market". Then, the natural language processing model first cuts the text into "618", "pair", "stock market", "influence", and then labels the part of speech of each word. "618 (noun)", "stock market (noun)", "pair (preposition)". The noun in the text is an entity, and the "influence on …" in the text can determine the relationship between the entities, i.e. the starting entity is "618" and the ending entity is "stock market". It will be understood by those skilled in the art that if the text message has only one entity, the default is the originating entity.
When an entity with the same content appears, the entity cannot be replaced, and only one relationship is added to the node corresponding to the entity to point to the next node. Finally, a complex network with a plurality of data is formed, and the knowledge graph is formed.
In step S106, a knowledge-graph is generated from the entities and relationships between the entities.
FIG. 2 shows a schematic diagram of a knowledge-graph. Neo4j, OrientDB, TITAN, preferably Neo4j, may be used in storing the knowledge graph. Neo4j is a high-performance graph database that stores structured data on a network rather than in tables. It is an embedded, disk-based Java persistence engine with full transactional properties, but it stores structured data on a graph rather than in a table. Neo4j can also be viewed as a high performance graph engine with all the features of a full database. This allows the structured information contained in the collected news data to be characterized in the Neo4j database.
In the embodiment, data source data are automatically collected, events are automatically processed through natural language processing, entities obtained through analysis and relationships among the entities are stored in the knowledge graph, so that the entities or the relationships related to the events can be displayed to users in a knowledge graph mode, and the users can be helped to quickly obtain related event information.
(II) data query
One embodiment of the disclosed data query method is described below in conjunction with fig. 3.
FIG. 3 shows a flow diagram of one embodiment of the disclosed data query method. As shown in fig. 3, the data query method in the present embodiment includes steps S302 to S308.
In step S302, query information input by a user is received.
In step S304, natural language processing is performed on the query information input by the user, and the query information is analyzed to obtain a query keyword.
In step S306, a query entity corresponding to the query keyword is determined in the knowledge-graph.
In step S308, the data chain in the knowledge-graph associated with the querying entity is returned to the user.
The query process can be roughly divided into two cases according to the query information input by the user.
The first situation is that the query information input by the user is analyzed by using natural language processing, after the query keyword is obtained, the initial query entity and the end query entity corresponding to the query keyword are determined in the knowledge graph, and then a data chain in which the initial query entity is used as the initial node and the end query entity is used as the end node in the knowledge graph is returned to the user.
For example, the query information input by the user is "three meet with lie four". Analyzing the Zhang Sanzhuilian Liqu to obtain Zhang Sanlian and Liqu. Determining a starting query entity 'Zhang three' and an ending query entity 'Liquan' in the knowledge graph. Two data chains are finally inquired, wherein the first data chain is ' three-piece ' - > ' American ' single meeting ' - > ' Korean ' - > ' LiSi four ', and the second data chain is ' three-piece ' - > ' sane ' - > ' Korean ' - > ' LiSi four '.
The second situation is that the query information input by the user is analyzed by using natural language processing, an initial query entity corresponding to the query keyword is determined in the knowledge graph after the query keyword is obtained, and then a data chain taking the initial query entity as an initial node in the knowledge graph is returned to the user.
When the data chain is determined, the related end entity starts to be traversed by the initial query entity according to the depth priority rule, and then the related end node continues to be traversed by taking the end entity as the start until the traversal is finished or a certain condition is reached.
Optionally, in step S308, the data chains associated with the query entity in the knowledge graph are returned to the user in the order from large to small of the number of relationships included in the data chains. That is, the more and higher the priority of the relationship tree existing between two hit entities, the more the relationship number will be displayed preferentially. And finally, preferentially returning node data with a large incidence relation according to the sequencing value of the traversal times.
For example, the first data chain "zhangsan" - > "usa" - > "individual meeting" - > "korean" - > "prune four" takes precedence over the second data chain "zhangsan" - > "sanctioning" - > "korean" - > "prune four"
Optionally, if an entity in any map is missed in the query process, the text information may be passed to the information collection stage through middleware (e.g., kafka), and a data collection instruction containing the specified text is executed. FIG. 4 shows a complete workflow diagram for data query federated data collection.
In the embodiment, the relevant data chain in the knowledge graph is returned to the user after the user inputs the query information, and the upstream and downstream association relation of the relevant data information can be fully displayed for the user, so that the data analysis range of the user is expanded, and the user can perform more comprehensive data analysis.
Furthermore, in graph databases, relationships are the most important elements by which entities can be associated with each other to build related complex models. Each node in the graph database model directly comprises a relationship list, and relationship records of the node and other nodes are stored in the relationship list. These relationship records are organized by type and orientation, and additional attributes may be saved. Whenever a JOIN operation similar to a relational database is run, the graph database directly accesses the connected nodes by using the list without performing recorded searching and matching calculation operations, thereby improving the efficiency and stability of information query.
A data query apparatus according to an embodiment of the present disclosure is described below with reference to fig. 5.
Fig. 5 shows a schematic flow chart of a data query device according to an embodiment of the present disclosure. As shown in fig. 5, the data query device 50 in the present embodiment includes:
an information receiving module 502 configured to receive query information input by a user.
And a keyword analysis module 504 configured to perform natural language processing on the query information input by the user and analyze the query information to obtain a query keyword.
An entity determination module 506 configured to determine a query entity corresponding to the query keyword in the knowledge-graph.
A data return module 508 configured to return data chains in the knowledge-graph associated with the query entity to the user.
In some embodiments, the entity determination module 506 is configured to: determining an initial query entity corresponding to a query keyword in a knowledge graph; the data return module 508 is configured to: and returning a data chain which takes the initial query entity as an initial node in the knowledge graph to the user.
In some embodiments, the entity determination module 506 is configured to: determining a starting query entity and an ending query entity corresponding to a query keyword in a knowledge graph; the data return module 508 is configured to: and returning a data chain which takes the initial query entity as an initial node and takes the end query entity as an end node in the knowledge graph to the user.
In some embodiments, the data return module 508 is configured to: and returning the data chains associated with the query entity in the knowledge graph to the user according to the sequence of the relationship numbers contained in the data chains from large to small.
In the embodiment, the relevant data chain in the knowledge graph is returned to the user after the user inputs the query information, and the upstream and downstream association relation of the relevant data information can be fully displayed for the user, so that the data analysis range of the user is expanded, and the user can perform more comprehensive data analysis.
In some embodiments, the data query device 50 further comprises a knowledge-graph generation module 500 configured to: collecting database data; carrying out natural language processing on database data, and extracting entities and relations among the entities; and generating a knowledge graph according to the entities and the relationship among the entities.
In some embodiments, the knowledge-graph generation module 500 is configured to: extracting data keywords from database data by using a natural language processing model; taking the data key words with the word frequency-reverse document frequency higher than a first threshold value as entities; relationships between entities are extracted from database data using a natural language processing model.
In some embodiments, the knowledge-graph generation module 500 is configured to: configuring a data source address list, a starting page number, an ending page number and acquisition time; according to the acquisition time, news data determined by a data source address list, a starting page number and an ending page number are automatically extracted; and analyzing to obtain the title and text data in the news data, and storing the title and text data in the database.
In the embodiment, data source data are automatically collected, events are automatically processed through natural language processing, entities obtained through analysis and relationships among the entities are stored in the knowledge graph, so that the entities or the relationships related to the events can be displayed to users in a knowledge graph mode, and the users can be helped to quickly obtain related event information.
Fig. 6 shows a schematic structural diagram of a data query device according to another embodiment of the present disclosure. As shown in fig. 6, the data query apparatus 60 of this embodiment includes: a memory 610 and a processor 620 coupled to the memory 610, the processor 620 being configured to execute the data query method of any of the foregoing embodiments based on instructions stored in the memory 610.
Memory 610 may include, for example, system memory, fixed non-volatile storage media, and the like. The system memory stores, for example, an operating system, an application program, a Boot Loader (Boot Loader), and other programs.
The data query apparatus 60 may further include an input-output interface 630, a network interface 640, a storage interface 650, and the like. These interfaces 630, 640, 650 and the connections between the memory 610 and the processor 620 may be through a bus 660, for example. The input/output interface 630 provides a connection interface for input/output devices such as a display, a mouse, a keyboard, and a touch screen. The network interface 640 provides a connection interface for various networking devices. The storage interface 650 provides a connection interface for external storage devices such as an SD card and a usb disk.
The present disclosure also includes a computer readable storage medium having stored thereon computer instructions which, when executed by a processor, implement the data query method in any of the foregoing embodiments.
As will be appreciated by one skilled in the art, embodiments of the present disclosure may be provided as a method, system, or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present disclosure may take the form of a computer program product embodied on one or more computer-usable non-transitory storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only exemplary of the present disclosure and is not intended to limit the present disclosure, so that any modification, equivalent replacement, or improvement made within the spirit and principle of the present disclosure should be included in the scope of the present disclosure.

Claims (6)

1. A method of data query, comprising:
collecting database data;
extracting data keywords from database data by using a natural language processing model;
taking the data key words with the word frequency-reverse document frequency higher than a first threshold value as entities;
extracting relationships between the entities from database data using a natural language processing model;
generating a knowledge graph according to entities and the relationship between the entities;
receiving query information input by a user;
carrying out natural language processing on query information input by a user, and analyzing to obtain query keywords;
determining a starting query entity and an ending query entity corresponding to a query keyword in a knowledge graph;
and returning the data chain which takes the initial query entity as an initial node and takes the end query entity as an end node in the knowledge graph to the user according to the descending order of the priority of the data chain, wherein the priority of the data chain is in positive correlation with the relation number included in the data chain.
2. The data query method of claim 1, wherein the collecting database data comprises:
configuring a data source address list, a starting page number, an ending page number and acquisition time;
according to the acquisition time, news data determined by a data source address list, a starting page number and an ending page number are automatically extracted;
and analyzing to obtain the title and text data in the news data, and storing the title and text data in a database.
3. A data query apparatus, comprising:
a knowledge-graph generation module configured to: collecting database data; extracting data keywords from database data by using a natural language processing model; taking the data key words with the word frequency-reverse document frequency higher than a first threshold value as entities; extracting relationships between the entities from database data using a natural language processing model; generating a knowledge graph according to entities and the relationship between the entities;
the information receiving module is configured to receive query information input by a user;
the keyword analysis module is configured to perform natural language processing on query information input by a user and analyze the query information to obtain a query keyword;
the entity determining module is configured to determine a starting query entity and an ending query entity corresponding to the query keyword in the knowledge graph;
and the data returning module is configured to return the data chain which takes the initial query entity as an initial node and takes the end query entity as an end node in the knowledge graph to the user according to the sequence of the priorities of the data chain from large to small, wherein the priority of the data chain is in positive correlation with the relation number included in the data chain.
4. The data query apparatus of claim 3, wherein the knowledge-graph generation module is configured to:
configuring a data source address list, a starting page number, an ending page number and acquisition time;
according to the acquisition time, news data determined by a data source address list, a starting page number and an ending page number are automatically extracted;
and analyzing to obtain the title and text data in the news data, and storing the title and text data in a database.
5. A data query apparatus, comprising:
a memory; and
a processor coupled to the memory, the processor configured to perform the data query method of claim 1 or 2 based on instructions stored in the memory.
6. A computer-readable storage medium, wherein the computer-readable storage medium stores computer instructions which, when executed by a processor, implement the data query method of claim 1 or 2.
CN201810647001.3A 2018-06-22 2018-06-22 Data query method and device and computer readable storage medium Active CN108829858B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810647001.3A CN108829858B (en) 2018-06-22 2018-06-22 Data query method and device and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810647001.3A CN108829858B (en) 2018-06-22 2018-06-22 Data query method and device and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN108829858A CN108829858A (en) 2018-11-16
CN108829858B true CN108829858B (en) 2021-09-17

Family

ID=64143010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810647001.3A Active CN108829858B (en) 2018-06-22 2018-06-22 Data query method and device and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN108829858B (en)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670049A (en) * 2018-11-19 2019-04-23 平安科技(深圳)有限公司 Map path query method, apparatus, computer equipment and storage medium
CN109766446A (en) * 2018-12-13 2019-05-17 平安科技(深圳)有限公司 A kind of data survey method, data survey device and computer readable storage medium
CN109710738A (en) * 2018-12-24 2019-05-03 广州天鹏计算机科技有限公司 Drug inquiry method, apparatus, system, computer equipment and storage medium
CN109658208A (en) * 2019-01-15 2019-04-19 京东方科技集团股份有限公司 Recommended method, device, medium and the electronic equipment of drug
CN109918436B (en) * 2019-03-08 2022-12-20 麦博(上海)健康科技有限公司 Medical knowledge management and query system
CN110119463A (en) * 2019-04-04 2019-08-13 厦门快商通信息咨询有限公司 Information processing method, device, equipment and storage medium
CN110187678B (en) * 2019-04-19 2021-11-05 广东省智能制造研究所 Information storage and digital application system of processing equipment in manufacturing industry
CN110209930A (en) * 2019-04-30 2019-09-06 平安科技(深圳)有限公司 The investee's analysis method and device of knowledge based map
CN110502645B (en) * 2019-08-28 2022-07-08 中国联合网络通信集团有限公司 Information query method and device
CN110765295A (en) * 2019-09-06 2020-02-07 中国平安财产保险股份有限公司 Graph database-based query method and device, computer equipment and storage medium
CN110688490A (en) * 2019-09-19 2020-01-14 上海数禾信息科技有限公司 Method and device for configuring knowledge graph
CN110674112A (en) * 2019-09-23 2020-01-10 北京百分点信息科技有限公司 Data query method and device and electronic equipment
CN110928984A (en) * 2019-09-30 2020-03-27 珠海格力电器股份有限公司 Knowledge graph construction method and device, terminal and storage medium
CN112035506A (en) * 2019-10-28 2020-12-04 竹间智能科技(上海)有限公司 Semantic recognition method and equipment
CN111061750A (en) * 2019-12-17 2020-04-24 Oppo广东移动通信有限公司 Query processing method and device and computer readable storage medium
CN111581188A (en) * 2020-01-20 2020-08-25 全息空间(深圳)智能科技有限公司 Live broadcast platform personal data tracking method, system and storage medium
CN111538863B (en) * 2020-03-19 2023-08-18 北京完美知识科技有限公司 Data display method, device, system, storage medium and electronic device
CN111475624B (en) * 2020-04-13 2023-08-04 杭州海康威视系统技术有限公司 Monitoring data retrieval method, device and equipment
CN112818092B (en) * 2020-04-20 2023-08-11 腾讯科技(深圳)有限公司 Knowledge graph query statement generation method, device, equipment and storage medium
CN111666746B (en) * 2020-06-05 2023-09-29 中国银行股份有限公司 Conference summary generation method and device, electronic equipment and storage medium
CN111859969A (en) * 2020-07-20 2020-10-30 航天科工智慧产业发展有限公司 Data analysis method and device, electronic equipment and storage medium
CN111984797A (en) * 2020-09-07 2020-11-24 中国银行股份有限公司 Customer identity recognition device and method
CN112597105A (en) * 2020-12-26 2021-04-02 中国农业银行股份有限公司 Processing method of file associated object, server side equipment and storage medium
CN112711716A (en) * 2021-01-25 2021-04-27 广东工业大学 Knowledge graph-based marine industry news pushing method and system
CN112948547B (en) * 2021-01-26 2024-04-09 中国石油大学(北京) Logging knowledge graph construction query method, device, equipment and storage medium
CN112988950B (en) * 2021-03-12 2023-10-13 成都数联铭品科技有限公司 Front-end rendering method and system of knowledge graph, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103870720A (en) * 2014-03-19 2014-06-18 中国人民解放军国防科学技术大学 Prediction method and device for protein signal transduction subnet
CN106250393A (en) * 2016-07-13 2016-12-21 广州安望信息科技有限公司 The short text understanding method of a kind of knowledge based collection of illustrative plates and device
CN107341215A (en) * 2017-06-07 2017-11-10 北京航空航天大学 A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform
CN107368509A (en) * 2012-12-31 2017-11-21 脸谱公司 Communication means, communication system and computer-readable non-transitory storage medium
CN107545000A (en) * 2016-06-28 2018-01-05 百度在线网络技术(北京)有限公司 The information-pushing method and device of knowledge based collection of illustrative plates
CN107944025A (en) * 2017-12-12 2018-04-20 北京百度网讯科技有限公司 Information-pushing method and device

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8880521B2 (en) * 2004-09-15 2014-11-04 3Degrees Llc Collections of linked databases
KR101593720B1 (en) * 2011-12-27 2016-02-17 한국전자통신연구원 Contents search recommendation apparatus and method based on semantic network
US20130311242A1 (en) * 2012-05-21 2013-11-21 International Business Machines Corporation Business Process Analytics
BR112015001467A2 (en) * 2012-07-23 2017-07-04 Facebook Inc structured search based on social chart information
CN104462504A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for providing reasoning process data in search
CN107783973B (en) * 2016-08-24 2022-02-25 慧科讯业有限公司 Method, device and system for monitoring internet media event based on industry knowledge map database
CN106713083B (en) * 2016-11-24 2020-06-26 海信集团有限公司 Intelligent household equipment control method, device and system based on knowledge graph
CN106919655B (en) * 2017-01-24 2020-05-19 网易(杭州)网络有限公司 Answer providing method and device
CN106919671B (en) * 2017-02-20 2020-06-05 广东省中医院 Intelligent system for traditional Chinese medicine text case mining and assistant decision making
CN107515951A (en) * 2017-09-20 2017-12-26 广东中标数据科技股份有限公司 A kind of searching method based on graphic data base, system and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107368509A (en) * 2012-12-31 2017-11-21 脸谱公司 Communication means, communication system and computer-readable non-transitory storage medium
CN103870720A (en) * 2014-03-19 2014-06-18 中国人民解放军国防科学技术大学 Prediction method and device for protein signal transduction subnet
CN107545000A (en) * 2016-06-28 2018-01-05 百度在线网络技术(北京)有限公司 The information-pushing method and device of knowledge based collection of illustrative plates
CN106250393A (en) * 2016-07-13 2016-12-21 广州安望信息科技有限公司 The short text understanding method of a kind of knowledge based collection of illustrative plates and device
CN107341215A (en) * 2017-06-07 2017-11-10 北京航空航天大学 A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform
CN107944025A (en) * 2017-12-12 2018-04-20 北京百度网讯科技有限公司 Information-pushing method and device

Also Published As

Publication number Publication date
CN108829858A (en) 2018-11-16

Similar Documents

Publication Publication Date Title
CN108829858B (en) Data query method and device and computer readable storage medium
CN106649260B (en) Product characteristic structure tree construction method based on comment text mining
WO2019174132A1 (en) Data processing method, server and computer storage medium
JP6247292B2 (en) Query expansion
TW201514845A (en) Title and body extraction from web page
CN103544255A (en) Text semantic relativity based network public opinion information analysis method
CN111026965B (en) Hot topic tracing method and device based on knowledge graph
CN112000790B (en) Legal text accurate retrieval method, terminal system and readable storage medium
CN109783815A (en) A kind of various dimensions network public-opinion big data comparative analysis method
CN109101551B (en) Question-answer knowledge base construction method and device
WO2014114175A1 (en) Method and apparatus for providing search engine tags
CN113190687A (en) Knowledge graph determining method and device, computer equipment and storage medium
Mehta et al. DOM tree based approach for web content extraction
CN114861677A (en) Information extraction method, information extraction device, electronic equipment and storage medium
CN104484379A (en) Method and device for determining relation among musical entities and inquiry processing method and device
KR20170043365A (en) Important precedents extraction and sorting method using Big Data
KR100557874B1 (en) Method of scientific information analysis and media that can record computer program thereof
CN112711716A (en) Knowledge graph-based marine industry news pushing method and system
CN111984797A (en) Customer identity recognition device and method
CN107748755A (en) Synonym method for digging, device, equipment and computer-readable recording medium
KR102025813B1 (en) Device and method for chronological big data curation system
US9785404B2 (en) Method and system for analyzing data in artifacts and creating a modifiable data network
Kurmi et al. Text summarization using enhanced MMR technique
CN113434607A (en) Behavior analysis method and device based on graph data, electronic equipment and storage medium
CN113407678A (en) Knowledge graph construction method, device and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Room 221, 2nd floor, Block C, 18 Kechuang 11th Street, Beijing Daxing District, Beijing

Applicant after: JINGDONG DIGITAL TECHNOLOGY HOLDINGS Co.,Ltd.

Address before: Room 221, 2nd floor, Block C, 18 Kechuang 11th Street, Beijing Daxing District, Beijing

Applicant before: BEIJING JINGDONG FINANCIAL TECHNOLOGY HOLDING Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder

Address after: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176

Patentee after: Jingdong Technology Holding Co.,Ltd.

Address before: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176

Patentee before: Jingdong Digital Technology Holding Co.,Ltd.

Address after: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176

Patentee after: Jingdong Digital Technology Holding Co.,Ltd.

Address before: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176

Patentee before: JINGDONG DIGITAL TECHNOLOGY HOLDINGS Co.,Ltd.

CP01 Change in the name or title of a patent holder