CN103530348A - Integrated data query method based on internet - Google Patents
Integrated data query method based on internet Download PDFInfo
- Publication number
- CN103530348A CN103530348A CN201310467428.2A CN201310467428A CN103530348A CN 103530348 A CN103530348 A CN 103530348A CN 201310467428 A CN201310467428 A CN 201310467428A CN 103530348 A CN103530348 A CN 103530348A
- Authority
- CN
- China
- Prior art keywords
- navigation
- internet
- data
- user
- query result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/954—Navigation, e.g. using categorised browsing
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to an integrated data query method based on the internet. The integrated data query method is characterized by including the following steps that (1), a navigation agent creates a foundation picture of dataset navigation; (2), a user inputs a route expression to the navigation agent; (3), the navigation agent performs transferring, skipping, seeking and other operation in an iterative mode according to the route expression; (4), the navigation agent outputs obtained internet data results to the user. Semi-automatic semantic navigation based on internet data is achieved through the method, and good navigation capacity is provided for queries. The user can limit rules and processes of navigation through the set route expression. The queries can be performed through the navigation to obtain an internet dataset wanted by the user, some reasoning work can be done, and the novel method is provided for the internet data to perform the queries and reasoning.
Description
Technical field
The present invention relates to information retrieval field, relate in particular to a kind of integrated data querying method based on internet.
Background technology
Internet data is by a large amount of interconnected RDF(resource description frameworks) scheme to form.Internet data has four rules: (1) is the name as things with URI; (2) use HTTP URI; (3) in RDF, provide Useful Information; (4) can be included in the RDF link of other URI.Internet data is seen as the distributed database of an opening on internet.Internet data can represent with RDF data set, and RDF data set is the set of a RDF name figure.RDF figure both can be expressed as the set of RDF tlv triple (s, p, o), can be expressed as again the digraph of tape label
To the internet data of magnanimity be processed and be applied, first will solve the inquiry problem of internet data.W3C recommendation SPARQL carries out the inquiry of semantic web data.The data layout of RDF is the digraph of tape label, so SPARQL is a kind of figure matching inquiry language.Its syntactic class of SPARQL is similar to SQL, and SPARQL inquiry is comprised of three parts.Pattern match part, modified result part and result output.The result type of SPARQL inquiry comprises: whether be the variate-value of match pattern and the new RDF data that are comprised of these values, the description of resource etc.
Because RDF is a kind of digraph of tape label, people more and more recognize, to tree-like and data (as XML and RDF) graph structure, homing capability is very important.Yet the support that SPARQL provides aspect navigation is considerably less, this has just caused the difficulty of some inquiries.Therefore we consider to provide with path expression the homing capability of inquiry.
Summary of the invention
For addressing the above problem, the invention provides the integrated data querying method based on internet that a kind of homing capability is strong.
In order to achieve the above object, the technical solution used in the present invention is: a kind of integrated data querying method based on internet, it is characterized in that, and step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, the operations such as the shifting of iteration, redirect, searching, and concrete operation step is:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
Further, described navigation agency is the data management server of programming language framework.
Invention thought of the present invention is: user is according to the internet data content of wanting inquiry, and navigation is acted on behalf of the path expression rule of submitting to RPath language description, and the constraint conditions such as node of internet data navigation beginning.
Navigation agency, is internet data inquiry and the inference engine based on path expression, and it can inquire the content that user wants automatically on internet data, and the inference rule that can provide according to user is carried out reasoning.Navigation agency is according to the given path expression of step (2), and from given starting point set, the foundation drawing of data set navigation is on the internet found the path of coupling, carries out navigation procedure, obtains inquiry and the reasoning results.
RPath language comprises following rule:
Step: in the figure of from (g, m) to (g, n), shift, make for some tlv triple t ∈ g, m, n ∈ Term (t), is denoted as (g, t).G presentation graphs wherein, m, the node in n presentation graphs, t represents a tlv triple, Term (t) represents an internet data tlv triple.Self is a kind of special Step, and its start node is same node, is the redirect of self arriving self;
Jump: from (g, n) to (g ', shifting across figure n), represents to jump to figure g ' from figure g, is denoted as (g, (n, JUMP, g ')).Wherein g and g ' represent two figure;
With p, represent reverse transition;
The regular expression of use standard is such as, Union (' | '), Sequence ('/'), and Star (' * '), represents compound expression.
Fold:folding Operations Analyst set of paths, forms a new path according to folding rule the packing of some paths, is denoted as f=(Pe=> p).Wherein Pe is nested path expression, and p is the property value in path.
By describing navigation procedure with drag: position p and one be take the set of p as starting point path.Path be a sequence path (p0, t1, p1 ..., tn, pn), p0 wherein, p1 ..., pn is position, ti is the transfer from pi1 to pi, 1≤i≤n.
Path expression Pe is used for explaining navigation procedure: (p, Pe) valuation in data set d, be denoted as Evald (p, Pe), and be from node p, the set of paths being produced by the navigation procedure of Pe appointment; P is under Pe rule, and arrived in the set in data set d, is denoted as Reachd (p, Pe), is included in the distal point in all paths in Evald (p, Pe).
Rule about RPath language can change form below into above,
(1) Step: be a transfer that another summit is arrived on summit in figure, transfer can be used
A limit in figure represents, s wherein, and o is the summit in figure, p represents a limit from s to o.S is corresponding to the subject in internet data tlv triple, and o is corresponding to object, and p is corresponding to predicate;
(2)
∩
Represent and relation,
∪
Represent or close;
(3)Fold:
, represent can release regular r from regular t, wherein t uses
∩
With
∪
A plurality of rules of link, we claim regular t fold to claim regular r.
Technical advantage of the present invention is: this method has realized the semi-automatic semantic navigation based on internet data, and provides good homing capability for inquiring about.User can retrain by given path expression formula rule and the process of navigation.By navigation, can inquire about and obtain the internet data collection that user wants, and can do some reasoning work, for inquiry and the reasoning of internet data provides a kind of new method.
Accompanying drawing explanation
Fig. 1 is the workflow schematic diagram of the present embodiment.
Embodiment
Below in conjunction with the drawings and specific embodiments, invention is described further.
With reference to figure 1, a kind of integrated data querying method based on internet, step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, the operations such as the shifting of iteration, redirect, searching, and concrete operation step is:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
For example, we wish according to FOAF information (friends information), and bibliography information and book review infer that who may buy the books of artificial intelligence professional domain.Concrete implementation step is as follows:
(1) scope of user's inquiry is on internet data collection M;
(2) user wishes to inquire about the book that Mia may buy.User is converted into navigation constraint rule by inquiry Inference Conditions, and sets Mia for navigation starting point;
A) we think that this this book is field D if a people writes, comments on or bought a book, and we think that he is interested in this field, arrive especially artificial intelligence (AI) field, are converted into navigation constraint rule to be:
B) we think if a people A is interested in certain field D, and this people knows that E is the expert of field D, and, on the D of field, A believes E.Be converted into path expression:
∩
C) if a people A of our hypothesis believes expert E at field D, and expert E commented on product P, and A may buy product P.Be converted into path expression:
(3) navigation agent apparatus is made as sky by result set, starts navigation procedure;
(4) limit starting for starting point Mia
, belong to navigation rule (a), add results set;
(5) each subgraph to results set, the summit book1 of subgraph is identical with summit in Bibliographical Information, carries out jump operation, and the summit book1 changing in Bibliographical Information is new starting point; To the limit starting with starting point book1
, belong to navigation rule (a), add results set.
(6) repeat the process of back, can be by
Add results set.
(7) navigation agent apparatus contrasts each bar navigation constraint rule successively.
With
Meet rule (a), release
Add results set;
With
Meet rule (b), release
Add result combination;
With
Meet rule (c), release
(8) user inquires about the book that Mia may buy, and navigation agent apparatus meets results set the result of buy condition
Submit to user, Mia may buy book1 and book2.
The present invention is not limited only to the protection domain shown in above-described embodiment, and all invention thought based on the present embodiment, all in protection scope of the present invention.
Claims (2)
1. the integrated data querying method based on internet, is characterized in that, step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, and the operations such as the shifting of iteration, redirect, searching, specifically comprise:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
2. the integrated data querying method based on internet according to claim 1, is characterized in that: described navigation agency is the data management server of programming language framework.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310467428.2A CN103530348A (en) | 2013-11-12 | 2013-11-12 | Integrated data query method based on internet |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310467428.2A CN103530348A (en) | 2013-11-12 | 2013-11-12 | Integrated data query method based on internet |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103530348A true CN103530348A (en) | 2014-01-22 |
Family
ID=49932357
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310467428.2A Pending CN103530348A (en) | 2013-11-12 | 2013-11-12 | Integrated data query method based on internet |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103530348A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5943667A (en) * | 1997-06-03 | 1999-08-24 | International Business Machines Corporation | Eliminating redundancy in generation of association rules for on-line mining |
CN102142034A (en) * | 2011-03-30 | 2011-08-03 | 浙江大学 | Internet data query method |
CN102609463A (en) * | 2012-01-13 | 2012-07-25 | 广东电网公司电力科学研究院 | Data cluster management system based on quasi-realtime platform |
CN103116655A (en) * | 2013-03-06 | 2013-05-22 | 亿赞普(北京)科技有限公司 | Clustered data query method, client side and system |
-
2013
- 2013-11-12 CN CN201310467428.2A patent/CN103530348A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5943667A (en) * | 1997-06-03 | 1999-08-24 | International Business Machines Corporation | Eliminating redundancy in generation of association rules for on-line mining |
CN102142034A (en) * | 2011-03-30 | 2011-08-03 | 浙江大学 | Internet data query method |
CN102609463A (en) * | 2012-01-13 | 2012-07-25 | 广东电网公司电力科学研究院 | Data cluster management system based on quasi-realtime platform |
CN103116655A (en) * | 2013-03-06 | 2013-05-22 | 亿赞普(北京)科技有限公司 | Clustered data query method, client side and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100815563B1 (en) | System and method for knowledge extension and inference service based on DBMS | |
Mäkelä | Survey of semantic search research | |
US20060265352A1 (en) | Methods and apparatus for information integration in accordance with web services | |
US8775356B1 (en) | Query enhancement of semantic wiki for improved searching of unstructured data | |
Sure et al. | On-To-knowledge: semantic web-enabled knowledge management | |
Ceri et al. | Search computing challenges and directions | |
Khan et al. | Categorization of knowledge graph based recommendation methods and benchmark datasets from the perspectives of application scenarios: A comprehensive survey | |
Pohorec et al. | Analysis of approaches to structured data on the web | |
CN102722569B (en) | Knowledge discovery device based on path migration of RDF (Resource Description Framework) picture and method | |
CN103605655A (en) | Shared data inquiring method based on Internet | |
CN102142034A (en) | Internet data query method | |
CN103577538A (en) | Key value data query method based on internet | |
US20150074121A1 (en) | Semantics graphs for enterprise communication networks | |
Ma et al. | RDF keyword search by query computation | |
CN103530348A (en) | Integrated data query method based on internet | |
CN103605654A (en) | Historical data inquiring method based on Internet | |
Grahne et al. | Preferentially annotated regular path queries | |
Mugnier et al. | On querying simple conceptual graphs with negation | |
Bill et al. | On Synergies between Model Transformations and Semantic Web Technologies. | |
Raji et al. | RDF approach on social network analysis | |
Wang et al. | A dataflow-pattern-based recommendation approach for data service mashups | |
Haller et al. | Enabling Exploratory Search on Manufacturing Knowledge Graphs. | |
Ranganathan et al. | Relation based Semantic web search engine | |
Xu et al. | Using social networks to organize researcher community | |
Wang et al. | Interactive recommendation of composition operators for situational data integration |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20140122 |