CN103530348A - Integrated data query method based on internet - Google Patents

Integrated data query method based on internet Download PDF

Info

Publication number
CN103530348A
CN103530348A CN201310467428.2A CN201310467428A CN103530348A CN 103530348 A CN103530348 A CN 103530348A CN 201310467428 A CN201310467428 A CN 201310467428A CN 103530348 A CN103530348 A CN 103530348A
Authority
CN
China
Prior art keywords
navigation
internet
data
user
query result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310467428.2A
Other languages
Chinese (zh)
Inventor
王萌
唐新来
李健军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LIUZHOU HONGYI TECHNOLOGY Co Ltd
Original Assignee
LIUZHOU HONGYI TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LIUZHOU HONGYI TECHNOLOGY Co Ltd filed Critical LIUZHOU HONGYI TECHNOLOGY Co Ltd
Priority to CN201310467428.2A priority Critical patent/CN103530348A/en
Publication of CN103530348A publication Critical patent/CN103530348A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to an integrated data query method based on the internet. The integrated data query method is characterized by including the following steps that (1), a navigation agent creates a foundation picture of dataset navigation; (2), a user inputs a route expression to the navigation agent; (3), the navigation agent performs transferring, skipping, seeking and other operation in an iterative mode according to the route expression; (4), the navigation agent outputs obtained internet data results to the user. Semi-automatic semantic navigation based on internet data is achieved through the method, and good navigation capacity is provided for queries. The user can limit rules and processes of navigation through the set route expression. The queries can be performed through the navigation to obtain an internet dataset wanted by the user, some reasoning work can be done, and the novel method is provided for the internet data to perform the queries and reasoning.

Description

A kind of integrated data querying method based on internet
Technical field
The present invention relates to information retrieval field, relate in particular to a kind of integrated data querying method based on internet.
Background technology
Internet data is by a large amount of interconnected RDF(resource description frameworks) scheme to form.Internet data has four rules: (1) is the name as things with URI; (2) use HTTP URI; (3) in RDF, provide Useful Information; (4) can be included in the RDF link of other URI.Internet data is seen as the distributed database of an opening on internet.Internet data can represent with RDF data set, and RDF data set is the set of a RDF name figure.RDF figure both can be expressed as the set of RDF tlv triple (s, p, o), can be expressed as again the digraph of tape label
Figure DEST_PATH_GDA0000412591440000011
To the internet data of magnanimity be processed and be applied, first will solve the inquiry problem of internet data.W3C recommendation SPARQL carries out the inquiry of semantic web data.The data layout of RDF is the digraph of tape label, so SPARQL is a kind of figure matching inquiry language.Its syntactic class of SPARQL is similar to SQL, and SPARQL inquiry is comprised of three parts.Pattern match part, modified result part and result output.The result type of SPARQL inquiry comprises: whether be the variate-value of match pattern and the new RDF data that are comprised of these values, the description of resource etc.
Because RDF is a kind of digraph of tape label, people more and more recognize, to tree-like and data (as XML and RDF) graph structure, homing capability is very important.Yet the support that SPARQL provides aspect navigation is considerably less, this has just caused the difficulty of some inquiries.Therefore we consider to provide with path expression the homing capability of inquiry.
Summary of the invention
For addressing the above problem, the invention provides the integrated data querying method based on internet that a kind of homing capability is strong.
In order to achieve the above object, the technical solution used in the present invention is: a kind of integrated data querying method based on internet, it is characterized in that, and step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, the operations such as the shifting of iteration, redirect, searching, and concrete operation step is:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
Further, described navigation agency is the data management server of programming language framework.
Invention thought of the present invention is: user is according to the internet data content of wanting inquiry, and navigation is acted on behalf of the path expression rule of submitting to RPath language description, and the constraint conditions such as node of internet data navigation beginning.
Navigation agency, is internet data inquiry and the inference engine based on path expression, and it can inquire the content that user wants automatically on internet data, and the inference rule that can provide according to user is carried out reasoning.Navigation agency is according to the given path expression of step (2), and from given starting point set, the foundation drawing of data set navigation is on the internet found the path of coupling, carries out navigation procedure, obtains inquiry and the reasoning results.
RPath language comprises following rule:
Step: in the figure of from (g, m) to (g, n), shift, make for some tlv triple t ∈ g, m, n ∈ Term (t), is denoted as (g, t).G presentation graphs wherein, m, the node in n presentation graphs, t represents a tlv triple, Term (t) represents an internet data tlv triple.Self is a kind of special Step, and its start node is same node, is the redirect of self arriving self;
Jump: from (g, n) to (g ', shifting across figure n), represents to jump to figure g ' from figure g, is denoted as (g, (n, JUMP, g ')).Wherein g and g ' represent two figure;
With p, represent reverse transition;
The regular expression of use standard is such as, Union (' | '), Sequence ('/'), and Star (' * '), represents compound expression.
Fold:folding Operations Analyst set of paths, forms a new path according to folding rule the packing of some paths, is denoted as f=(Pe=> p).Wherein Pe is nested path expression, and p is the property value in path.
By describing navigation procedure with drag: position p and one be take the set of p as starting point path.Path be a sequence path (p0, t1, p1 ..., tn, pn), p0 wherein, p1 ..., pn is position, ti is the transfer from pi1 to pi, 1≤i≤n.
Path expression Pe is used for explaining navigation procedure: (p, Pe) valuation in data set d, be denoted as Evald (p, Pe), and be from node p, the set of paths being produced by the navigation procedure of Pe appointment; P is under Pe rule, and arrived in the set in data set d, is denoted as Reachd (p, Pe), is included in the distal point in all paths in Evald (p, Pe).
Rule about RPath language can change form below into above,
(1) Step: be a transfer that another summit is arrived on summit in figure, transfer can be used
Figure DEST_PATH_GDA0000412591440000021
A limit in figure represents, s wherein, and o is the summit in figure, p represents a limit from s to o.S is corresponding to the subject in internet data tlv triple, and o is corresponding to object, and p is corresponding to predicate;
(2)
Represent and relation,
Represent or close;
(3)Fold:
t ⇒ r
, represent can release regular r from regular t, wherein t uses
With
A plurality of rules of link, we claim regular t fold to claim regular r.
Technical advantage of the present invention is: this method has realized the semi-automatic semantic navigation based on internet data, and provides good homing capability for inquiring about.User can retrain by given path expression formula rule and the process of navigation.By navigation, can inquire about and obtain the internet data collection that user wants, and can do some reasoning work, for inquiry and the reasoning of internet data provides a kind of new method.
Accompanying drawing explanation
Fig. 1 is the workflow schematic diagram of the present embodiment.
Embodiment
Below in conjunction with the drawings and specific embodiments, invention is described further.
With reference to figure 1, a kind of integrated data querying method based on internet, step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, the operations such as the shifting of iteration, redirect, searching, and concrete operation step is:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
For example, we wish according to FOAF information (friends information), and bibliography information and book review infer that who may buy the books of artificial intelligence professional domain.Concrete implementation step is as follows:
(1) scope of user's inquiry is on internet data collection M;
(2) user wishes to inquire about the book that Mia may buy.User is converted into navigation constraint rule by inquiry Inference Conditions, and sets Mia for navigation starting point;
A) we think that this this book is field D if a people writes, comments on or bought a book, and we think that he is interested in this field, arrive especially artificial intelligence (AI) field, are converted into navigation constraint rule to be:
person → review / buy / write book ∩ book → isa AIbook
⇒ person → interest AIbook
B) we think if a people A is interested in certain field D, and this people knows that E is the expert of field D, and, on the D of field, A believes E.Be converted into path expression:
person → interest AIbook
E → expert AIbook
⇒ person → trustOnAIbook E
C) if a people A of our hypothesis believes expert E at field D, and expert E commented on product P, and A may buy product P.Be converted into path expression:
A → trustInAIbook E ∩ E → review / write P
⇒ A → buy P
(3) navigation agent apparatus is made as sky by result set, starts navigation procedure;
(4) limit starting for starting point Mia
Mia → review book 1
, belong to navigation rule (a), add results set;
(5) each subgraph to results set, the summit book1 of subgraph is identical with summit in Bibliographical Information, carries out jump operation, and the summit book1 changing in Bibliographical Information is new starting point; To the limit starting with starting point book1
book 1 → isa AIbook
, belong to navigation rule (a), add results set.
(6) repeat the process of back, can be by
book 2 → isa AIbook ,
Joe → write book 2 ,
Joe → review book 1 ,
Joe → expertOn AIbook
Add results set.
(7) navigation agent apparatus contrasts each bar navigation constraint rule successively.
Mia → review book 1
With
book 1 → isa AIbook
Meet rule (a), release
Mia → interest AIbook
Add results set;
Mia → interest AIbook
With
Joe → expertOn AIbook
Meet rule (b), release
Mia → trustOnAIbook Joe
Add result combination;
Mia → trustOnAIbook Joe
With
Joe → review book 1 ,
Joe → write book 2
Meet rule (c), release
Mia → buy book 1 ∪ book 2 .
(8) user inquires about the book that Mia may buy, and navigation agent apparatus meets results set the result of buy condition
Mia → buy book 1 ∪ book 2
Submit to user, Mia may buy book1 and book2.
The present invention is not limited only to the protection domain shown in above-described embodiment, and all invention thought based on the present embodiment, all in protection scope of the present invention.

Claims (2)

1. the integrated data querying method based on internet, is characterized in that, step is as follows:
(1) navigation agency creates the foundation drawing of data set navigation;
(2) user wizard boat is acted on behalf of input path expression formula;
(3) navigation agency is according to path expression, and the operations such as the shifting of iteration, redirect, searching, specifically comprise:
Connect with database broker device;
Send querying condition to described database broker device;
The result being obtained based on described querying condition respectively by database server described in each that each database server obtains in described data-base cluster that reception is returned from described database broker device, as the first Query Result;
Described the first Query Result is stored to specified database server;
Reception by specified database server, based on grouping querying condition and stored data, returned to described the first Query Result according to the Query Result imposing a condition after grouping;
(4) navigation agency exports to user by the internet data result of acquisition.
2. the integrated data querying method based on internet according to claim 1, is characterized in that: described navigation agency is the data management server of programming language framework.
CN201310467428.2A 2013-11-12 2013-11-12 Integrated data query method based on internet Pending CN103530348A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310467428.2A CN103530348A (en) 2013-11-12 2013-11-12 Integrated data query method based on internet

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310467428.2A CN103530348A (en) 2013-11-12 2013-11-12 Integrated data query method based on internet

Publications (1)

Publication Number Publication Date
CN103530348A true CN103530348A (en) 2014-01-22

Family

ID=49932357

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310467428.2A Pending CN103530348A (en) 2013-11-12 2013-11-12 Integrated data query method based on internet

Country Status (1)

Country Link
CN (1) CN103530348A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5943667A (en) * 1997-06-03 1999-08-24 International Business Machines Corporation Eliminating redundancy in generation of association rules for on-line mining
CN102142034A (en) * 2011-03-30 2011-08-03 浙江大学 Internet data query method
CN102609463A (en) * 2012-01-13 2012-07-25 广东电网公司电力科学研究院 Data cluster management system based on quasi-realtime platform
CN103116655A (en) * 2013-03-06 2013-05-22 亿赞普(北京)科技有限公司 Clustered data query method, client side and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5943667A (en) * 1997-06-03 1999-08-24 International Business Machines Corporation Eliminating redundancy in generation of association rules for on-line mining
CN102142034A (en) * 2011-03-30 2011-08-03 浙江大学 Internet data query method
CN102609463A (en) * 2012-01-13 2012-07-25 广东电网公司电力科学研究院 Data cluster management system based on quasi-realtime platform
CN103116655A (en) * 2013-03-06 2013-05-22 亿赞普(北京)科技有限公司 Clustered data query method, client side and system

Similar Documents

Publication Publication Date Title
KR100815563B1 (en) System and method for knowledge extension and inference service based on DBMS
Mäkelä Survey of semantic search research
US20060265352A1 (en) Methods and apparatus for information integration in accordance with web services
US8775356B1 (en) Query enhancement of semantic wiki for improved searching of unstructured data
Sure et al. On-To-knowledge: semantic web-enabled knowledge management
Ceri et al. Search computing challenges and directions
Khan et al. Categorization of knowledge graph based recommendation methods and benchmark datasets from the perspectives of application scenarios: A comprehensive survey
Pohorec et al. Analysis of approaches to structured data on the web
CN102722569B (en) Knowledge discovery device based on path migration of RDF (Resource Description Framework) picture and method
CN103605655A (en) Shared data inquiring method based on Internet
CN102142034A (en) Internet data query method
CN103577538A (en) Key value data query method based on internet
US20150074121A1 (en) Semantics graphs for enterprise communication networks
Ma et al. RDF keyword search by query computation
CN103530348A (en) Integrated data query method based on internet
CN103605654A (en) Historical data inquiring method based on Internet
Grahne et al. Preferentially annotated regular path queries
Mugnier et al. On querying simple conceptual graphs with negation
Bill et al. On Synergies between Model Transformations and Semantic Web Technologies.
Raji et al. RDF approach on social network analysis
Wang et al. A dataflow-pattern-based recommendation approach for data service mashups
Haller et al. Enabling Exploratory Search on Manufacturing Knowledge Graphs.
Ranganathan et al. Relation based Semantic web search engine
Xu et al. Using social networks to organize researcher community
Wang et al. Interactive recommendation of composition operators for situational data integration

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140122