JP3671765B2 - Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program - Google Patents

Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program Download PDF

Info

Publication number
JP3671765B2
JP3671765B2 JP27114199A JP27114199A JP3671765B2 JP 3671765 B2 JP3671765 B2 JP 3671765B2 JP 27114199 A JP27114199 A JP 27114199A JP 27114199 A JP27114199 A JP 27114199A JP 3671765 B2 JP3671765 B2 JP 3671765B2
Authority
JP
Japan
Prior art keywords
information
query
information source
search condition
description
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP27114199A
Other languages
Japanese (ja)
Other versions
JP2001092844A (en
Inventor
紳一郎 瀬尾
光明 綱川
源吾 鈴木
裕一 飯塚
Original Assignee
日本電信電話株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電信電話株式会社 filed Critical 日本電信電話株式会社
Priority to JP27114199A priority Critical patent/JP3671765B2/en
Publication of JP2001092844A publication Critical patent/JP2001092844A/en
Application granted granted Critical
Publication of JP3671765B2 publication Critical patent/JP3671765B2/en
Anticipated expiration legal-status Critical
Application status is Expired - Lifetime legal-status Critical

Links

Images

Description

[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a heterogeneous information source query conversion method and apparatus and a storage medium storing a heterogeneous information source query conversion program, and in particular, in a search for various information sources (related DB, image DB, Web page, etc.) Heterogeneous information source query conversion method that can realize query descriptions for all information sources in a simple single format, without making the user aware of differences in query description methods that are completely different for each information source, and the limitations of each information source Further, the present invention relates to a storage medium storing an apparatus and a different information source inquiry conversion program.
[0002]
[Prior art]
With the spread of open networks in recent years, the movement to connect and share various information sources accumulated so far to the network has become active. However, these information sources are used for each type (relational database, Web page, image database, XML, etc.) according to the purpose, and are managed and operated independently. If the type of information source is different, the access method is naturally different, and the query description is different. If the query description method is different, the query for each information source must be individually described and accessed individually.
[0003]
An existing technique for integrating a plurality of relational databases includes a so-called multi-database. This creates a schema that integrates multiple databases in the same way as creating a relational database view, and allows users to query the integrated schema. That's it. However, multi-database cannot solve the problems specific to new information sources as described below.
[0004]
Information sources such as Web pages and image databases have restrictions different from those of general relational databases. For example, suppose that the information source is a Web page in a form format that requests items such as “product name” and “product content” based on the “price” of the product. When this Web page is regarded as a multi-database information source and an attempt is made to search as if a table is searched, a virtual table having items such as “price”, “product name”, and “product content” is created. . However, this table has a restriction that it cannot be searched unless it is specified with “price” as a condition. In the relational database, all items can be searched in the same way, but that does not apply to Web page information sources.
[0005]
Further, in a recent commercially available image database, an image search is performed using a query that is an extension of the SQL language that is a query language of a relational database. In order to search for an image, it is possible to search by giving parameters such as “color” and “shape” as well as simply specifying an image item. Conversely, there may be a restriction that the search cannot be performed unless parameters specific to these images are specified. Such restrictions vary from product to product.
[0006]
In order to solve such a search problem for different kinds of information sources, a system architecture called mediator has been proposed. The mediator is the same as the multi-database in that it manages information for mapping data from different information sources to a data model unique to each mediator and the mapping information of the integrated schema and individual schema. However, the mapping information is characterized by using a knowledge expression technique such as rule base. When there is an inquiry request from the user, the rule processing engine interprets the mapping information expressed by the rule and the user inquiry, and generates an inquiry to an appropriate information source.
[0007]
Further, as a method for converting a query description unique to an image database, there is a query conversion method disclosed in an image database heterogeneity elimination search method (Japanese Patent Application No. 10-36351). This method implements query conversion from a single query to each heterogeneous image database by mapping the image database to a relational database and managing the representation format of each data item and the delivery method of the image data that is the search result. Is.
[0008]
[Problems to be solved by the invention]
However, the conventional method has the following problems.
In the multi-database, a method for handling different kinds of information sources such as a Web page / image database other than the relational database is not particularly prepared.
Further, in the mediator, it is necessary to use rules in order to express the constraints of the Web page / image database. However, using a general knowledge expression language such as rules requires that you learn the rule language when describing constraints and mapping, and what rules are used for certain types of constraints. The burden on the system administrator increases due to the fact that the guideline on whether or not to express it is not clear. In addition, mediators so far have been able to describe restrictions on the level of data items such as whether or not a certain item can be searched, but it is possible or not to perform comparison processing on a certain data type. The description method of such data type level (metadata level) constraints is not clear.
[0009]
Also, in the query conversion method disclosed in the image database heterogeneity elimination search method (Japanese Patent Application No. 10-36351), the method for handling the constraints of the image database is not clear.
The present invention has been made in view of the above points, and solves easy inquiry description, easy system maintenance and management of flexible heterogeneous information sources, and various information sources (relation DB, image DB, Web In the search for pages, etc., query descriptions for all information sources are realized in a simple single format without making the user aware of the differences in query statement descriptions that are completely different for each information source. It is an object of the present invention to provide a heterogeneous information source query conversion method and apparatus for converting into a data source, and a storage medium storing a heterogeneous information source query conversion program.
[0010]
[Means for Solving the Problems]
  FIG. 1 is a diagram for explaining the principle of the present invention.
  The present invention (Claim 1) converts a single-format query description from a user into various information source-specific query descriptions.In the heterogeneous information source inquiry conversion device,In the heterogeneous information source query conversion method,
  Conversion meansInformation source type query description capability information for managing query description capability for each information source type of various information sources, data type query description capability information for managing query description capability for each data type of information source type, information source type Information source information that manages access information, information source query description capability information that manages query capability of data items in which the information source is inherent, and essential search condition control information that manages the processing method of essential conditions for each table in which the information source is inherent And each information of the mandatory search condition group information for managing the mandatory search condition for each table in which the information source is inherent.Using the information of the storage means to store,Inquiries from users, queries specific to various information sourcesDescriptionConvert toPerform the conversion step ( Step 1, 2 ) ,
  The conversion step is
  When converting to queries specific to various sources,
  Parse the query syntax for the query in single format
  Generate temporary query candidates using information source query description capability information,
  Delete primary query candidates that cannot be queried using information source type query description capability information and data type query description capability information,
  Those that can be queried using the query description capability information that cannot be queried using the mandatory search condition control information and the mandatory search condition group information, and that lack the mandatory condition data item complement the search condition,
  Use each remaining inquiry primary candidate as an inquiry candidate,
  It passes to the query statement generation function for each information source type, generates a query description specific to the information source, and returns the result.
[0013]
  FIG. 2 is a principle configuration diagram of the present invention.
  The present invention (Claim 2) converts a single-format query description from a user into various information source-specific query descriptions.The differenceSpecies information source inquiry conversion device,
  Information source type query description capability information for managing query description capability for each information source type of various information sources, data type query description capability information for managing query description capability for each data type of information source type, information source type Information source information that manages access information, information source query description capability information that manages query capability of data items in which the information source is inherent, and essential search condition control information that manages the processing method of essential conditions for each table in which the information source is inherent And storage means 150 for storing each information of the essential search condition group information for managing the essential search condition for each table in which the information source is inherent,
  Conversion means 100 for converting the inquiry description from the user into inquiry descriptions specific to various information sources using the information in the storage means 150.
  The conversion means 100
  A query syntax analysis means 120 for analyzing the syntax of the query for the acquired query in a single format;
  Means for generating temporary query candidates using information source query description capability information; means for deleting primary query candidates that cannot be queried using information source type query description capability information and data type query description capability information; and essential search conditions A query candidate generating means 130 having means for complementing the search condition that can be queried using the query description capability information that cannot be queried using the control information and the essential search condition group information;
  Each of the remaining inquiry primary candidates is used as a query candidate, is passed to a query statement generation function for each information source type, generates a query description specific to the information source, and includes a query syntax generation unit 140 that returns a result.
[0016]
  The present invention (claims)3) To convert a single-format query description from the user into query descriptions specific to various sourcesIn the heterogeneous information source inquiry conversion device,A storage medium storing a heterogeneous information source inquiry conversion program,
  A storage medium storing a program for causing a computer to execute processing for realizing the heterogeneous information source inquiry conversion method according to claim 1.
[0019]
As described above, the present invention is information source type query description capability information, data type query description capability information, information source information, information source query description capability information, essential search condition control information, which is information specific to various information sources. By defining the required search condition group information and defining it in the storage means, and managing it independently for each information source, by creating a query description including the search items and their conditions, a simple single Query description can be realized using the format.
[0020]
DETAILED DESCRIPTION OF THE INVENTION
In the present invention, first, for different information sources, (1) information source type query description capability information for managing query description capability for each information source type, and (2) query description capability for each data type of information source type are managed. Data type query description capability information, (3) information source information for managing the type of information source and access information, (4) information source query description capability information for managing the query capability for each data item in which the information source exists, 5) Stores essential search condition control information for managing the processing method of the essential condition for each table in which the information source exists, and [6] Required search condition group information for managing the essential search condition for each table in which the information source exists. I shall keep it.
[0021]
The contents of each of the above information are shown below.
(1) Information source type inquiry description capability information for managing inquiry description capability for each information source type:
・ Source type name
-Query statement generation function name
-Query statement generation library name
・ AND condition specification flag
・ OR condition specification flag
(2) Data type query description capability information for managing query description capability for each data type of information source type:
・ Source type name
Data type name
-= Condition specification flag
-<> Condition availability flag
・ <Condition specification availability flag
-> Condition specification flag
-<= Condition specification flag
-> = Condition specification flag
・ LIKE condition specification flag
(3) Information source information for managing information source types and access information:
・ Source name
・ Source type name
・ Source access information
(4) Information source query description capability information for managing the query capability for each data item in which the information source is inherent:
・ Source name
・ Table name
・ Data item name
Data type name
-Data item value acquisition flag
・ Specifiable flag for search conditions
・ Default value of search condition
(5) Mandatory search condition control information for managing the processing method of the mandatory condition for each table in which the information source is inherent:
・ Source name
・ Table name
・ Required search condition control flag
(6) Mandatory search condition group information for managing the mandatory search condition for each table in which the information source exists:
・ Source name
・ Table name
・ Required search condition group number
・ Data item name
The operation when the above information is defined and stored and the inquiry description from the user is converted into inquiries specific to various information sources will be described.
[0022]
3 and 4 are flowcharts of the heterogeneous information source query conversion process of the present invention.
Step 101) First, for a query in a single format, the query syntax is analyzed.
Step 102) An information source including all data items designated by the inquiry is searched using the information source inquiry description capability information.
[0023]
Step 103) For each data item, it is checked whether or not the data item value can be acquired and whether or not the search condition can be designated, and a combination of inquiry destinations satisfying each availability specification is generated as a primary inquiry candidate.
Step 104) It is checked whether or not the AND / OR designation managed by the information source type query description capability information is checked for each query primary candidate, and also managed by the data type query description capability information = / </> / </> /, <= /> = Check whether LINK designation is possible, and delete query candidates that do not satisfy any of the designations from the primary query candidates.
[0024]
Step 105) For each primary inquiry candidate, it is determined whether there is mandatory search condition group information. If there is, the process proceeds to Step 106, and if not, nothing is done.
Step 106) It is checked whether a data item belonging to any of the essential search condition groups is designated.
[0025]
Step 107) If specified, the process proceeds to step 108, and if not, the process proceeds to step 111.
Step 108) If it is specified, it is checked whether all data items belonging to the group are specified as search conditions or whether the essential search condition control flag is “candidate using default value”.
[0026]
Step 109) If the condition in Step 108 is satisfied, the essential search condition is complemented and the candidate is left.
Step 110) If the condition of Step 108 is not satisfied, the query is deleted from the primary inquiry candidate.
Step 111) If there is no designation in Step 107, it is checked whether the mandatory search condition control flag is “not a candidate when the mandatory condition is not satisfied”. If so, the process proceeds to Step 112. If not, the process proceeds to step 113.
[0027]
Step 112) Delete the candidate from the inquiry primary candidate.
Step 114) Each remaining primary inquiry candidate is set as an inquiry candidate.
Step 115) A query description unique to the information source is generated by passing it to the query statement generation function managed by the information source type query description capability information together with the access information for the information source.
[0028]
Next, the configuration of the heterogeneous information source inquiry conversion apparatus of the present invention will be described.
FIG. 5 shows the configuration of the heterogeneous information source query conversion apparatus of the present invention.
The heterogeneous information source query conversion apparatus 100 shown in FIG. 1 includes a user interface unit 110, a query syntax analysis unit 120, a query candidate generation unit 130, a query syntax generation unit 140, an information source query information storage unit 150, and an information source query information management unit. 160, and communicates with the application program 200 via the user interface unit 110.
[0029]
Below, each said component is demonstrated.
The user interface unit 110 receives an inquiry including only the search item and the search condition input from the user application program 200, and returns the converted inquiry to the application program 200.
The query syntax analysis unit 120 analyzes the syntax of the query received by the user interface unit 110.
[0030]
The inquiry candidate generation unit 130 includes a data item search unit 131, an inquiry description capability check unit 132, and an essential condition data item processing unit 133.
The data item search unit 131 searches the location of the designated data item in the query analyzed by the query syntax analysis unit 120, and generates a primary query candidate.
[0031]
The query description capability check unit 132 checks the query description capability according to the information source type and data type of the data item specified in the query against the query primary candidate generated by the data item search unit 131, and the query primary candidate that cannot be queried. Is deleted.
The essential condition data item processing unit 133 deletes the inquiry primary candidates that cannot be inquired for the inquiry primary candidates remaining in the inquiry description ability check unit 132 using the essential condition control information and the essential search condition group information, and can inquire. If the required condition data item is missing, the condition is complemented.
[0032]
The query syntax generation unit 140 includes a query syntax generation control unit 141 and a query statement generation function library 142.
The query syntax generation control unit 141 sets the query primary candidate remaining in the query candidate generation unit 130 as a query candidate, passes it to the query statement generation function library 142 for each information source type, and returns the converted query to the user interface unit 110. .
[0033]
The query statement generation function library 142 generates a query language specific to the information source for the query candidate received from the query syntax generation control unit 141 and returns it to the query syntax generation control unit 141.
The information source query information storage unit 150 stores information source type query description capability information, data type query description capability information, information source information, information source query description capability information, essential search condition control information, and essential search condition group information. to manage.
[0034]
The information source inquiry information management unit 160 inputs / deletes / changes various information to the information source inquiry information storage unit 150.
In the heterogeneous information source inquiry conversion apparatus configured as described above, a preparation phase for preparing various types of information before conversion and a conversion phase for performing actual conversion will be described.
[0035]
FIG. 6 is a flowchart showing the operation of the preparation phase of the present invention.
It is determined whether or not the required information source type has been registered (step 200). If the required information source type has not been registered, the query description capability information definition for each information source type (step 210) and the information source Definition of query description capability information for each type of data type (step 220), definition of information source information for managing information source type and access information (step 230), and query capability for each data item in which the information source resides Definition of information (step 240), definition of essential condition control information for each table in which the information source exists (step 250), and definition of essential search condition group information for each table in which the information source exists (step 260). Do. In the above description, Step 200 to Step 260 are defined by a series of operations. However, in the above information, undefined information is defined as needed.
[0036]
FIG. 7 is a flowchart showing the operation of the conversion phase of the present invention.
In response to a query in a single format acquired via the user interface unit 110, the query syntax analysis unit 120 analyzes the query syntax, and the data item search unit 131 of the query candidate generation unit 130 stores the information source query information storage unit 150. A query primary candidate is generated using the information source query description capability information (step 300), and a query primary candidate that cannot be queried using the information source type query description capability information and data type query description capability information in the query description capability check unit 132. Is deleted (step 310). The essential condition data item processing unit 133 deletes the inquiry primary candidate that cannot be inquired using the essential search condition control information and the essential search condition group information (step 320), and can be inquired using the information source inquiry description capability information. Those lacking data items complement the search criteria (step 330). The query syntax generation unit 140 sets each remaining query primary candidate as a query candidate and passes it to the query statement generation function library 142 for each information source type (step 340). The query syntax generation control unit 141 generates a query description specific to the information source in the query syntax generation control unit 141, and returns the result (step 350).
[0037]
【Example】
Embodiments of the present invention will be described below with reference to the drawings.
FIG. 8 is an example of the contents stored in the information source inquiry information storage unit of one embodiment of the present invention.
FIG. 6A shows a company A PC product RDB, which is a relational database storing information related to the company A PC product. SQL is accepted as a search request, and the search result is returned in a table format.
[0038]
FIG. 5B is a Web page in which information related to company B's PC product is described using HTML. Assume that the following syntax is accepted as a search request, the data in the part delimited by the table tag in the Web page is handled as tabular data, and accessed via an access driver that returns the search result in tabular format.
......
<Search request>: == <URL specification> "|" <Search item group> "|" <Search condition group>
<URL specification> :: = <URL> ["?" <CGI variable group>]
<CGI variable group> :: = <CGI variable> ["&" <CGI variable>] ...
<CGI variable> :: = <CGI variable name> ”=” <CGI variable value>
<CGI variable name> :: = any character string
<CGI variable value> :: = any character string
<Search item group> :: = <search item> ["," <search item>] ...
<Search condition group> :: = <Search condition> ["," <Search condition>] ...
<Search condition> :: = <Search item> <Comparison operator> Arbitrary character string
Example)
http://www.b-shop.co.jp/products.html | Product name, price, type | Price = 10000 ......
FIG. 10C is a shop handling PC search engine, which gives information on PC products handled at C shop on the condition of manufacturer name, and the search results are shown in table format as shown in FIG. Is displayed. Search engines and manufacturer names on the Web must be specified as conditions and cannot be omitted. The search engine is assumed to be accessed through an access driver in the same manner as the above Web page.
[0039]
Hereinafter, based on the above example, the preparation phase and the search phase will be described in accordance with FIG. 6 and FIG.
First, the preparation phase will be described.
(1) Definition of inquiry description capability information for each information source type (step 210):
A query that is the name of a function that generates a query statement in a format suitable for each information source type based on the result of analyzing the query from the user for each type of information source, such as RDB / Web page, image DB, etc. Inquiry description capability information such as a statement generation function name, a query statement generation library name that is the name of the library that contains the query generation function, and a flag indicating whether or not AND or OR can be included as a conditional expression Set as follows. An example of the setting is shown in FIG.
[0040]
(1) Relational database:
-Information source type name: RDB
-Query statement generation function name: createSQL
-Query statement generation library name: createQuery.dll
-AND condition designation flag: yes
-OR condition specification flag: Yes
(2) Web page:
-Information source type name: WebPage
-Query statement generation function name: createURL
-Query statement generation library name: createQuery.dll
-AND condition designation flag: yes
-OR condition specification flag: Not possible
Here, an example is shown in which an OR cannot be specified on a Web page.
[0041]
(2) Definition of inquiry description capability information for each data type of information source type (step 220):
For each data type of each information source type, query description capability information, which is a flag indicating whether or not a comparison operator such as = <> can be specified as a condition, is set as follows. A setting example is shown in FIG.
[0042]
(1) Related database:
-Information source type name: RDB
-Data type name: CHAR
-= Condition specification flag: Yes
-<> Condition specification flag: Yes
・ <Condition specification availability flag: Possible
-> Condition specification flag: Yes
-<= Condition specification flag: Yes
-> = Condition specification flag: Yes
・ LIKE condition specification flag: Yes

-Information source type name: RDB
-Data type name: INT
-= Condition specification flag: Yes
-<> Condition specification flag: Yes
・ <Condition specification availability flag: Possible
-> Condition specification flag: Yes
-<= Condition specification flag: Yes
-> = Condition specification flag: Yes
・ LIKE condition specification flag: Not possible
▲ 2 ▼ Web page
-Information source type name: WebPage
-Data type name: CHAR
-= Condition specification flag: Yes
-<> Condition specification flag: Yes
・ <Condition specification availability flag: Possible
-> Condition specification flag: Yes
-<= Condition specification flag: Yes
-> = Condition specification flag: Yes
・ LIKE condition specification flag: Yes

-Information source type name: WebPage
-Data type name: LONG
-= Condition specification flag: Yes
-<> Condition specification flag: Yes
・ <Condition specification availability flag: Possible
-> Condition specification flag: Yes
-<= Condition specification flag: Yes
-> = Condition specification flag: Yes
・ LIKE condition specification flag: Not possible
For the RDB INT type and the Web page LONG type, the IKE operator cannot be specified, but other conditions can be specified.
[0043]
(3) Information source information for managing information source type and access information (step 230):
For each information source name, the type of information source and access information (database name and Web page URL) for accessing the information source are set. In the example, set as follows. A setting example is shown in FIG.
[0044]
(1) Company A PC product RDB: • Information source name: Company A PC
-Information source type name: RDB
・ Source access information: ashop
(2) Company B PC product web page:
・ Source name: Company B PC
-Information source type name: WebPage
・ Source access information: http://www.b-shop.co.jp/product.html
▲ 3 ▼ C shop handling PC search engine:
・ Source name: Company C PC
-Information source type name: WebPage
・ Source access information: http://www.c-shop.co.jp/products.cgi
(4) Definition of inquiry description capability information for each data item in which the information source is inherent (step 240):
The query description capability information is set for each data item in which each information source is inherent. The inquiry description capability information includes the following information.
[0045]
Acquisition availability flag: A flag indicating whether or not the value of the data item can be acquired from the information source. Items such as “keyword” on the Web page cannot acquire a value from the information source.
Specification flag: A flag indicating whether or not a search condition can be specified for an information source in the data item. For example, for a data item returned as a result in a Web page in a form format The search condition cannot be specified.
[0046]
Mandatory search condition (required condition): A condition that must be specified when searching a certain table. For example, if a search can be performed without specifying a condition on a Web page in the form format, the condition Is such an example. When a user tries to perform a search without specifying the essential search condition, the conversion method of the present invention can automatically add the search condition when generating an inquiry sentence for the information source. The default value of the search condition is a value used as the search condition at that time.
[0047]
These pieces of information are set as follows. A setting example is shown in FIG.
(1) Company A PC product RDB:
・ Source name: Company A PC
-Table name: Company A PC product table
-Data item name: Manufacturer name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: Company A PC
-Table name: Company A PC product table
-Data item name: Product name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: Company A PC
-Table name: Company A PC product table
-Data item name: Price
-Data type name: INT
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>
▲ 2 ▼ Company B PC product web page
・ Source name: Company B PC
-Table name: Company B PC product table
-Data item name: Manufacturer name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: Company B PC
-Table name: Company B PC product table
-Data item name: Product name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: Company B PC
-Table name: Company B PC product table
-Data item name: Price
-Data type name: LONG
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: Company B PC
-Table name: Company B PC product table
-Data item name: Delivery date
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

▲ 4 ▼ C shop handling PC search engine
・ Source name: C Shop PC
・ Table name: PC handling C shop
-Data item name: Manufacturer name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: C Shop PC
・ Table name: PC handling C shop
-Data item name: Product name
-Data type name: CHAR
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>

・ Source name: C Shop PC
・ Table name: PC handling C shop
-Data item name: Price
-Data type name: LONG
-Data item value acquisition flag: Yes
-Flag for specifying search conditions: Yes
-Default value of search condition: <NULL>
In the C shop PC page, the data item “maker name” can be specified only for the search condition, and therefore whether or not the data item value can be acquired is set to “No”.
[0048]
(5) Definition of essential search condition control information for each table in which the information source exists (step 250):
The essential search condition control information is set as follows for each table in which each information source exists. A setting example is shown in FIG.
(1) Company A PC product RDB:
Since there is no need to set mandatory search conditions, nothing is set.
[0049]
(2) Company B PC product web page:
Since there is no need to set mandatory search conditions, nothing is set.
▲ 3 ▼ C shop handling PC search engine
When searching with the C shop handling PC search engine, the search is not possible unless the manufacturer name is set as a search condition. This indispensable item “maker name” needs to be set as an essential search condition. The operation when the essential search condition is not included in the inquiry from the user is set by the essential search condition control flag.
[0050]
There are two setting values for the mandatory search condition control flag: “not a candidate” and “use a default value as a candidate”. “Not a candidate” indicates that a user does not specify a required search condition and is excluded from search candidates. “Making a candidate using a default value” means that when a user does not specify an essential search condition, the search condition is added using the default value and is created as a search candidate.
[0051]
In this example, it is assumed that a manufacturer name must be included as a value. In this case, when the essential search condition is not specified during the inquiry from the user, it is interpreted that the information of all the manufacturers is required, so that a selfish value cannot be entered as the default value. Therefore, specifically, it sets as follows.
・ Source name: C Shop PC
・ Table name: PC handling C shop
-Mandatory search condition control flag: Not a candidate when the mandatory condition is not met.
[0052]
For example, if the reserved word “all manufacturers” is specified as the manufacturer name, and there is a Web page that retrieves information on all manufacturers, “all manufacturers” is searched for information source query description capability information. It is effective to set the default value of the condition, and set “candidate using default value” as the essential search condition control flag of the essential search condition control information. When the user does not specify the manufacturer name as the search condition, the search condition for “all manufacturers” is complemented by the conversion method of the present invention.
[0053]
(6) Definition of mandatory search condition group information for each table in which the information source is inherent (step 260):
As for an essential condition, a plurality of items may be essential for a certain table. A collection of such data items is called an essential item group. The required search condition group information for each table in which each information source exists is set as follows. A setting example is shown in FIG.
[0054]
(1) Company A PC product RDB
Do not set anything because there is no need to set mandatory search conditions.
▲ 2 ▼ Company B PC product web page
Do not set anything because there is no need to set mandatory search conditions.
▲ 4 ▼ C shop handling PC search engine
When searching with the C shop handling PC search engine, the search is not possible unless the manufacturer name is set as a search condition. Therefore, set as follows.
・ Source name: C Shop PC
・ Table name: PC handling C shop
・ Required search condition group number: 1
-Data item name: Manufacturer name
Next, the conversion phase will be described.
[0055]
The processing for a simple syntax search request consisting of the following desired items and conditions will be described.

Search item: Price
Search condition: Product name = A-PC1

(1) The query syntax is analyzed, and a primary query candidate is generated using the information source query description capability information (step 300):
Information source query description for an information source that has both a data item “price” for which the data item value acquisition flag is “enabled” and a data item name “product name” for which the flag for specifying search conditions is “enabled” Search from the ability table. As a result, we get:
[0056]
(Candidate 1)
Information source name: Company A PC
Information source type name: RDB
Table name: Company A PC product table
Data item name: price, data type name: INT, search condition default value: <NULL>
Data item name: product name, data type name: CHAR, search condition default value: <NULL>
(Candidate 2)
Information source name: Company B PC
Information source type name: WebPage
Table name: Company B PC product table
Data item name: price, data type name: LONG, default value of search condition: <NULL>
Data item name: product name, data type name: CHAR, search condition default value: <NULL>
(Candidate 3)
Information source name: Company C PC
Information source type name: WebPage
Table name: PC handling C shop
Data item name: price, data type name: LONG, default value of search condition: <NULL>
Data item name: product name, data type name: CHAR, search condition default value: <NULL>
(2) Using the information source type query description capability information and the data type query description capability information, delete the query primary candidate with the query added (step 310):
Although the OR condition designation availability flag of the information source type name “WebPage” is “impossible”, since OR is not designated as the search request, there is no problem in checking the information source type query description capability.
[0057]
In the data type query description capability table, “=” is designated as the conditional expression for the query, so the = condition designation availability flag is checked. However, since it is “possible”, there is no problem.
(3) Delete the inquiry primary candidates that cannot be inquired using the essential search condition control information and the essential search condition group information (step 320):
For the created candidates, it is checked whether the table to be accessed is included in the required search condition group table. In this example, the C shop PC information source is registered as an essential search condition group, and the data item “maker name” is an essential item. In the above inquiry, “maker name” is not specified. As described above, for an inquiry that does not include the essential search condition, the essential search condition control table is checked, the contents of the essential search condition control flag are referred to, and the handling of the candidate is determined. In this example, “cannot be a candidate”, and (candidate 3) is deleted from the candidates.
[0058]
(4) If the information source query description capability information can be queried and the essential condition data item is missing, the search condition is complemented (step 330):
In this example, (candidate 3) corresponding to the essential search condition is deleted in the process (3) above, and thus the search condition is not complemented.
(5) Each remaining primary inquiry candidate is set as an inquiry candidate and passed to an inquiry sentence generation function for each information source type (step 340):
The remaining query primary candidates (candidate 1) and (candidate 2) are passed to the query generation function.
[0059]
(6) Generate a query description specific to the information source and return the result (step 350):
Each of the following information source specific query descriptions is generated.
(Candidate 1) select price from Company A's PC product table where product name = 'A-PC1'
(Candidate 2) http://www.b-shop.co.jp/products.html | Price | Product Name = 'A-PC1'
Further, the above-described operation of the preparation phase shown in FIG. 6 and the operation of the conversion phase shown in FIG. The present invention can be easily realized by storing it in a portable storage medium such as a ROM and installing it when carrying out the present invention.
[0060]
The present invention is not limited to the above-described embodiments, and various modifications and applications are possible within the scope of the claims.
[0061]
【The invention's effect】
As described above, according to the present invention, it is only necessary to manage various information independently for each information source with respect to information sources with different and increasing query descriptions on an open network. Easy and extensible, the query description for the present invention uses a simple syntax consisting only of the search item and its conditions, so without making the user aware of the difference in query statement description that differs for each information source, Since query descriptions for all information sources can be realized in a simple single format, the query description efficiency for different types of information sources can be improved epoch-makingly.
[0062]
Further, the present invention includes, for example, “keyword search in extranet composed of various information source types of each company”, “multi-media encyclopedia composed of a plurality of image databases”, “customer database and product Web page” It can be used for query conversion such as “Help Desk”, and can contribute to the establishment of seamless information source access technology.
[Brief description of the drawings]
FIG. 1 is a diagram for explaining the principle of the present invention.
FIG. 2 is a principle configuration diagram of the present invention.
FIG. 3 is a flowchart (part 1) of a heterogeneous information source query conversion process of the present invention.
FIG. 4 is a flowchart (part 2) of the heterogeneous information source query conversion process of the present invention.
FIG. 5 is a configuration diagram of a heterogeneous information source query conversion apparatus according to the present invention.
FIG. 6 is a flowchart of the operation of the preparation phase of the present invention.
FIG. 7 is a flowchart of the operation of the conversion phase of the present invention.
FIG. 8 is an example of contents stored in an information source inquiry information accumulation unit according to an embodiment of the present invention.
FIG. 9 is an example of a search result by a PC search engine according to an embodiment of the present invention.
FIG. 10 is an example of information source type query description capability information according to an embodiment of the present invention;
FIG. 11 is an example of information source type data type inquiry description capability information according to an embodiment of the present invention;
FIG. 12 is an example of information source information according to an embodiment of the present invention.
FIG. 13 is an example of information source query description capability information according to an embodiment of the present invention;
FIG. 14 is an example of essential search condition control information according to an embodiment of the present invention.
FIG. 15 is an example of essential search condition group information according to an embodiment of the present invention;
[Explanation of symbols]
100 conversion means
110 User interface part
120 Query syntax analysis means, query syntax analysis unit
130 Inquiry candidate generation means, inquiry candidate generation unit
131 Data item search part
132 Inquiry description ability check part
133 Mandatory condition data item processing part
140 Query syntax generator
141 Query syntax generation control unit
142 Query Statement Generation Function Library
150 Storage means, information source inquiry information storage unit
160 Information Source Inquiry Information Management Department
200 Application program
300 Information definition means

Claims (3)

  1. In a heterogeneous information source query conversion method in a heterogeneous information source query conversion apparatus for converting a query description in a single format from a user into a query description specific to various information sources,
    Conversion means, information source type query description capability information for managing query description capability for each information source type of the various information sources, data type query description capability information for managing query description capability for each data type of information source type, Information source information that manages the type of information source and access information, information source query description capability information that manages the query capability of the data item that contains the information source, and the processing method of the essential conditions for each table that contains the information source Using the information of the storage means for storing each information of the essential search condition control information and the essential search condition group information for managing the essential search condition for each table in which the information source is inherent, the inquiry description from the user is Perform a conversion step to convert the query description specific to various sources ,
    The converting step includes
    In converting to the various information source specific queries,
    Parse the query syntax for the query in single format
    Generate temporary query candidates using the information source query description capability information ,
    Delete primary query candidates that cannot be queried using the information source type query description capability information and the data type query description capability information,
    Those that can be queried using the query description capability information that cannot be queried using the essential search condition control information and the essential search condition group information, and those lacking the essential condition data item complement the search condition,
    Use each remaining inquiry primary candidate as an inquiry candidate,
    A heterogeneous information source query conversion method, wherein a query description unique to an information source is generated and returned to a query statement generation function for each information source type, and the result is returned .
  2. A single format for query descriptor from the user, a heterologous source query converter for converting a variety of information sources specific query descriptor,
    Information source type query description capability information for managing query description capability for each information source type of the various information sources, data type query description capability information for managing query description capability for each data type of information source type, information source type Information source information that manages information and access information, information source query description capability information that manages query capability of data items that contain information sources, and mandatory search condition control that manages the processing method of essential conditions for each table that contains information sources Storage means for storing information and each information of essential search condition group information for managing the essential search condition for each table in which the information source is inherent;
    Conversion means for converting an inquiry description from the user into an inquiry description specific to the various information sources using information in the storage means, the conversion means,
    Query parsing means for analyzing the syntax of the query for the obtained single format query,
    Means for generating temporary query candidates using the information source query description capability information; means for deleting primary query candidates that cannot be queried using the information source type query description capability information and the data type query description capability information; Inquiry candidates that can be queried using the query description capability information that cannot be queried using the essential search condition control information and the essential search condition group information, and have means for complementing the search condition if there is no essential condition data item Generating means;
    A query syntax generation means for generating a query description specific to the information source and returning a result as a query candidate for each remaining query primary candidate and passing it to a query statement generation function for each information source type Heterogeneous information source inquiry conversion device.
  3. A storage medium storing a heterogeneous information source query conversion program in a heterogeneous information source query conversion device for converting a query description in a single format from a user into a query description unique to various information sources,
    A storage medium storing a heterogeneous information source query conversion program, wherein a program for causing a computer to execute processing for realizing the heterogeneous information source query conversion method according to claim 1 is stored.
JP27114199A 1999-09-24 1999-09-24 Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program Expired - Lifetime JP3671765B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP27114199A JP3671765B2 (en) 1999-09-24 1999-09-24 Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP27114199A JP3671765B2 (en) 1999-09-24 1999-09-24 Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program

Publications (2)

Publication Number Publication Date
JP2001092844A JP2001092844A (en) 2001-04-06
JP3671765B2 true JP3671765B2 (en) 2005-07-13

Family

ID=17495902

Family Applications (1)

Application Number Title Priority Date Filing Date
JP27114199A Expired - Lifetime JP3671765B2 (en) 1999-09-24 1999-09-24 Heterogeneous information source query conversion method and apparatus, and storage medium storing heterogeneous information source query conversion program

Country Status (1)

Country Link
JP (1) JP3671765B2 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7698441B2 (en) * 2002-10-03 2010-04-13 International Business Machines Corporation Intelligent use of user data to pre-emptively prevent execution of a query violating access controls
AU2003299837B2 (en) * 2002-12-23 2010-03-25 Antenna Dexterra, Inc. Mobile data and software update system and method
US7383255B2 (en) * 2003-06-23 2008-06-03 Microsoft Corporation Common query runtime system and application programming interface
JP4704769B2 (en) * 2005-02-21 2011-06-22 エスアーペー アーゲーSap Ag Repair estimate output device, repair estimate output method, repair estimate output program, and repair estimate output system
JP5326303B2 (en) * 2008-03-10 2013-10-30 富士通株式会社 Integration device, integration program, and integration method
JP5100820B2 (en) 2010-11-25 2012-12-19 株式会社東芝 Query expression conversion apparatus, method and program
CN102571720B (en) * 2010-12-27 2015-02-04 中国移动通信集团辽宁有限公司 Method and device for processing heterogeneous information contents
JP5843965B2 (en) * 2012-07-13 2016-01-13 株式会社日立ソリューションズ Search device, search device control method, and recording medium
JP6523923B2 (en) * 2015-11-06 2019-06-05 三菱電機株式会社 Search control apparatus and search control method

Also Published As

Publication number Publication date
JP2001092844A (en) 2001-04-06

Similar Documents

Publication Publication Date Title
Bohannon et al. From XML schema to relations: A cost-based approach to XML storage
He et al. Relational databases for querying XML documents: Limitations and opportunities
McHugh et al. Indexing semistructured data
US6643640B1 (en) Method for performing a data query
US7650357B2 (en) Translation of object queries involving inheritence
US6240407B1 (en) Method and apparatus for creating an index in a database system
US7082433B2 (en) Translation of object queries involving inheritence
US7644066B2 (en) Techniques of efficient XML meta-data query using XML table index
US6611838B1 (en) Metadata exchange
US7047242B1 (en) Weighted term ranking for on-line query tool
Deutsch et al. Querying XML data
US6519597B1 (en) Method and apparatus for indexing structured documents with rich data types
Carey et al. XPERANTO: Publishing Object-Relational Data as XML.
US5913214A (en) Data extraction from world wide web pages
Atzeni et al. Semistructured and structured data in the web: Going back and forth
Abiteboul Querying semi-structured data
US5799310A (en) Relational database extenders for handling complex data types
US7668806B2 (en) Processing queries against one or more markup language sources
US7162469B2 (en) Querying an object for properties
Walmsley XQuery
US8412746B2 (en) Method and system for federated querying of data sources
US7461053B2 (en) System and interface for manipulating a database
US7370061B2 (en) Method for querying XML documents using a weighted navigational index
US6553367B2 (en) Method for obtaining a unified information graph from multiple information resources
US7305613B2 (en) Indexing structured documents

Legal Events

Date Code Title Description
A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20041012

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20041213

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20050111

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20050304

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20050329

R150 Certificate of patent or registration of utility model

Ref document number: 3671765

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20050411

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090428

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090428

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20100428

Year of fee payment: 5

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20100428

Year of fee payment: 5

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110428

Year of fee payment: 6

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120428

Year of fee payment: 7

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130428

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140428

Year of fee payment: 9

S531 Written request for registration of change of domicile

Free format text: JAPANESE INTERMEDIATE CODE: R313531

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

EXPY Cancellation because of completion of term