WO2009058696A1 - Method and apparatus for searching a hierarchical database and an unstructured database with a single search query - Google Patents

Method and apparatus for searching a hierarchical database and an unstructured database with a single search query Download PDF

Info

Publication number
WO2009058696A1
WO2009058696A1 PCT/US2008/081220 US2008081220W WO2009058696A1 WO 2009058696 A1 WO2009058696 A1 WO 2009058696A1 US 2008081220 W US2008081220 W US 2008081220W WO 2009058696 A1 WO2009058696 A1 WO 2009058696A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
hierarchical database
data
database
hierarchical
Prior art date
Application number
PCT/US2008/081220
Other languages
French (fr)
Inventor
Christopher Waters
Original Assignee
Paglo Labs Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Paglo Labs Inc. filed Critical Paglo Labs Inc.
Priority to EP20080845213 priority Critical patent/EP2220577A4/en
Publication of WO2009058696A1 publication Critical patent/WO2009058696A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/289Object oriented databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2246Trees, e.g. B+trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2272Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • Embodiments of the invention relate to the field of database searching; and more specifically, to the searching of a hierarchical database and an unstructured database with a single search query.
  • Structured data is used to refer to data that has some structure associated with the data.
  • a relational database contains structured data as the data within the relational database is structured into tables, columns, and rows.
  • searching structured data requires knowledge of the underlying structure.
  • searching the relational database requires knowledge of the table names.
  • searching a relational database requires knowledge of a rigid searching syntax, such as SQL.
  • Structured data may also be stored in a hierarchical database.
  • the hierarchical database can be a tree, where each data element can be considered a node of the tree.
  • unstructured data is used to refer to data that does not have structure associated with the data.
  • a common example of unstructured data is data stored in virtual documents in an inverted index.
  • virtual document is used to refer to representation of data as textual data that may be indexed.
  • searching the inverted index typically consists of entering in keywords.
  • keyword is used to refer to a search string.
  • Relational databases have a limited text searching feature. Relational databases are commonly made up of multiple relations (often called tables), which may or may not be connected. Each relation typically represents a different data domain. For example, one relation may represent product suppliers and another relation may represent clients. In order to maintain the structure of the relations within a search result, text searching is performed on a per relation basis. In other words, as the relations represent different data domains, text searching across the multiple data sets would not result in meaningful results as there would not be an indication of which relation the result belongs to. Thus, prior art relational database text searching has the disadvantage that knowledge of a particular relation is required. Additionally, when there are multiple relations a separate text search must be performed on each relation.
  • Prior art techniques exist that convert structured data into unstructured data to allow for full text searching. For example, data within a relational database may be converted to a format suitable for unstructured searching (e.g., converted into an inverted index to allow for keyword searching).
  • a disadvantage of converting data stored in a structured manner into data stored in an unstructured manner is that while searching may be easier for a user (e.g., the user does need to know the structure or special syntax) the results of the search will not include the associated structure.
  • the keyword search acts as a hint as to where in the relational database the information is located.
  • this prior art technique has the disadvantage that if there are multiple identifiers, the user is required to manually search each tuple for each identifier (i.e., the user must manually form a structured query for each identifier). Additionally, if the identifiers correspond to different relations, the user is required to manually search each relation for each identifier (i.e, the user must manually form a structured query for each identifier).
  • Figure 1 is a data flow diagram illustrating an exemplary system to search a hierarchical database and an inverted index with a single search query according to one embodiment of the invention
  • Figure 2A is a block diagram illustrating exemplary single search query syntaxes configured to search a hierarchical database and an inverted index with the single query according to one embodiment of the invention.
  • Figure 2B is a data flow diagram illustrating an exemplary searching of a hierarchical database and an inverted index with a single search query according to one embodiment of the invention.
  • Figure 2C is a block diagram illustrating an exemplary results screen of a single search query configured to search a hierarchical database and an inverted index according to one embodiment of the invention.
  • Figure 3 is a data flow diagram illustrating an exemplary system for generating virtual documents from a hierarchical database and indexing those virtual documents into an inverted index according to one embodiment of the invention.
  • Figure 4 is a data flow diagram illustrating an exemplary system for generating virtual document(s) from a hierarchical database and indexing those virtual documents into an inverted index upon receipt of data according to one embodiment of the invention.
  • Figure 5 is a block diagram illustrating an exemplary hierarchical structure according to one embodiment of the invention.
  • Figure 6 is an exemplary search screen graphical user interface configured to allow a user to generate a single search query to search a hierarchical database and an inverted index by selecting items returned as a result from a previous unstructured search according to one embodiment of the invention.
  • Figure 7 is an exemplary results screen in response to a user generating a single search query to search a hierarchical database and an inverted index from selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention.
  • references in the specification to "one embodiment”, “an embodiment”, “an example embodiment”, etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
  • Coupled may mean that two or more elements are in direct physical or electrical contact. However, “coupled” may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
  • Such computers store and communicate (internally and with other computers over a network) code and data using machine -readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices) and machine communication media (e.g., electrical, optical, acoustical or other form of propagated signals - such as carrier waves, infrared signals, digital signals, etc.).
  • machine storage media e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices
  • machine communication media e.g., electrical, optical, acoustical or other form of propagated signals - such as carrier waves, infrared signals, digital signals, etc.
  • such computers typically include a set of one or more processors coupled to one or more other components, such as a storage device, a number of user input/output devices (e.g., a keyboard and a display), and a network connection.
  • the coupling of the set of processors and other components is typically through one or more busses and bridges (also termed as bus controllers).
  • the storage device and network traffic respectively represent one or more machine storage media and machine communication media.
  • the storage device of a given computer system typically stores code and data for execution on the set of one or more processors of that computer.
  • one or more parts of an embodiment of the invention may be implemented using different combinations of software, firmware, and/or hardware.
  • a method and apparatus for searching a hierarchical database and an unstructured database with a single search query is described.
  • virtual documents are generated from the hierarchical database and are indexed into an inverted index along with associated identifiers of the hierarchical database.
  • a single search query searches the inverted index and from that result automatically searches the hierarchical database.
  • Figure 1 is a data flow diagram illustrating an exemplary system to search a hierarchical database and an unstructured database with a single search query according to one embodiment of the invention. It should be understood that while this data flow diagram and other data flow diagrams illustrate steps to be performed at a time, the order in which they can be performed is exemplary and the order may be performed differently by certain embodiments. [0025] Referring to Figure 1, at a time 1, a single search query with an unstructured search string within a structured search query is received by search server user interface 180 to automatically cause a search of inverted index 120 and use of the result to automatically search hierarchical database 110.
  • an inverted index as an unstructured database is exemplary and other unstructured databases may be used (e.g., a forward index, a trie, a vector space model, etc.).
  • the single search query originates from a user entering in the query from a graphical user interface (e.g., a web browser)
  • the query originates from different sources (e.g., from an application, from a command line interface, etc.). Examples illustrating a single search query will be discussed with reference to Figure 2A.
  • parser 140 extracts the unstructured search string from within the structured search query and forwards the unstructured search string to inverted index engine 135 at a time 2.
  • Inverted index engine 135 accepts the unstructured search string and at a time 3 searches inverted index 120 according to the unstructured search string.
  • Inverted index 120 includes virtual documents that were selectively generated from hierarchical database 110.
  • Each virtual document is associated with metadata that includes a unique identifier from the hierarchical database 110 used to designate the data in the hierarchical database 110 from which that virtual document was created.
  • metadata also can include path information from the hierarchical database 110.
  • Each unique identifier represents a point in hierarchical database 110.
  • a point in hierarchical database 110 may be any data element in the hierarchal database that is not a value according to one embodiment (e.g., a node in hierarchical database may be a point).
  • each point in hierarchical database 110 includes a unique identifier. Note that values of hierarchical database 110 do not include a unique identifier.
  • Each virtual document includes information from a starting point and all points and values beneath that starting point.
  • the virtual document may be overlapping data indexed in inverted index 120 depending on the manner in which the virtual documents are generated.
  • inverted index engine 135 searches inverted index 120 according to the unstructured search string. At a time 4, inverted index engine 135 receives the results of that search, which include one or more unique identifiers associated with the virtual documents that match the search. According to another embodiment, inverted index engine 135 also receives path information instead of or in addition to the unique identifiers. Inverted index engine 135 at a time 5 forwards the results of the unstructured search string search to structured search query generator 150 within search server user interface 180.
  • structured search query generator 150 At a time 6, for each of the unique identifiers returned from the unstructured search string search, structured search query generator 150 generates a separate search query from the single search query by replacing the unstructured search string in the structured search query with that unique identifier. In one embodiment structured search query generator 150 separately forwards each generated separate search query to hierarchical database engine 130 to allow a search of the hierarchical database. In another embodiment, structured search query generator 150 forwards the separate search queries as a group and hierarchical database engine 130 determines an order that the separate search queries will be processed and used for the search.
  • Hierarchal database 110 is searched according to the separate search query at a time 7. Examples of syntax of the separate search query will be discussed with reference to Figure 2B.
  • hierarchical database receives the returned results and forwards the returned results to hierarchical search results module 160. While in one embodiment of the invention hierarchical search results module 160 formats the results of the separate search queries in a tree format, in alternative embodiments of the invention hierarchical search results module 160 formats the results of the separate search queries in different formats (e.g., table, list, graph, chart, etc.). Hierarchical search results module 160 may also be configured to allow the formatting of the results of the separate search queries to be user configurable and selectable.
  • hierarchical search results module 160 may convert the results from one format to another format. For example, a user originally selected the results to be formatted in a tree format and later selects the results to be converted into a table, list, graph, chart, etc
  • FIG. 2A is a block diagram illustrating exemplary single search query syntaxes configured to search a hierarchical database and an inverted index with the single query according to one embodiment of the invention.
  • the single search query with an unstructured search string within a structured search query uses features that are familiar to SQL users.
  • a simple single search query syntax may take the form of SELECT * FROM %Search String% .
  • the SELECT clause in the single search query is used to specify the data returned subject to the FROM clause.
  • the asterisk indicates that everything in the hierarchical database is to be returned subject to the FROM clause.
  • the FROM clause describes where in the database information should be returned from.
  • the FROM clause represents which sub-tree the data will be searched from.
  • a simple exemplary single search query with an unstructured search string within a structured search query syntax may take the following syntax: SELECT * FROM %MAC% .
  • a WHERE clause may be used to specify the selection.
  • the WHERE clause restricts or filters the data returned.
  • Figure 2B is a data flow diagram illustrating an exemplary searching of a hierarchical database and an inverted index with a single search query according to one embodiment of the invention.
  • search server user interface receives the single query SELECT * FROM %MAC%.
  • This single query has syntax identifying an unstructured search string ( %MAC%) within a structured search query to automatically cause a search of inverted index 120 and use of that result to automatically search hierarchical database 110.
  • parser 140 extracts the unstructured search string MAC from the structured search query.
  • Parser 140 forwards the extracted unstructured search string 'MAC' to inverted index engine 135 to allow a search of the inverted index 120.
  • Inverted index 135 accepts the unstructured search string MAC and searches the inverted index 120 for the unstructured search string MAC.
  • inverted index 135 searches each virtual document in the inverted index for the occurrence of the search string 'MAC.
  • Hierarchical database engine 130 searches the hierarchical database 110 according to this separate search query at a time 7.
  • Hierarchical database engine 130 forwards the result to hierarchical search results module 160 at a time 9.
  • Hierarchical database engine 130 searches the hierarchical database 110 according to this separate search query at a time 11.
  • Hierarchical database engine 130 forwards the result to hierarchical search results module 160 at a time 13.
  • hierarchical search results module 160 formats the results of the separate search queries in a tree format
  • hierarchical search results module 160 formats the results of the separate search queries in different formats (e.g., table, list, graph, chart, etc.).
  • Hierarchical search results module 160 may also be configured to allow the formatting of the results of the separate search queries to be user configurable and selectable. That is, a user may select the format in which the results are outputted.
  • hierarchical search results module 160 may convert the results from one format to another format. For example, a user originally selected the results to be formatted in a tree format and later selects the results to be converted into a table, list, graph, chart, etc.
  • results of the separate search queries in the above example were across multiple relations of hierarchical database 110. That is, the results of the separate search queries included information from different data domains, in this case a device domain and a user domain.
  • a single search query in our example SELECT * FROM %MAC% searched multiple data domains in the hierarchical database and results from the multiple data domains retain the structure associated with the data (e.g., the tree structure) and were returned from that single search query.
  • searching of the hierarchical database with the results of the unstructured search string search was performed automatically without any user action required. Thus a user is not required to manually form a structured search query for each of the results received from the unstructured search string search
  • a single search including a search string may be performed over a large number of data domains in a hierarchical database where the results retain the structure associated with the data. While the example single search query and the example hierarchical database were both rather simple, it should be understood that a typical database may include a large number of data domains. [0041] Although the result of the above single search query does not include partial duplicative results, partial duplicative results are possible depending on the single search query and the virtual documents generated.
  • An example of a single search query that would return partial duplicative results is SELECT * FROM %10 %.
  • hierarchical search results module determines whether there is partial duplicative data and handles this in one of a number of ways (e.g., keep only the most narrow results (i.e., the furthest nested data), keep only the most broad results (i.e., the data that includes the most information), a combination based on user selection, etc.).
  • partial duplicative results are displayed to the user so as to allow the user to fine tune the query or to view a broader result set.
  • the values stored in hierarchical database 110 are associated with timestamp values. These timestamp values may identify the historical record of the values. While in one embodiment of the invention a different timestamp is associated with a value each time the value is added or changed, in an alternative embodiment of the invention a different timestamp is associated with a value at certain predefined periods of time (e.g., hourly, daily, weekly, monthly, etc.). These timestamp values may be displayed along with the values according to certain embodiments of the invention. Additionally, in one embodiment of the invention hierarchical search results module 160 determines whether values returned from the search are stale.
  • Hierarchical search results module 160 hides stale values (e.g., does not display the stale values) from the user. A user may optionally configure hierarchical search results module 160 to display the hidden stale values.
  • Figure 2C is a block diagram illustrating an exemplary results screen of a single search query configured to search a hierarchical database and an inverted index according to one embodiment of the invention.
  • the results screen can be displayed on any web browser or displayed from any stand alone application.
  • Included in the results screen is single search query box 250 which is configured to accept the single search query.
  • the single search query is displayed along with the results to remind the user of what the particular single search query was.
  • the results screen in Figure 2C corresponds to the single search query as described in Figure 2B.
  • the single search query SELECT * FROM %MAC% is displayed in single search query box 250.
  • results from the search performed as described in Figure 2B are results from the search performed as described in Figure 2B.
  • the results are formatted as a tree in Figure 2C, however in alternative embodiments of the invention results may be formatted differently (e.g., as a list, as a table, as a chart, as a graph, etc.).
  • the results screen in Figure 2C is configured to allow a user to format the results in different formats. For example, the user may convert the results from a tree format to a table format by selecting the Table function included in Figure 2C.
  • the results shown in Figure 2C of the single search query used in Figure 2B included multiple relations of hierarchical database 110. That is, the results of the single search query included information from different data domains, in this case a device domain and a user domain.
  • a single search query in our example SELECT * FROM %MAC% searched multiple data domains in the hierarchical database and results from the multiple data domains retain the structure associated with the data (e.g., the tree structure) and were returned from that single search query.
  • the searching of the hierarchical database with the results of the unstructured search string search is performed automatically without any user action required. Thus a user is not required to manually form a structured search query for each of the results received from the unstructured search string search
  • FIG. 3 is a data flow diagram illustrating an exemplary system for selectively generating virtual documents from a hierarchical database and indexing those virtual documents into an inverted index according to one embodiment of the invention.
  • virtual documents are selectively generated from data stored in hierarchical database 110.
  • document generator 170 receives input that identifies point(s) in hierarchical database 110 to selectively generate the virtual document(s) from.
  • a point in hierarchical database 110 may be the sub-tree root node of any of the sub-trees in hierarchical database 110.
  • a sub- tree begins at a node that is a child of the root node and is not a value.
  • the sub-tree root node is the top node of the sub-tree.
  • the sub-tree includes information starting at the sub-tree root node and traversing through each child node of the sub-tree and ending with at least one value.
  • device 140, user 142, name 144, 1_F 146, name 148, and IP_addr 152 are each sub-tree root nodes and may be identified as a point in hierarchical database 110 where a virtual document is selectively generated from. Note that it is possible for one sub-tree to include another sub-tree. Thus, each virtual document generated represents a sub-tree in hierarchical database 110.
  • the input that identifies point(s) in hierarchical database 110 may originate from numerous entities or modules.
  • the input is received from a user selecting point(s) in hierarchical database 110 by browsing a visual representation of hierarchical database 110 where the user decides the point(s) from which to generate virtual document(s) from.
  • the input is received from a user using a command line interface to identify the point(s) in hierarchical database 110 to selectively generate the virtual document(s) from.
  • the input that identifies point(s) in hierarchical database 110 to generate the virtual document(s) from is received automatically as a result of an algorithm.
  • an algorithm may select as a point to generate virtual documents from each node in the hierarchical database that includes at least one child node.
  • an algorithm may select as a point to generate virtual documents from every node in the hierarchical database.
  • an algorithm may select as a point to generate virtual documents from all nodes of a certain data domain (e.g., all nodes of the type DEVICE).
  • the input that identifies point(s) in hierarchical database 110 to selectively generate virtual document(s) from may originate from various sources and/or combination of sources.
  • FIG. 3 input has been received that identifies three points in hierarchical database 110 to generate virtual document(s) from: sub-tree root node device 140, sub-tree root node I_F 146, and sub-tree root node user 142.
  • the virtual documents to be generated are illustrated by dashed lines within hierarchical database 110.
  • the virtual document corresponding to sub-tree root node device 140 includes the information in the virtual document corresponding to sub-tree root node I_F 146.
  • document generator 170 sends appropriate query/queries to hierarchical database engine 130 to obtain the data required for the virtual document(s) at a time 2.
  • An example query syntax may take the form of SELECT * FROM "point”.
  • each point identified represents a sub-tree in hierarchical database 110.
  • Hierarchical database engine 130 queries hierarchical database 110 according to the received queries and receives the sub-tree results of those queries, including the sub-tree root node identifier at a time 3. At a time 4, hierarchical database engine 130 returns the sub-tree results of the query/queries to document generator 170.
  • Document generator 170 forms a virtual document for each of the sub-tree results of the queries and sends these virtual documents to inverted index engine 135 at a time 5.
  • Inverted index engine 135 indexes each virtual document into inverted index 120 and causes the storage of the indexed virtual documents with the sub-tree root node identifiers at a time 6.
  • three virtual documents were created as a result of the input received at document generator 170 and each of the virtual documents represents a sub-tree (and are associated with the sub-tree root node) of hierarchical database 110.
  • FIG 4 is a data flow diagram illustrating an exemplary system for generating virtual document(s) from a hierarchical database and indexing those virtual documents into an inverted index upon receipt of data according to one embodiment of the invention.
  • data stored in hierarchical database 110 is the same as described in Figure 3.
  • a user or module e.g., a crawler traversing information
  • data receiving module 190 does not know whether the data received is already included in hierarchical database 110.
  • data receiving module sends a query to hierarchical database engine 130 that is configured to add new data to hierarchical database 110, update existing data in hierarchical database 110, or take no action.
  • the query will neither update nor add data to the hierarchical database.
  • document generator 170 receives input that identifies point(s) in hierarchical database 110 to selectively generate the virtual document(s) from. The input that is received to identify point(s) is described with reference to Figure 3. Once the point(s) are identified, document generator 170 sends appropriate query/queries to hierarchical database engine 130 to obtain the data required for the virtual document(s) at a time 5.
  • An example query syntax may take the form of SELECT * FROM "point".
  • Hierarchical database engine 130 queries hierarchical database 110 according to the received queries and receives the sub-tree results of those queries, including the sub-tree root node identifier at a time 6. At a time 7, hierarchical database engine 130 returns the sub-tree results of the query/queries to document generator 170. Document generator 170 forms a virtual document for each of the sub-tree results of the queries and sends these virtual documents to inverted index engine 135 at a time 8.
  • Inverted index engine 135 indexes each virtual document into inverted index 120 and causes the storage of the indexed virtual documents with the sub-tree root node identifiers at a time 9. While in one embodiment of the invention inverted index engine 135 replaces each virtual document stored in inverted index 120 with the corresponding virtual documents it has received from document generator 170, in alternative embodiments of the invention inverted index engine 135 replaces virtual documents stored in inverted index 120 only if the virtual document received from document generator 170 is different from the corresponding virtual document stored in the inverted index.
  • inverted index engine 135 causes the replacement of each of the originally stored virtual documents in inverted index 120 with the newly received virtual documents.
  • inverted index engine 135 causes the replacement of only the virtual document associated with node identifier two as this is the only virtual document that has been modified.
  • FIG. 5 is a block diagram illustrating an exemplary hierarchical structure of hierarchical database 110 according to one embodiment of the invention.
  • the data of hierarchical database 110 is organized into a tree structure.
  • Each data element (i.e., not a value) on the tree is a node of the tree.
  • Each node on the tree has a corresponding unique identifier (e.g., a node identifier).
  • network node 502 has a unique identifier of two.
  • the root node of the tree 500 which is represented by the symbol / . Directly below the root node exists two child nodes, network 502 and directory 572.
  • a child node is a node, not a value, that itself descends from a node (e.g., a parent node or root node).
  • Each parent node can have many child nodes, but each child node only has one parent.
  • a child node may also be a parent node.
  • network 502 and directory 572 each are parent nodes in addition to being child nodes because they include one or more child nodes.
  • network 502 and directory 572 are each root nodes of a subtree.
  • a sub-tree is a subset of the tree.
  • a sub-tree includes information starting at the sub-tree root node and traversing through each child node of the sub-tree root node and ending with at least one value. Any node on the tree that itself has nodes below it (e.g., a parent node) can be referred to as a sub-tree root node.
  • each sub-tree may include other sub-trees (i.e., the sub-trees may be nested within a subtree). There are many sub-trees in Figure 5.
  • a sub-tree where network 502 is the sub-tree root node includes all the information, including values, from the nodes device 504, device 506, and device 508.
  • device 504 is a sub-tree root node for the sub-tree that includes all the information, including values, from the nodes manufacturer 510, interface 512, and interface 514.
  • the directory 572 includes the nodes users 574 and users 576, which include the nodes name 578 and names 580, respectively.
  • Values are associated with leaf nodes.
  • the node manufacturer 510 is a leaf node because it is associated with the value 510 'Dell Corporation' . While in one embodiment of the invention values are only associated with leaf nodes, in alternative embodiments of the invention any node in the hierarchy can have values associated with that node.
  • the data stored in hierarchical database 110 as shown in Figure 5 is exemplary as many other types of data may be stored.
  • the data includes technical data that IT professionals may find useful when fulfilling their duties.
  • the data stored in hierarchical database includes information regarding substantially all devices within a LAN, a list of software installed on those devices, and a list of users authorized to use those devices.
  • the data stored may include information regarding the operating system version installed on substantially all devices within the LAN, the software which is running on substantially all devices within the LAN, and a configuration file from at least one router, switch, or firewall within the LAN.
  • the devices may include substantially all workstations within a LAN, substantially all routers within the LAN, substantially all switches within the LAN, substantially all servers within the LAN, substantially all firewalls within the LAN, and substantially all directory servers within the LAN.
  • the data stored in hierarchical database includes information regarding the existence of devices within one or more LANs (e.g., devices including one or more routers, one or more switches, one or more servers, one or more directory servers, and one or more workstations), existence of a plurality of hardware modules within each of the devices, states of the hardware modules, properties of the hardware modules, history of the hardware modules, existence of a peripheral coupled with at least one of the devices, states of the peripheral, properties of the peripheral, configuration of the peripheral, history of the peripheral, existence of at least one operating system operating within each of the devices, state of the operating systems, properties of the operating systems, configuration of the operating systems, history of the operating systems, existence of software within each of the devices, state of the software, properties of the software, configuration
  • each node existing directly below the tree root node i.e., the child nodes directly below the root node
  • each node existing directly below the tree root node represents a private sub-tree where values and subsequent child nodes are private to an organization.
  • multiple organizations may share the same tree data structure yet each organization can access only their data.
  • More complex single search queries will now be described with reference to Figure 5.
  • the virtual documents that have been generated and stored in the inverted index are represented by dashed lines.
  • a WHERE clause is used to specify the selection.
  • the WHERE clause restricts or filters the data returned. For example, if one wants to find information regarding interfaces on Dell devices where the interface status is 'up', the following single search query may be used:
  • the unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'.
  • the search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)).
  • These two queries produce the below result which includes nodes interface 512, MAC_address 516, name 518, and status 520; interface 536, name 538 and status 540 respectively:
  • the single search query will display everything about all interfaces in a Dell device that have a status of 'up'.
  • the WHERE clause may include paths. For example, if one wants to find information regarding devices that include the string 'Dell' where the interface status is 'up', the following single search query may be used:
  • the unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'.
  • the search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)).
  • the WHERE clause may include more than one path. For example, if a user would like to find information about all devices made by Dell that have an interface named 'ethO' and the status of that interface is 'up', the user may enter in the following single search query:
  • the unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'.
  • the search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)).
  • This syntax correlates the two paths in the WHERE clause.
  • this query outputs all information about Dell devices that have an interface named ethO that is up.
  • this query produces the following output:
  • a single search query may include paths in the SELECT clause. For example, if a user would like to find the interface name, and interface MAC address for all dell devices the following single search query may be used:
  • the unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'.
  • the search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)).
  • two structured search queries are generated, SELECT interf ace/mac_address , interface/name FROM id 3, and SELECT interf ace/mac_address , interface/name FROM id 5.
  • the second separate structured query does not contain any data regarding the interface MAC Address. While in one embodiment of the invention a null value is returned if a node returned in a query result does not include a value, in alternative embodiments of the invention different results may be returned (e.g., error messages, the data is skipped, etc.). Note that the results are not correlated for each interface. In other words, the output does not reflect the relationship between the MAC address and the interface name as the paths are not correlated. [0067] To correlate the paths the following syntax may be used:
  • the unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'.
  • the search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)).
  • two structured search queries are generated, SELECT interface/ (mac_address , name ) FROM id 3, and SELECT interface/ (mac_address , name ) FROM id 5. These two queries produce the following output:
  • interface mac_address: '00:01:02:03:04:05' name: y eth0' interface : mac_address: ' 00 : Al : A2 : A3 : A4 : A5 : name : ' ethl ' row : interface : mac_address: null name: y eth0' interface : mac_address: null name : : ' ethl '
  • this output reflects the relationship between the MAC address and the interface name. This is because the single search query included correlation in the paths.
  • Figure 6 is an exemplary search screen graphical user interface configured to allow a user to generate a single search query to search a hierarchical database and an inverted index by selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention.
  • Search screen 600 includes search box 610 and structured search generator box 620. While in one embodiment of the invention search box 610 accepts only unstructured search queries, in alternative embodiments of the invention search box 610 accepts structured queries and queries with an unstructured search string within a structured search query.
  • a user has searched the inverted index for the search string 'Dell'.
  • the database described in Figure 5 will be used.
  • the results outputted to search screen 600 correspond to the virtual documents defined in Figure 5 that include the string 'Dell'.
  • two results have been returned.
  • the user may construct a structured search by selecting certain items in the result. The user may select items by any known methods (e.g., using a cursor to select, using a mouse to select, using a touch screen to select, etc.).
  • a user has selected two paths in which to generate a structured search query from (/device/interface/name, and /device/interface/status) .
  • Figure 7 is an exemplary results screen in response to a user generating a single search query to search a hierarchical database and an inverted index from selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention.
  • the items selected by a user in Figure 6 have been converted into a single search query to search the hierarchical database and an inverted index.
  • results are formatted as a tree
  • results may be formatted differently (e.g., as a list, as a table, as a chart, as a graph, etc.).
  • the results screen in Figure 7 is configured to allow a user to format the results in different formats. For example, a user may convert the results from a tree format to a table format by selecting the Table function included in Figure 7.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Techniques for searching a hierarchical database and an unstructured database with a single search query, including an embodiment wherein, a single search query is received that has syntax identifying an unstructured search string within a structured search query to automatically cause a search of the inverted index and use of the result to automatically search the hierarchical database, wherein the inverted index includes virtual documents created from data stored in the hierarchical database, wherein each virtual document includes a unique identifier from the hierarchical database used to designate the data in the hierarchical database from which that virtual document was created, wherein a result of the inverted index search includes the unique identifiers of the virtual documents that meet the search criteria.

Description

Method and Apparatus for Searching a Hierarchical Database and an Unstructured Database with a Single Search Query
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] Not Applicable.
BACKGROUND
Field
[0002] Embodiments of the invention relate to the field of database searching; and more specifically, to the searching of a hierarchical database and an unstructured database with a single search query. Background
[0003] Data may be stored in numerous fashions both unstructured and structured. The term "structured data" is used to refer to data that has some structure associated with the data. For example, a relational database contains structured data as the data within the relational database is structured into tables, columns, and rows. Typically, searching structured data requires knowledge of the underlying structure. For example in the case of the relational database, searching the relational database requires knowledge of the table names. Additionally, searching a relational database requires knowledge of a rigid searching syntax, such as SQL. [0004] Structured data may also be stored in a hierarchical database. The hierarchical database can be a tree, where each data element can be considered a node of the tree. Similarly as with relational databases, searching the structured data in the hierarchical database requires knowledge of the hierarchical structure (e.g., nodes of the tree) and also requires knowledge of a searching syntax. [0005] The term "unstructured data" is used to refer to data that does not have structure associated with the data. A common example of unstructured data is data stored in virtual documents in an inverted index. The term "virtual document" is used to refer to representation of data as textual data that may be indexed. As the data in an inverted index is unstructured, searching the inverted index typically consists of entering in keywords. The term "keyword" is used to refer to a search string. Thus, unlike searching structured data, searching unstructured data does not require knowledge of a rigid searching syntax. However, a disadvantage of searching unstructured data is that the results may not be accurate as keywords may be shared across numerous data sets.
[0006] Relational databases have a limited text searching feature. Relational databases are commonly made up of multiple relations (often called tables), which may or may not be connected. Each relation typically represents a different data domain. For example, one relation may represent product suppliers and another relation may represent clients. In order to maintain the structure of the relations within a search result, text searching is performed on a per relation basis. In other words, as the relations represent different data domains, text searching across the multiple data sets would not result in meaningful results as there would not be an indication of which relation the result belongs to. Thus, prior art relational database text searching has the disadvantage that knowledge of a particular relation is required. Additionally, when there are multiple relations a separate text search must be performed on each relation.
[0007] Prior art techniques exist that convert structured data into unstructured data to allow for full text searching. For example, data within a relational database may be converted to a format suitable for unstructured searching (e.g., converted into an inverted index to allow for keyword searching). However, a disadvantage of converting data stored in a structured manner into data stored in an unstructured manner is that while searching may be easier for a user (e.g., the user does need to know the structure or special syntax) the results of the search will not include the associated structure.
[0008] Other prior art techniques exist that support keyword based searches in association with manual relational database searching. In these techniques, virtual documents are built from a relational database and are indexed into an inverted index. The virtual documents are associated with relation tuples of the relational database (e.g., by using identifiers). Keyword based searches can be performed on the inverted index where the returned results are the identifiers to the relations matching the search. The returned results may contain multiple identifiers in the case where the keyword search term matches multiple virtual documents, and thus multiple tuples. For each identifier that is returned in the result, a user is required to manually search the relational database relation corresponding to that identifier. Thus, in this prior art technique, the keyword search acts as a hint as to where in the relational database the information is located. However, this prior art technique has the disadvantage that if there are multiple identifiers, the user is required to manually search each tuple for each identifier (i.e., the user must manually form a structured query for each identifier). Additionally, if the identifiers correspond to different relations, the user is required to manually search each relation for each identifier (i.e, the user must manually form a structured query for each identifier).
BRIEF DESCRIPTION OF THE DRAWINGS
[0009] The invention may best be understood by referring to the following description and accompanying drawings that are used to illustrate embodiments of the invention. In the drawings:
[0010] Figure 1 is a data flow diagram illustrating an exemplary system to search a hierarchical database and an inverted index with a single search query according to one embodiment of the invention;
[0011] Figure 2A is a block diagram illustrating exemplary single search query syntaxes configured to search a hierarchical database and an inverted index with the single query according to one embodiment of the invention.
[0012] Figure 2B is a data flow diagram illustrating an exemplary searching of a hierarchical database and an inverted index with a single search query according to one embodiment of the invention.
[0013] Figure 2C is a block diagram illustrating an exemplary results screen of a single search query configured to search a hierarchical database and an inverted index according to one embodiment of the invention.
[0014] Figure 3 is a data flow diagram illustrating an exemplary system for generating virtual documents from a hierarchical database and indexing those virtual documents into an inverted index according to one embodiment of the invention.
[0015] Figure 4 is a data flow diagram illustrating an exemplary system for generating virtual document(s) from a hierarchical database and indexing those virtual documents into an inverted index upon receipt of data according to one embodiment of the invention.
[0016] Figure 5 is a block diagram illustrating an exemplary hierarchical structure according to one embodiment of the invention. [0017] Figure 6 is an exemplary search screen graphical user interface configured to allow a user to generate a single search query to search a hierarchical database and an inverted index by selecting items returned as a result from a previous unstructured search according to one embodiment of the invention. [0018] Figure 7 is an exemplary results screen in response to a user generating a single search query to search a hierarchical database and an inverted index from selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention. DETAILED DESCRIPTION
[0019] In the following description, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in detail in order not to obscure the understanding of this description. Those of ordinary skill in the art, with the included descriptions, will be able to implement appropriate functionality without undue experimentation.
[0020] References in the specification to "one embodiment", "an embodiment", "an example embodiment", etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
[0021] In the following description and claims, the terms "coupled" and "connected," along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other. Rather, in particular embodiments, "connected" may be used to indicate that two or more elements are in direct physical or electrical contact with each other. "Coupled" may mean that two or more elements are in direct physical or electrical contact. However, "coupled" may also mean that two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other.
[0022] The techniques shown in the figures can be implemented using code and data stored and executed on one or more computers. Such computers store and communicate (internally and with other computers over a network) code and data using machine -readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices) and machine communication media (e.g., electrical, optical, acoustical or other form of propagated signals - such as carrier waves, infrared signals, digital signals, etc.). In addition, such computers typically include a set of one or more processors coupled to one or more other components, such as a storage device, a number of user input/output devices (e.g., a keyboard and a display), and a network connection. The coupling of the set of processors and other components is typically through one or more busses and bridges (also termed as bus controllers). The storage device and network traffic respectively represent one or more machine storage media and machine communication media. Thus, the storage device of a given computer system typically stores code and data for execution on the set of one or more processors of that computer. Of course, one or more parts of an embodiment of the invention may be implemented using different combinations of software, firmware, and/or hardware.
[0023] A method and apparatus for searching a hierarchical database and an unstructured database with a single search query is described. In one embodiment virtual documents are generated from the hierarchical database and are indexed into an inverted index along with associated identifiers of the hierarchical database. A single search query searches the inverted index and from that result automatically searches the hierarchical database.
[0024] Figure 1 is a data flow diagram illustrating an exemplary system to search a hierarchical database and an unstructured database with a single search query according to one embodiment of the invention. It should be understood that while this data flow diagram and other data flow diagrams illustrate steps to be performed at a time, the order in which they can be performed is exemplary and the order may be performed differently by certain embodiments. [0025] Referring to Figure 1, at a time 1, a single search query with an unstructured search string within a structured search query is received by search server user interface 180 to automatically cause a search of inverted index 120 and use of the result to automatically search hierarchical database 110. It should be understood that the use of an inverted index as an unstructured database is exemplary and other unstructured databases may be used (e.g., a forward index, a trie, a vector space model, etc.). While in one embodiment of the invention the single search query originates from a user entering in the query from a graphical user interface (e.g., a web browser), in alternative embodiments of the invention the query originates from different sources (e.g., from an application, from a command line interface, etc.). Examples illustrating a single search query will be discussed with reference to Figure 2A.
[0026] Within search server user interface, parser 140 extracts the unstructured search string from within the structured search query and forwards the unstructured search string to inverted index engine 135 at a time 2. Inverted index engine 135 accepts the unstructured search string and at a time 3 searches inverted index 120 according to the unstructured search string.
[0027] Inverted index 120 includes virtual documents that were selectively generated from hierarchical database 110. Each virtual document is associated with metadata that includes a unique identifier from the hierarchical database 110 used to designate the data in the hierarchical database 110 from which that virtual document was created. According to another embodiment, metadata also can include path information from the hierarchical database 110. Each unique identifier represents a point in hierarchical database 110. A point in hierarchical database 110 may be any data element in the hierarchal database that is not a value according to one embodiment (e.g., a node in hierarchical database may be a point). For example, in Figure 1 each point in hierarchical database 110 includes a unique identifier. Note that values of hierarchical database 110 do not include a unique identifier. [0028] Each virtual document includes information from a starting point and all points and values beneath that starting point. For example, in Figure 1, the virtual document NAME = ' MAC DENNI S ' is associated with unique identifier two (ID=2) and includes information from point NAME (ID=5) and the value 'MAC DENNIS'. The virtual document was designated to be generated from the point ID=2 and below. Thus, unique identifier two (ID=2) represents the point of /USER in hierarchical database 110. There may be overlapping data indexed in inverted index 120 depending on the manner in which the virtual documents are generated. For example, the virtual document
NAME = 'MAC
I_F/IP_ADDR = '10.10.1.1'
that is associated with unique identifier one (ID=I) includes information that is also included in the virtual document IP_ADDR = ' 10 . 10 . 1 . 1 ' that is associated with unique identifier four (ID=4). A more detailed description of generating virtual documents from hierarchical database 110 will be discussed in reference to Figure 3.
[0029] As previously described, inverted index engine 135 searches inverted index 120 according to the unstructured search string. At a time 4, inverted index engine 135 receives the results of that search, which include one or more unique identifiers associated with the virtual documents that match the search. According to another embodiment, inverted index engine 135 also receives path information instead of or in addition to the unique identifiers. Inverted index engine 135 at a time 5 forwards the results of the unstructured search string search to structured search query generator 150 within search server user interface 180. [0030] At a time 6, for each of the unique identifiers returned from the unstructured search string search, structured search query generator 150 generates a separate search query from the single search query by replacing the unstructured search string in the structured search query with that unique identifier. In one embodiment structured search query generator 150 separately forwards each generated separate search query to hierarchical database engine 130 to allow a search of the hierarchical database. In another embodiment, structured search query generator 150 forwards the separate search queries as a group and hierarchical database engine 130 determines an order that the separate search queries will be processed and used for the search.
[0031] Hierarchal database 110 is searched according to the separate search query at a time 7. Examples of syntax of the separate search query will be discussed with reference to Figure 2B. At a time 8, hierarchical database receives the returned results and forwards the returned results to hierarchical search results module 160. While in one embodiment of the invention hierarchical search results module 160 formats the results of the separate search queries in a tree format, in alternative embodiments of the invention hierarchical search results module hierarchical search results module 160 formats the results of the separate search queries in different formats (e.g., table, list, graph, chart, etc.). Hierarchical search results module 160 may also be configured to allow the formatting of the results of the separate search queries to be user configurable and selectable. That is, a user may select the format in which the results are outputted. Furthermore, hierarchical search results module 160 may convert the results from one format to another format. For example, a user originally selected the results to be formatted in a tree format and later selects the results to be converted into a table, list, graph, chart, etc
[0032] An exemplary search of hierarchical database 110 and inverted index 120 with a single search query that includes an unstructured search string within a structured search query will be described with reference to Figure 2B. [0033] Figure 2A is a block diagram illustrating exemplary single search query syntaxes configured to search a hierarchical database and an inverted index with the single query according to one embodiment of the invention. As many database users are familiar with SQL, according to one embodiment of the invention the single search query with an unstructured search string within a structured search query uses features that are familiar to SQL users. For example, a simple single search query syntax may take the form of SELECT * FROM %Search String% . Similarly to SQL, the SELECT clause in the single search query is used to specify the data returned subject to the FROM clause. In the above simple example, the asterisk indicates that everything in the hierarchical database is to be returned subject to the FROM clause. In the case of a tree, everything in the tree will be returned subject to the FROM clause. Also similar to SQL, the FROM clause describes where in the database information should be returned from. In the case of a tree, the FROM clause represents which sub-tree the data will be searched from. A simple exemplary single search query with an unstructured search string within a structured search query syntax may take the following syntax: SELECT * FROM %MAC% . Also similar to SQL, a WHERE clause may be used to specify the selection. In other words, the WHERE clause restricts or filters the data returned. An example of a single search query with a WHERE clause is SELECT * FROM %MAC% WHERE I_F/ IP_ADDR = ' 10 . 10 . 1 . 1 ' .
[0034] Figure 2B is a data flow diagram illustrating an exemplary searching of a hierarchical database and an inverted index with a single search query according to one embodiment of the invention. In Figure 2B, at a time 1 search server user interface receives the single query SELECT * FROM %MAC%. This single query has syntax identifying an unstructured search string ( %MAC%) within a structured search query to automatically cause a search of inverted index 120 and use of that result to automatically search hierarchical database 110. At a time 2, parser 140 extracts the unstructured search string MAC from the structured search query. While in one embodiment of the invention a leading symbol % represents the beginning of a search string and a closing % represents the closing of the search string, in alternative embodiments of the invention different symbols or words or any combination of symbols and words may be used (e.g., "", ", ** **, && &&, $$, etc.). Parser 140 forwards the extracted unstructured search string 'MAC' to inverted index engine 135 to allow a search of the inverted index 120. [0035] Inverted index 135 accepts the unstructured search string MAC and searches the inverted index 120 for the unstructured search string MAC. Thus inverted index 135 searches each virtual document in the inverted index for the occurrence of the search string 'MAC. As the search string 'MAC appears in two separate virtual documents (the virtual document associated with unique identifier one (ID=I) and the virtual document associated with unique identifier two (ID=2)), inverted index 135 will receive those unique identifiers (i.e., ID=I and ID=2) as a result of the search. While in one embodiment of the invention only the unique identifiers are returned as a result of the search, in alternative embodiments of the invention path information is returned in addition to or in place of the unique identifiers. Inverted index engine 135 forwards the result including the unique identifiers to structured search query generator 150 at a time 5. [0036] Structured search query generator 150 generates a separate search query by replacing the unstructured search string in the structured search query (%MAC%) with the first unique identifier received (ID=I) at a time 6. Thus, the structured search query generator 150 forwards the separate search query SELECT * FROM ID=I to hierarchical database engine 130. Hierarchical database engine 130 searches the hierarchical database 110 according to this separate search query at a time 7. Thus, the hierarchical database engine searches everything in the tree starting at device node (ID=I) 140. Therefore device node 140 and everything below device node 140 is returned as a result of the search to hierarchical database engine 130 at a time 8. Hierarchical database engine 130 forwards the result to hierarchical search results module 160 at a time 9.
[0037] As there were two unique identifiers returned from the unstructured search string search, structured search query generator 150 generates another separate search query for the second unique identifier received (ID=2) at a time 10. Thus, the structured search query generator 150 forwards the separate search query SELECT * FROM ID=2 to hierarchical database engine 130. Hierarchical database engine 130 searches the hierarchical database 110 according to this separate search query at a time 11. Thus, the hierarchical database engine searches everything in the tree starting at user node (ID=2) 142. Therefore user node 142 and everything below user node 142 is returned as a result of the search to hierarchical database engine 130 at a time 12. Hierarchical database engine 130 forwards the result to hierarchical search results module 160 at a time 13. [0038] While in one embodiment of the invention hierarchical search results module 160 formats the results of the separate search queries in a tree format, in alternative embodiments of the invention hierarchical search results module hierarchical search results module 160 formats the results of the separate search queries in different formats (e.g., table, list, graph, chart, etc.). Hierarchical search results module 160 may also be configured to allow the formatting of the results of the separate search queries to be user configurable and selectable. That is, a user may select the format in which the results are outputted. Furthermore, hierarchical search results module 160 may convert the results from one format to another format. For example, a user originally selected the results to be formatted in a tree format and later selects the results to be converted into a table, list, graph, chart, etc. [0039] Note that the results of the separate search queries in the above example were across multiple relations of hierarchical database 110. That is, the results of the separate search queries included information from different data domains, in this case a device domain and a user domain. Thus, a single search query (in our example SELECT * FROM %MAC%) searched multiple data domains in the hierarchical database and results from the multiple data domains retain the structure associated with the data (e.g., the tree structure) and were returned from that single search query. Furthermore the searching of the hierarchical database with the results of the unstructured search string search was performed automatically without any user action required. Thus a user is not required to manually form a structured search query for each of the results received from the unstructured search string search
[0040] Thus a single search including a search string may be performed over a large number of data domains in a hierarchical database where the results retain the structure associated with the data. While the example single search query and the example hierarchical database were both rather simple, it should be understood that a typical database may include a large number of data domains. [0041] Although the result of the above single search query does not include partial duplicative results, partial duplicative results are possible depending on the single search query and the virtual documents generated. For example, in Figure 2B, if the unstructured search string search returns the unique identifiers associated with device 140 (id= 1 ) and I_F 146 (id=4), the results of the separate search queries may include partial duplicative results as the data described by the virtual document associated with I_F 146 (id=4) is completely within the data described by the virtual document associated with device 140 (id=l). In other words, the data returned from node I_F 146 (id=4) is nested within the data from the node device 140 (id=l). An example of a single search query that would return partial duplicative results is SELECT * FROM %10 %. As a user may not want such partial duplicative results, in one embodiment of the invention hierarchical search results module determines whether there is partial duplicative data and handles this in one of a number of ways (e.g., keep only the most narrow results (i.e., the furthest nested data), keep only the most broad results (i.e., the data that includes the most information), a combination based on user selection, etc.). In another embodiment of the invention, partial duplicative results are displayed to the user so as to allow the user to fine tune the query or to view a broader result set.
[0042] Although not shown in Figure 2B, in certain embodiments of the invention the values stored in hierarchical database 110 are associated with timestamp values. These timestamp values may identify the historical record of the values. While in one embodiment of the invention a different timestamp is associated with a value each time the value is added or changed, in an alternative embodiment of the invention a different timestamp is associated with a value at certain predefined periods of time (e.g., hourly, daily, weekly, monthly, etc.). These timestamp values may be displayed along with the values according to certain embodiments of the invention. Additionally, in one embodiment of the invention hierarchical search results module 160 determines whether values returned from the search are stale. Values are stale if the values are associated with a timestamp that is excessively old (i.e., the timestamp should have been updated but has not). If the value is stale, it is likely that the value is not current and should not be automatically displayed to the user. Thus, in one embodiment of the invention hierarchical search results module 160 hides stale values (e.g., does not display the stale values) from the user. A user may optionally configure hierarchical search results module 160 to display the hidden stale values.
[0043] Figure 2C is a block diagram illustrating an exemplary results screen of a single search query configured to search a hierarchical database and an inverted index according to one embodiment of the invention. The results screen can be displayed on any web browser or displayed from any stand alone application. Included in the results screen is single search query box 250 which is configured to accept the single search query. The single search query is displayed along with the results to remind the user of what the particular single search query was. To illustrate, the results screen in Figure 2C corresponds to the single search query as described in Figure 2B. Thus, the single search query SELECT * FROM %MAC% is displayed in single search query box 250.
[0044] Included in Figure 2C are results from the search performed as described in Figure 2B. The results are formatted as a tree in Figure 2C, however in alternative embodiments of the invention results may be formatted differently (e.g., as a list, as a table, as a chart, as a graph, etc.). Furthermore, the results screen in Figure 2C is configured to allow a user to format the results in different formats. For example, the user may convert the results from a tree format to a table format by selecting the Table function included in Figure 2C.
[0045] Note that the results shown in Figure 2C of the single search query used in Figure 2B included multiple relations of hierarchical database 110. That is, the results of the single search query included information from different data domains, in this case a device domain and a user domain. Thus, a single search query (in our example SELECT * FROM %MAC%) searched multiple data domains in the hierarchical database and results from the multiple data domains retain the structure associated with the data (e.g., the tree structure) and were returned from that single search query. Furthermore the searching of the hierarchical database with the results of the unstructured search string search is performed automatically without any user action required. Thus a user is not required to manually form a structured search query for each of the results received from the unstructured search string search
[0046] Figure 3 is a data flow diagram illustrating an exemplary system for selectively generating virtual documents from a hierarchical database and indexing those virtual documents into an inverted index according to one embodiment of the invention. As previously described, virtual documents are selectively generated from data stored in hierarchical database 110. At a time 1, document generator 170 receives input that identifies point(s) in hierarchical database 110 to selectively generate the virtual document(s) from. A point in hierarchical database 110 may be the sub-tree root node of any of the sub-trees in hierarchical database 110. A sub- tree begins at a node that is a child of the root node and is not a value. The sub-tree root node is the top node of the sub-tree. The sub-tree includes information starting at the sub-tree root node and traversing through each child node of the sub-tree and ending with at least one value. For example in Figure 3, device 140, user 142, name 144, 1_F 146, name 148, and IP_addr 152 are each sub-tree root nodes and may be identified as a point in hierarchical database 110 where a virtual document is selectively generated from. Note that it is possible for one sub-tree to include another sub-tree. Thus, each virtual document generated represents a sub-tree in hierarchical database 110.
[0047] The input that identifies point(s) in hierarchical database 110 may originate from numerous entities or modules. In one embodiment of the invention the input is received from a user selecting point(s) in hierarchical database 110 by browsing a visual representation of hierarchical database 110 where the user decides the point(s) from which to generate virtual document(s) from. In another embodiment of the invention the input is received from a user using a command line interface to identify the point(s) in hierarchical database 110 to selectively generate the virtual document(s) from. In another embodiment of the invention the input that identifies point(s) in hierarchical database 110 to generate the virtual document(s) from is received automatically as a result of an algorithm. For example, an algorithm may select as a point to generate virtual documents from each node in the hierarchical database that includes at least one child node. As another example, an algorithm may select as a point to generate virtual documents from every node in the hierarchical database. As yet another example, an algorithm may select as a point to generate virtual documents from all nodes of a certain data domain (e.g., all nodes of the type DEVICE). Thus, it should be understood that the input that identifies point(s) in hierarchical database 110 to selectively generate virtual document(s) from may originate from various sources and/or combination of sources.
[0048] In Figure 3, input has been received that identifies three points in hierarchical database 110 to generate virtual document(s) from: sub-tree root node device 140, sub-tree root node I_F 146, and sub-tree root node user 142. The virtual documents to be generated are illustrated by dashed lines within hierarchical database 110. As can be seen in Figure 3, the virtual document corresponding to sub-tree root node device 140 includes the information in the virtual document corresponding to sub-tree root node I_F 146.
[0049] Once the point(s) are identified, document generator 170 sends appropriate query/queries to hierarchical database engine 130 to obtain the data required for the virtual document(s) at a time 2. An example query syntax may take the form of SELECT * FROM "point". As previously described, each point identified represents a sub-tree in hierarchical database 110. Hierarchical database engine 130 queries hierarchical database 110 according to the received queries and receives the sub-tree results of those queries, including the sub-tree root node identifier at a time 3. At a time 4, hierarchical database engine 130 returns the sub-tree results of the query/queries to document generator 170.
[0050] Document generator 170 forms a virtual document for each of the sub-tree results of the queries and sends these virtual documents to inverted index engine 135 at a time 5. Inverted index engine 135 indexes each virtual document into inverted index 120 and causes the storage of the indexed virtual documents with the sub-tree root node identifiers at a time 6. As shown in Figure 3, three virtual documents were created as a result of the input received at document generator 170 and each of the virtual documents represents a sub-tree (and are associated with the sub-tree root node) of hierarchical database 110.
[0051] Figure 4 is a data flow diagram illustrating an exemplary system for generating virtual document(s) from a hierarchical database and indexing those virtual documents into an inverted index upon receipt of data according to one embodiment of the invention. Originally, the data stored in hierarchical database 110 is the same as described in Figure 3. At a time 1, data receiving module 190 receives data User /Name = ' Smith ' . While in one embodiment of the invention the received data originates from a user manually requesting data to be added to hierarchical database, in alternative embodiments of the invention the received data originates from a user or module (e.g., a crawler traversing information) and it is unclear whether the data is already included in hierarchical database 110. Regardless from where the received data originated from, data receiving module 190 does not know whether the data received is already included in hierarchical database 110. As a result, at a time 2, data receiving module sends a query to hierarchical database engine 130 that is configured to add new data to hierarchical database 110, update existing data in hierarchical database 110, or take no action. For example, the query MERGE INTO / VALUES { user [ id=2 ] => { name => ' Smith ' } } is configured such that if a user node with an id=2 exists in the tree the value associated with the leaf node name is updated with the value 'Smith'. If a user node with an id=2 does not exist, then it is created along with the leaf node name and the value 'Smith'. According to one embodiment of the invention, if the information in the query is already included in the hierarchical database (e.g., the path, nodes, and values currently exist) the query will neither update nor add data to the hierarchical database.
[0052] Thus, in our example, at a time 3, hierarchical database engine causes the stored data User /Name = ' Mac Dennis ' to be updated to User /Name = ' Smith ' . At a time 4, document generator 170 receives input that identifies point(s) in hierarchical database 110 to selectively generate the virtual document(s) from. The input that is received to identify point(s) is described with reference to Figure 3. Once the point(s) are identified, document generator 170 sends appropriate query/queries to hierarchical database engine 130 to obtain the data required for the virtual document(s) at a time 5. An example query syntax may take the form of SELECT * FROM "point".
[0053] Hierarchical database engine 130 queries hierarchical database 110 according to the received queries and receives the sub-tree results of those queries, including the sub-tree root node identifier at a time 6. At a time 7, hierarchical database engine 130 returns the sub-tree results of the query/queries to document generator 170. Document generator 170 forms a virtual document for each of the sub-tree results of the queries and sends these virtual documents to inverted index engine 135 at a time 8.
[0054] Inverted index engine 135 indexes each virtual document into inverted index 120 and causes the storage of the indexed virtual documents with the sub-tree root node identifiers at a time 9. While in one embodiment of the invention inverted index engine 135 replaces each virtual document stored in inverted index 120 with the corresponding virtual documents it has received from document generator 170, in alternative embodiments of the invention inverted index engine 135 replaces virtual documents stored in inverted index 120 only if the virtual document received from document generator 170 is different from the corresponding virtual document stored in the inverted index. For example, if inverted index 120 included the virtual documents as described in Figure 3, and document generator sends inverted index engine 135 three virtual documents (corresponding to the dashed lines in hierarchical database 110 in Figure 4), in one embodiment of the invention inverted index engine 135 causes the replacement of each of the originally stored virtual documents in inverted index 120 with the newly received virtual documents. In an alternative embodiment, inverted index engine 135 causes the replacement of only the virtual document associated with node identifier two as this is the only virtual document that has been modified.
[0055] Figure 5 is a block diagram illustrating an exemplary hierarchical structure of hierarchical database 110 according to one embodiment of the invention. In Figure 5, the data of hierarchical database 110 is organized into a tree structure. Each data element (i.e., not a value) on the tree is a node of the tree. Each node on the tree has a corresponding unique identifier (e.g., a node identifier). For example, network node 502 has a unique identifier of two. At the top of the tree structure is the root node of the tree 500, which is represented by the symbol / . Directly below the root node exists two child nodes, network 502 and directory 572. A child node is a node, not a value, that itself descends from a node (e.g., a parent node or root node). Each parent node can have many child nodes, but each child node only has one parent. A child node may also be a parent node. For example, network 502 and directory 572 each are parent nodes in addition to being child nodes because they include one or more child nodes.
[0056] In addition, network 502 and directory 572 are each root nodes of a subtree. A sub-tree is a subset of the tree. A sub-tree includes information starting at the sub-tree root node and traversing through each child node of the sub-tree root node and ending with at least one value. Any node on the tree that itself has nodes below it (e.g., a parent node) can be referred to as a sub-tree root node. Thus, each sub-tree may include other sub-trees (i.e., the sub-trees may be nested within a subtree). There are many sub-trees in Figure 5. For example as previously described, a sub-tree where network 502 is the sub-tree root node includes all the information, including values, from the nodes device 504, device 506, and device 508. As an example of a nested sub-tree, device 504 is a sub-tree root node for the sub-tree that includes all the information, including values, from the nodes manufacturer 510, interface 512, and interface 514. As another example of a nested sub-tree, the directory 572 includes the nodes users 574 and users 576, which include the nodes name 578 and names 580, respectively.
[0057] Values are associated with leaf nodes. For example, the node manufacturer 510 is a leaf node because it is associated with the value 510 'Dell Corporation' . While in one embodiment of the invention values are only associated with leaf nodes, in alternative embodiments of the invention any node in the hierarchy can have values associated with that node.
[0058] It should be understood that the data stored in hierarchical database 110 as shown in Figure 5 is exemplary as many other types of data may be stored. As one example, the data includes technical data that IT professionals may find useful when fulfilling their duties. For example in one embodiment of the invention the data stored in hierarchical database includes information regarding substantially all devices within a LAN, a list of software installed on those devices, and a list of users authorized to use those devices. Additionally, the data stored may include information regarding the operating system version installed on substantially all devices within the LAN, the software which is running on substantially all devices within the LAN, and a configuration file from at least one router, switch, or firewall within the LAN. The devices may include substantially all workstations within a LAN, substantially all routers within the LAN, substantially all switches within the LAN, substantially all servers within the LAN, substantially all firewalls within the LAN, and substantially all directory servers within the LAN. [0059] In another embodiment of the invention, the data stored in hierarchical database includes information regarding the existence of devices within one or more LANs (e.g., devices including one or more routers, one or more switches, one or more servers, one or more directory servers, and one or more workstations), existence of a plurality of hardware modules within each of the devices, states of the hardware modules, properties of the hardware modules, history of the hardware modules, existence of a peripheral coupled with at least one of the devices, states of the peripheral, properties of the peripheral, configuration of the peripheral, history of the peripheral, existence of at least one operating system operating within each of the devices, state of the operating systems, properties of the operating systems, configuration of the operating systems, history of the operating systems, existence of software within each of the devices, state of the software, properties of the software, configuration of the software, history of the software, and presence of users using each of the devices, an inventory of users that are authorized to use each of the devices, policies assigned to the users for each of the devices, and history of each users' actions regarding each of the devices.
[0060] While in one embodiment the database stored in the tree belongs to a single organization, in alternative embodiments of the invention each node existing directly below the tree root node (i.e., the child nodes directly below the root node) represents a private sub-tree where values and subsequent child nodes are private to an organization. Thus, while not illustrated in Figure 5, multiple organizations may share the same tree data structure yet each organization can access only their data. [0061] More complex single search queries will now be described with reference to Figure 5. In Figure 5, the virtual documents that have been generated and stored in the inverted index are represented by dashed lines. Similarly to SQL, in one embodiment of the invention a WHERE clause is used to specify the selection. In other words, the WHERE clause restricts or filters the data returned. For example, if one wants to find information regarding interfaces on Dell devices where the interface status is 'up', the following single search query may be used:
SELECT interface FROM %dell% WHERE interface/status='up'
The unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'. The search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)). Using the node identifiers associated with the virtual documents that matched the unstructured search string, two structured search queries are generated, SELECT interface FROM id 3 WHERE interface/ status = ' up ' , and SELECT interface FROM id 5 WHERE interface/ status = y up ' . These two queries produce the below result which includes nodes interface 512, MAC_address 516, name 518, and status 520; interface 536, name 538 and status 540 respectively:
interface : mac_address: '00:01:02:03:04:05' name: yeth0' status: yup' interface : name : ' ethl ' status: yup'
Thus the single search query will display everything about all interfaces in a Dell device that have a status of 'up'.
[0062] Additionally, the WHERE clause may include paths. For example, if one wants to find information regarding devices that include the string 'Dell' where the interface status is 'up', the following single search query may be used:
SELECT * FROM %dell%
WHERE interface/status = yup'
The unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'. The search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)). Using the node identifiers associated with the virtual documents that matched the unstructured search string, two structured search queries are generated, SELECT * FROM id 3 WHERE interface/ status = ' up ' , and SELECT * FROM id 5 WHERE interface/ status = y up ' . These two queries produce the below result which includes nodes device 504, manufacturer 510, interface 512, MAC_address 516, name 518, status 520, interface 514, MAC_address 522, name 524, and status 526; device 508, interface 530, name 532, status 534, interface 536, name 538 and status 540 respectively:
device : manufacturer: 'Dell Corporation' interface : mac_address: '00:01:02:03:04:05' name: yeth0' status: yup' interface : mac_address: ' 00 : Al : A2 : A3 : A4 : A5 ' name: yethl' status: 'down' device : manufacturer: 'Dell Corporation' interface : name: yeth0' status: 'down' interface : name: yethl' status: yup'
Note that information regarding both interfaces on both Dell devices were displayed. The reason is that the SELECT clause asked for information about Dell devices where an interface status was 'up'. That is, if a device has multiple interfaces, the query returns information regarding all interfaces if at least one interface has a status of 'up'.
[0063] Additionally, the WHERE clause may include more than one path. For example, if a user would like to find information about all devices made by Dell that have an interface named 'ethO' and the status of that interface is 'up', the user may enter in the following single search query:
SELECT * FROM %dell% WHERE interface/name = yeth0' and interface/status = yup'
The unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'. The search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)). Using the node identifiers associated with the virtual documents that matched the unstructured search string, two structured search queries are generated, SELECT * FROM id 3 WHERE interface/name = y eth0 ' and interface/ status = ' up ' , and SELECT * FROM id 5 WHERE interface/name = y eth0 ' and interface/ status = y up ' . These two queries produce the following result:
device : manufacturer: 'Dell Corporation' interface : mac_address: '00:01:02:03:04:05' name: yeth0' status: yup' interface : mac_address: ' 00 : Al : A2 : A3 : A4 : A5 ' name: yethl' status: 'down' device : manufacturer: Dell Corporation interface : name: yeth0' status: 'down' interface : name: yethl' status: yup'
Note that this query generates results for device 504 (ID=3) and device 508 (ID=5) and not results only with Dell devices that have an interface named 'ethO' that is 'up' . This is because the query is asking for Dell devices that have an interface named 'ethO' and have an interface that has a status of 'up'. The query does not specify that the interface named 'ethO' be 'up'. In other words, the two paths in the WHERE clause are not correlated. [0064] To correlate the paths the following single search query may be used:
SELECT * FROM %dell% WHERE interface/ (name = yeth0' and status = yup').
This syntax correlates the two paths in the WHERE clause. Thus, this query outputs all information about Dell devices that have an interface named ethO that is up. Thus, this query produces the following output:
device : manufacturer: 'Dell Corporation' interface : mac_address: '00:01:02:03:04:05' name: yeth0' status: yup' interface : mac_address: '00 : Al : A2 : A3 : A4 : A5 ' name: yethl' status: 'down'
[0065] In addition, a single search query may include paths in the SELECT clause. For example, if a user would like to find the interface name, and interface MAC address for all dell devices the following single search query may be used:
SELECT interface/mac_address, interface/name FROM %dell%
The unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'. The search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)). Using the node identifiers associated with the virtual documents that matched the unstructured search string, two structured search queries are generated, SELECT interf ace/mac_address , interface/name FROM id 3, and SELECT interf ace/mac_address , interface/name FROM id 5. These two queries produces the following output:
row: mac_address: '00:01:02:03:04:05' mac_address: ' 00 : Al : A2 : A3 : A4 : A5 ' name: yeth0' name: yethl' row: mac_address: null mac_address: null name: yeth0' name : ' eth l '
[0066] Note that the second separate structured query does not contain any data regarding the interface MAC Address. While in one embodiment of the invention a null value is returned if a node returned in a query result does not include a value, in alternative embodiments of the invention different results may be returned (e.g., error messages, the data is skipped, etc.). Note that the results are not correlated for each interface. In other words, the output does not reflect the relationship between the MAC address and the interface name as the paths are not correlated. [0067] To correlate the paths the following syntax may be used:
SELECT interface/ (mac_address, name) FROM %dell%
The unstructured search string 'dell' is extracted from the query and the inverted index is searched according to 'dell'. The search finds two virtual documents that include the unstructured search string 'dell' (virtual document associated with device node 504 (node identifier of 3) and the virtual document associated with device node 508 (node identifier of 5)). Using the node identifiers associated with the virtual documents that matched the unstructured search string, two structured search queries are generated, SELECT interface/ (mac_address , name ) FROM id 3, and SELECT interface/ (mac_address , name ) FROM id 5. These two queries produce the following output:
row : interface : mac_address: '00:01:02:03:04:05' name: yeth0' interface : mac_address: ' 00 : Al : A2 : A3 : A4 : A5 : name : ' ethl ' row : interface : mac_address: null name: yeth0' interface : mac_address: null name : ' ethl '
Note that this output reflects the relationship between the MAC address and the interface name. This is because the single search query included correlation in the paths.
[0068] Figure 6 is an exemplary search screen graphical user interface configured to allow a user to generate a single search query to search a hierarchical database and an inverted index by selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention. Search screen 600 includes search box 610 and structured search generator box 620. While in one embodiment of the invention search box 610 accepts only unstructured search queries, in alternative embodiments of the invention search box 610 accepts structured queries and queries with an unstructured search string within a structured search query.
[0069] In the example of Figure 6, a user has searched the inverted index for the search string 'Dell'. In this example, the database described in Figure 5 will be used. Thus, the results outputted to search screen 600 correspond to the virtual documents defined in Figure 5 that include the string 'Dell'. As can be seen, two results have been returned. Once a user has searched the inverted index with an unstructured search string, the user may construct a structured search by selecting certain items in the result. The user may select items by any known methods (e.g., using a cursor to select, using a mouse to select, using a touch screen to select, etc.). As an example of a selection, in Figure 6 a user has selected two paths in which to generate a structured search query from (/device/interface/name, and /device/interface/status) .
[0070] Figure 7 is an exemplary results screen in response to a user generating a single search query to search a hierarchical database and an inverted index from selecting item(s) returned as a result from a previous unstructured search according to one embodiment of the invention. The items selected by a user in Figure 6 have been converted into a single search query to search the hierarchical database and an inverted index.
[0071] The single search query has produced four results as can be seen in Figure 7. While in one embodiment the results are formatted as a tree, in alternative embodiments of the invention results may be formatted differently (e.g., as a list, as a table, as a chart, as a graph, etc.). Furthermore, the results screen in Figure 7 is configured to allow a user to format the results in different formats. For example, a user may convert the results from a tree format to a table format by selecting the Table function included in Figure 7.
[0072] While the invention has been described in terms of several embodiments, those skilled in the art will recognize that the invention is not limited to the embodiments described, can be practiced with modification and alteration within the spirit and scope of the appended claims. The description is thus to be regarded as illustrative instead of limiting.

Claims

CLAIMSWhat is claimed is:
1. A method of searching a hierarchical database and an inverted index, comprising: receiving a single search query that has syntax identifying an unstructured search string within a structured search query to automatically cause a search of the inverted index and use of the result to automatically search the hierarchical database; extracting the unstructured search string from the single search query; searching the inverted index according to the unstructured search string, wherein the inverted index includes virtual documents created from data stored in the hierarchical database, wherein each virtual document includes a unique identifier from the hierarchical database used to designate the data in the hierarchical database from which that virtual document was created, wherein a result of the inverted index search includes the unique identifiers of the virtual documents that meet the search; and for each of the unique identifiers in the result, generating a separate search query from the single search query by replacing the unstructured search string in the structured search query with that unique identifier, and searching the hierarchical database according to the separate search query
2. The method of claim 1, wherein at least one unique identifier in the result corresponds to a virtual document that is associated to a first type of data in the hierarchical database, wherein the first type of data belongs to a first data domain, and wherein at least one other unique identifier in the result corresponds to a different virtual document that is associated to a second type of data in the hierarchical database, wherein the second type of data belongs to a different second data domain.
3. The method of claim 1, wherein the hierarchical database includes one or more sub-trees branching from a tree root node, wherein each sub-tree includes one or more nodes starting at a sub-tree root node, and wherein each node has a unique identifier, and wherein creating the virtual documents includes selectively generating the virtual documents from the one or more sub-trees, wherein each of the virtual documents corresponds to one of the one or more sub-trees and includes all nodes of that sub-tree.
4. The method of claim 1, wherein the hierarchical database has a tree structure and the unique identifiers in the result correspond to identifiers of nodes of the tree.
5. The method of claim 1, wherein the hierarchical database includes collected information from across disparate information sources stored in a plurality of devices of a single LAN, wherein the collected information is organized by items of interest, and wherein the hierarchical database is not organized by documents located on the plurality of devices of the LAN.
6. The method of claim 1 wherein the single search query is formed in response to a selection of at least one item returned as a result from a previous unstructured search, wherein the structured search query includes the item selected, and the unstructured search string within the structured search query corresponds to the previous unstructured search string query.
7. The method of claim 1 wherein the hierarchical database having stored therein information regarding, substantially all devices within a LAN, a list of software installed on those devices, and a list of users authorized to use those devices.
8. The method of claim 1 further comprising: receiving data to be merged in the hierarchical database; and merging the received data with the data stored in the hierarchical database upon receiving a single merge query that has syntax for, inserting the received data into the hierarchical database, and updating the data in the hierarchical database with the received data, wherein the received data is updated upon determining that the received data changes values of the data already stored in the hierarchical database.
9. The method of claim 3, wherein each node existing directly below the tree root node represents a private sub-tree, wherein values and node information in the private sub-tree are private to an organization.
10. The method of claim 3 wherein the syntax for the single search query includes a SELECT clause and a FROM clause, wherein the SELECT clause includes syntax to identify a path in the hierarchical database starting at the tree root node, and wherein the FROM clause includes the unstructured search string.
11. The method of claim 3, wherein each of the unique identifiers corresponds to one of the one or more sub-tree root nodes, and wherein each virtual document includes path information starting at one of the one or more sub-tree root nodes and ending with a value.
12. The method of claim 11 wherein a result of each of the separate search queries is formatted in a hierarchical manner starting from the sub-tree root node corresponding to the unique identifier.
13. A search database system, comprising: a hierarchical database to store a set of data in a hierarchical manner, wherein each of a plurality of points in the hierarchy has a unique identifier; a hierarchical database engine coupled with the hierarchical database, the hierarchical database engine to search the set of data stored in the hierarchical database; a document generator coupled with the hierarchical database engine, the document generator to create a different document from the data stored under each of the plurality of points in the hierarchical database; an inverted index; an inverted index engine coupled with the document generator, the inverted index engine to, index each document and the associated unique identifiers in the inverted index, and search the inverted index; a search server user interface, coupled with the hierarchical database engine and the inverted index engine, the search server user interface to receive a single search query that has syntax identifying an unstructured search string within a structured search query to automatically cause a search of the inverted index and use of the result to automatically search the hierarchical database, the search server user interface including, a parser to extract the unstructured search string from the single search query and forward the extracted unstructured search string to the inverted index engine to cause a search of the inverted index; a structured query generator to receive a result of the inverted index search that includes the one or more unique identifiers of the documents that meet the search, and for each of the unique identifiers in the result, to generate a separate search query from the single search query by replacing the unstructured search string in the structured search query with that unique identifier, and forward the separate search query to the hierarchical database engine to cause a search of the hierarchical database according to the separate search query.
14. The search database system of claim 13, wherein the result of the inverted index search includes a plurality of unique identifiers, where at least one unique identifier in the result corresponds to a virtual document that is associated to a first type of data in the hierarchical database, wherein the first type of data belongs to a first data domain, and where at least one other unique identifier in the result corresponds to a different virtual document that is associated to a second type of data in the hierarchical database, wherein the second type of data belongs to a different second data domain.
15. The search database system of claim 13, wherein the hierarchical database has a tree structure and the unique identifiers in the result correspond to identifiers of nodes of the tree.
16. The search database system of claim 13, wherein the document generator creating the plurality of documents includes selectively generating documents from the one or more sub-trees, wherein each document includes path information starting at the root node of the sub-tree and ending at a value of the sub-tree, and wherein each document is associated with the unique identifier of the root node of the sub-tree.
17. The search database system of claim 13, wherein the data stored in the hierarchical database includes collected information from across disparate information sources stored in a plurality of devices of a single LAN, wherein the collected information is organized by items of interest, and wherein the hierarchical database is not organized by documents located on the plurality of devices of the LAN.
18. The search database system of claim 13, wherein the single search query is formed in response to a selection of at least one item returned as a result from a previous unstructured search, wherein the structured search query includes the item selected, and the unstructured search string within the structured search query corresponds to the previous unstructured search string query.
19. The search database system of claim 13 wherein the set of data stored in the hierarchical manner includes information regarding, substantially all devices within a LAN, a list of software installed on those devices, and a list of users authorized to use those devices.
20. The search database system of claim 13, wherein the hierarchical database includes one or more sub-trees branching from a tree root node, wherein each subtree includes one or more nodes starting at a sub-tree root node and includes at least one value, and wherein the unique identifier associated with each of the plurality of documents corresponds to the sub-tree root node.
21. The search database system of 20, wherein each node existing directly below the tree root node represents a private sub-tree, wherein values and node information in the private sub-tree are private to an organization.
22. The search database system of claim 13, further comprising: a data receiving module coupled with the hierarchical database engine, the data receiving module to, receive data that is to be merged into the hierarchical database, and forward the received data to the hierarchical database engine to allow the received data to be merged into the hierarchical database.
23. The search database system of claim 21 wherein the syntax for the single search query includes a SELECT clause and a FROM clause, wherein the SELECT clause includes syntax to identify a path in the hierarchical database starting at the tree root node, and wherein the FROM clause includes the unstructured search string.
24. The search database system of claim 21, further comprising: a hierarchical search results module to receive a result for each of the separate search queries and to format the results in a hierarchical manner starting from the sub-tree root node corresponding to the unique identifier.
25. The search database system of claim 22, wherein the data receiving module is to merge data upon receipt of a single merge query that has syntax configured to, insert the received data into the hierarchical database, and update the data in the hierarchical database with the received data, wherein the received data is updated upon determining that the received data changes values of the data stored in the hierarchical database.
PCT/US2008/081220 2007-11-02 2008-10-24 Method and apparatus for searching a hierarchical database and an unstructured database with a single search query WO2009058696A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP20080845213 EP2220577A4 (en) 2007-11-02 2008-10-24 Method and apparatus for searching a hierarchical database and an unstructured database with a single search query

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/982,699 US8046353B2 (en) 2007-11-02 2007-11-02 Method and apparatus for searching a hierarchical database and an unstructured database with a single search query
US11/982,699 2007-11-02

Publications (1)

Publication Number Publication Date
WO2009058696A1 true WO2009058696A1 (en) 2009-05-07

Family

ID=40589210

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/081220 WO2009058696A1 (en) 2007-11-02 2008-10-24 Method and apparatus for searching a hierarchical database and an unstructured database with a single search query

Country Status (3)

Country Link
US (2) US8046353B2 (en)
EP (1) EP2220577A4 (en)
WO (1) WO2009058696A1 (en)

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3493074A1 (en) 2006-10-05 2019-06-05 Splunk Inc. Time series search engine
US20090187552A1 (en) * 2008-01-17 2009-07-23 International Business Machine Corporation System and Methods for Generating Data Analysis Queries from Modeling Constructs
TW201013431A (en) * 2008-09-17 2010-04-01 Mitac Int Corp Local search method, local search system, program product, portable miniature electronic device, and input interface
US8271557B1 (en) * 2009-04-20 2012-09-18 Xilinx, Inc. Configuration of a large-scale reconfigurable computing arrangement using a virtual file system interface
US20110051174A1 (en) * 2009-08-25 2011-03-03 Tomoki Hattori Method of querying image output devices on a network
US20110137886A1 (en) * 2009-12-08 2011-06-09 Microsoft Corporation Data-Centric Search Engine Architecture
US9218390B2 (en) 2011-07-29 2015-12-22 Yellowpages.Com Llc Query parser derivation computing device and method for making a query parser for parsing unstructured search queries
US10872082B1 (en) * 2011-10-24 2020-12-22 NetBase Solutions, Inc. Methods and apparatuses for clustered storage of information
US9198010B2 (en) 2012-05-08 2015-11-24 24/7 Customer, Inc. Data assistance application for mobile devices
US8516008B1 (en) 2012-05-18 2013-08-20 Splunk Inc. Flexible schema column store
US9626445B2 (en) * 2015-06-12 2017-04-18 Bublup, Inc. Search results modulator
US20150261837A1 (en) * 2012-08-29 2015-09-17 Vinay Avasthi Querying Structured And Unstructured Databases
US9230011B1 (en) 2012-11-30 2016-01-05 Amazon Technologies, Inc. Index-based querying of archived data sets
GB2517122A (en) * 2013-04-30 2015-02-18 Giovanni Tummarello Method and system for navigating complex data sets
US10614132B2 (en) 2013-04-30 2020-04-07 Splunk Inc. GUI-triggered processing of performance data and log data from an information technology environment
US10353957B2 (en) 2013-04-30 2019-07-16 Splunk Inc. Processing of performance data and raw log data from an information technology environment
US10997191B2 (en) * 2013-04-30 2021-05-04 Splunk Inc. Query-triggered processing of performance data and log data from an information technology environment
US10019496B2 (en) 2013-04-30 2018-07-10 Splunk Inc. Processing of performance data and log data from an information technology environment by using diverse data stores
US10346357B2 (en) 2013-04-30 2019-07-09 Splunk Inc. Processing of performance data and structure data from an information technology environment
US10318541B2 (en) 2013-04-30 2019-06-11 Splunk Inc. Correlating log data with performance measurements having a specified relationship to a threshold value
US10225136B2 (en) 2013-04-30 2019-03-05 Splunk Inc. Processing of log data and performance data obtained via an application programming interface (API)
US8751486B1 (en) 2013-07-31 2014-06-10 Splunk Inc. Executing structured queries on unstructured data
US20150095842A1 (en) 2013-09-30 2015-04-02 Microsoft Corporation Extendable blade sequence along pannable canvas direction
US11921715B2 (en) 2014-01-27 2024-03-05 Microstrategy Incorporated Search integration
US10095759B1 (en) 2014-01-27 2018-10-09 Microstrategy Incorporated Data engine integration and data refinement
US11386085B2 (en) 2014-01-27 2022-07-12 Microstrategy Incorporated Deriving metrics from queries
US10255320B1 (en) 2014-01-27 2019-04-09 Microstrategy Incorporated Search integration
US9952894B1 (en) 2014-01-27 2018-04-24 Microstrategy Incorporated Parallel query processing
US11379530B2 (en) * 2017-01-31 2022-07-05 Splunk Inc. Leveraging references values in inverted indexes to retrieve associated event records comprising raw machine data
US10474674B2 (en) * 2017-01-31 2019-11-12 Splunk Inc. Using an inverted index in a pipelined search query to determine a set of event data that is further limited by filtering and/or processing of subsequent query pipestages
US10846318B1 (en) 2017-04-18 2020-11-24 Microstrategy Incorporated Natural language visualizations
US10216823B2 (en) 2017-05-31 2019-02-26 HarperDB, Inc. Systems, methods, and apparatus for hierarchical database
AU2018427622B2 (en) * 2018-06-13 2021-12-02 Fujitsu Limited Acquiring method, generating method acquiring program, generating program, and information processing apparatus
US11500909B1 (en) * 2018-06-28 2022-11-15 Coupa Software Incorporated Non-structured data oriented communication with a database
US11195050B2 (en) 2019-02-05 2021-12-07 Microstrategy Incorporated Machine learning to generate and evaluate visualizations
CN110046236B (en) * 2019-03-20 2022-12-20 腾讯科技(深圳)有限公司 Unstructured data retrieval method and device
CN110471916B (en) * 2019-07-03 2023-05-26 平安科技(深圳)有限公司 Database query method, device, server and medium
US11614970B2 (en) 2019-12-06 2023-03-28 Microstrategy Incorporated High-throughput parallel data transmission
US11567965B2 (en) 2020-01-23 2023-01-31 Microstrategy Incorporated Enhanced preparation and integration of data sets
US11263195B2 (en) * 2020-05-11 2022-03-01 Servicenow, Inc. Text-based search of tree-structured tables
CN117355827A (en) * 2022-03-16 2024-01-05 库尔马甘贝托夫·阿努阿尔·莱哈诺维奇 Method for organizing document search in unstructured database of application program
TWI840784B (en) * 2022-04-07 2024-05-01 創鑫智慧股份有限公司 Generation method and index condensation method of embedding table

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5845278A (en) * 1997-09-12 1998-12-01 Inioseek Corporation Method for automatically selecting collections to search in full text searches
US20030033275A1 (en) * 2001-08-13 2003-02-13 Alpha Shamim A. Combined database index of unstructured and structured columns
US20070156677A1 (en) * 1999-07-21 2007-07-05 Alberti Anemometer Llc Database access system

Family Cites Families (72)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5918225A (en) * 1993-04-16 1999-06-29 Sybase, Inc. SQL-based database system with improved indexing methodology
US5913214A (en) * 1996-05-30 1999-06-15 Massachusetts Inst Technology Data extraction from world wide web pages
US5974407A (en) * 1997-09-29 1999-10-26 Sacks; Jerome E. Method and apparatus for implementing a hierarchical database management system (HDBMS) using a relational database management system (RDBMS) as the implementing apparatus
US6078917A (en) * 1997-12-18 2000-06-20 International Business Machines Corporation System for searching internet using automatic relevance feedback
US6085188A (en) * 1998-03-30 2000-07-04 International Business Machines Corporation Method of hierarchical LDAP searching with relational tables
US6119126A (en) * 1998-05-29 2000-09-12 Oracle Corporation Object-relational query builder which determines existence of structures from information loaded from the server and cached locally on the client computing system
US6397221B1 (en) * 1998-09-12 2002-05-28 International Business Machines Corp. Method for creating and maintaining a frame-based hierarchically organized databases with tabularly organized data
US6804663B1 (en) * 1998-09-21 2004-10-12 Microsoft Corporation Methods for optimizing the installation of a software product onto a target computer system
JP3698242B2 (en) 1999-08-20 2005-09-21 日本電気株式会社 Information set importance determination system and method, and recording medium recording information set importance determination program
US6598058B2 (en) * 1999-09-22 2003-07-22 International Business Machines Corporation Method and apparatus for cross-node sharing of cached dynamic SQL in a multiple relational database management system environment
US6438539B1 (en) 2000-02-25 2002-08-20 Agents-4All.Com, Inc. Method for retrieving data from an information network through linking search criteria to search strategy
US20060173873A1 (en) * 2000-03-03 2006-08-03 Michel Prompt System and method for providing access to databases via directories and other hierarchical structures and interfaces
US6611835B1 (en) 2000-05-04 2003-08-26 International Business Machines Corporation System and method for maintaining up-to-date link information in the metadata repository of a search engine
US6581072B1 (en) 2000-05-18 2003-06-17 Rakesh Mathur Techniques for identifying and accessing information of interest to a user in a network environment without compromising the user's privacy
US6880086B2 (en) * 2000-05-20 2005-04-12 Ciena Corporation Signatures for facilitating hot upgrades of modular software components
US6463430B1 (en) 2000-07-10 2002-10-08 Mohomine, Inc. Devices and methods for generating and managing a database
US7630959B2 (en) * 2000-09-06 2009-12-08 Imagitas, Inc. System and method for processing database queries
US7925967B2 (en) 2000-11-21 2011-04-12 Aol Inc. Metadata quality improvement
US6804677B2 (en) * 2001-02-26 2004-10-12 Ori Software Development Ltd. Encoding semi-structured data for efficient search and browsing
JP3842573B2 (en) * 2001-03-30 2006-11-08 株式会社東芝 Structured document search method, structured document management apparatus and program
US6882996B2 (en) * 2001-05-31 2005-04-19 International Business Machines Corporation System, method, and computer program product for reformatting non-XML data for use with internet based systems
US6799184B2 (en) * 2001-06-21 2004-09-28 Sybase, Inc. Relational database system providing XML query support
CA2451208A1 (en) * 2001-06-21 2003-01-03 Paul P. Vagnozzi Database indexing method and apparatus
US7398201B2 (en) * 2001-08-14 2008-07-08 Evri Inc. Method and system for enhanced data searching
US6931408B2 (en) * 2001-08-17 2005-08-16 E.C. Outlook, Inc. Method of storing, maintaining and distributing computer intelligible electronic data
US6901410B2 (en) * 2001-09-10 2005-05-31 Marron Pedro Jose LDAP-based distributed cache technology for XML
US6801904B2 (en) * 2001-10-19 2004-10-05 Microsoft Corporation System for keyword based searching over relational databases
WO2004038528A2 (en) 2002-10-23 2004-05-06 Medialingua Group Method of digital certificate (dc) composition, issuance and management providing multitier dc distribution model and multiple accounts access based on the use of dc and public key infrastructure (pki)
US7308454B2 (en) * 2001-11-09 2007-12-11 British Telecommunications Public Limited Company Data integration
US7552135B2 (en) * 2001-11-15 2009-06-23 Siebel Systems, Inc. SQL adapter business service
JP2003281191A (en) 2002-03-20 2003-10-03 Fujitsu Ltd Retrieval server and retrieval result providing method
US6965903B1 (en) * 2002-05-07 2005-11-15 Oracle International Corporation Techniques for managing hierarchical data with link attributes in a relational database
EP1532542A1 (en) * 2002-05-14 2005-05-25 Verity, Inc. Apparatus and method for region sensitive dynamically configurable document relevance ranking
WO2003107321A1 (en) * 2002-06-12 2003-12-24 Jena Jordahl Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view
SE523930C3 (en) * 2002-09-03 2004-08-18 Sixsteps Ab Ideon Science Park Computer software product with associated methods for searching a database of objects, connecting objects in such a database and exporting data from at least one arbitrary database
GB2394800A (en) * 2002-10-30 2004-05-05 Hewlett Packard Co Storing hierarchical documents in a relational database
US7447686B2 (en) * 2002-11-22 2008-11-04 Sas Institute Inc. Computer-implemented system and method for handling database statements
US7146356B2 (en) * 2003-03-21 2006-12-05 International Business Machines Corporation Real-time aggregation of unstructured data into structured data for SQL processing by a relational database engine
US20040230571A1 (en) * 2003-04-22 2004-11-18 Gavin Robertson Index and query processor for data and information retrieval, integration and sharing from multiple disparate data sources
US7013311B2 (en) * 2003-09-05 2006-03-14 International Business Machines Corporation Providing XML cursor support on an XML repository built on top of a relational database system
US7478100B2 (en) * 2003-09-05 2009-01-13 Oracle International Corporation Method and mechanism for efficient storage and query of XML documents based on paths
US20050076003A1 (en) * 2003-10-06 2005-04-07 Dubose Paul A. Method and apparatus for delivering personalized search results
US7956742B2 (en) * 2003-10-30 2011-06-07 Motedata Inc. Method and system for storing, retrieving, and managing data for tags
EP1544749B1 (en) * 2003-12-16 2018-11-14 Software AG Method for searching a database and database
US7412444B2 (en) * 2004-02-11 2008-08-12 Idx Systems Corporation Efficient indexing of hierarchical relational database records
FR2870023B1 (en) * 2004-03-23 2007-02-23 Alain Nicolas Piaton INFORMATION SEARCHING METHOD, SEARCH ENGINE AND MICROPROCESSOR FOR IMPLEMENTING THE METHOD
US7376642B2 (en) * 2004-03-30 2008-05-20 Microsoft Corporation Integrated full text search system and method
US7136851B2 (en) * 2004-05-14 2006-11-14 Microsoft Corporation Method and system for indexing and searching databases
US20060047636A1 (en) * 2004-08-26 2006-03-02 Mohania Mukesh K Method and system for context-oriented association of unstructured content with the result of a structured database query
US20080040342A1 (en) * 2004-09-07 2008-02-14 Hust Robert M Data processing apparatus and methods
US7685136B2 (en) * 2005-01-12 2010-03-23 International Business Machines Corporation Method, system and program product for managing document summary information
US7650320B2 (en) * 2005-02-24 2010-01-19 Nahava Inc. Method and system for efficient indexed storage for unstructured content
US7685203B2 (en) * 2005-03-21 2010-03-23 Oracle International Corporation Mechanism for multi-domain indexes on XML documents
US20060235820A1 (en) * 2005-04-14 2006-10-19 International Business Machines Corporation Relational query of a hierarchical database
US20060253423A1 (en) * 2005-05-07 2006-11-09 Mclane Mark Information retrieval system and method
US7739104B2 (en) * 2005-05-27 2010-06-15 Hakia, Inc. System and method for natural language processing and using ontological searches
US7627564B2 (en) * 2005-06-21 2009-12-01 Microsoft Corporation High scale adaptive search systems and methods
US7774361B1 (en) * 2005-07-08 2010-08-10 Symantec Corporation Effective aggregation and presentation of database intrusion incidents
US7937344B2 (en) 2005-07-25 2011-05-03 Splunk Inc. Machine data web
US7743066B2 (en) * 2005-07-29 2010-06-22 Microsoft Corporation Anonymous types for statically typed queries
US7664742B2 (en) * 2005-11-14 2010-02-16 Pettovello Primo M Index data structure for a peer-to-peer network
US8954426B2 (en) * 2006-02-17 2015-02-10 Google Inc. Query language
US20070203869A1 (en) 2006-02-28 2007-08-30 Microsoft Corporation Adaptive semantic platform architecture
US8843467B2 (en) * 2007-05-15 2014-09-23 Samsung Electronics Co., Ltd. Method and system for providing relevant information to a user of a device in a local network
US20070244865A1 (en) * 2006-04-17 2007-10-18 International Business Machines Corporation Method and system for data retrieval using a product information search engine
US8695031B2 (en) * 2006-08-02 2014-04-08 Concurrent Computer Corporation System, device, and method for delivering multimedia
JP4343206B2 (en) * 2006-09-27 2009-10-14 株式会社東芝 Structured document search support apparatus and program
US9009133B2 (en) * 2006-10-02 2015-04-14 Leidos, Inc. Methods and systems for formulating and executing concept-structured queries of unorganized data
US9817902B2 (en) * 2006-10-27 2017-11-14 Netseer Acquisition, Inc. Methods and apparatus for matching relevant content to user intention
US20080263006A1 (en) * 2007-04-20 2008-10-23 Sap Ag Concurrent searching of structured and unstructured data
US7840556B1 (en) * 2007-07-31 2010-11-23 Hewlett-Packard Development Company, L.P. Managing performance of a database query
US8301633B2 (en) * 2007-10-01 2012-10-30 Palo Alto Research Center Incorporated System and method for semantic search

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5845278A (en) * 1997-09-12 1998-12-01 Inioseek Corporation Method for automatically selecting collections to search in full text searches
US20070156677A1 (en) * 1999-07-21 2007-07-05 Alberti Anemometer Llc Database access system
US20030033275A1 (en) * 2001-08-13 2003-02-13 Alpha Shamim A. Combined database index of unstructured and structured columns

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"EPICENTER Concepts and Solutions Guide, Version 6.0", EXTREME NETWORKS, November 2006 (2006-11-01), XP008135938, Retrieved from the Internet <URL:http://www.extremenetworks.com/libraries/services/EPICenter60_SolutionsGuide.pdf> [retrieved on 20081212] *

Also Published As

Publication number Publication date
US20090119257A1 (en) 2009-05-07
EP2220577A4 (en) 2011-11-09
US8046353B2 (en) 2011-10-25
US20120084296A1 (en) 2012-04-05
EP2220577A1 (en) 2010-08-25
US9129005B2 (en) 2015-09-08

Similar Documents

Publication Publication Date Title
US8046353B2 (en) Method and apparatus for searching a hierarchical database and an unstructured database with a single search query
CN101436192B (en) Method and apparatus for optimizing inquiry aiming at vertical storage type database
US7664742B2 (en) Index data structure for a peer-to-peer network
Cai et al. RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network
De Virgilio et al. Converting relational to graph databases
US7043472B2 (en) File system with access and retrieval of XML documents
US8938459B2 (en) System and method for distributed index searching of electronic content
US7707168B2 (en) Method and system for data retrieval from heterogeneous data sources
US9075881B2 (en) System and method for identifying the owner of a document on the world-wide web
RU2507574C2 (en) Page-by-page breakdown of hierarchical data
US9020951B2 (en) Methods for indexing and searching based on language locale
JP2006012125A (en) Method and system for indexing and searching databases
EP2545465A1 (en) Data integration system
CN102768674B (en) A kind of XML data based on path structure storage method
WO2007132342A1 (en) Documentary search procedure in a distributed information system
CA2461871A1 (en) An efficient index structure to access hierarchical data in a relational database system
CN101324887B (en) Method and apparatus for searching information resource
Schlegel et al. Balloon fusion: SPARQL rewriting based on unified co-reference information
US6978269B1 (en) Apparatus and method for generating and displaying a schema diagram for a database
JP2016521889A (en) Cross model filtering
CN111159285A (en) Enterprise cross-system retrieval method based on distributed index service deployment
Lo et al. Flexible user interface for converting relational data into XML
Barashev et al. Indexing XML to Support Path Expressions.
US8745030B2 (en) Fast searching of directories
US10929396B1 (en) Multi-type attribute index for a document database

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08845213

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008845213

Country of ref document: EP