EP1360607A2 - Systeme de collecte de donnees collaboratives sur le web - Google Patents
Systeme de collecte de donnees collaboratives sur le webInfo
- Publication number
- EP1360607A2 EP1360607A2 EP01946682A EP01946682A EP1360607A2 EP 1360607 A2 EP1360607 A2 EP 1360607A2 EP 01946682 A EP01946682 A EP 01946682A EP 01946682 A EP01946682 A EP 01946682A EP 1360607 A2 EP1360607 A2 EP 1360607A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- data
- web
- format
- markup language
- structured
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000013480 data collection Methods 0.000 title claims description 10
- 238000000034 method Methods 0.000 claims abstract description 48
- 230000006870 function Effects 0.000 claims abstract description 23
- 238000013499 data model Methods 0.000 claims abstract description 19
- 238000013506 data mapping Methods 0.000 claims abstract description 6
- 238000013500 data storage Methods 0.000 claims description 32
- 238000012545 processing Methods 0.000 claims description 27
- 238000013507 mapping Methods 0.000 claims description 22
- 230000009471 action Effects 0.000 claims description 10
- 238000013515 script Methods 0.000 claims description 10
- 230000001960 triggered effect Effects 0.000 claims description 5
- 230000008569 process Effects 0.000 abstract description 18
- 238000006243 chemical reaction Methods 0.000 abstract description 4
- 239000003795 chemical substances by application Substances 0.000 description 58
- 238000004891 communication Methods 0.000 description 18
- 230000004044 response Effects 0.000 description 9
- 230000003213 activating effect Effects 0.000 description 6
- 230000005540 biological transmission Effects 0.000 description 6
- 230000008520 organization Effects 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000003245 working effect Effects 0.000 description 3
- 230000002860 competitive effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000347391 Umbrina cirrosa Species 0.000 description 1
- 230000001447 compensatory effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000010339 medical test Methods 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
Definitions
- This invention relates to computer systems, and more particularly to a system for gathering data from a server, and storing and processing the data on the client.
- B2B refers to a wide variety of information exchanges between different independent organizations. For example, B2B can imply transfer of patient records from one hospital to another, or transfer of pricing data to an independent distributor's point of sales system. B2B can also refer to the transfer of sensitive information, such as financial records between different banks.
- B2B is often used ambiguously. For example, it is not appropriate to refer to B2B in the context of the internal workings of a company. Although internal workflow within an organization may be similar to a B2B system, transactions within the same organizations are easy to direct, since a governing authority can establish policies and protocols. In contrast, B2B systems have no "boss" that can mandate how transactions occur. [0005] Also, B2B is distinctly different from Business to Consumer (“B2C”) exchanges. Although both B2B and B2C share common aspects, B2B is motivated primarily by profitability and competitiveness, whereas B2C includes aspects such as glamour and mass appeal. B2B requires a high degree of standardization, while B2C does not.
- B2B e-commerce attempts to correct these deficiencies by using machine-readable electronic product catalogs.
- These electronic catalogs provide real-time information to e- commerce buying programs. This allows product information such as part numbers, product descriptions, and pricing to be available and automatically updated by suppliers.
- the formats of the purchase order and product catalog are potential trouble spots for B2B systems. In most cases, the buyer will require the supplier to present its electronic product catalog in a format that can be used by the buyer's client software. This is important when there is more than one supplier, because the client program must compare pricing, delivery, and other parameters between the various suppliers. Also, the buyer will require that the supplier recognizes, and correctly process, purchase orders generated by the buyer's client program.
- the overriding difficulty of implementing the B2B e-commerce system is the issue of product catalog and purchase order compatibility. Specifically, the problem lies in obtaining consensus of format and protocol among many dissimilar and autonomous organizations.
- B2B e-commerce systems will work only if they can be inexpensively layered upon the private policies of a company. Any B2B system that attempts to make drastic changes in the internal workings of the organization runs a high risk of being rejected for the simple reason that participants view their carefully crafted internal systems as part of their "competitive edge.” Furthemiore, although buyers may want e-commerce, suppliers will resist it in the absence of any mandating authority until the cost of e- commerce is compensatory.
- the present invention is directed to a system for gathering data from a web-based server, transmitting the data to a web-based client, and storing the data on the web-based client.
- an enabler agent translates data from a first data model to a second data model using a data mapping function.
- an enabler agent determines which data mapping function to apply by referencing source-identifying information contained within a data request.
- an enabler agent converts data from a structured data format to a markup language format.
- an enabler agent is entirely web-based.
- a data collector periodically polls a list of URLs to obtain updated data from data servers pointed to by the URLs.
- a data collector accesses data stored on a data server using a pre-defined query stored on the server.
- a data collector is entirely web-based.
- a data collector converts data from a markup language format to a structured data format.
- a data collector is a modular component, by using a second enabler agent to access a database associated with a data collector
- FIG. 1 is a depiction of a client/server relationship in accordance with an embodiment of the invention.
- FIG. 2A-2B represent an exemplary data model for a data supplier.
- FIG. 3 is a flowchart of a first method for providing data to a data collector, in accordance with an embodiment of the invention.
- FIG. 4 is a flowchart of a method of converting data from a structured data storage format to a markup language format, in accordance with an embodiment of the invention.
- FIG. 5 is an exemplary XML data document constructed using the method of FIG.
- FIG. 6 is a flowchart of a second method for providing data to a data collector, in accordance with an embodiment of the invention.
- FIG. 7 is a depiction of a query viewer screen, in accordance with an embodiment of the invention.
- FIG. 8 is a depiction of a query editor screen, in accordance with an embodiment of the invention.
- FIG. 9 is a depiction of a parameter editor screen, in accordance with an embodiment of the invention.
- FIG. 10 is a representation of a computer system in accordance with an embodiment of the invention.
- FIG. 11 is a representation of a processing unit in accordance with an embodiment of the invention.
- FIG. 12 is a flowchart of a method of parsing data using the optional Emily
- FIG. 13 is an exemplary data model for a buyer, in accordance with an embodiment of the invention.
- FIG. 14 is a flowchart of a method for collecting data, in accordance with an embodiment of the invention.
- FIG. 15 is an exemplary URL table, in accordance with an embodiment of the invention.
- FIG. 16 is a flowchart of a method for polling data sources, in accordance with an embodiment of the invention.
- FIG. 17 is a flowchart of a method for a method of converting data from a markup language format into a structured data storage format.
- XML Extensible Markup Language
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- HTML Hypertext Markup Language
- XML can be pulled from a web site using HTTP (Hypertext Transport Protocol).
- HTTP Hypertext Transport Protocol
- XML provides a small advantage to HTML in that it is more readable by programs (although slightly less readable by humans).
- XML has become a hot topic, mainly because it simplifies the construction of programs that pull data from the web.
- XML is one of the few standards that provide assistance in the construction of B2B systems.
- a simple Server / Client architecture can be established that uses HTTP as the communications protocol, and XML as the underlying data format.
- the two basic components of such a system are an XML server and an XML client.
- the XML server resides at a supplier's site, and allows machine-readable product catalogs to be fetched in real-time by an XML client program at a buyer's site.
- the server program "advertises" products to potential buyers.
- the server may also have components to enable online buying of products.
- the XML client component resides at a buyer's site, and fetches product catalogs from various XML servers on the network.
- the client program's job is to scan through the list of suppliers, and determine the best price and delivery options advertised.
- the client may also be responsible for actually performing a purchase of the products via delivery of an electronic purchase order to the supplier.
- XML servers provide product catalogs, which are read by XML clients.
- XML server programs are HTTP servers that provide XML documents rather than HTML documents.
- XML client programs are programs that read the XML data, similar to the way web browsers read HTML, but with a special graphical user interface.
- FIG. 1 A depiction of the XML server / client relationship is provided in FIG. 1.
- the supplier's XML server 110 includes a server database 112, which provides electronic product catalog information describing the organization's products, and an enabler agent 102 which processes this data for transmission to the buyer.
- a data collector 104 on the buyer's XML client 120 receives this information, and stores it in a client database 122.
- a database tool 130 such as a purchase order program, on the buyer's client 120 can access the data for various functions, such as initiating purchase orders to the supplier's order processor (not shown).
- a web-based data communication system in accordance with an embodiment of the invention includes two basic functional units.
- the first functional unit is an enabler agent 102 that resides on a server, such as the XML server 110 described above.
- the second functional unit is a data collector 104 that resides on a client, such as the XML client 120 described above.
- the enabler agent 102 works in tandem with a server database 112, to provide data from the server database in a standard, platform independent format.
- the data collector 104 works in tandem with a client database 122, to receive data in a standard, platform independent format and store the data in the client database 122.
- the data communication system works in either a one-to-one, one-to-many, many-to-one or many-to-many environment.
- Each server communicates data to one or more clients.
- Each client receives data from one or more servers.
- the enabler agent provides patient records for doctors, indicating the current patient health. In general, the enabler agent serves data to those hospital client programs requiring information on patient status.
- Doctor Client The "Doctor Portal” software allows the doctor to place orders for new tests and medication. The orders are similar to purchase orders, however these orders initiate actions by hospital nurses and technicians rather than actions by a supplier's shipping department.
- Administrator Client The "Hospital Administration Portal” allows the patient insurance forms (including tests and medication prescribed by the Doctor) to be submitted to the insurance company.
- Insurance Company Server At each of the various insurance companies used by the hospital and patients, the enabler agent provides insurance information for the hospital, for use by the Administrator Client. This allows the administrator to inspect insurance data and submit insurance claims on line.
- the only major changes to the B2B model are in the particular data catalogs (which refer to patients rather than products), and the types of orders initiated by the system (which refer to medical tests and prescriptions rather than purchases.) Otherwise, the system functions much as the B2B e-commerce system described in preceding sections.
- the first component of the data collection system is the enabler agent 102.
- the enabler agent 102 runs on the server 110, which is controlled by the data supplier.
- the enabler agent 102 is a web-based application that works in tandem with the server database 112.
- the server database 112 is a database under the control of the data supplier.
- the server database 112 stores data in a particular data storage format. Exemplary data storage formats include SQL, flat files of varying format, CORBA databases, XML documents, and HTML documents.
- the data is stored in accordance with a particular data model. The details of the data model are determined by the needs and desires of the data supplier, and are not critical to the invention.
- FIGS. 2A-2B are an example of how a particular data supplier, in this case a manufacturer of bicycles, would model their data describing bicycle parts.
- the data model includes a products table 210 and a parts table 220.
- the data contained within the products table 210 includes a list of all of the fully assembled products that the bicycle manufacturer sells.
- the products table 210 includes a series of product records 211, which store the data for each product the bicycle manufacturer sells.
- the products table 210 contains a products key field 212, which uniquely identifies each record in the table.
- the products key field 212 is used internally to associate the products table 210 with other tables within the server database.
- the products table 210 also includes a model number field 214, a description field 216, and a price field 218, which contain the various data items which describe each product record in the products table 210.
- the data contained within the parts table 220 includes a list of all of the various parts the bicycle manufacturer sells.
- the parts table 220 includes a series of parts record 221, which store the data for each part the bicycle manufacturer sells.
- the parts table 220 has a parts key field 222, which uniquely identifies each record in the parts table 220.
- the parts key field 222 is used internally to associate the parts table 220 with other tables within the server database.
- the parts table 220 also includes a foreign key field 224, which refers back to the products key field 212 of the products table 210.
- the enabler agent 102 retrieves data from the server database 112 in response to a data request from a user.
- the user may be a human being, or the user may be a computer process controlled by a human being, or the user may be an automated computer process.
- the user is the data collector 104 residing on the client 120.
- the request contains information that identifies the data to be retrieved.
- the request can be transmitted in a variety of ways.
- the request is a URL transmitted from the data collector 104.
- the request is an electronic mail message.
- the request is an application-specific message transmitted by the data collector 104 or by another computer process.
- the request comprises information that identifies a pre- generated database query, the query being stored on the server 110.
- An exemplary request comprises a URL taking the following form:
- (hostspec) is a value that identifies the particular server that the request is directed to.
- An exemplary (hostspec) value is www.supplier.com.
- the portion of the URL between the (hostspec) and the (queryname) is information that identifies a particular location on the server where the pre-generated database queries are stored. This information points to, for example, a file or directory containing the pre-generated query.
- (queryname) is a value that uniquely identifies the particular database query that the request seeks to invoke.
- the (queryname) value is an arbitrary identifier or keyword that is mapped to a query stored on the server.
- An exemplary mapping scheme comprises storing each query in a separate file, and referencing the query in the URL by providing the name the file it is stored in the (queryname) section of the URL.
- the bicycle manufacturer data model discussed above would have queries named "PartsByPrice", which retrieves the entire parts table sorted by price, and "BicycleParts", which retrieves all records in the parts table that correspond to parts used in bicycles.
- an administrator pre-generates a set of database queries and stores those queries on the server. The administrator can use a variety of ways to create these queries. The administrator can manually write the queries. The administrator can also use a query design tool to generate the query in the query language, from a higher level model.
- the query design tool may be a component associated with the particular database software used to create the server database, or it may be a third-party or stand- alone package.
- the administrator uses a web-based editor accompanying the enabler agent software to create the queries.
- the queries are stored on the server.
- the queries are stored in separate files on the server. This allows the enabler agent to locate the proper query quickly and easily, when the enabler agent is presented with a URL containing the keyword associated with the query, as discussed above. Queries can also be stored in other formats; for example, a single file could contain a library of related queries. The enabler agent in this example would parse the library file to locate the particular query specified in the request.
- a method for providing data to a data collector in response to a pre-defined query begins at step 305, with a data requestor generating an URL.
- An HTTP server on the requestor's machine routes the URL over the network to the supplier's server, at step 310.
- the supplier's HTTP server determines, based upon information contained in the URL, that the URL is a request for the enabler agent, and the URL is routed to the enabler agent at step 315.
- This routing is a function of the HTTP server, which maps URLs to specific disk files and directories on the server platform.
- the Apache HTTP server uses an "Alias" directory in the Apache configuration file.
- the enabler agent parses the URL, at step 320, to extract the query identifier. Using this query identifier, at step 325 the enabler agent locates the query associated with the query identifier, the query being stored on the supplier's server. At step 330, the enabler agent queries the database using the query identified above. The server database generates a result set based on the contents of the query, at step 335. This result set comprises the rows from the various tables of the server database that correspond to the parameters of the query.
- the result set is sent back form the server database to the enabler agent, at step 340.
- the result set is still represented in the structured data storage format used by the server database.
- the enabler agent converts the structured data storage formatted data into markup language data at step 345, and creates a markup language formatted document.
- the enabler agent then sends this document to the HTTP server at the supplier's site, at step 350.
- the supplier's HTTP server routes the markup language document to the requestor, at step 355.
- the request comprises a database query generated at the time of the request.
- An exemplary request in accordance with this embodiment is the following URL:
- the URL is one way of sending a request to the HTTP server, using an HTTP POST command.
- This allows arguments contained in an HTML document (using HTML ⁇ LNPUT> directives) to be sent to the enabler agent.
- the ⁇ INPUT> directive is a standard component of HTML, which allows input by forms to an HTTP server.
- the ⁇ INPUT> command contains a SQL query, or some other way of directly specifying a request for data, the data to be converted to an XML document.
- the arguments are sent to the enabler agent by incorporating them as part of a second URL.
- the server uses this additional information to ensure that the query is run properly, and that the requesting user has permission to access the server database. If this information is not provided, the request is processed using default values configured by the administrator of the enabler agent.
- the administrator may configure the enabler agent using the web-based interface discussed below, or the administrator may manually edit a configuration file.
- a method for retrieving data from a server in response to a user- generated query begins at step 605, with a data requestor composing a query.
- the query is then submitted to the supplier's HTTP server by sending an HTTP "post" command to the server, at step 610.
- the post command contains the query and any additional parameters necessary to process the query, such as a user name or a password.
- the supplier's HTTP server parses the URL and routes the post command to the enabler agent, at step 615.
- the enabler agent executes a CGI script that extracts the query from the incoming URL, and sends the query to the supplier's database, at step 620.
- the server database generates a result set based on the contents of the query, at step 625.
- This result set comprises the rows from the various tables of the server database that correspond to the parameters of the query.
- the result set is sent back form the server database to the enabler agent, at step 630.
- the result set is still represented in the structured data storage format used by the server database.
- the enabler agent converts the structured data storage formatted data into markup language data at step 635, and creates a markup language formatted document.
- the enabler agent then sends this document to the HTTP server at the supplier's site, at step 640.
- the supplier's HTTP server routes the markup language document to the requestor, at step 645.
- the enabler agent converts the structured data formatted data to XML format using the method of FIG. 5.
- An exemplary XML document generated by applying this method to a query of the database shown in FIGS. 2A-2B is shown in FIG. 6.
- the enabler agent receives the result set from the server database, and generates a query identifier tag at step 405.
- An exemplary query identifier tag 502 is shown for the query "BicycleParts" discussed above.
- the enabler agent examines the result set and obtains the next database record to be processed.
- the enabler agent selects the next record to process, and creates a record container tag for the record, at step 415.
- the record container tag is assigned a value that uniquely identifies which record of the client database is contained in the query.
- An exemplary record container tag 504 is shown for the first record in the result set generated by the "BicycleParts" query discussed above.
- the record container tag 504 has a value of one (1), which corresponds to the value of the key field 222 of the record, as shown in FIGS. 2A-2B.
- the enabler agent then parses the fields of the record being processed, at step 420. Assuming that fields remain to be processed, the enabler agent selects the next field to process, and creates a field entry tag for the field, at step 425.
- the field entry tag is given the name of the corresponding table column matched by the specified query. An administrator using a web-based setup screen can override these default tag names, and the administrator can assign different names.
- the field value is inserted into the field entry tag, and the field entry tag is then closed.
- An exemplary field entry tag 506 is shown for the "Key" field of the first record in the result set generated by the "BicycleParts" query discussed above.
- the field entry tag 506 has a value of one (1), corresponding to the value of the key column 222 of the first record of the parts table 220, as shown in FIGS. 2A-2B.
- the enabler agent creates an end of record container tag, at step 340.
- An exemplary end of record container tag 508 is shown for the first record of the result set for the query "BicycleParts" discussed above.
- Control then returns to step 410, for processing of the next record in the result set.
- the final XML tags are created and the XML document is closed.
- An exemplary closing tag 510 is shown for the result set of the "BicycleParts" query discussed above.
- a data type diagram is also generated for the result set. This data type diagram specifies the type of data being stored in each field of the records of the result set. Table 1 shows an exemplary data type diagram for the "BicycleParts" query. Table 1
- the enabler agent also contains a collection of utility programs to facilitate the setup, operation and debugging of the data collection system.
- Exemplary utility programs include: a query viewer, to display the results of a query in HTML format; a query editor, to assist an administrator in constructing the pre-defined queries discussed above; a database parameter editor, to allow the user to configure the basic login parameters of the enabler agent, such as the default server name of the server containing the server database, the default login name to supply to a user-generated query, the default password to supply to a user-generated query, and other such parameters; a security parameter program, to assist the administrator in configuring the security parameters of the enabler agent, such as a list of trusted hosts, an authentication mechanism to be used in verifying the identity of users, and a list of the allowed operating modes of the system, which specify whether the enabler agent will accept either pre-defined queries, user-defined queries, or both; and a tag browser program, which provides a simple web-based XML and HTML client that can be used to test the operation of
- a query viewer screen 70 in accordance with an embodiment of the enabler agent is implemented as a web-based CGI script.
- the query viewer screen 70 is displayed by activating a first URL 71 , inside a web browser program 72.
- the query viewer screen 70 allows the user to select a particular query to be viewed in HTML format.
- the screen shows the queried data in a tabular format, and allows the user to select the particular query via a pull-down menu 74.
- the various queries available to the user are provided in the pull-down menu 74. Queries are constructed via the query editor tool shown in FIG. 8.
- the query viewer screen 70 provides a simple way to obtain visibility into a system for debug and analysis purposes.
- the table 76 displayed by this screen is automatically created from the results of the query.
- the table 76 is generated from the query results by parsing the query results into components that are formatted into an HTML document that is returned to the user.
- the top of the table 76 indicates the various fields 78 specified in the query, which also corresponds to the various markup language format tags read by the data collector.
- the query viewer screen 70 in addition to providing a view of queried information, also provides the user with a list of the various queries created by the query editor screen from the pull-down menu 74. Additionally, database parameters are configurable for the query by activating a button 77 that calls up a query parameter screen. A button 79, when activated, submits the query to the database.
- a user of the query viewer screen 70 views a particular query by: (1) selecting the query from the pull-down menu 74, (2) providing any required database parameters by activating the button 77 and entering the values into the form that pops up, and then (3) activating the button 79, thus submitting the query to the database.
- the query results are returned, automatically converted into HTML format, and displayed in the query viewer screen 70.
- a query editor screen 80 in accordance with an embodiment of the enabler agent is implemented as a web-based CGI script.
- the query editor screen 80 is displayed inside the web browser 72, by activating a second URL 81.
- the query editor screen 80 allows the user to compose queries that are associated with particular items in the pull-down menu 74 of the query viewer screen 70 (of FIG. 7).
- the query editor screen 80 allows the user to create, modify, or delete queries, and associate these queries with items in the pull-down menu 74 of the query viewer screen 70 (of FIG. 7).
- the query editor screen 80 contains a text window 82 where the user can enter the query, in a structured data storage format, such as SQL.
- the query editor screen 80 also contains a palette of buttons 83 that provide functionality to the screen 80. These buttons include; a save button 84, which when activated cause the contents of the text window 82 to be saved to long-term storage; a save_as button 85, which when activated causes the user to be prompted to enter an identifier to be associated with the contents of the text window 82, and then saves the contents into long-term storage; a reload button 86, which when activated causes the user to be prompted for a query identifier, and then retrieves the query associated with the query identifier and displays it on the text window 82; and finally a cancel button 87, which when cancels any operation in progress. For example, if the user activates a long query, and then wishes to halt operation of the query, the user clicks the cancel button 87. These are exemplary members of the button palette 83.
- buttons could be included in the button palette 83.
- the query editor screen 80 facilitates associations of queries (which are not visible to clients or unprivileged users) and the query results. Each query resides in its own file on the server. The files can be modified by the query editor screen 80, or by a standard text file editor.
- a database parameter screen 90 in accordance with an embodiment of the enabler agent is implemented as a web-based CGI script.
- the database parameter screen 90 is displayed inside the web browser 72, by activating a third URL 91.
- the database parameter screen 90 provides general utility in configuring the various parameters of the system. This screen provides a central place for specifying values needed to make queries, such as driver programs, time-outs, passwords, and security items. This screen also allows easy modification of an existing mapping of a query to a data source. This screen will typically be available to privileged users.
- the database parameter screen 90 has text boxes 92, where a user enters the necessary parameters for the database.
- the database parameter screen 90 also has a button palette 92, which provides functionality to the screen. These buttons include: a commit button 93 that when activated causes the changes made by the user to be stored to the database; an edit query button 94 that when activated pops up a window containing the query, and allowing the query to be edited; and a cancel button 95 that when activated closes the database parameter screen 90 without making any changes. These are exemplary members of the button palette 92. Other buttons can be used in the button palette 92, depending on the particular functionality desired. [0088] The enabler agent is intended to be a comprehensive system containing various support tools and facilities.
- Exemplary support tools include: an embedded HTTP server, to allow users to run the enabler agent without needing a third-party HTTP server; database insert and update utilities, which allow privileged users to insert and update tables on the server database and provide a programmatic interface to allow a table to be loaded by external software, such as the Emily Framework language discussed below, or an HTTP client; the Emily Framework scripting language and development kit, to allow users to create new functionality and capabilities for the system; and online documentation in the form of markup language documents and PDF files, the documentation being sufficient to install, configure, operate and maintain the system.
- Emily Framework Scripting Language to allow users to run the enabler agent without needing a third-party HTTP server
- database insert and update utilities which allow privileged users to insert and update tables on the server database and provide a programmatic interface to allow a table to be loaded by external software, such as the Emily Framework language discussed below, or an HTTP client
- the Emily Framework scripting language and development kit to allow users to create new functionality and capabilities for the system
- online documentation in the form of markup language documents and PDF
- the Emily scripting language may be used for processing a markup language file having one or more tagged portions.
- the method comprises opening a first markup language file, step 1200, and parsing the first markup language file for one or more portions, step 1210.
- the language interpreter then stores each portion of the first markup language file into one or more objects in an electronic memory, at step 1202.
- Part of the Emily language may include a CAT or DIR command. If such a command is received by the language interpreter, step 1206, then the interpreter causes the one or more objects to be presented for selection, viewing or other processing in one more corresponding folders and sub-folders on-screen, step 1208.
- the one or more folders may be presented in a hierarchical list having sub-folders according to an arrangement of sub- portions of the first markup language file, each sub-folder representing an arrangement, sub-arrangement, or object containing a portion of the first markup language file.
- the Emily language comprises a command language set allowing selection, viewing and other processing of the one or more objects.
- the command language set comprises a plurality of commands for selection, viewing and other processing.
- a subset of commands may comprise one or more commands for processing one or more folders, subfolders, portions, or sub-portions of the first markup language file, the subset comprising one or more executable batch files containing a subset of the set of commands.
- One or more executable batch files may be included within a second markup language file, the subset of commands in the executable batch file comprising commands for including one or more of the objects containing portions of the first markup language file in the second markup language file.
- the language interpreter may receive one or more SET or UPDATE commands for setting or updating objects in the first or second markup language file, step 1210. If such a command is received, the language interpreter updates the objects in memory according to the commands received, step 1212. [0093] If a POST or SAVE command is received, step 1214, the following steps may be performed as part of said other processing for updating the respective markup language file, step 1216: receiving a subset of commands comprising one or more commands, said one or more commands comprising instructions for updating one or more objects based on the received one or more commands; updating the one or more portions contained in the one or more objects according to the received one or more commands; and saving the portions contained within the one or more objects to the first respective language file that have been updated.
- the following steps may be performed as part of said other processing for providing a second markup language file to a network node identified by a uniform resource locator: receiving a subset of commands comprising one or more commands, said one or more commands comprising instructions for updating one or more objects based on the received one or more commands; updating the one or more portions contained in the one or more objects according to the received one or more commands; and providing the second markup language file to the network node identified by the uniform resource locator.
- the first markup language file may comprise one or more portions receptive to data input
- said other processing comprises receiving and sending said data input by performing the steps of: receiving said data input, storing said data input into in said objects containing said one or more portions receptive to data input; and posting said data input to said markup language file. At least a portion of the data input may then be processed after the step of receiving and before the step of storing.
- This alternative processing may be used, for example, to receive SQL commands for performing a query on a remote database, or posting to a remote database.
- Data Collector [0096] Returning to Fig. 1, the second functional unit of the B2B system is the data collector 104.
- the data collector 104 is a computer program that runs on a server 120 controlled by the buyer.
- the data collector 104 works in tandem with a client database 122 under the control of the buyer.
- the client database stores data in a particular data storage format. Exemplary data storage formats include SQL, flat files of varying format, CORB A databases, XML documents, and HTML documents.
- the data is stored in accordance with a proprietary data model, determined by the needs and desires of the buyer. This proprietary data model is typically different from the proprietary data model used by the supplier, as discussed above.
- the client database 122 is also associated with a suite of database tools 130, used to manipulate the data stored in the client database 122, and to provide the data to users. [0097] FIG.
- the 13 is an example of how a particular buyer, in this case a distributor of wheels, would model the data describing its inventory.
- the data is stored in a table 1400.
- the wheel distributor orders products from many different suppliers, so it has created a supplier ID column 1410, which contains information identifying the source of the product.
- a supplier record number field 1420 is provided in order to properly link the records stored in the buyer's database with the catalog data coming from the supplier.
- the key field 1430 is a locally maintained field that uniquely identifies each record in the buyer's database.
- the supplier part number field 1440 contains information from the supplier that is used by the supplier to uniquely identify the part.
- the description field 1450 and wholesale price field 1460 contain additional information about the catalog items stored in the buyer's database.
- the fields 1420, 1440, 1450, and 1460 all contain data downloaded from the supplier's database.
- the supplier ID field 1410 is populated with the name of the supplier. This value can be downloaded from the supplier, or it can be supplied locally by the buyer, as will be discussed below.
- the values for the key field 1430 are determined locally.
- the exemplary buyer of FIG. 14 has loaded its inventory ordering table 1400 with data from three suppliers, Car Co., Truck Co., and Bike Co., the supplier discussed above.
- the data collector 104 transmits a data request to the server 110, receives a response from the server 110, the response containing the requested data, and stores the requested data in the client database 122.
- the data request can be transmitted in a variety of ways.
- the data request is an URL transmitted by the data collector 104 to the server 110.
- the data request is an e-mail message sent to the server 110 or an application-specific message sent to the server 110 by the data collector or another computer process.
- the server 110 initiates the data request, and the results are pushed across the network to the data collector 104.
- the data request contains information that identifies the data to be retrieved.
- the request comprises information that identifies a pre-generated database query, the query being stored on the server 110.
- the details of the pre-defined queries of this embodiment are discussed above in the enabler agent section.
- the request comprises a database query generated at the time of the request. Details of the request of this embodiment are also discussed above in the enabler agent section.
- the data collector receives the markup language formatted result set from the server, and converts the result set into the particular structured database format used by the client database, at step 1540.
- the final step comprises the data collector updating the client database with the information from the result set, at step 1550.
- the data collector can generate the URL of step 1510 in several different ways.
- the data collector in a preferred embodiment contains a listing of URLs of the various suppliers that the data collector communicates with. Alternatively, the data collector prompts a user to provide a URL.
- steps 1510 and 1520 of the method of FIG. 14 are omitted.
- a scheduling system is used to automatically provide URLs for querying supplier databases.
- the list of URLs is maintained in a database table, containing a series of records, one for each URL to be queried.
- An exemplary URL storage table is shown in FIG. 15.
- the URL column 1610 contains the URL of the server to be polled by the data collector.
- the Sys Descr column 1620 contains optional additional information about the server.
- the Last Polled column 1630 contains the date and time the server was last polled for new data.
- the Sys Status column 1640 contains a status code indicating the status of the data server as of the last polled time. In this example, the allowable status codes are listed in Table 2.
- the key field 1650 is a unique identifier used to link records to other data in the client system, such as expanded supplier-related data. The values typically increase monotonically from one.
- a URL data table is created, the table is populated with URLs that the buyer wishes to automatically poll for data. The user can either supply the URL directly to the table, or preferably the user uses a web-based form or CGI screen to enter the URL into the URL table.
- a polling schedule is created. This schedule minimally determines the rate at which URLs are polled for new data. The polling schedule can also be used to perform other periodic actions such as executing external programs. The polling schedule is set up by the administrator of the data collector based upon the particular needs of the buyer. This setup is preferably done using a web-based CGI screen.
- the polling schedule is defined.
- An automated process is periodically triggered by the polling schedule, based upon the parameters defined by the administrator.
- the polling function steps through the list of URLs stored in the URL table, and transmits the URLs to the HTTP server on the client, thus generating the URL as specified in step 1510 of FIG 14.
- a method of polling the servers identified by the URLs in the URL list is shown in FIG. 16.
- the method commences at step 1700, when the data collector opens the URL table.
- the data collector typically opens the URL table in response to a command from the polling schedule.
- the data collector can open the URL table in response to a command from a user, or from another process running on the client.
- step 1710 the poller steps through the entries in the URL table.
- step 1710 a check is made to see if any rows remain to be processed. If all the URLs have been polled, the method exits at step 1790.
- step 1720 the URL value is read from the current row of the URL table.
- step 1730 this value is routed to the HTTP server on the client, then over the network, and then to the HTTP server at the supplier. Depending on the status of the supplier's server, one of the different responses, shown in Table 2, is received back from the server, at step 1740.
- the HTTP server residing on the client handles the actual data transfers. Methods of transferring data using HTTP servers are well known to those skilled in the art and are not critical to the invention.
- step 1540 of FIG. 14 follows the method shown in FIG. 17, as applied to the exemplary XML data file of FIG. 6.
- the data conversion method of FIG. 17 begins at step 1800, where the data collector receives the XML data file.
- the data collector parses the query row 502 and identifies the data supplier for the data document.
- the data collector determines the proper target database for the data contained in the incoming document. Where there are multiple possible targets, the data collector determines which table to store the data in by, for example, checking the value contained in the query row 502 against values stored in a URL table stored on the client.
- the data collector identifies the data map to use in associating the contents of the various XML tags with the corresponding database items.
- this data map can be a simple correlation of field tags to database fields, or it can be a complex mapping of record tags to records in one table or many different tables within a large multi-table relational database system. In simpler systems, the data map is omitted, and the various tags are directly mapped to database items bearing the same name. If data type information is provided in the XML document, this information is also parsed and stored for future use. Data type information would typically be used where a new table is being created to store incoming data. [0117] Once the initial setup information has been processed, the data collector then steps through the file, unpacking the records and fields stored in the file. At step 1820, a check is made to see if any records remain to be converted.
- step 1863 the method proceeds to step 1863, where any necessary final cleanup is done, such as deleting from the client database records that no longer appear in the XML data document, closing database tables, storing backup copies of files, etc.
- the method then terminates at step 1865.
- step 1825 the current record identifier is read by the data collector.
- the data map is also checked at this step, to see if there is a mapping defined for this tag.
- the proper target database item is then searched, at step 1830, for the record corresponding to the current record tag. A check is made at step 1835 to see if a corresponding record was found. If no record corresponds to the current record tag, then at step 1840 the data collector issues a command to the database instructing the database to create a new record for storing the current record tag.
- step 1845 a check is made to see if any field tags remain to be parsed. If all field tags associated with the record have been parsed, control passes back to step 1820, where the next record is processed. Assuming there are fields remaining to be parsed, at step 1850 the field tag is read by the data collector. At step 1855, the data collector maps the field tag in the XML data document to the proper field in the database record. Where a data map is available, the data collector applies the mapping defined in the data map to make the like between the field tag and the database field.
- the data collector has an alarm system.
- This alarm system triggers an alarm when the content of a database item changes.
- the administrator can set up an alarm that is triggered when, for example, the value of the price field for a record changes, or a new record is added to the database.
- the alarm system is invoked either at step 1840 when a new record is created, at step 1860 when a field value is stored, assuming the newly stored value is not the same as the old value, and/or at step 1803 when obsolete data is deleted from the system.
- the alarm system interacts with an action and notification system.
- the action and notification system allows the administrator to define one or more actions that are performed when a particular alarm is triggered. For example, when a new record is added to the database, an e-mail message is sent to selected users in the buyer's organization, advising them of the newly available goods. Other actions include running an external program, performing an HTTP post operation, or running an Emily or third-party script.
- the data collector also contains a collection of utility programs to facilitate the setup and operate of the data collector. In a preferred embodiment, these programs are implemented as web-based CGI scripts.
- Exemplary utility programs include screens which allow a user to: specify URLs to be polled, define a data map for data from a particular URL, set up alarms for the various data items stored in the client database, set up notifications that will be triggered by the alarms, set up system login accounts for other users of the data collection system, and perform other activities needed to configure and maintain the data collector.
- Other setup and configuration screens can also be provided, depending on the particular requirements of a given embodiment.
- the data collection system optionally includes a suite of database manipulation tools.
- these tools include: a web-based search engine, whereby a user can search the database by running a pre-defined query on the client database; a web-based screen creator, whereby a user can make simple markup language screens that display data from the client database; a script editor, whereby a user can create scripts in a scripting language, to execute more complex operations on the database or to create dynamic screens; and/or a form creator, whereby a user can create markup language forms for updating the client database or for posting client database items to other HTTP servers.
- These tools can be provided as part of the data collection system, or they can be third-party application programs provided by the user.
- Exemplary third-party application programs include ASP, Cold Fusion, or ISQL.
- the particular database manipulation tools chosen are design choices for those skilled in the art and are not critical to the invention.
- the data collector also includes security features, to limit usage to authorized personnel and to safeguard the database. These features include an administrative/security console program.
- This console program is preferably a non-web- based application that is only available to selected administrative personnel.
- the console program could be program maintained only on a special machine, or located in a protected directory on the client machine.
- the console program may also be a web- based screen or series of screens.
- the console program allows administrators to set passwords and permissions on the various screens of the data collector and associated toolkits, as well as on any embedded HTTP server.
- the console program permits the administrator to use HTTP authentication, securing any screen or directory associated with the data collector with a pop-up login interface.
- the console program also includes a utility to set up and maintain a trusted host list (by IP address or by network).
- the trusted host list is a list of host machines that are allowed to access various files and screens within the client.
- a computer system 1020 includes a host computer 1022 connected to a plurality of individual user stations 1024.
- the user stations 1024 each comprise suitable data terminals, for example, but not limited to, e.g., personal computers, portable laptop computers, or personal data assistants ("PDAs"), which can store and independently run one or more applications, i.e., programs.
- PDAs personal data assistants
- some of the user stations 1024 are connected to the host computer 1022 via a local area network (“LAN”) 1025.
- LAN local area network
- Other user stations 1024 are remotely connected to the host computer 1022 via a public telephone switched network (“PSTN”) 1028 and/or a wireless network 1030.
- PSTN public telephone switched network
- the host computer 1022 operates in conjunction with a data storage system 1031, wherein the data storage system 1031 contains a database 1032 that is readily accessible by the host computer 1022.
- the database 1032 may be resident on the host computer, stored, e.g., in the host computer's ROM, PROM, EPROM, or any other memory chip, and/or its hard disk. In yet alternative embodiments, the database 1032 may be read by the host computer 1022 from one. or more floppy disks, flexible disks, magnetic tapes, any other magnetic medium, CD-ROMs, any other optical medium, punchcards, papertape, or any other physical medium with patterns of holes, or any other medium from which a computer can read.
- the host computer 1022 can access two or more databases 1032, stored in a variety of mediums, as previously discussed.
- each user station 1024 and the host computer 1022 each referred to generally as a processing unit, embodies a general architecture 1102.
- a processing unit includes a bus 1103 or other communication mechanism for communicating instructions, messages and data, collectively, information, and one or more processors 1104 coupled with the bus 1103 for processing information.
- a processing unit also includes a main memory 1108, such as a random access memory (RAM) or other dynamic storage device, coupled to the bus 1103 for storing dynamic data and instructions to be executed by the processor(s) 1104.
- the main memory 1108 also may be used for storing temporary data, i.e., variables, or other intermediate information during execution of instructions by the processor(s) 1104.
- a processing unit may further include a read only memory (ROM) 1109 or other static storage device coupled to the bus 1103 for storing static data and instructions for the processor(s) 1104.
- ROM read only memory
- a storage device 1110 such as a magnetic disk or optical disk, may also be provided and coupled to the bus 1103 for storing data and instructions for the processor(s) 1104.
- a processing unit may be coupled via the bus 1103 to a display device 1111, such as, but not limited to, a cathode ray tube (CRT), for displaying information to a user.
- a display device 1111 such as, but not limited to, a cathode ray tube (CRT)
- An input device 1112 is coupled to the bus 1103 for communicating information and command selections to the processor(s) 1104.
- Another type of user input device may include a cursor control 1113, such as, but not limited to, a mouse, a trackball, a fingerpad, or cursor direction keys, for communicating direction information and command selections to the processor(s) 1104 and for controlling cursor movement on the display 1111.
- the individual processing units perform specific operations by their respective processor(s) 1104 executing one or more sequences of one or more instructions contained in the main memory 1108.
- Such instructions may be read into the main memory 1108 from another computer-usable medium, such as the ROM 1109 or the storage device 1110.
- Execution of the sequences of instructions contained in the main memory 1108 causes the processor(s) 1104 to perform the processes described herein.
- hard-wired circuitry may be used in place of or in combination with software instructions to implement the invention.
- embodiments of the invention are not limited to any specific combination of hardware circuitry and/or software.
- Non-volatile media i.e., media that can retain information in the absence of power
- Volatile media i.e., media that can not retain information in the absence of power
- Transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise the bus 1103. Transmission media can also take the form of carrier waves; i.e., electromagnetic waves that can be modulated, as in frequency, amplitude or phase, to transmit information signals.
- transmission media can take the form of acoustic or light waves, such as those generated during radio wave and infrared data communications.
- Common forms of computer-usable media include, for example: a floppy disk, flexible disk, hard disk, magnetic tape, any other magnetic medium, CD-ROM, any other optical medium, punchcards, papertape, any other physical medium with patterns of holes, RAM, ROM, PROM (i.e., programmable read only memory), EPROM (i.e., erasable programmable read only memory), including FLASH-EPROM, any other memory chip or cartridge, carrier waves, or any other medium from which a processor 1104 can retrieve information.
- Various forms of computer-usable media may be involved in providing one or more sequences of one or more instructions to the processor(s) 1104 for execution.
- the instructions may initially be provided on a magnetic disk of a remote computer (not shown).
- the remote computer may load the instructions into its dynamic memory and then transit them over a telephone line, using a modem.
- a modem local to the processing unit may receive the instructions on a telephone line and use an infrared transmitter to convert the instruction signals transmitted over the telephone line to corresponding infrared signals.
- An infrared detector (not shown) coupled to the bus 1103 may receive the infrared signals and place the instructions therein on the bus 1103.
- the bus 1103 may carry the instructions to the main memory 1108, from which the processor(s) 1104 thereafter retrieves and executes the instructions.
- the instructions received by the main memory 1108 may optionally be stored on the storage device 1110, either before or after their execution by the processor(s) 1104.
- Each processing unit may also include a communication interface 1114 coupled to the bus 1103.
- the communication interface 1114 provides two-way communication between the respective user stations 1024 and the host computer 1022.
- the communication interface 1114 of a respective processing unit transmits and receives electrical, electromagnetic or optical signals that include data streams representing various types of information, including instructions, messages and data.
- a communication link 1115 links a respective user station 1024 and a host computer 1022.
- the communication link 1115 may be a LAN 1025, in which case the communication interface 1114 may be a LAN card.
- the communication link 1115 may be a PSTN 1028, in which case the communication interface 1114 may be an integrated services digital network (ISDN) card or a modem.
- ISDN integrated services digital network
- the communication link 1115 may be a wireless network 1030.
- a processing unit may transmit and receive messages, data, and instructions, including program, i.e., application, code, through its respective communication link 1115 and communication interface 1114. Received program code may be executed by the respective processor(s) 1104 as it is received, and/or stored in the storage device 1110, or other associated non-volatile media, for later execution. In this manner, a processing unit may receive messages, data and/or program code in the form of a carrier wave.
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
La présente invention concerne un système qui permet de collecter des données à partir d'un serveur Web, de transmettre ces données à un client sur le Web, et de les stocker chez le client en question. A l'aide d'une fonction de mappage de données, le serveur Web traduit les données d'un modèle de données privées de fournisseur de données en modèle de données privées de consommateur de données. Le serveur Web convertit également les données d'un format de données structuré en format de langage de balisage. Le client sur le Web interroge périodiquement un ou plusieurs serveurs de données en vue d'obtenir des données. Lorsque ce client reçoit des données dans un format de langage de balisage, il les traduit en un format de données structuré, puis les stocke dans une base de données. Le client et le serveur Web peuvent collaborer ensemble pour rationaliser la conversion de données et le processus de traduction.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US21406700P | 2000-06-26 | 2000-06-26 | |
US214067P | 2000-06-26 | ||
US23545800P | 2000-09-26 | 2000-09-26 | |
US235458P | 2000-09-26 | ||
PCT/US2001/020047 WO2002001389A2 (fr) | 2000-06-26 | 2001-06-21 | Systeme de collecte de donnees collaboratives sur le web |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1360607A2 true EP1360607A2 (fr) | 2003-11-12 |
Family
ID=29254145
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01946682A Withdrawn EP1360607A2 (fr) | 2000-06-26 | 2001-06-21 | Systeme de collecte de donnees collaboratives sur le web |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP1360607A2 (fr) |
BR (1) | BR0112357A (fr) |
-
2001
- 2001-06-21 EP EP01946682A patent/EP1360607A2/fr not_active Withdrawn
- 2001-06-21 BR BR0112357-2A patent/BR0112357A/pt not_active Application Discontinuation
Non-Patent Citations (1)
Title |
---|
See references of WO0201389A3 * |
Also Published As
Publication number | Publication date |
---|---|
BR0112357A (pt) | 2004-07-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7076521B2 (en) | Web-based collaborative data collection system | |
US9405736B1 (en) | Method and system for automatically downloading and storing markup language documents into a folder based data structure | |
CA2397907C (fr) | Systeme d'interface electronique dispensateur de soins medicaux-patient | |
US8095497B2 (en) | Process for data driven application integration for B2B | |
US7536323B2 (en) | Online intelligent multilingual comparison-shop agents for wireless networks | |
US6816865B2 (en) | Process for data driven application integration for B2B | |
US6981222B2 (en) | End-to-end transaction processing and statusing system and method | |
US8671113B2 (en) | Internet delivery system | |
US6662199B1 (en) | Method and apparatus for customized hosted applications | |
US20020069081A1 (en) | Methods and systems for providing employment management services over a network | |
US20030074271A1 (en) | Customizable two step mapping of extensible markup language data in an e-procurement system and method | |
US20080281899A1 (en) | Method for managing commerce contexts | |
US20060218164A1 (en) | Document management device and document management program | |
US20080059429A1 (en) | Integrated search processing method and device | |
TWI280488B (en) | Online intelligent information comparison agent of multilingual electronic data sources over inter-connected computer networks | |
EP1360607A2 (fr) | Systeme de collecte de donnees collaboratives sur le web | |
WO2002027604A2 (fr) | Procede et systeme pour le commerce electronique | |
Kodali | the design and implementation of an e-commerce Site for online book sales | |
CA2360906C (fr) | Methode de mappage d'information a partir d'une source de donnees basee sur reseau | |
Ilapogu | XML-based e-commerce shopping cart application | |
Bohne-Lang et al. | PMD2HD–A web tool aligning a PubMed search results page with the local German Cancer Research Centre library collection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030127 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20050104 |