WO2001004775A2 - Procédé pour construire un catalogue électronique homogène - Google Patents
Procédé pour construire un catalogue électronique homogène Download PDFInfo
- Publication number
- WO2001004775A2 WO2001004775A2 PCT/IL2000/000417 IL0000417W WO0104775A2 WO 2001004775 A2 WO2001004775 A2 WO 2001004775A2 IL 0000417 W IL0000417 W IL 0000417W WO 0104775 A2 WO0104775 A2 WO 0104775A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- database
- catalog
- supplier
- fields
- homogenous
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
Definitions
- This invention relates to the generation and use of electronic catalogs.
- a typical concept list includes: item id (standing for a key field that uniquely identifies each item) item name (e.g. Air Jordan shoe); item info (e.g. sport shoe for running).
- item id standing for a key field that uniquely identifies each item
- item info e.g. sport shoe for running
- the concepts include also categories such as sport shoe which include properties, e.g. size, color etc.
- the user has no knowledge on the identity of the relevant suppliers (or at least not on all the relevant suppliers), and hence he/she must conduct the search across the
- the user invokes a query using a search engine utility (e.g. the known Yahoo) and provides query parameters that identify the desired product, category and/or property of interest.
- a search engine utility e.g. the known Yahoo
- the search engine finds a site or sites that include data that match the sought parameters, the data (e.g. relevant html pages) are retrieved and downloaded to the user.
- the so retrieved data are often sorted by some relevance ranking, which is intended to approximate the degree of relevance of the resulting data to the query.
- An exemplary (yet not exclusive) Al-based concept harmonization technique includes automatic learning of the "logic" which governs the classification of category.
- Methods belonging to this approach utilize a set of training data, for which the correct categories are known in advance (usually as the result of manual classification of these categories).
- a learning method may then include a learning phase, in which some model of the category is constructed.
- a model may include terms that are highly associated with the category, and possibly some weights that quantify the degree of correlation between each term and the category.
- a learning method may be memory based, in which case the learning method simply stores the training data in some useful format. Then, when a new item (say product) is given for classification, the method classifies it automatically by consulting or applying the category model (or by simply comparing the new data item to the training data, in case of a memory based approach).
- Relational Model (or Database): The relational model, introduced by Codd, is a landmark in the history of database development.
- an abstract concept has been introduced, according to which the data is represented by tables (referred to as entity or relationship “relations") in which the columns represent the fields and rows represent the records.
- Field A column in a table of a relational database which represents an attribute of a data record (standing for a product) in the table, for example color, size, price in a table that represents clothing.
- a product is represented by some or all of its fields.
- Category Hierarchical structure of concepts represented as category values, to which products or group of products are classified. In the present invention, products are, typically (although not necessarily) classified to category values taken from the leaf nodes of the hierarchy;
- Property A specific field type which signifies a characteristic of given product or products and which is normally not common to all the products in the catalog. As will be shown below, a property is assigned with property values.
- a technique to construct a homogeneous knowledge base (constituting a homogeneous catalog) from a plurality of dispersed knowledge bases, each of which constitutes a separate catalog.
- all or some of the fields are mapped to corresponding fields in the homogeneous catalog (constituting homogenous catalog field structure).
- the catalog field structure which, as will be explained in greater detail below, includes, preferably, fields of "field type", "category type” and "property type"
- the category values and property values of the separate catalogs are mapped to the homogeneous catalog.
- a 'catalog import' step in which the contents of the supplier database is mapped to the homogenous catalog.
- mapping selected fields in the supplier catalog database to corresponding fields in the homogenous catalog database include “field type” fields, "property type” fields and “category type” fields;
- mapping category values in said supplier catalog database to corresponding category values in the homogenous catalog database include "field type” fields, "property type” fields and "category type” fields;
- mapping property values in said supplier catalog database to corresponding property values in the homogenous catalog database mapping property values in said supplier catalog database to corresponding property values in the homogenous catalog database; and transferring data contained in said fields from said supplier catalog database to said homogenous catalog database.
- the category values of the source supplier database are mapped to respective category values in the homogenous catalog using, preferably a "group by" function.
- additional separate catalogs are mapped to the same homogeneous catalog.
- the catalog stores in a unified and homogenous manner, data originated from said separate catalogs.
- the knowledge base is arranged in accordance with the relational model database.
- a typical, yet not exclusive example of a communication network being the Internet.
- the catalog may be subject to queries, utilizing e.g. conventional query languages such as SQL.
- the query that is applied to the homogeneous catalog uses terms which are identical (or substantially identical) to those that constitute the homogeneous catalog, and therefore the prospects of missing data due to inconsistent definitions are substantially reduced or even eliminated.
- the specified terms may be chosen to be field names - (of field, category and/or property type), category values, and /or property values). Other terms may also be used, all as required and appropriate.
- Fig. 1 is a generalized system architecture in accordance with the invention
- Fig. 2 is a flow chart illustrating a generalized sequence of operation, in accordance with the invention
- Fig. 3 is a generalized flow chart illustrating a field mapping sequence in accordance with one embodiment of the invention
- FIG. 4A-B illustrate exemplary user interface screens for realizing the field mapping sequence of Fig. 3
- Fig. 5 is a generalized flow chart illustrating category values mapping sequence in accordance with one embodiment of the invention
- Figs. 6A-B illustrate exemplary user interface screens for realizing the category values mapping sequence of Fig. 5;
- Fig. 7 illustrates a typical hierarchy of categories
- Fig. 8 is a generalized flow chart illustrating a property mapping sequence in accordance with one embodiment of the invention.
- Figs. 9A-C illustrates the resulting homogeneous catalog after mapping the fields, category values and property values, represented in an efficient manner, in accordance with one embodiment of the invention.
- the system (10) includes a homogenous catalog site (12) and a supplier catalog site (14), inter-linked by means of Internet network (16).
- the separate catalog (18) (coupled to conventional desktop 19) at the supplier site (14) is mapped, in a convenient semi-automatic procedure, into homogenous catalog (20) (coupled to desktop 21) at the remote site (12), using a mapping protocol over the internet (16).
- each catalog is held in one physical storage medium, the invention is not bound to any particular physical and/or logical representation of the catalog sites.
- Fig. 2 there is shown a flow chart illustrating a generalized sequence of operation, in accordance with the invention.
- the supplier (14) logs into the homogenous catalog (standing for the server) site (12) and after undergoing known per se admittance control steps (30 to 32), the server catalog "accesses" the supplier's site (standing for the client), and by means of e.g. known per se ODBC driver (33) links to the client's database.
- the catalog database at the client's site is arranged in accordance with the relational model.
- the invention is by no means bound by any particular high-level or low-level model for representing data.
- a flat model where all the data is held in one table is utilized.
- the fields (34) are mapped (including field type, category type and property type). Having mapped the fields (to thereby constitute a field structure), there follows category values mapping (35) followed by property values mapping (36). Having mapped category values and property values, the contents of the supplier catalog is mapped to the homogenous catalog (referred to as catalog import step (37), and thereafter integrity checking steps (38) and (39) are performed, in which errors are rejected (38) and an optional manual modification step is provided (39) (e.g. for inputting missing data, such as contents of fields, say the value of color field for the product jeans trouser). The process terminates by providing a status summary report (39') (e.g. success or error, and in the latter case also indicating the error type).
- a status summary report 39'
- Fig. 3 there is shown a generalized flow chart illustrating a field mapping sequence in accordance with one embodiment of the invention.
- the supplier identifies the tables which are subject to mapping (and which constitute the supplier's catalog).
- the fields of the catalog table or tables and also the contents thereof can be easily identified, all as known er se.
- the fields of the client's catalog are mapped into fields at the server's catalog (41).
- the fields mapping is then tested (42 and 43) and thereafter the catalog and properties (44 and 45) are mapped (see below).
- Figs. 4A-B illustrate an exemplary user interface screens for realizing the field mapping sequence of Fig. 3.
- the communication protocol program that links the client and server sites identifies the appropriate relational tables in the server catalog and client catalog that are subject to mapping
- ItemName is mapped to the corresponding client catalog field name ProdName.
- the homogenous catalog field SizeName is mapped to the corresponding client catalog field name CategorName
- the homogenous catalog field SmallPicture is mapped to the corresponding client catalog field name SmallPic.
- the client is not obliged to map all the existing fields of the catalog into corresponding fields in his local catalog. Put differently, only those fields that are of interest are mapped.
- fields of all types are mapped, normally, "fields”, “category” and “property” (and if desired possibly also others, all as required and appropriate, depending upon the particular application).
- H-Catalog stands for the homogenous catalog side and Data Source Field stands for the supplier's fields as appearing in his/her local database. Focusing now on the h-catalog side, the fields 1,2 and 3 stand for "fields", since they are common attributes to all products. Put differently, every product must have a name (field no. 1), a catalog number (field number 2) and a price (field number 3). Field number 4 stands for "category” (as will be explained in greater detail below).
- Fields 5 and 6 stand for "property".
- property is, as a rule, an attribute of one or more products, but not of all of them.
- size and color are attributes of some products such as shoes and shirts, but not of others, such as tyres (for cars).
- the latter may have other properties such as (tyre) width and (tyre) diameter.
- Category value mapping is illustrated in Figs. 6A and 6B.
- Fig. 6A the category list as extracted from the table of the client catalog is displayed (in the left column under the title Supplier Category), and is mapped manually to corresponding category value (in the right column under the title h- category - designated also as e-FES category ) in the homogenous catalog at the server site.
- the resulting mapping is shown in Fig. 6B.
- the h-category category value Teamclothing trousers/men is mapped to the supplier (client) category value Trousers long Unisex/men.
- Fig. 7 illustrating the hierarchy (tree) of categories (70), of the supplier end.
- the hierarchy tree is a non-limiting form of representing categories.
- the most generalized definition of categories resides (at the top of the hierarchy -referred to also as root node) and more specific definitions reside in lower levels of the tree.
- root (71) represents the general category definition clothing.
- Nodes (72) and (73), lower in the hierarchy, represent more specific category definition (youth clothing and kids' clothing).
- the category values residing at the lowest level of the hierarchy represent the most specific category definition.
- (74) represents sport pants for youth
- category value (75) represents elegant pants for youth.
- a category value may be viewed as concatenation of the nodes from root to leaf.
- clothing -> youth -> pants -> sport corresponds to category value (74).
- the category values In order to map the category values, it is first required to "flatten" the hierarchical representation of categories in order to obtain a list of category values that are subject to mapping (from the supplier catalog to the h-catalog database).
- the flattening results in extracting only the category values of interest, and in the specific example of Fig. 7, this means category values (74 to 78). Whilst in the specific example of Fig. 7 only leaf nodes were extracted (for category value mapping purposes), this is not necessarily always the case. Thus, by an alternative embodiment, higher levels in the hierarchy of categories may also or alternatively be used.
- the flattening procedure is implemented manually, or in a semi-automatic manner.
- the mapping of category values is implemented basically in the same manner as mapping the fields.
- mapping results of some of the category values is illustrated in table 2 below:
- a preliminary "group by" function may be utilized in order to improve the efficiency of the category values mapping.
- a catalog at the supplier end that holds X items all classified to the Sport pants for youth category value and additional Y items all classified to the Jeans trousers for youth category value.
- the table includes X repetitions of the Sport pants for youth category value and Y repetitions of the Jeans trousers for youth category value.
- the property mapping may involve a preliminary "group by" function in order to extract unique property values and avoid repetitions. Having mapped the fields, the category values and the property values, the contents of the catalog at the supplier end may now be imported to the h-catalog database (step 37 in Fig. 2) so as to construct the h-catalog catalog database.
- the data import is realized using known er se data transfer techniques. Of course, the data in the server catalog are organized under the field names of the homogeneous catalog.
- the original catalog (at the supplier site, Table 6), and the mapped catalog (at the h-catalog site, Table 5) are, accordingly, as follows:
- Figs. 9A-C illustrate one out of many possible variants for representing data in the h-catalog site in accordance with one embodiment of the invention.
- the category field is represented as an integral part of table (91).
- Table (92) (Fig. 9B) stands for category table and it includes the key field Category Id and category name. The contents of the category table is, as shown, the distinct category values, i.e. "Sport pants for youth”, “Jeans trousers for youth”, and "Swimsuits and light summer clothing for youth”.
- Fig. 9C stands for "property” having "color” and “size” properties and their respective values green, blue and yellow (for color) and 40, 38 and 32 for size.
- the representation of data in accordance with Fig. 9A-C is, of course, only one out of many known per se manners of representing data and by way of alternative non limiting embodiment the known ERD model may be used. As is well known, the latter enables efficient 1 :N relationship (e.g. a category can be assigned to more than one product) and N:M representation.
- the procedure described with reference to a supplier catalog database is not bound to any specific order or scope.
- the entire database may be mapped in one time or, if desired, the procedure described above may be applied successively to database portions, e.g. applied to each database table separately.
- the procedure described with reference to Figs. 1 to 9 is repeated for each supplier who wishes to subscribe to the homogenous catalog.
- the data of all the separate catalogs are represented in a unified manner in the homogenous catalog and, accordingly, querying the homogenous catalog using the common field, category and/or property nomenclature, will bring about consistent results as compared to the alternative of querying the inconsistent separate catalogs of the suppliers.
- the actual representation of data in the h-catalog and suppler may be one in any known per se manner taking in account depending on e.g. volume and performance considerations.
- Alphabetic characters and roman symbols used to designate method steps are used for convenience of explanation only and do not necessarily imply any particular order steps.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU58431/00A AU5843100A (en) | 1999-07-14 | 2000-07-14 | A method for constructing a homogeneous electronic catalog |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US35345299A | 1999-07-14 | 1999-07-14 | |
US09/353,452 | 1999-07-14 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2001004775A2 true WO2001004775A2 (fr) | 2001-01-18 |
WO2001004775A3 WO2001004775A3 (fr) | 2003-01-09 |
Family
ID=23389163
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IL2000/000417 WO2001004775A2 (fr) | 1999-07-14 | 2000-07-14 | Procédé pour construire un catalogue électronique homogène |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU5843100A (fr) |
WO (1) | WO2001004775A2 (fr) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1241597A2 (fr) * | 2001-03-14 | 2002-09-18 | Aktiebolaget SKF | Produit logiciel d'assistance à un utilisateur dans son choix d'unités d'information parmi plusieurs unités d'information structurées concernant des roulements et des joints |
US6871198B2 (en) | 2001-12-21 | 2005-03-22 | Requisite Technology, Inc. | Composing and cataloging item configuration data |
US7039645B1 (en) | 2002-09-26 | 2006-05-02 | Requisite Technology, Inc. | Managing content of an electronic catalog by collaboration with another electronic catalog |
US7979324B2 (en) * | 2007-02-27 | 2011-07-12 | Microsoft Corporation | Virtual catalog |
US8838592B2 (en) | 2007-06-13 | 2014-09-16 | Mlslistings Inc. | Methods and systems for developing a data repository for heterogeneous MLS systems |
CN113986362A (zh) * | 2021-10-22 | 2022-01-28 | 山东云海国创云计算装备产业创新中心有限公司 | 一种raid卡及其控制方法、服务器主机 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5560005A (en) * | 1994-02-25 | 1996-09-24 | Actamed Corp. | Methods and systems for object-based relational distributed databases |
-
2000
- 2000-07-14 WO PCT/IL2000/000417 patent/WO2001004775A2/fr active Application Filing
- 2000-07-14 AU AU58431/00A patent/AU5843100A/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5560005A (en) * | 1994-02-25 | 1996-09-24 | Actamed Corp. | Methods and systems for object-based relational distributed databases |
Non-Patent Citations (3)
Title |
---|
CHOY D M ET AL: "A distributed catalog for heterogeneous distributed database resources" PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, 1991., PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON MIAMI BEACH, FL, USA 4-6 DEC. 1991, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 4 December 1991 (1991-12-04), pages 236-244, XP010025430 ISBN: 0-8186-2295-4 * |
KELLER A M ET AL: "MULTI-VENDOR CATALOGS: SMART CATALOGS AND VIRTUAL CATALOGS" EDI FORUM, EDI GROUP, OAK PARK, IL, US, vol. 9, no. 3, September 1996 (1996-09), pages 87-93, XP001056432 ISSN: 1048-3047 * |
SANG-GOO LEE ET AL: "Digital catalog library: a shared repository of online catalogs for electronic commerce" ADVANCE ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, WECWIS, 1999. INTERNATIONAL CONFERENCE ON SANTA CLARA, CA, USA 8-9 APRIL 1999, PISCATAWAY, NJ, USA,IEEE, US, 8 April 1999 (1999-04-08), pages 84-86, XP010348777 ISBN: 0-7695-0334-9 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1241597A2 (fr) * | 2001-03-14 | 2002-09-18 | Aktiebolaget SKF | Produit logiciel d'assistance à un utilisateur dans son choix d'unités d'information parmi plusieurs unités d'information structurées concernant des roulements et des joints |
EP1241597A3 (fr) * | 2001-03-14 | 2005-12-21 | Aktiebolaget SKF | Produit logiciel d'assistance à un utilisateur dans son choix d'unités d'information parmi plusieurs unités d'information structurées concernant des roulements et des joints |
US6871198B2 (en) | 2001-12-21 | 2005-03-22 | Requisite Technology, Inc. | Composing and cataloging item configuration data |
US7039645B1 (en) | 2002-09-26 | 2006-05-02 | Requisite Technology, Inc. | Managing content of an electronic catalog by collaboration with another electronic catalog |
US7979324B2 (en) * | 2007-02-27 | 2011-07-12 | Microsoft Corporation | Virtual catalog |
US8838592B2 (en) | 2007-06-13 | 2014-09-16 | Mlslistings Inc. | Methods and systems for developing a data repository for heterogeneous MLS systems |
CN113986362A (zh) * | 2021-10-22 | 2022-01-28 | 山东云海国创云计算装备产业创新中心有限公司 | 一种raid卡及其控制方法、服务器主机 |
CN113986362B (zh) * | 2021-10-22 | 2024-01-23 | 山东云海国创云计算装备产业创新中心有限公司 | 一种raid卡及其控制方法、服务器主机 |
Also Published As
Publication number | Publication date |
---|---|
AU5843100A (en) | 2001-01-30 |
WO2001004775A3 (fr) | 2003-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6708166B1 (en) | Method and apparatus for storing data as objects, constructing customized data retrieval and data processing requests, and performing householding queries | |
US6199059B1 (en) | System and method for classifying and retrieving information with virtual object hierarchy | |
US7917549B2 (en) | Database interface generator | |
US6665677B1 (en) | System and method for transforming a relational database to a hierarchical database | |
US6944619B2 (en) | System and method for organizing data | |
US7925658B2 (en) | Methods and apparatus for mapping a hierarchical data structure to a flat data structure for use in generating a report | |
US8051102B2 (en) | Data base and knowledge operating system | |
US7725460B2 (en) | Method and system for a transparent application of multiple queries across multiple data sources | |
US8700494B2 (en) | Identifying product variants | |
US8862557B2 (en) | System and method for rule-driven constraint-based generation of domain-specific data sets | |
Li | Knowledge gathering and matching in heterogeneous databases | |
US10127313B2 (en) | Method of retrieving attributes from at least two data sources | |
CN101853295B (zh) | 一种图像检索方法 | |
US20020161757A1 (en) | Simultaneous searching across multiple data sets | |
US20020055932A1 (en) | System and method for comparing heterogeneous data sources | |
US20090106286A1 (en) | Method of Hybrid Searching for Extensible Markup Language (XML) Documents | |
CN103559189B (zh) | 基于元数据集成模型的电力仿真培训资源管理系统及方法 | |
JP5410514B2 (ja) | X500データモデルをリレーショナル・データベースにマッピングするための方法 | |
US7860903B2 (en) | Techniques for generic data extraction | |
CN105930174B (zh) | 一种图形化页面程序差异比较方法与系统 | |
WO2001004775A2 (fr) | Procédé pour construire un catalogue électronique homogène | |
Hansen et al. | DWStar-automated star schema generation | |
Lim et al. | Integrating HTML tables using semantic hierarchies and meta-data sets | |
Nair et al. | A conceptual query-driven design framework for data warehouse | |
Tompa et al. | The Application of Current Database Technology to Videotex |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase in: |
Ref country code: JP |