NL1014288C2

NL1014288C2 - Computerized system for selecting and buying number of objects via electronic network, e.g. Internet, is flexible and easy to program

Info

Publication number: NL1014288C2
Application number: NL1014288A
Authority: NL
Inventors: Harald Kikkers
Original assignee: It Consultancy Group B V
Priority date: 2000-02-04
Filing date: 2000-02-04
Publication date: 2001-08-07

Abstract

The program operates in two phases. In the first phase, the user examines the product data and description (1, 2) and selects a product code. In the second phase, data about the number required, price and availability are added to the results held in the memory (6, 7). All the data is assembled (13) and output as a result (3).

Description

Titel: Gegevenstransformatie 5 De uitvinding heeft betrekking op een werkwijze vol gens de aanhef van conclusie 1.Title: Data transformation The invention relates to a method according to the preamble of claim 1.

Dergelijke werkwijzen zijn uit de praktijk bekend en worden toegepast om uit verschillende brongegevensbestanden een doelgegevensbestand samen te stellen, bijvoorbeeld om 10 binnen een bedrijf de bestanden van verkoop, productie en administratie samen te voegen tot een enkel bestand voor een managementinformatiesysteem, of bijvoorbeeld voor data warehousing.Such methods are known in practice and are used to compile a target database from different source databases, for example to merge the files of sales, production and administration within a company into a single file for a management information system, or for example for data warehousing. .

Hierbij wordt gebruik gemaakt van een gegevenstrans-15 formatieprogramma, dat speciaal is geschreven om de betreffende brongegevens te transformeren naar doelgegevens. Dit transformatieprogramma wordt geregeld, bijvoorbeeld dagelijks of wekelijks, gedraaid om de doelgegevens te vernieuwen en daarmee te actualiseren.It uses a data transformation program specially written to transform the respective source data into target data. This transformation program is run regularly, for example daily or weekly, in order to refresh and update the target data.

2 0 De bekende gegevenstransformatieprogramma's hebben als nadeel dat voor een bepaalde toepassing telkens een speciaal maat gemaakt programma moet worden geschreven. Daarbij komt dat het schrijven van het programma geheel plaats heeft door middel van vrije programmering of door 25 middel van rule-based programmering in combinatie met vrije programmering. Nadelig aan vrije programmering is dat het tijdrovend is en de voor het schrijven van het programma benodigde tijd zeer slecht te voorspellen is. Rule-based programmering geeft als nadeel dat hiermee maar enkele, 30 triviale transformaties kunnen worden uitgevoerd. Daarnaast zijn dergelijke programma's niet flexibel en dienen ingrijpende aanpassingen gemaakt te worden bij veranderingen zoals bijvoorbeeld het toevoegen van andersoortige brongegevens en gewijzigde gewenste doelgegevens.2 0 The known data transformation programs have the drawback that for a particular application a specially tailored program has to be written each time. In addition, the writing of the program takes place entirely by means of free programming or by means of rule-based programming in combination with free programming. The disadvantage of free programming is that it is time-consuming and the time required for writing the program is very difficult to predict. Rule-based programming has the disadvantage that it can only perform a few, 30 trivial transformations. In addition, such programs are not flexible and drastic adjustments have to be made in the event of changes, such as the addition of other types of source data and changed desired target data.

35 De uitvinding heeft tot doel een werkwijze voor het genereren van een computerprogramma voor het uitvoeren van een gegevenstransformatie. te verschaffen waarmee op een systematische en voorspelbare wijze een transformatiepro- 1014288 2 gramma wordt verkregen, waarbij vrije programmering geheel of althans grotendeels achterwege kan blijven.The object of the invention is a method of generating a computer program for performing a data transformation. to provide a transformation program in a systematic and predictable manner, whereby free programming can be completely or at least largely omitted.

Hiertoe voorziet de uitvinding in een werkwijze volgens conclusie 1.To this end, the invention provides a method according to claim 1.

5 Volgens de uitvinding worden de gegevens uit de brongegevensstructuur door middel van een samenvoegingsview in een vorm gebracht die los staat van de brongegevensstructuur. In deze vorm kunnen de brongegevens op een standaard manier verder verwerkt worden. De aldus verkregen en-10 titeiten worden door middel van een samenvoegingsbewerking op logische wijze samengevoegd, gegroepeerd en voorzien van een unieke groepsaanduiding. Op basis van deze gegroepeerde gegevens kan een geheugenstructuur worden opgesteld. Op basis van de geheugenstructuur en de door middel van de sa-15 menvoegingsview geprojecteerde brongegevens kan dan de gewenste doelgegevensstructuur worden samengesteld. Naast sa-menvoegingsviews worden ook de veldwaarden en relaties tussen gegevens uit de brongegevensstructuren door middel van respectievelijk veldwaardeviews en relatieviews getransfor-20 meerd en op de voorgaande wijze verwerkt in de doelgegevensstructuur.According to the invention, the data from the source data structure is brought into a form separate from the source data structure by means of a merging view. In this form, the source data can be further processed in a standard manner. The entities thus obtained are logically combined, grouped and provided with a unique group designation by means of a merging operation. A memory structure can be drawn up on the basis of this grouped data. The desired target data structure can then be compiled on the basis of the memory structure and the source data projected by means of the merging view. In addition to aggregation views, the field values and relationships between data from the source data structures are also transformed by means of field value views and relationship views, respectively, and are processed in the target data structure in the above manner.

Hierdoor is de gegevenstransformatie volgens de uitvinding een proces, bestaande uit de stappen van projectie van de brongegevens, en het uitvoeren van standaard uitvoe-25 ringsregels. Het opstellen van de transformatieregels voor gebruik in de projectiefase is een zeer gestructureerd en daardoor voorspelbaar proces, waarvoor de benodigde tijd en inspanning goed is in te schatten. Na het bepalen van de transformatieregels kan ook goed worden ingeschat hoe snel 30 het transformatieproces zal verlopen. Voor een gegeven bronstructuur en doelstructuur hoeven in principe slechts eenmalig de transformatieregels bepaald te worden. Bij veranderingen in de bron- en/of doelentiteiten kunnen aanpassingen snel en eenvoudig worden aangebracht door de regels 35 voor de projectiefase overeenkomstig aan te passen.Therefore, the data transformation according to the invention is a process consisting of the steps of projection of the source data, and the execution of standard execution rules. Drawing up the transformation rules for use in the projection phase is a highly structured and therefore predictable process, for which the required time and effort can be accurately estimated. After determining the transformation rules, it is also possible to estimate how fast the transformation process will proceed. In principle, the transformation rules need only be determined once for a given source structure and target structure. In the event of changes in the source and / or target entities, adjustments can be made quickly and easily by adjusting the projection phase rules 35 accordingly.

1Dl42êê 31Dl42êê 3

De uitvinding voorziet tevens in een computerprogramma dat is ingericht voor het uitvoeren van de werkwijze volgens de uitvinding.The invention also provides a computer program which is arranged for carrying out the method according to the invention.

De uitvinding zal nader worden toegelicht aan de 5 hand van een uitvoerige beschrijving van een uitvoerings-voorbeeld van de uitvinding aan de hand van de fig. 1. Deze toont een schema van een transformatieproces volgens de uitvinding.The invention will be further elucidated on the basis of a detailed description of an exemplary embodiment of the invention with reference to Fig. 1. This shows a diagram of a transformation process according to the invention.

In een uitvoeringsvoorbeeld volgens de uitvinding is 10 binnen een bedrijf een tweetal brongegevensstructuren aanwezig. De eerste brongegevensstructuur bevat gegevens over het assortiment aan producten dat geleverd wordt door het bedrijf, en wordt bijvoorbeeld beheerd door de inkoopafdeling. De tweede brongegevensstructuur bevat gegevens over 15 de verkooptransacties die zijn uitgevoerd door het bedrijf, bijvoorbeeld afkomstig van de het computersysteem dat de kassa's bestuurt.In an exemplary embodiment according to the invention, two source data structures are present within a company. The first source data structure contains data about the range of products supplied by the company and is managed, for example, by the purchasing department. The second source data structure contains data on the sales transactions carried out by the company, for example from the computer system that controls the cash registers.

In dit uitvoeringsvoorbeeld wordt onder een gegevensstructuur verstaan een verzameling entiteiten van een 20 of meerdere soorten die elk een of meer attributen omvatten, waarbij de attributen voorzien zijn van attribuutwaarden. Dergelijke structuren zijn bekend uit de praktijk, bijvoorbeeld op het gebied van de relationele databases. De fysieke vorm waarin de brongegevens beschikbaar zijn kan op 25 elke gewenste wijze plaats hebben, bijvoorbeeld binnen een of meerdere computers of een netwerk, op één of meerdere harddisks op één of meerdere fysieke locaties; de uitvinding is niet beperkt tot een bepaalde fysieke implementatie van de gegevens.In this exemplary embodiment, a data structure is understood to mean a collection of entities of one or more types, each comprising one or more attributes, the attributes being provided with attribute values. Such structures are known from practice, for example in the field of relational databases. The physical form in which the source data is available can take place in any desired manner, for instance within one or more computers or a network, on one or more hard disks at one or more physical locations; the invention is not limited to any particular physical implementation of the data.

30 Om inzicht te verkrijgen in de verkoopgegevens van het bedrijf dienen gegevens uit beide brongegevensstructuren te worden gecombineerd tot een doelgegevensstructuur. Deze doelgegevensstructuur kan bijvoorbeeld door middel van een managementinformatiesysteem gebruikt worden om informa-35 tie over de verkoop-performance van het bedrijf te verkrijgen.30 To gain insight into the company's sales data, data from both source data structures must be combined into a target data structure. This target data structure can be used, for example, by means of a management information system to obtain information about the sales performance of the company.

. i30l4il§ 4. i30l4il§ 4

Hiertoe wordt eerst bepaald welke gegevens uit de bronnen in de doelgegevensstructuur moeten worden opgenomen, om de gewenste informatie in de doelgegevensstructuur te verkrijgen.To this end, it is first determined which data from the sources must be included in the target data structure, in order to obtain the desired information in the target data structure.

5 Volgens de uitvinding wordt hiertoe de samenstelling van de eerste brongegevensstructuur, in dit geval de pro-ductgegevensstructuur, geanalyseerd waarbij gekeken wordt naar de aanwezige brongegevens en op welke wijze deze gegevens zijn opgeslagen. In het bijzonder is hierbij van be-10 lang in welke structuur, bijvoorbeeld entiteit-typen en attributen, de gegevens beschikbaar zijn.According to the invention, the composition of the first source data structure, in this case the product data structure, is analyzed for this purpose, taking into account the source data present and how this data is stored. In particular, it is important here in which structure, for example entity types and attributes, the data are available.

Op gelijke wijze wordt tevens de samenstelling van de tweede brongegevensstructuur, in dit geval de verkoop-transactiegegevensstructuur, geanalyseerd.Likewise, the composition of the second source data structure, in this case the sales-transaction data structure, is also analyzed.

15 De productgegevensstructuur is opgebouwd uit enti teiten van het in de onderstaande tabel 1 beschreven enti-teit-type PRODUCT, dat voorzien is van een tweetal attributen te weten PRODUCT_CODE en PRODUCTOMSCHRIJVING. Hierbij geeft PRODUCT_CODE een codering weer van een product dat in 20 het bedrijf aanwezig is. PRODUCTOMSCHRIJVING geeft een beschrijving van het product in woorden en leestekens.15 The product data structure is made up of entities of the entity type PRODUCT described in table 1 below, which has two attributes, namely PRODUCT_CODE and PRODUCT DESCRIPTION. PRODUCT_CODE represents an encoding of a product that is present in the company. PRODUCT DESCRIPTION provides a description of the product in words and punctuation marks.

Tabel 1. Entiteit-type van productgegevensstructuur (brongegevensstructuur 1) _Entiteit-type__Attributen_ _PRODUCT___PRODUCT CODE_Table 1. Entity type of product data structure (source data structure 1) _Entity type__Attributes_ _PRODUCT___PRODUCT CODE_

___PRODUCT OMSCHRIJVING___PRODUCT DESCRIPTION

2525

De verkooptransactiegegevensstructuur (brongegevensstructuur 2) is opgebouwd uit entiteiten van het in de onderstaande tabel 2 beschreven entiteit-type VERKOOPJTRANSACTIE, dat voorzien is van een vijftal attri-30 buten, te weten PRODUCT_COE)E, AANTAL_VERKOCHT, VERKOOP_DATUM, VERKOOP_Tijd- en VERKOOPBEDRAG. Hierbij geeft PRODUCT_C°DE een codering weer van een product dat in het bedrijf aanwezig is, overigens komt deze code niet *014288 5 noodzakelijkerwijs overeen met de code van het PRODUCT-type. De attributen AANTAL_VERKOCHT, VERKOOP_DATUM, VERKOOP_TIJD, en VERKOOP_BEDRAG beschrijven elk een variabele die met de respectieve aanduiding overeenkomt.The sales transaction data structure (source data structure 2) is composed of entities of the entity type SALES TRANSACTION described in Table 2 below, which includes five attributes, namely PRODUCT_COE) E, NUMBER_SOLD, SALES_DATE, SALES_TIME and SALES AMOUNT. PRODUCT_C ° DE represents an encoding of a product that is present in the company, incidentally, this code does not * 014288 5 necessarily correspond to the PRODUCT-type code. The attributes COUNT_SOLD, SALES_DATE, SALES_TIME, and SALES_Amount each describe a variable corresponding to the respective designation.

55

Tabel 2. Entiteit-type van verkooptransactiegege-vensstructuur (brongegevensstructuur 2) -_Enteit-type__Attributen_ _ VERKOOP TRANSACTIE__PRODUCT CODE_ __AANTAL VERKOCHT_ __VERKOOP DATUM_ __VERKOOP TIJD_ '_ VERKOOP BEDRAG_ 10 In dit uitvoeringsvoorbeeld is de gewenste doelgege- vensstructuur opgebouwd uit entiteiten van het type zoals beschreven in tabel 3 om de gewenste gegevens aan het informatiesysteem te leveren.Table 2. Entity-type of sales transaction data structure (source data structure 2) -_Enity type__Attributes_ _ SALES TRANSACTION_PRODUCT CODE_ __NUMBER OF SOLD_ __SALES DATE_ __SALES TIME_ '_ SALES AMOUNT_ 10 In this implementation example, the type of target data is built up from the entity data in Table 3 to provide the desired data to the information system.

15 Tabel 3. Entiteit-typen van doelgegevensstructuur15 Table 3. Entity types of target data structure

Entiteit-type : AttributenEntity type: Attributes

PRODUCT PRODUCT SLEUTELPRODUCT PRODUCT KEY

__PRODUCT CODE_ __PRODUCT OMSCHRIJVING _ __AANTAL VERKOCHT___PRODUCT CODE_ __PRODUCT DESCRIPTION _ __NUMBER SOLD_

OMZETREVENUE

TRANSACTIE PRODUCT SLEUTELTRANSACTION PRODUCT KEY

AANTAL VERKOCHT VERKOOP DATUMNUMBER OF SOLD SALE DATE

VERKOPP~BEDRAG ~~SELL ~ AMOUNT ~~

De doelgegevensstructuur heeft twee entiteit-typen, te weten PRODUCT en TRANSACTIE. De attributen van de enti-20 text-typen PRODUCT en TRANSACTIE (te weten AANTAL_VERKOCHT, VERKOOPJDATUM, PRODUCT_CODE, PRODUCTOMSCHRIJVING, en VERKOOP_BEDRAG) zijn gelijk aan de respectieve attributen uit de brongegevensstructuren. Het attribuut OMZET wordt bepaald tijdens de transformatie uit attributen uit de i * d, —-The target data structure has two entity types, PRODUCT and TRANSACTION. The attributes of the enti-20 text types PRODUCT and TRANSACTION (namely NUMBER_SOLD, SALESDATE, PRODUCT_CODE, PRODUCT DESCRIPTION, and SALES_Amount) are the same as the attributes from the source data structures. The attribute TURNOVER is determined during the transformation from attributes from i * d, -

i ; : i ’· ,· . ; · Qi; : i ’·, ·. ; Q

J ^ 3 “f· |L ö QJ ^ 3 “f · L ö Q

6 brongegevensstructuur, in dit geval uit het attribuut AANTAL_VERKOCHT en VERKOOP_BEDRAG. Het attribuut PRODUCT_SLEUTEL wordt tijdens de transformatie bepaald, waarbij rekening wordt gehouden met mogelijke eerder 5 (tijdens een mogelijke vorige uitvoering van het transformatieproces) toegekende waarden.6 source data structure, in this case from the attribute COUNT_SOLD and SALES_Amount. The PRODUCT_KEY attribute is determined during the transformation, taking into account possible values previously assigned (during a possible previous execution of the transformation process).

Uit het voorafgaande is de structuur bekend van beide brongegevensstructuren en de gewenste doelgegevensstruc-tuur. Hieruit kunnen de vereiste transformatieregels be-10 paald worden. Voor de gegevenstransformatie volgens de uitvinding zijn de volgende vereisten opgesteld om in dit uit-voeringsvoorbeeld uit de brongegevens de gewenste doelgege-vens te verkrijgen: - Bij de gegevenstransformatie dienen gegenereerde 15 identificaties in het attribuut PRODUCT_SLEUTEL persistent te blijven. Door dit vereiste blijven eenmaal toegekende identificaties gelijk bij opeenvolgende door transformatie verkregen doelstructuren. Hierdoor is het mogelijk op verschillende tijdstippen bepaalde doelstructuren met elkaar 20 te vergelijken.From the foregoing, the structure of both source data structures and the desired target data structure is known. The required transformation rules can be determined from this. For the data transformation according to the invention, the following requirements have been set up in order to obtain the desired target data from the source data in this exemplary embodiment: - In the data transformation, generated identifications in the PRODUCT_Key attribute must remain persistent. Due to this requirement, once assigned identifications remain the same on successive transformation-derived target structures. This makes it possible to compare certain target structures with each other at different times.

- Producten met een verschillende PR0DUCT_C0DE, maar met overeenkomstige PRODUCTOMSCHRIJVING dienen samengevoegd te worden, waarbij het resultaat in hoofdletters zonder leestekens wordt geschreven. Hiertoe wordt een niet na- 25 der uitgewerkte functie "unifyT^schr()" gebruikt. Door dit vereiste wordt bereikt dat producten die logischerwijs gelijk zijn, maar een verschillende codering hebben, in de doelstructuur onder eenzelfde aanduiding voorkomen.- Products with different PR0DUCT_C0DE, but with corresponding PRODUCT DESCRIPTION should be merged, with the result written in capital letters without punctuation marks. For this purpose a function "unifyT ^ schr ()", which is not further elaborated, is used. This requirement ensures that products that are logically identical but have different coding appear in the target structure under the same designation.

- Bij de hiervoor genoemde samenvoeging van produc-30 ten dienen geen transactie-gegevens verloren te gaan.- In the aforementioned combination of products, no transaction data should be lost.

- De attributen AANTAL_VERKOCHT en OMZET worden berekend uit de betreffende gegevens van TRANSACTIE.- The attributes NUMBER_SOLD and SALES are calculated from the relevant data of TRANSACTION.

- Fictieve PRODUCT attributen dienen te worden aangemaakt voor elke unieke PR0DUCT_C0DE in VERKOOP_TRANSACTIE- Fictitious PRODUCT attributes must be created for each unique PR0DUCT_C0DE in SALES_TRANSACTION

35 die niet voorkomt in PRODUCT. Door dit aanvullen van missende attributen wordt bereikt dat de bepaalde gegevens 1014288 7 uniform zijn zodat deze binnen het transformatieproces eenvoudig op een standaard wijze verwerkt kunnen worden.35 that does not appear in PRODUCT. This addition of missing attributes ensures that the determined data 1014288 7 are uniform, so that they can easily be processed in a standard manner within the transformation process.

- VERKOOP_TRANSACTIES dienen opgeteld te worden per PRODUCT_CODE per VERKOOP_DATUM.- SALES_TRANSACTIONS must be added per PRODUCT_CODE per SALES_DATE.

5 - Bij samenvoegingen dient voorrang te worden gege ven aan de PRODUCT_CODE-s en PRODUCTOMSCHRIJVING-en uit de productgegevensstructuur. In geval van meerdere alternatieven binnen de productgegevensstructuur dient voor de laagste waarde te worden gekozen. Hierdoor wordt op een eenvou-10 dige wijze een eenduidige bepaling verkregen.5 - In the case of aggregations, priority must be given to the PRODUCT_CODEs and PRODUCT DESCRIPTION and the product data structure. In case of several alternatives within the product data structure, the lowest value should be chosen. This provides an unambiguous determination in a simple manner.

Bij deze eisen wordt aangenomen dat de PRODUCT_C°DE in de productgegevensstructuur uniek is.These requirements assume that the PRODUCT_C ° DE is unique in the product data structure.

De bovenvermelde eisen worden volgens de uitvinding beschreven in transformatieregels, waarvan er drie typen 15 bestaan te weten samenvoeging, veldwaarde en relatie. De voor de gegevenstransformatie vereiste bewerkingen kunnen volgens de uitvinding beschreven worden in deze drie typen transformatieregels.The above requirements are described according to the invention in transformation rules, of which three types exist, namely aggregation, field value and relationship. The operations required for the data transformation can be described according to the invention in these three types of transformation rules.

In dit uitvoeringsvoorbeeld zijn de transformatiere-20 gels uitgewerkt in Structured Query Language (SQL). SQL is een algemene taal voor het uitvoeren van bewerkingen op relationele databases. De uitvinding is niet beperkt tot uitvoering in deze taal; de uitvinding kan tevens zijn belichaamd in een andere computertaal die geschikt is voor het 25 bewerken van gegevensstructuren.In this exemplary embodiment, the transformation rows are elaborated in Structured Query Language (SQL). SQL is a common language for performing operations on relational databases. The invention is not limited to implementation in this language; the invention may also be embodied in another computer language suitable for manipulating data structures.

In dit voorbeeld zijn de transformatieregels uitgevoerd als zogenaamde views. Een view omvat een of meerdere selectiecriteria of instructies om uit een verzameling gegevens een deelverzameling gegevens te vormen die aan de 30 selectiecriteria voldoen. Daarnaast kunnen ook instructies zijn opgenomen die de geselecteerde gegevens bewerken of gegevens daaraan toevoegen. In de context van dit uitvoeringsvoorbeeld levert de toepassing van een view op een bronentiteitenverzameling, zoals een brongegevensstructuur, 35 een verzameling entiteiten op.In this example, the transformation rules are implemented as so-called views. A view contains one or more selection criteria or instructions to form a subset of data that meets the selection criteria from a collection of data. In addition, instructions may also be included to edit or add data to the selected data. In the context of this exemplary embodiment, the application of a view to a source entity set, such as a source data structure, yields a set of entities.

1014288 81014288 8

In dit voorbeeld heeft de gewenste doelstructuur twee entiteittypen. Hiertoe worden volgens de uitvinding twee, met de respectieve doelentiteittypen overeenkomende, samenvoegingsviews bepaald. De samenvoegingsviews worden zo 5 ingericht dat toepassing van de samenvoegingsviews op de bronstructuren de voor de betreffende doelentiteit benodigde informatie uit de bronstructuren wordt verkregen en in een generiek formaat wordt gebracht.In this example, the desired target structure has two entity types. According to the invention, two aggregation views corresponding to the respective target entity types are determined for this purpose. The merge views are arranged in such a way that application of the merge views to the source structures, the information required for the relevant target entity is obtained from the source structures and is brought into a generic format.

Toepassing van een samenvoegingsview volgens de uit- 10 vinding op een of meerdere brongegevensstructuren levert een entiteitenverzameling op van een entiteit-type met tenminste de volgende attributen: - SID (System IDentification): een unieke identificatie van de brongegevensstructuur, in dit voorbeeld 1 of 15 2 .Application of a merging view according to the invention to one or more source data structures yields an entity set of an entity type with at least the following attributes: - SID (System IDentification): a unique identification of the source data structure, in this example 1 or 15 2.

- DID (Data IDentification): een unieke identificatie van entiteit binnen een brongegevensstructuur.- DID (Data IDentification): a unique identification of entity within a source data structure.

Gecombineerd duiden SID en DID eenduidig een bepaalde entiteit aan.Combined, SID and DID uniquely identify a particular entity.

20 - typ: aanduiding van het samenvoegingscriterium, bijvoorbeeld in de vorm van een naam.20 - type: designation of the combination criterion, for example in the form of a name.

- val: waarde uit de bron behorende bij het samenvoegingscriterium, zoals bijvoorbeeld een bepaald attribuut uit de brongegevens, of het resultaat van een bewerking 25 uitgevoerd op een of meerdere attributen van de brongegevens .val: value from the source associated with the aggregation criterion, such as, for example, a certain attribute from the source data, or the result of an operation performed on one or more attributes of the source data.

Hierbij komen de attributen typ en val in combinatie voor; volgens de uitvinding kunnen meerdere combinaties van attributen typ en val voorkomen per samenvoegingsview.The attributes typ and fall occur in combination; according to the invention, multiple combinations of attributes type and trap can occur per merge view.

30 Met samenvoegings-views volgens de uitvinding kunnen onder meer de volgende functies worden uitgevoerd: - persistentie van identificatie van doelgegevens in de doelgegevens-structuren, waarbij een door het samenvoegingsview bepaalde entiteit die reeds in een eerder trans- 35 formatieproces was bepaald voorzien wordt van een gelijke identificatie, en 10 14288 9 - ontdubbelingsfaciliteiten op entiteit-niveau, waarmee redundante entiteiten uit de brongegevenstructuren kunnen worden verwijderd.With merge views according to the invention, the following functions can be performed, among others: - persistence of identification of target data in the target data structures, whereby an entity determined by the merge view that was already determined in a previous transformation process is provided with Equal identification, and 10 14288 9 - Entity-level deduplication facilities, which allow redundant entities to be removed from source data structures.

Voor de bovengenoemde gegevenstransformatie volgens 5 het uitvoeringsvoorbeeld zijn de volgende twee voorbeelden van samenvoegingsviews volgens de uitvinding opgesteld waarin een deel van de transformatieregels is verwerkt. De views zijn weergegeven in SQL, en betreffen respectievelijk de samenvoegingsview SW_PRODUCT en de samenvoegingsview 10 SW_TRANSACTIE. Hierbij dient de samenvoegingsview SW_PRODUCT te worden toegepast op de brongegevensstructuur 1 (PRODUCT) en de samenvoegingsview SW_TRANSACTIE te worden toegepast op de brongegevensstructuur 2 (TRANSACTIE). Resultaat van toepassing van beide samenvoegingsviews is 15 dat een entiteitenverzameling wordt bepaald die een te voren bepaalde, generieke vorm heeft die los staat van de vorm waarin de brongegevens in eerste instantie beschikbaar waren. Hierdoor wordt de verdere bewerking van de gegevens aanzienlijk vereenvoudigd.For the aforementioned data transformation according to the exemplary embodiment, the following two examples of aggregation views according to the invention have been prepared incorporating part of the transformation rules. The views are presented in SQL, and are respectively the SW_PRODUCT merge view and the SW_TRANSACTIE merge view. Here, the merge view SW_PRODUCT must be applied to the source data structure 1 (PRODUCT) and the merge view SW_TRANSACTIE must be applied to the source data structure 2 (TRANSACTION). As a result of applying both merge views, an entity set is determined that has a predetermined generic form that is separate from the form in which the source data was initially available. This further simplifies the further processing of the data.

20 Code-fragment 1. Samenvoegings-view SW_PRODUCT: create view SW_PRODUCT ( sid, did, 25 typ, val ) as select 1 sid, 3 0 PRODUCTJ30DE did, 'CODE' typ, PRODUCT_CODE val from PRODUCT union all 35 select distinct 1 sid, PRODUCT_CODE did, 'OMSCHR' typ, unify_omschr (PRODUCTOMSCHRIJVING) val from PRODUCT 40 union all select distinct 2 sid, PRODUCT_CODE did, "CODE1 typ, PRODUCT_CODE val 1014288 10 I from VERKOOPJTRANSACTIE;20 Code snippet 1. Merge view SW_PRODUCT: create view SW_PRODUCT (sid, did, 25 typ, val) as select 1 sid, 3 0 PRODUCTJ30DE did, 'CODE' typ, PRODUCT_CODE val from PRODUCT union all 35 select distinct 1 sid , PRODUCT_CODE did, 'DESCRIBE' typ, unify_omschr (PRODUCT DESCRIPTION) val from PRODUCT 40 union all select distinct 2 sid, PRODUCT_CODE did, "CODE1 typ, PRODUCT_CODE val 1014288 10 I from SALES TRANSACTION;

Pseudo-code fragment 2. Samenvoegings-view SW_TRANSACTIE:Pseudo-code fragment 2. Merge view SW_TRANSACTIE:

5 create view SW_TRANSACTIE5 create view SW_TRANSACTION

( sid, did, typ, val 10 ) as select distinct 2 sid, PRODUCT_CODEI|VERKOOP_DATUM did, 15 " CODE_DAG1 typ, PRODUCT_CODEI|VERKOOP_DATUM val from VERKOOPJTRANSACTIE;(sid, did, typ, val 10) as select distinct 2 sid, PRODUCT_CODEI | SALES_DATE did, 15 "CODE_DAY1 typ, PRODUCT_CODEI | SALES_DATE val from SALES TRANSACTION;

In dit voorbeeld heeft de gewenste doelstructuur 20 twee entiteittypen. Hiertoe worden volgens de uitvinding twee, met de respectieve doelentiteittypen overeenkomende, veldwaardeviews bepaald. De veldwaardeviews worden zo ingericht dat toepassing van de veldwaardeviews op de bron-structuren de voor de betreffende doelentiteit benodigde 25 informatie uit de bronstructuren wordt verkregen en in een generiek formaat wordt gebracht. In het bij zonder worden de attribuutwaarden voor de attributen van de doelentiteiten bepaald.In this example, the desired target structure 20 has two entity types. According to the invention two field value views corresponding to the respective target entity types are determined for this purpose. The field value views are arranged in such a way that application of the field value views to the source structures, the information required for the relevant target entity is obtained from the source structures and is brought into a generic format. In particular, the attribute values for the attributes of the target entities are determined.

Toepassing van een veldwaardeview volgens de uitvin-30 ding op een of meerdere gegevensstructuren levert een entiteitenverzameling op van een entiteit-type met tenminste de volgende attributen: - SID (System IDentification) unieke identificatie van de brongegevensstructuur, in dit voorbeeld 1 of 2.Application of a field value view according to the invention to one or more data structures yields an entity set of an entity type with at least the following attributes: - SID (System IDentification) unique identification of the source data structure, in this example 1 or 2.

35 - DID (Data IDentification) unieke identificatie van entiteit binnen een brongegevensstructuur.35 - DID (Data IDentification) unique identification of entity within a source data structure.

Gecombineerd duiden SID en DID eenduidig een bepaalde entiteit aan. Overigens kan de uitvinding ook worden toegepast met een enkel attribuut ter eenduidige aanduiding 40 van een entiteit.Combined, SID and DID uniquely identify a particular entity. Incidentally, the invention can also be applied with a single attribute to unambiguously designate an entity.

1014288 11 - doelattributen; één of meerdere attributen uit de brongegevensstructuur die elk overeenkomen met een attribuut van een doelentiteit.1014288 11 - target attributes; one or more attributes from the source data structure, each corresponding to an attribute of a target entity.

Met veldwaardeviews volgens de uitvinding kunnen onder 5 meer de volgende functies worden uitgevoerd: - formaat-uniformering, waardoor gegevens die eenzelfde betekenis hebben, maar in de bron(nen) op verschillende wijzen zijn weergegeven, in de doelgegevensstructuur op gelijke wijze worden weergegeven, 10 - keuzefaciliteiten op attribuut en attribuut- corabinatie-niveau, waardoor tegenstrijdige gegevens uit de gegevensbronnen kunnen worden verwijderd.Field value views according to the invention can, among other things, perform the following functions: - format uniformity, whereby data that have the same meaning, but are represented in different ways in the source (s), are displayed in the same way in the target data structure, 10 - facilities at attribute and attribute corbination level, through which contradictory data can be removed from the data sources.

Voor de bovengenoemde gegevens trans format ie volgens het uitvoeringsvoorbeeld zijn de volgende twee voorbeelden 15 van veldwaardeviews volgens de uitvinding opgesteld en weergegeven in SQL, te weten de veldwaardeview VWV_PRODUCT en de veldwaardeview VWV_TRANSACTIE. Hierbij dient de veldwaardeview VWV_PRODUCT te worden toegepast op de bronstruc-turen en de veldwaardeview VWV_TRANSACTIE te worden toege-20 past op de bronstructuren. Resultaat van toepassing van beide views is dat een entitéitenverzameling wordt bepaald die een te voren bepaalde, generieke vorm heeft die los staat van de vorm waarin de brongegevens in eerste instantie beschikbaar waren. Hierdoor wordt de verdere bewerking 25 van de gegevens aanzienlijk vereenvoudigd. Hierdoor kunnen later de waarden van de entiteiten in de doelgegevensstructuur worden ingevuld samen met de door samenvoegingsviews bepaalde entiteiten.For the above-mentioned data transformation according to the exemplary embodiment, the following two examples of field value views according to the invention are drawn up and shown in SQL, namely the field value view VWV_PRODUCT and the field value view VWV_TRANSACTIE. The VWV_PRODUCT field value view should be applied to the source structures and the VWV_TRANSACTIE field value view should be applied to the source structures. The result of applying both views is that an entity set is determined that has a predetermined, generic form that is separate from the form in which the source data was initially available. This further simplifies the further processing of the data. This allows the values of the entities in the target data structure to be populated later along with the entities determined by merge views.

Pseudo-code fragment 3. Veldwaarde-view VWV_PRODUCTPseudo-code fragment 3. Field value view VWV_PRODUCT

30 create view VWV_PRODUCT ( sid, did, PRODUCT_CODE, 3 5 PRODUCT_OMSCHRIJVING, AANTAL_VERKOCHT,30 create view VWV_PRODUCT (sid, did, PRODUCT_CODE, 3 5 PRODUCT_DESCRIPTION, NUMBER_SOLD,

OMZETREVENUE

) as 1014288 12 select 1 sid, PRODUCT_CODE did, PRODUCTOMSCHRIJVING PRODUCTOMSCHRIJVING, 0 aantal_verkocht , 5 0 ΟΜΖΕΤ from PRODUCT union all select 2 sid, PRODUCT_CODe did,) as 1014288 12 select 1 sid, PRODUCT_CODE did, PRODUCT DESCRIPTION PRODUCT DESCRIPTION, 0 qty. sold, 5 0 ΟΜΖΕΤ from PRODUCT union all select 2 sid, PRODUCT_CODe did,

10 'fictief ' I IPRODUCT_CODE10 'fictional' I IPRODUCT_CODE

PRODUCT_Omschri'JV!NG, sum (AANTAL_VERKOCHT) AANTAL_VERKOCHT,PRODUCT_DescriptionJV! NG, sum (NUMBER_SOLD) NUMBER_SOLD,

sum(OMZET) OMZETsum (TURNOVER) TURNOVER

from TRANSACTIEfrom TRANSACTION

15 group by sid, did, PRODUCTOMSCHRIJVING ;15 group by sid, did, PRODUCT DESCRIPTION;

Pseudo-code fragment 4. Veldwaarde-view VWVORANSACTIEPseudo-code fragment 4. Field value view VWVORANSACTIE

20 create view VWJTRANSACTIE ( sid, did, AANTAL_VERKOCHT, 25 VERKOOPOATUM/ verkoop_bedrag ) as select 2 sid, 3 0 PRODUCTCODE] |VERKOOP_DATUM did, sum (AANTAL_VERKOCHT) AANTAL_VERKOCHT, VERKOOP_DATUM VERKOOPJDATUM,20 create view VWJTRANSACTIE (sid, did, NUMBER_SOLD, 25 SALES_ATUM / sales_amount) as select 2 sid, 3 0 PRODUCT CODE] | SALES_DATE did, sum (NUMBER_SOLD) NUMBER_SOLD, SALES_DATE SALESDATE,

sum(OMZET) OMZETsum (TURNOVER) TURNOVER

from TRANSACTIEfrom TRANSACTION

35 group by sid, did, verkoopoatum ;35 group by sid, did, sales date;

In dit voorbeeld heeft de gewenste doelstructuur 40 twee entiteittypen. Hiertoe wordt volgens de uitvinding een relatieview bepaald die de relatie tussen de twee doelenti-teittypen legt, die van verschillende bronstructuren afkomstig kunnen zijn. De relatieview wordt zo ingericht dat toepassing van de hiervan op de bronstructuren de voor het 45 samenstellen van de doelentiteiten benodigde informatie betreffende de relatie tussen de entiteiten van de bronstructuren wordt verkregen.In this example, the desired target structure 40 has two entity types. To this end, according to the invention, a relationship view is determined which establishes the relationship between the two target types, which may come from different source structures. The relationship view is arranged in such a way that application of the information thereof to the source structures obtains the information required for the composition of the target entities regarding the relationship between the entities of the source structures.

i w ; v v 0 13i w; v v 0 13

Toepassing van een relatieview volgens de uitvinding op een of meerdere gegevensstructuren levert een entiteitenverzameling op van een entiteit-type met tenminste de volgende attributen: 5 - from_sid: (System IDentification) unieke identifi catie van de brongegevensstructuur, in dit voorbeeld 1 of 2.Application of a relationship view according to the invention to one or more data structures yields an entity set of an entity type with at least the following attributes: 5 - from_sid: (System IDentification) unique identification of the source data structure, in this example 1 or 2.

- from_did: (Data IDentification) unieke identificatie van een eerste entiteit.- from_did: (Data IDentification) unique identification of a first entity.

10 - to_did: unieke identificatie van een tweede enti teit waarnaar door de eerste entiteit wordt verwezen.10 - to_did: unique identification of a second entity referenced by the first entity.

- to_sid: unieke identificatie van een tweede brongegevensstructuur waarnaar door de eerste entiteit wordt 15 verwezen.- to_sid: unique identification of a second source data structure referenced by the first entity.

De combinatie from_sid en from_did duiden eenduidig een bepaalde eerste, startentiteit aan, van de door de relatieview gespecificeerde relatie. Overeenkomstig geeft de combinatie van to_sid en to_did eenduidig een bepaalde 20 tweede, doelentiteit aan van de door de relatieview gespecificeerde relatie. Overigens kunnen volgens de uitvinding ook andere geschikte identificaties gebruikt worden voor de start- en doelentiteit.The combination from_sid and from_did unambiguously indicate a specific first, starting entity, of the relationship specified by the relationship view. Correspondingly, the combination of to_sid and to_did unambiguously indicates a certain second, target entity of the relationship specified by the relationship view. Incidentally, according to the invention other suitable identifications can also be used for the start and target entity.

Met relatieviews volgens de uitvinding kunnen onder 25 meer relaterings-faciliteiten worden uitgevoerd, waarmee relaties tussen entiteiten in de doelgegevens-structuren worden samengesteld en onderhouden, door middel van als zodanig voor relationele gegevensbestand bekende methoden, waarbij entiteiten via verwijzende sleutels (zoals bijvoor-30 beeld een foreign key) wijzen naar de uniek identificerende sleutel (zoals bijvoorbeeld een primary key) van een andere entiteit.Relationship views according to the invention may, inter alia, provide relationship facilities, with which relationships between entities in the target data structures are constructed and maintained, using methods known per se for relational databases, whereby entities are provided via referring keys (such as e.g. image a foreign key) point to the uniquely identifying key (such as a primary key) of another entity.

Pseudo-code fragment 5. Relatie-view RLV_TRANSACTIE_PRODUCTPseudo-code fragment 5. Relation view RLV_TRANSACTIE_PRODUCT

35 create view RLV_TRANSACTIE_PRODUCT35 create view RLV_TRANSACTIE_PRODUCT

( from_sid, from_did, 1014288 14 to_sid, to_did ) as 5 select distinct 2 from_sid; PRODUCT_CODEI IVERKOOP_DATUM from_did, PRODUCT_CODE to_did 2 to_sid, f rom TRANSACTIE; 10(from_sid, from_did, 1014288 14 to_sid, to_did) as 5 select distinct 2 from_sid; PRODUCT_CODEI SALE_DATE from_did, PRODUCT_CODE to_did 2 to_sid, f rom TRANSACTION; 10

In het navolgende zal een voorbeeld van een transformatieproces volgens de uitvinding worden beschreven aan de hand van een voorbeeld onder verwijzing naar fig 1.In the following, an example of a transformation process according to the invention will be described with reference to an example with reference to Fig. 1.

Hierbij wordt gebruik gemaakt van de hiervoor beschreven 15 views. In het voorbeeld zijn twee brongegevensstructuren 1 en 2 (zie tabel 4 en 5, en fig. 1) van de in de tabellen 1 en 2 getoonde typen voorzien van entiteiten met van waarden voorziene attributen. Hierbij vormt een rij in de tabel 4 en 5 een entiteit, en zijn de kolommen attributen, waarbij 20 op elke betreffende positie een attribuutwaarde is ingevuld.Use is made of the 15 views described above. In the example, two source data structures 1 and 2 (see Tables 4 and 5, and Figure 1) of the types shown in Tables 1 and 2 are provided with entities with valued attributes. A row in Tables 4 and 5 forms an entity, and the columns are attributes, with an attribute value being entered at each relevant position.

Tabel 4. PRODUCT (brongegevensstructuur 1) -PRODUCT”'CODE ('FRÖDUCTOMSCHRIJVING- _MYOGIO__1 liter magere yoghurt_ _HMLK10__1 liter halfvolle melk_ _VMLK05_ volle melk, halve literTable 4. PRODUCT (source data structure 1) -PRODUCT ”'CODE (' FRÖDUCT DESCRIPTION- _MYOGIO__1 liter of low-fat yogurt_ _HMLK10__1 liter of semi-skimmed milk_ _VMLK05_ whole milk, half a liter

VMLKHL VOLLE MELK HALVE LITERVMLKHL FULL MILK HALF LITER

25 tabel 5. VERKOOP_TRANSACTIE (brongegevensstructuur 2)25 table 5. SALES_TRANSACTION (source data structure 2)

PKÜBUCT_CODE AANTAirVERKOCHT j VERKOOP DATUM VERKÖÜ1TTÏJD VERKOOP BEDRAGPKÜBUCT_CODE SUMMER SOLD j SALES DATE SOLD SALES AMOUNT

' MifOGlO 3 10-07-1999 13 :17 6,30 HM1.K10 2 10-07-1999 14:28 3,9Ö -VMLK05--3--10-07-1999--14:56--3750 -VMLKHL--S--10-07-1999--15:24--7700 -MYOGIO--3--1T-07-1999--097TB--6770- -HMLK10--2--ΓΤ-07-1999--Ï0T30--3750 -7HCK05--3--ΓΤ-07-1999--TT705--3750- -VMLK05--1--11-07-1999--12:09--07B5- VMLKHL 8 11-07-1999 11:16 7700 ~ COCÖLA ........ 'L ............---11707-1999 -------TTTJB" ιι,ιο ~ 30 Volgens het transformatieproces volgens de uitvin ding worden de beide samenvoegingsviews, zoals getoond in 1014288 15 codefragment 1 en 2 toegepast, ook wel genoemd geprojecteerd, op de respectieve brongegevensstructuren 1 en 2 (4 en 5 in fig. 1). De resultaten van deze bewerking zijn te vinden in de tabellen 6 en 7.'MifOGlO 3 10-07-1999 13: 17 6.30 HM1.K10 2 10-07-1999 14:28 3.9Ö -VMLK05--3--10-07-1999--14: 56--3750 - VMLKHL - S - 10-07-1999--15: 24--7700 -MYOGIO - 3--1T-07-1999--097TB - 6770- -HMLK10--2 - ΓΤ-07-1999 --Ï0T30--3750 -7HCK05--3 - ΓΤ-07-1999 - TT705--3750- -VMLK05--1--11-07-1999--12: 09--07B5- VMLKHL 8 11- 07-1999 11:16 7700 ~ COCÖLA ........ 'L ............--- 11707-1999 ------- TTTJB "ιι, ιο ~ According to the transformation process of the invention, the two merge views, as shown in 1014288 code fragment 1 and 2 applied, also called projected, are applied to the respective source data structures 1 and 2 (4 and 5 in Fig. 1). operation can be found in Tables 6 and 7.

5 Volgens de uitvinding wordt tevens bij het toepassen van de samenvoegingsviews de resultaatentiteiten elk voorzien van een GRP aanduiding. Deze GRP-aanduiding is een unieke identificatie van de entiteiten binnen de verkregen verzamelingen. Hierbij wordt elke GRP-aanduiding die wordt 10 toegekend vergeleken met een geheugenstructuur (6, 7, 6', 7' in fig. 1) waarin reeds eerder toegekende GRP-aanduidingen zijn opgeslagen. Bij entiteiten die reeds in deze geheugenstructuur zijn opgenomen, wordt telkens dezelfde, in de geheugenstructuur opgeslagen, GRP-aanduiding 15 aan die entiteit toegekend. Hierdoor blijft een eenmaal toegekende GRP-aanduiding bij opeenvolgende transformatieprocessen constant. Hierdoor blijven aanduidingen van terugkerende entiteiten persistent zodat onderlinge vergelijkingen van eerder bepaalde doelstructuren mogelijk zijn.According to the invention, when applying the merging views, the result entities are each provided with a GRP designation. This GRP designation uniquely identifies the entities within the collections obtained. Here, each GRP designation assigned is compared with a memory structure (6, 7, 6 ', 7' in Fig. 1) in which previously assigned GRP designations are stored. For entities that are already included in this memory structure, the same GRP designation 15, each stored in the memory structure, is assigned to that entity. As a result, a GRP designation once assigned remains constant in successive transformation processes. This means that indications of returning entities remain persistent to allow cross-comparisons of previously determined target structures.

2 0 . Omdat in dit uitvoeringsvoorbeeld nog geen eerder bepaalde doelverzamelingen zijn bepaald, zijn de resultaatentiteiten voorzien van een uniek, telkens oplopend identificatienummer.2 0. Since no previously determined target sets have been determined in this exemplary embodiment, the result entities are provided with a unique, ascending identification number.

25 Tabel 6. PROJECTIE SW_PRODUCT25 Table 6. PROJECTION SW_PRODUCT

—SID—-DID--TYP--VKL-["GRP" ï MYOGIO CODE MYOGIO ï 1__HMLK10 CODE__HMLK10__2_ 1 VMLK05 CODE VMLK05 3 1 VMLKHL CODE VMLKHL 4 1 MYOGIO OMSCHR 1LITERMAGEREYOGHURT 5 1 HMLK10 OMSCHR 1LITERHALFVOLLEMELK 6 1 VMLK05 OMSCHR VOLLEMELKHA.LVELITER 7 1 VMLKHL OMSCHR VOLLEMELKHALVELITER 8 2 MYOGIO CODE MYOGIO 9 2 HMLK10 CODE__HMLK10__10 2 VMLK05 CODE__VMLK05__11 2 VMLKHL CODE VMLKHL 12 ~ 2 COCOLA CODE COCO LA 13 101426$—SID —- DID - TYP - VKL - ["GRP" ï MYOGIO CODE MYOGIO ï 1__HMLK10 CODE__HMLK10__2_ 1 VMLK05 CODE VMLK05 3 1 VMLKHL CODE VMLKHL 4 1 MYOGIO OMSCHR 1LITERMAGEREYOGHURKALKHKKRMKHMKHKKKKKKKKKKKKKKKKKKK 1 VMLKHL DESCRIBE FULL MILK HALVELITER 8 2 MYOGIO CODE MYOGIO 9 2 HMLK10 CODE__HMLK10__10 2 VMLK05 CODE__VMLK05__11 2 VMLKHL CODE VMLKHL 12 ~ 2 COCOLA CODE COCO LA 13 101 426 $

Tabel 7. PROJECTIE SW__TRANSACTIETable 7. PROJECTION SW_TRANSACTION

16 -5IE--ΠΠ3--TYP--VRL- 5RP- 2 MYOGIOIO- CODE_DAG MYOG1010-07- ï __07-1999 1999__ 2 HMLK1010- CODE_DAG HMLK1010-07- 2 __07-1999___1999__ 2 VMLK0510- CODE_DAG VMLK0510-07- 3 07-1999 1999 2 VMLKHL10- CODE_DAG VMLKHL10-07- 4 __07-1999___1999__ 2 MYOG1011- CODE_DAG MYOG1011-07- 5 __07-1999___1999__ 2 HMLK1011- CODE_DAG HMLK1011-07- 6 __07-1999___1999__ 2 VMLK0511- CODE_DAG VMLK0511-07- 7 __07-1999___1999__ 2 VMLK0511- CODE_DAG VMLK0511-07- 8 __07-1999___1999__ 2 VMLKHL11- CODEJOAG VMLKHL11-07- 9 __07-1999___1999__ 2 COCOLA11- CODE_DAG COCOLA11-07- 10 _ 07-1999 __1999__ 5 Nu worden de veldwaardeviews, zoals getoond in code- fragment 3 en 4, toegepast op de brongegevens. Het resultaat is getoond in de tabellen 8 en 9 (8, 9 in fig. 1).16 -5IE - ΠΠ3 - TYP - VRL- 5RP- 2 MYOGIOIO- CODE_DAG MYOG1010-07- ï __07-1999 1999__ 2 HMLK1010- CODE_DAG HMLK1010-07- 2 __07-1999 ___ 1999__ 2 VMLK0510- CODE_DAG VMLK0510-07- 3 07 -1999 1999 2 VMLKHL10- CODE_DAG VMLKHL10-07- 4 __07-1999 ___ 1999__ 2 MYOG1011- CODE_DAG MYOG1011-07- 5 __07-1999 ___ 1999__ 2 HMLK1011- CODE_DAG HMLK1011-07- 6 __07-1999 ___ 1999__ 2 VMLK0511- 1999___1999__ 2 VMLK0511- CODE_DAG VMLK0511-07- 8 __07-1999 ___ 1999__ 2 VMLKHL11- CODEJOAG VMLKHL11-07- 9 __07-1999 ___ 1999__ 2 COCOLA11- CODE_DAG COCOLA11-07- 10 _ 07-1999 __1999__- 5 Now being shown fragment 3 and 4, applied to the source data. The result is shown in Tables 8 and 9 (8, 9 in Fig. 1).

10 Tabel 8. PROJECTIE VWVJPRODUCT10 Table 8. PROJECTION VWVJPRODUCT

—ΕΠΠ3—-UIT»- PRODUCT_OMSCHRIJ AANTAL_VERK OMZET—ΕΠΠ3 —- OFF »- PRODUCT_DESCRIPTION NUMBER_SALES TURNOVER

VING OCHTFISHING MORNING

1 MYOG10 1 liter magere 0 0 ___yoghurt___ 1 HMLK10 1 liter halfvol- 0 0 ___le melk___ 1 VMLK05 volle melk, hal- 0 0 ___ve liter___ 1 VMLKHL VOLLE MELK HALVE 0 01 MYOG10 1 liter of skimmed 0 0 ___yoghurt___ 1 HMLK10 1 liter of semi-skimmed 0 0 ___le milk___ 1 VMLK05 whole milk, half 0 0 ___ve liter___ 1 VMLKHL FULL MILK HALF 0 0

LITERLITERS

2 MYOG10 fictief MYOG10 6 12,60 2__HMLK10__fictief HMLK10__4__7,80 2 VMLK05 fictief VMLK05 9 7,85 2__VMLKHL fictief VMLKHL__16 14,00 2 COCOLA fictief COCOLA 5 11,10 15 10142882 MYOG10 fictional MYOG10 6 12.60 2__HMLK10__fictive HMLK10__4__7,80 2 VMLK05 fictional VMLK05 9 7,85 2__VMLKHL fictional VMLKHL__16 14,00 2 COCOLA fictional COCOLA 5 11,10 15 1014288

Tabel 9. PROJECTIE VWV_TRANSACTIETable 9. PROJECTION VWV_TRANSACTIE

1717

SID- -DID 'A5NTAL_ VERKOOP_ OMZETSID -DID 'A5NTAL_ SALES_ SALES

VERKOCHT DATUMSOLD DATE

2 MYOG1010-07-1999 3 10-07-1999 6TTÖ 2 HMLK1010-07-1999 2 10-07-1999 3,90 2 VMLKO 510-07-1999 4 10-07-1999 3,50 2 VMLKHL10-07-1999 8 10-07-1999 7,00 2 MYOG1011-07-1999 3 11-07-1999 6,30 2 HMLK1011-07-1999 2 11-07-1999 3,90 2 VMLKO 511-07-1999 5 11-07-1999 4,35 2 VMLKHLll-07-1999 8 11-07-1999 7,00 ~ 2 COCOLA11-07-1999 5 | 11-07-1999 11,10 5 Nu wordt de relatieview, zoals hiervoor bepaald, toegepast op de verkregen verzameling, met als resultaat de tabel 10 (10 in fig. 1).2 MYOG1010-07-1999 3 10-07-1999 6TTÖ 2 HMLK1010-07-1999 2 10-07-1999 3.90 2 VMLKO 510-07-1999 4 10-07-1999 3.50 2 VMLKHL10-07-1999 8 10-07-1999 7.00 2 MYOG1011-07-1999 3 11-07-1999 6.30 2 HMLK1011-07-1999 2 11-07-1999 3.90 2 VMLKO 511-07-1999 5 11-07 -1999 4.35 2 VMLKHLll-07-1999 8 11-07-1999 7.00 ~ 2 COCOLA11-07-1999 5 | 11-07-1999 11.10 5 Now the relationship view, as defined above, is applied to the obtained set, resulting in table 10 (10 in fig. 1).

Tabel 10. PROJECTIE RLV_TRANSACTIE_PRODUCTTable 10. PROJECTION RLV_TRANSACTIE_PRODUCT

10 FROM_SID -FROM_DID--TO_DID--TO SID- 2 MYOG1010-07-1999 MYOG10 2 2 HMLK1010-07-1999 HMLK10 2 2 VMLKO 510-07-1999 VMLKO5 2 2 VMLKHL10-07-1999 VMLKHL 2 2 MYOG1011-07-1999 MYOG10 2 2 HMLK1011-07-1999 HMLK10 2 2 VMLKO 511-07-1999 VMLKO5 2 2 VMLKHLll-07-1999 VMLKHL 2 ~ 2 COCOLA11-07-1999 COCOLA 210 FROM_SID -FROM_DID - TO_DID - TO SID- 2 MYOG1010-07-1999 MYOG10 2 2 HMLK1010-07-1999 HMLK10 2 2 VMLKO 510-07-1999 VMLKO5 2 2 VMLKHL10-07-1999 VMLKHL 2 2 MYOG1011-07- 1999 MYOG10 2 2 HMLK1011-07-1999 HMLK10 2 2 VMLKO 511-07-1999 VMLKO5 2 2 VMLKHLll-07-1999 VMLKHL 2 ~ 2 COCOLA11-07-1999 COCOLA 2

Vervolgens worden de verkregen entiteiten, zoals getoond in de tabellen 6-10 samengevoegd (11, 12 in fig. 1). Dit samenvoegen heeft plaats door: 15 Eerst worden binnen een verzameling entiteiten met een gelijke SID/DID-combinatie voorzien van een gelijke GRP-aanduiding. Omdat de SID/DID combinatie een unieke entiteit aanduidt, betekent dit dat bij een gelijke SID/DID aanduiding de betreffende entiteiten dezelfde doelentiteit 20 betreffen. Hierbij wordt rekening gehouden met de hierboven beschreven wijze voor het toekennen van initiële GRP-aanduidingen op basis van een eerder bepaalde geheugenstructuur.Then, the obtained entities are combined as shown in Tables 6-10 (11, 12 in Fig. 1). This merging takes place by: 15 First, within a collection of entities with the same SID / DID combination, the same GRP designation is provided. Because the SID / DID combination designates a unique entity, this means that with an equal SID / DID designation, the relevant entities are the same target entity. This takes into account the manner described above for allocating initial GRP designations based on a previously determined memory structure.

si : -r - i 0 Usi: -r - i 0 U

1818

Vervolgens worden binnen dezelfde verzameling entiteiten met een gelijke typ/val combinatie voorzien van een gelijke GRP-aanduiding.Subsequently, within the same collection of entities with the same typ / fall combination, an equal GRP designation is provided.

Toegepast op het voorbeeld geeft dit voor de verza-5 meling zoals getoond in tabel 6, na het verzamelen van gelijke SID/DID-combinaties, waarbij aan entiteiten met een gelijke SID/DID-combinatie eenzelfde GRP-aanduiding wordt gegeven, de waardes voor GRP zoals in de kolom GRP (tussen) is getoond.When applied to the example, for the set as shown in Table 6, after collecting equal SID / DID combinations, giving entities with the same SID / DID combination the same GRP designation, the values for GRP as shown in the GRP column (in-between).

10 Vervolgens worden de entiteiten met een gelijke typ/val combinatie voorzien van hetzelfde GRP-nummer, uitgaande van de GRP(tussen) aanduidingen, hetgeen de kolom GRP(nieuw) oplevert.10 Subsequently, entities with the same type / fall combination are given the same GRP number, based on the GRP (intermediate) indications, which yields the column GRP (new).

Het samenvoegen is klaar als aan het vereiste is 15 voldaan dat er geen entiteiten meer over zijn die een gelijke typ/val-combinatie of een gelijke sid/did combinatie hebben maar nog verschillende GRP-identificaties. Hoewel in dit voorbeeld aan dit vereiste wordt voldaan door het hiervoor beschreven herhaald uitvoeren van samenvoegingsstappen 20 waar bij het groeperen steeds het laagst mogelijke GRP-nummer wordt gekozen, is de uitvinding geenszins beperkt tot dit bij wijze van voorbeeld genoemde algoritme. De uitvinding kan worden toegepast met elk ander algoritme dat is ingericht voor het bereiken van het hiervoor genoemde ver-25 eiste voor het beëindigen van het samenvoegen.The merge is complete when the requirement is met that there are no more entities left that have the same typ / trap combination or the same sid / did combination but still have different GRP identifications. Although in this example this requirement is met by the above described repeated execution of merging steps 20 where the lowest possible GRP number is always chosen in the grouping, the invention is in no way limited to this exemplary algorithm. The invention can be applied with any other algorithm arranged to achieve the aforementioned requirement to terminate the merging.

1 0142881 014288

Tabel 11. SAMENVOEGING SW_PRODUCTTable 11. MERGER SW_PRODUCT

1919

SID""1-DIB--TYP--VKE SEP GKP 5EPSID "" 1-DIB - TYP - VKE SEP GKP 5EP

(oud) (tussen) (nieuw)(old) (in between) (new)

" I--MYOGIO--CODE-f-MYOGIO--1--1 I"I - MYOGIO - CODE-f-MYOGIO - 1--1 I

—I--ΗΜΕΚΓ0 CODE 1 HMLK1Ö : : 2 2 2 —I--VMLK05 CODE VMLKüb 3 3 3 -1--VMLKHL--CODE VMLKHL 3 3 3—I - ΗΜΕΚΓ0 CODE 1 HMLK1Ö:: 2 2 2 —I - VMLK05 CODE VMLKüb 3 3 3 -1 - VMLKHL - CODE VMLKHL 3 3 3

—I--MYOGIO--0M5CHE--ILTTEEHKGEÏÏEYOGHDET--5--1 I—I - MYOGIO - 0M5CHE - ILTTEEHKGEÏEYOGHDET - 5--1 I

-1--HMLK10--OMSCHR--ÏEITERHALPVÜLLEMEL'K--5--2 2 -1--VMLK05--OM5CHE--VOLLEMELKHAEVEL ITEE--7--3 3 —I--VMLKHL--ÖM5CHK--VOLLEMELKHALVELITER :-8--3--3- —2--MYOGIO--CODE--MYOGIO--9--9--1- —2--HMLK10--CODE--HMLKTO- ID TO 2 —2--VMLK05--CODE--VMLK05--Π Π 3 —2--VMT.KHI,--CODE--VMLKHL--12--12--3- —2— ÜÜCOLA CODE CÜCÜLA i3 'U ι3 5-1 - HMLK10 - DESCRIPTION - LIQUID-HALPVÜLLEMEL'K - 5--2 2 -1 - VMLK05 - OM5CHE - FULL MILK HAEVEL ITEE - 7--3 3 —I - VMLKHL - ÖM5CHK - FULL MILK HALVELITER : -8--3--3- —2 - MYOGIO - CODE - MYOGIO - 9--9--1- —2 - HMLK10 - CODE - HMLKTO-ID TO 2 —2-- VMLK05 - CODE - VMLK05 - Π Π 3 —2 - VMT.KHI, - CODE - VMLKHL - 12--12--3- —2— ÜÜCOLA CODE CÜCÜLA i3 'U ι3 5

De samenvoegingsbewerking zoals toegepast op tabel 6 wordt vervolgens toegepast op de tabel 7. Hierbij is gemakshalve de kolom GRP(tussen) weggelaten. Het resultaat is afgebeeld in tabel 12.The merge operation as applied to Table 6 is then applied to Table 7. For convenience, the GRP column (intermediate) is omitted. The result is shown in Table 12.

1010

Tabel 12 . SAMENVOEGING SWJTRANSACTIETable 12. MERGER SWJ TRANSACTION

-STD--DID- TYP--VSE-[~GEP—-GEP- (oud) (nieuw) -2--MYOGIOIO-TTT^--CODE_DAG--MYOG1010-ÜT-1999--1--1- 1999 2 HMLK1ÜIÜ-Ü7- CODE_DAG HMUK1Ö10-O7-1999 2 2 1999 2 VMLKOSlö-öT^ COUE_ÜAü VMiik0bl0-07-1999 3 3 1999 -2--VMLKHE10-07---CODE^DAG--VMLEHLlö-07-1999--3--3- 1999 -2--MYOGlOll-ÜY-'--C0DE_DAG--MYOGlöll-ö'r-1999--5--5- 1999 2 HMLKIÜ11-07- CODE_DAü HMLK.lüll-07-1999 δ δ 1999 -2--VMLK0311-0T=--CODE_DAG--VMLK0511-(JT-1999--7--7- 1999 -2--VMLKO b'irrD,r:--CODE_DAG--VMLKÖ511-07-1999--8--7- 1999 -2--VMLKHL11-Ö7---CÖDE_DAG--VMLKHL11-07-1999--9--9- 1999 -2--COCOLAl 1-07---CODEJDAG--COCÖLA11-Ö7-1999--Tö--Tü- __1999_____ 15 Vervolgens wordt het verkregen resultaat vastgelegd in een geheugenstructuur. Volgens de uitvinding is wordt per doelentiteit een geheugenstructuur opgesteld, waarbij het entiteittype de volgende attributen omvat: 1 0 U 2 8 8 20 - SID (System IDentification) unieke identificatie van de brongegevensstructuur, in dit voorbeeld 1 of 2, - DID (Data IDentification) unieke identificatie van entiteit binnen een brongegevensstructuur, en 5 - GRP, de GRP-aanduiding die na het samenvoegings- proces aan de SID/DID-combinatie is toegekend.-STD - DID- TYP - VSE- [~ GEP —- GEP- (old) (new) -2 - MYOGIOIO-TTT ^ - CODE_DAY - MYOG1010-ÜT-1999--1--1- 1999 2 HMLK1ÜIÜ-Ü7- CODE_DAG HMUK1Ö10-O7-1999 2 2 1999 2 VMLKOSlö-öT ^ COUE_ÜAü VMiik0bl0-07-1999 3 3 1999 -2 - VMLKHE10-07 --- CODE ^ DAY - VMLEHLlö-07-1999-- 3--3- 1999 -2 - MYOGlOll-ÜY -'-- C0DE_DAG - MYOGlöll-ö'r-1999--5--5- 1999 2 HMLKIÜ11-07- CODE_DAü HMLK.lüll-07-1999 δ δ 1999 -2 - VMLK0311-0T = - CODE_DAG - VMLK0511- (JT-1999--7--7- 1999 -2 - VMLKO b'irrD, r: - CODE_DAG - VMLKÖ511-07-1999- -8--7- 1999 -2 - VMLKHL11-Ö7 --- CÖDE_DAG - VMLKHL11-07-1999--9--9- 1999 -2 - COCOLAl 1-07 --- CODEJDAG - COCÖLA11-Ö7 -1999 - Tö - Tü- __1999_____ 15 Next, the result obtained is recorded in a memory structure. According to the invention, a memory structure is drawn up for each target entity, the entity type comprising the following attributes: 1 0 U 2 8 8 20 - SID ( System IDentification) unique identification of the source data structure, in this example 1 or 2, - DID (Data IDentific ation) unique entity identification within a source data structure, and 5 - GRP, the GRP designation assigned to the SID / DID combination after the merge process.

Toegepast op de tabellen 11 en 12 levert dit het resultaat op zoals getoond in de tabellen 13 en 14 (6, 7 in fig. 1) .When applied to Tables 11 and 12, this yields the result shown in Tables 13 and 14 (6, 7 in Fig. 1).

1010

Tabel 13. GEHEUGEN-STRUCTUUR PRODUCTTable 13. PRODUCT MEMORY STRUCTURE

5TD -DTD- GRP--' 1 MYOGIO 1 1 HMLK10 2 1 VMLK05 3 1 VMLKHL 3 2 MYOGIO 1 2 HMLK10 2 2 VMLK05 3 2 VMLKHL 3 2 COCOLA 135TD -DTD- GRP-- '1 MYOGIO 1 1 HMLK10 2 1 VMLK05 3 1 VMLKHL 3 2 MYOGIO 1 2 HMLK10 2 2 VMLK05 3 2 VMLKHL 3 2 COCOLA 13

15 Tabel 14. GEHEUGEN-STRUCTUUR TRANSACTIE15 Table 14. MEMORY STRUCTURE TRANSACTION

SID—-DID- GRP- 2 MYOGIO10-07-1999 ï 2 HMLK1010-07-1999__2_ 2 VMLKO 510-07-1999__3_ 2 VMLKHL10-07-1999 4 2 MYOGIO11-07-1999 5 2 HMLK1011-07-1999 6 2 VMLKO 511-07-1999 7 2 VMLKHL11 - 07-1999__9_ 2 COCOLA11-07-1999 10SID —- DID- GRP- 2 MYOGIO10-07-1999 ï 2 HMLK1010-07-1999__2_ 2 VMLKO 510-07-1999__3_ 2 VMLKHL10-07-1999 4 2 MYOGIO11-07-1999 5 2 HMLK1011-07-1999 6 2 VMLKO 511-07-1999 7 2 VMLKHL11 - 07-1999__9_ 2 COCOLA11-07-1999 10

Op basis van de door middel van de samenvoeging verkregen tabellen en de geheugenstructuur kan de uiteindelijk 20 beoogde doelgegevensstructuur worden opgebouwd.On the basis of the tables obtained by means of the merging and the memory structure, the ultimate target data structure can be built up.

Hierbij wordt per unieke GRP-aanduiding zoals bekend uit de tabellen 13 en 14, een doelgegevensentiteit gemaakt, waarbij de GRP-waarde wordt toegekend aan het attribuut PRODUCT_SLEUTEL (13 in fig. 1) . De overige attributen wor-25 den via de bij de GRP-aanduiding behorende SID/DID combina- «f. Cs r- -λHere, for each unique GRP designation, as known from Tables 13 and 14, a target data entity is created, whereby the GRP value is assigned to the attribute PRODUCT_Key (13 in Fig. 1). The other attributes are entered via the SID / DID combination associated with the GRP designation. Cs r- -λ

. t' 0 i - v O. t '0 i - v O

21 tie uit de reeds bepaalde verzameling gehaald, mogelijk via een bewerking, zoals in het geval van het attribuut OMZET.21 removed from the already determined collection, possibly via an operation, such as in the case of the attribute TURNOVER.

De resultaten hiervan zijn weergegeven in de tabellen 15 en 16 (3 in fig. 1).The results of this are shown in Tables 15 and 16 (3 in Fig. 1).

55

Tabel 15. PRODUCTTable 15. PRODUCT

PRÖDuCt_ PRODUCtCÖ PRÖl)Udt_OMSCHRXjVÏN AAliTAli_VÈRKOC omzEtPRÖDuCt_ PRODUCtCÖ PRÖl) Udt_OMSCHRXjVÏN AAliTAli_VÈRKOC turnover

SLEUTEL DE G HTKEY THE G HT

1 myöGio 1 liter magere yog- δ 12,60 hurt 2 HMLkio l liter iialtvolie 3 7, S”0 melk 3 VMiiküb volle melk, naive 25 21,55 liter 13 CoCOLA TicETéF C0C0L& 5 11, lö-1 myöGio 1 liter skim yog- δ 12.60 hurt 2 HMLkio l liter iialtvolie 3 7, S ”0 milk 3 VMiiküb whole milk, naive 25 21.55 liter 13 CoCOLA TicETéF C0C0L & 5 11, lö-

10 Tabel 16. TRANSACTIE10 Table 16. TRANSACTION

'TRANSACTIE PRODUCT-!" ASKTAL VERKOCHTT VERKOOPDATUM VERKOOP BEDRAG ' SLEUTEL SLEUTEL"PRODUCT TRANSACTION!" ASKTAL SOLD SALES DATE SALES AMOUNT "KEY KEY

-1--1--3--10-07-1999--57TÜ- -2--2--2--10'-07 7L999--37513- -3--3--3--10=07=1999--3750- -3--3--8--10-07-1999--7700- -5--1--3--11-07-1999--5730- -5--2--2--11-07-1999--3790- -7--3--5--11-07-1999--3755- -9--3--8--11-07=1999--7700- 10 I i3 1 a 11-07=1999 11,10------1--1--3--10-07-1999--57TÜ- -2--2--2--10'-07 7L999--37513- -3--3--3--10 = 07 = 1999--3750- -3--3--8--10-07-1999--7700- -5--1--3--11-07-1999--5730- -5--2 --2--11-07-1999--3790- -7--3--5--11-07-1999--3755- -9--3--8--11-07 = 1999-- 7700- 10 I i3 1 a 11-07 = 1999 11.10 -----

In het uitvoeringsvoorbeeld is uitgegaan van een eenduidige identificatie van een entiteit door middel van 15 een combinatie van SID en DID; de uitvinding is geenszins beperkt tot deze identificatie. Volgens de uitvinding kan eenduidige identificatie van entiteiten op elke geschikte wijze plaats vinden, bijvoorbeeld door één of meerdere attributen, al dan niet samengesteld.The exemplary embodiment assumes an unambiguous identification of an entity by means of a combination of SID and DID; the invention is by no means limited to this identification. According to the invention, unambiguous identification of entities can take place in any suitable manner, for instance by one or more attributes, whether or not composed.

20 Hoewel in het uitvoeringsvoorbeeld is uitgegaan van een relationele database, is de uitvinding niet tot deze toepassing beperkt. De uitvinding kan ook worden toegepast met andere databasesystemen, zoals bijvoorbeeld netwerk of hiërarchische systemen.Although the relational database assumes a relational database, the invention is not limited to this application. The invention can also be applied with other database systems, such as, for example, network or hierarchical systems.

25 Hoewel in het uitvoeringsvoorbeeld een gegeven- stransformatie is beschreven' met betrekking tot een product /verkoopsysteem, is de uitvinding geenszins beperkt tot 1014238 22 deze bij wijze van voorbeeld genoemde uitvoering. De uitvinding kan worden toegepast bij andere gegevenstransformaties, zoals bijvoorbeeld dataconversies, datakwaliteitsana-lyse, batch-interfaces, en datawarehousing.Although in the exemplary embodiment a data transformation is described with respect to a product / sales system, the invention is by no means limited to 1014238 22 this exemplary embodiment. The invention can be applied to other data transformations, such as, for example, data conversions, data quality analysis, batch interfaces, and data warehousing.

5 De uitvinding leent zich in het bijzonder voor ge bruik bij een softwaregeneratie tooi. Een dergelijke tooi kan na het bepalen van de transformatieregels volgens de uitvinding, in verregaande mate automatisch de programmatuur genereren voor het toepassen van de transformatiere-10 gels en de uitvoeringsregels op de brongegegevens.The invention is particularly suitable for use in a software generation tool. After determining the transformation rules according to the invention, such a tool can largely generate automatically the software for applying the transformation rules and the execution rules to the source data.

De uitvinding leent zich in het bijzonder voor uitvoering op een computer, hiertoe kan de uitvinding worden uitgevoerd als een computerprogramma, dat is ingericht om bij uitvoering door een computer de werkwijze volgens de 15 uitvinding uit te voeren. In het bijzonder kan het computerprogramma in voor een computer leesbare vorm zijn opgeslagen op een informatiedrager zoals bijvoorbeeld een geheugen, schijfgeheugen, floppydisk en cd-rom.The invention lends itself particularly to execution on a computer, for this purpose the invention can be implemented as a computer program, which is designed to carry out the method according to the invention when executed by a computer. In particular, the computer program may be stored in a computer readable form on an information carrier such as, for example, a memory, disk memory, floppy disk and CD-ROM.

10142881014288

Claims

A method for assembling a target data structure comprising at least one target entity type, from at least one source data structure comprising at least one source entity type with at least one attribute with an attribute value, characterized by the steps of projecting the source data structure comprising applying at least one aggregation view, yielding a set of candidate entities, of a candidate entity type that includes at least: 10. a unique resource identifier (sid, did) that is different for each resource, identifying an associated resource entity, - an aggregation designation (typ), and - a aggregation value (val), wherein the aggregation value is dependent on said attribute value of said associated source entity, forming an aggregation set comprising aggregating candidate entities based on at least some degree of agreement between associated with -20 e unique source identifiers (sid, did), merging candidate entities based on at least some degree of correspondence between the associated merging designations (typ) and the merging values (val), and assigning unique group identifications 25 (grp) to entities of the merge set, forming a memory structure from the merge set, comprising entities each with one of said unique source identifiers (sid, did) and each one of the unique group identifiers (grp), and transforming the source data structure into a target data structure in accordance with the memory structure, the pooling and the candidate entities. 1 014 ° 0 a

The method of claim 1, further comprising the step of projecting the source data structure comprising applying at least one field value view, yielding a set of candidate entities, of a candidate entity type comprising at least: - a unique one for each source different source identification (sid, did) identifying an associated source entity, and 10. at least one attribute that depends on said attribute value of said associated source entity.

Method according to claim 1 or 2, further comprising the step of 15 projecting the source data structure applying at least one relationship view, yielding a set of candidate entities, of a candidate entity type comprising at least: - a start identification ( from_sid, from_did) that uniquely identifies a first, target candidate entity, and - a target identifier (to_sid, to_did) that uniquely identifies a second, target candidate entity, thereby establishing a relationship between the first and second candidate entities.

Method according to any one of the preceding claims, further comprising the steps of comparing formed candidate entities with a previously determined memory structure in forming a merge set, and assigning candidate entities corresponding to entities of group identifiers occurring in the previously determined memory structure. cations (grp) equal to the relevant entities occurring in the previously determined memory structure.

A computer program product, directly inputable into the internal memory of a computer, comprising program components 10H2 & 8 parts configured to perform a method of assembling a target data structure comprising at least one target entity type, from at least one source data structure comprising at least one source entity type with at least one attribute with an attribute value, with the steps of projecting the source data structure comprising applying at least one concatenation view, yielding a set of candidate entities, of a candidate entity type comprising at least: - a unique one for each source different source identifier (sid, did) identifying a corresponding source entity, - a merging designation (typ) and 15. a merging value (val), the aggregation value being dependent on said attribute value of said associated source entity, forming a merger matching set comprising merging candidate entities based on at least some degree of agreement between associated unique source identifiers (sid, did), merging candidate entities based on at least some degree of agreement between associated combination indicia (typ) and the combination values 25 (val), and assigning unique group identifiers (grp) to entities of the merge set, forming a memory structure from the merge set, comprising entities each with one of said unique source identifiers (sid, did) and each one of the unique group identifiers (grp), and transforming the source data structure into a target data structure in accordance with the memory structure, the aggregation set and the candidate entities. 1014 ^ 88

The computer program product according to claim 5, further comprising program code parts adapted to perform the step of projecting the source data structure comprising applying at least one field value view, yielding a set of candidate entities, of a candidate entity type which at least comprises: - a unique source identifier (sid, did) different for each source, which identifies an associated source entity 10, and - at least one attribute which is dependent on said attribute value of said associated source entity.

The computer program product of claim 5 or 6, 15 further comprising program code portions adapted to perform the step of projecting the source data structure comprising applying at least one relationship view, which yields a set of candidate entities, of a candidate entity type that at least includes: - a start identification (from_sid, from_did) that uniquely identifies a first, starting candidate entity, and - a target identification (to_sid, to_did) that uniquely identifies a second, target candidate entity, thereby establishing a relationship between the first and the second candidate entity.

8. Computer program product according to any one of claims 5-7, further comprising program code parts adapted to perform the steps of comparing formed candidate entities with a previously determined memory structure in forming a merge set, and assigning candidate entities corresponding to in the previously determined memory structure occurring entities of group identifications (grp) equal to the respective entities occurring in the previously determined memory structure. 'I T *', ν 3