EP1629401A2 - Procede, dispositif et programme informatique comportant des elements de code de programme et un produit de code de programme destines a l'analyse de donnees utiles structurees selon une structure de base de donnees - Google Patents
Procede, dispositif et programme informatique comportant des elements de code de programme et un produit de code de programme destines a l'analyse de donnees utiles structurees selon une structure de base de donneesInfo
- Publication number
- EP1629401A2 EP1629401A2 EP03794981A EP03794981A EP1629401A2 EP 1629401 A2 EP1629401 A2 EP 1629401A2 EP 03794981 A EP03794981 A EP 03794981A EP 03794981 A EP03794981 A EP 03794981A EP 1629401 A2 EP1629401 A2 EP 1629401A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- user data
- statistical
- probability model
- common
- database structure
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 73
- 238000004590 computer program Methods 0.000 title claims description 19
- 238000004458 analytical method Methods 0.000 claims description 39
- 238000007619 statistical method Methods 0.000 claims description 29
- 238000003066 decision tree Methods 0.000 claims description 15
- 238000007418 data mining Methods 0.000 claims description 11
- 238000007726 management method Methods 0.000 claims description 6
- 238000013068 supply chain management Methods 0.000 claims description 5
- 238000000528 statistical test Methods 0.000 claims description 4
- 238000003860 storage Methods 0.000 claims description 4
- 238000009826 distribution Methods 0.000 description 21
- 238000011161 development Methods 0.000 description 10
- 230000018109 developmental process Effects 0.000 description 10
- 230000006399 behavior Effects 0.000 description 7
- 230000004044 response Effects 0.000 description 3
- 238000013179 statistical model Methods 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 244000170475 Saraca indica Species 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 238000012384 transportation and delivery Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
Definitions
- the invention relates to an analysis of user data structured according to a database structure, such as customer or product data of a company. 0
- C RM customer relationship management systems
- SCM supply chain management systems
- data warehouses data warehouses
- Each data record Di represents a specific object from a group of objects, for example a specific customer from all registered customers of a company or a specific product from a product line of a company.
- Each data record comprises a predeterminable number of entries, Ai, Bi, Ci, ..., the individually recorded data, with categories or attributes A, B, C, ... These categories or attributes represent properties of an object group pe, such as age (A), income (B), product purchased (C), ....
- the entries Ai, Bi, Ci, ... for the respective categories A, B, C, ... can be numerical or semantic.
- a well-known and frequently used data mining process is a so-called decision tree [5].
- a disadvantage of many of the known and mentioned analysis methods is that they cannot be used adequately when analyzing large amounts of data. As a rule, there is one or multiple access to the whole Analyzing data stock, which is stored, for example, in a database, is necessary.
- a determination of a common probability model P (A, B, C, ..., X) for a data structure (A, B, C, ...) based on a hidden variable X is known from [7].
- the invention is based on the object of specifying an analysis method for the analysis of structured useful data which can also be used with large amounts of useful data and also has a high performance there.
- a common statistical probability model is first determined for the user data structured according to the database structure.
- the user data structured according to the database structure is then analyzed using a statistical analysis method, the statistical analysis method used in the analysis being applied to the common statistical probability model, not, as is customary, directly to the output data.
- the arrangement for analyzing user data structured according to a database structure has:
- a modeling unit with which a common statistical probability model for the user data structured according to the database structure can be determined
- the invention is clearly based on a two-stage procedure.
- the first step is to assume user data that can be specified and structured according to a database structure.
- a database structure of this type should be understood to mean that the user data is based on a higher-level, fixed structure, for example, data sets with the same structure (Ai, Bi, Ci, ...) with the same entry categories A, B, C, .... Such structures are generally known.
- a common, purpose-independent probability model as described, for example, in [7], [8], is formed from these user data to be analyzed and structured according to a database structure.
- the computer program according to the invention with program code means is set up to carry out all steps according to the analysis method according to the invention when the program is executed on a computer.
- the computer program product with program code means stored on a machine-readable carrier is set up to carry out all steps according to the analysis method according to the invention when the program is executed on a computer.
- the arrangement and the computer program with program code means, set up to carry out all steps according to the inventive analysis method when the program is executed on a computer, and the computer program product with program code means stored on a machine-readable medium, set up all steps according to the Carrying out analysis methods according to the invention when the program is executed on a computer are particularly suitable for carrying out the analysis method according to the invention or one of its further developments explained below.
- structured user data are used in user data records, for example user data records from a database.
- Each user data record represents a specific object from a group of objects.
- the user data associated with the respective user data record describe properties of the respective object.
- the statistical analysis method is applied to the common statistical probability model in such a way that a common probability is used as an input variable for the statistical analysis method. driving is used.
- the common probability results directly from the common probability model. This avoids unnecessary intermediate steps, costs computing time and extends response times.
- a method based on a data mining method [4], [10], [11], [12] can be used as the statistical analysis method, for example a clustering method [5] or a decision tree [6] or association rules [9].
- the analytical database image i.e. the common probability model is newly formed at predefinable time intervals, such as daily or weekly. Education can take place at night or on weekends.
- the complete analytical database image is then available as required to significantly speed up analyzes.
- the user data can be obtained from various data sources. The easiest way is to obtain the user data from a database in which the user data are stored and from which they are read. Because of the performance it can achieve in the analysis of data, the invention is particularly suitable where large amounts of data have to be processed or analyzed, such as in the area of customer relationship management (CRM) [1] or supply chain management [2]. or a data warehouse (DW) [3].
- CRM customer relationship management
- DW data warehouse
- the object is a customer, which is described by at least two of the following properties, age, income, product purchased, date of purchase, frequency of purchases.
- This enables eminently important issues to be solved for marketing departments, such as the customer behavior of certain customer groups.
- targeted target groups can be determined when customers are acquired, customer groups can be selected for specific products and marketing campaigns, and customers can be served with more foresight.
- Figure 1 sketch that schematically shows how a
- Figures 2a to g sketches showing analysis results of an analysis system for analyzing customer data according to an embodiment.
- Execution example Analysis system for analyzing customer behavior at a bank based on a customer relationship management system
- the exemplary embodiment relates to an analysis system for analyzing customer data from a bank.
- analysis system described below can be used not only at banks, but also at any company for the analysis of corresponding company data, such as for example in department stores or manufacturing companies.
- FIG. 1 shows schematically the functioning 100 of the analysis system for analyzing bank customer data 110.
- Functionality 100 is divided into knowledge acquisition 101 and implementation of the knowledge into intelligent operation of bank customers 102.
- customer data 110 Large and thus difficult-to-handle amounts of customer data 110 are first condensed 111 into a statistical model 112, a common probability model, of customer behavior.
- the shared probability model 112 used is one based on a hidden variable. The basics are described in [7].
- statistical methods 120 in general data mining methods and here in this case a decision tree, are used, which are based on the statistical model.
- the coupling is made possible by the fact that the data mining methods or decision tree 120 are based on a statistical framework, and thus use the same statistical terms or the same statistical language as the common probability model 112.
- the results of the questions can be further implemented 121 in intelligent customer service 130.
- Customer data ((Fig. 1, 110)
- the customer data 110 in the analysis system is collected as part of a customer relationship management (CRM) 150.
- CRM customer relationship management
- CRM 150 large amounts of data 110 about the bank customers from all sales channels of the bank, such as direct contacts, web, call center, are recorded and stored.
- the data is stored in a database in the form of customer-specific data records Di (AI, A2, ..., Bl-2, B2-3, ..., C, D, ...), the index i being the respective bank customer i features.
- the common probability model 112 used is one based on a hidden variable X. The foundations for this are described in [7].
- the common probability model 112 is written based on the hidden variable X as P (A, B, C, ..., X) for all attributes (A, B, C, ).
- Such a statistical image of data represents a highly compressed form of knowledge about customers and can be used to efficiently and interactively explore dependencies 120, 140.
- the common probability model 112 also provides quickly retrievable forecasts of further expected behavior and current needs of a customer.
- the forecasts can also be used to serve customers proactively and in a targeted manner and to provide proactive, personal offers 130.
- the decision tree [6] is placed 120 on the statistical model 112, the common probability model 112.
- the common distribution P (A, B, C, ..., X) goes over all attributes of the customers by summation over the hidden variable X.
- Structural learning immediately provides a common distribution P (A, B, C, ).
- the structure of the models for example those with a predefined hidden variable or those that were generated by structure learning, or a combination of the above, is used to efficiently calculate necessary sums over the common distribution.
- Decision trees are usually built using a known CHAID or a known CART procedure.
- the necessary probabilities or distributions for the construction of the decision tree can (as usual) be determined from the data or also from the most accurate probability model described in the above (inference process).
- 2a to 2g show examples of some of the possible interactive analyzes 140 that can be carried out with the decision tree 120 using the common probability model 112.
- A1 " Giro / Salary Account “), P (A3
- A1 " Giro / Salary Account ”) , P (A4
- A1 "Giro / Salary Account”), P (A5
- A1 "Giro / Salary Account”), P (B1-2
- A1 "Giro / Salary Account ⁇ ), P ( B2-3
- A1 "Gi ro / Salary Account “), P (B3-4
- A1 " Giro / Salary Account ”) and P (C
- A1 " Giro / Salary Account ”) and P (D
- A1 " Giro / Salary - Account “).
- P (A2 "insurance product
- Fig. 2e shows the probability distributions P (A1), P (A2) P (A3) P (A4), P (A5), P (Bl-2), P (B2-3), P (B3-4) and P (C) and P (D).
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Entrepreneurship & Innovation (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Probability & Statistics with Applications (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Operations Research (AREA)
- Marketing (AREA)
- Economics (AREA)
- Fuzzy Systems (AREA)
- Tourism & Hospitality (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10240443 | 2002-09-02 | ||
PCT/EP2003/009752 WO2004025501A2 (fr) | 2002-09-02 | 2003-09-02 | Procede, dispositif et programme informatique comportant des elements de code de programme et un produit de code de programme destines a l'analyse de donnees utiles structurees selon une structure de base de donnees |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1629401A2 true EP1629401A2 (fr) | 2006-03-01 |
Family
ID=31983891
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03794981A Withdrawn EP1629401A2 (fr) | 2002-09-02 | 2003-09-02 | Procede, dispositif et programme informatique comportant des elements de code de programme et un produit de code de programme destines a l'analyse de donnees utiles structurees selon une structure de base de donnees |
Country Status (5)
Country | Link |
---|---|
US (1) | US20060173889A1 (fr) |
EP (1) | EP1629401A2 (fr) |
JP (1) | JP2005537585A (fr) |
AU (1) | AU2003264251A1 (fr) |
WO (1) | WO2004025501A2 (fr) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103685475A (zh) * | 2013-11-22 | 2014-03-26 | 广东泛在无线射频识别公共技术支持有限公司 | 一种实现跨机构共享产品主数据的数据定位方法及系统 |
US10909575B2 (en) | 2015-06-25 | 2021-02-02 | Salesforce.Com, Inc. | Account recommendations for user account sets |
US10715626B2 (en) | 2015-06-26 | 2020-07-14 | Salesforce.Com, Inc. | Account routing to user account sets |
US20160379266A1 (en) * | 2015-06-29 | 2016-12-29 | Salesforce.Com, Inc. | Prioritizing accounts in user account sets |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5704017A (en) * | 1996-02-16 | 1997-12-30 | Microsoft Corporation | Collaborative filtering utilizing a belief network |
-
2003
- 2003-09-02 AU AU2003264251A patent/AU2003264251A1/en not_active Abandoned
- 2003-09-02 EP EP03794981A patent/EP1629401A2/fr not_active Withdrawn
- 2003-09-02 US US10/526,160 patent/US20060173889A1/en not_active Abandoned
- 2003-09-02 WO PCT/EP2003/009752 patent/WO2004025501A2/fr active Application Filing
- 2003-09-02 JP JP2004535418A patent/JP2005537585A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
US20060173889A1 (en) | 2006-08-03 |
WO2004025501A2 (fr) | 2004-03-25 |
AU2003264251A1 (en) | 2004-04-30 |
JP2005537585A (ja) | 2005-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69527880T2 (de) | Bestimmung des Verhaltensprofils einer Funktionseinheit mittels unscharfer Logik | |
DE69809964T2 (de) | Online-datenbank ausbeutung | |
DE69127847T2 (de) | Verfahren und Gerät zur Klassifizierung von Daten | |
DE112019004260T5 (de) | Analyse und korrektur der lieferketten-gestaltung durch maschinelles lernen | |
DE602004006485T2 (de) | Verfahren zur automatisierten anmerkung von berichten mehrdimensionaler datenbanken mit informationsobjekten eines datenspeichers | |
EP3584751A1 (fr) | Procédé de production d'un jumeau numérique | |
WO2000011581A1 (fr) | Procede ameliore d'affectation des couts basee sur la quantite a partir de differentes sources d'informations | |
EP1403793A1 (fr) | Procédé pour la consignation intégrée et automatique des documents dans l'enregistrement des transactions commerciales | |
DE69130541T2 (de) | System und Verfahren zur Geschäftsüberwachung | |
WO2004083983A2 (fr) | Comparaison de modeles de systemes complexes | |
CH704497B1 (de) | Verfahren zum Benachrichtigen, Speichermedium mit Prozessoranweisungen für ein solches Verfahren. | |
DE102020126568A1 (de) | Systeme und verfahren für das feature-engineering | |
WO2004083982A2 (fr) | Modelisation d'un systeme complexe | |
Braunwarth et al. | Economic evaluation and optimization of the degree of automation in insurance processes | |
Rippler | Das Persona-Prinzip | |
Gedenk et al. | Die Wirkung von Preis-und Nicht-Preis-Promotions auf die Markenloyalität | |
EP1629401A2 (fr) | Procede, dispositif et programme informatique comportant des elements de code de programme et un produit de code de programme destines a l'analyse de donnees utiles structurees selon une structure de base de donnees | |
DE10331810A1 (de) | Herstellungsoptimierung und Synchronisationsprozess | |
WO2012017056A1 (fr) | Procédé et dispositif de traitement automatique de données en un format de cellule | |
DE102012025349A1 (de) | Bestimmung eines Ähnlichkeitsmaßes und Verarbeitung von Dokumenten | |
EP3672200A1 (fr) | Procédé de coopération d'une pluralité d'appareil d'un réseau local | |
Mütze et al. | Die Auftragsfreigabe–Unterschätzte Aufgabe der Fertigungssteuerung? | |
DE10208682A1 (de) | Verfahren und System zur automatischen verhaltensorientierten Optimierung eines Portfolios der finanziellen Instrumente | |
Heinrich et al. | SEMPA–A Semantic Business Process Management Approach for the Planning of Process Models | |
EP1403794A1 (fr) | Procédé et système de mémorisation automatique de données relatives à la gestion d'entreprise |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20050303 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: PANORATIO DATABASE IMAGES GMBH |
|
PUAK | Availability of information related to the publication of the international search report |
Free format text: ORIGINAL CODE: 0009015 |
|
17Q | First examination report despatched |
Effective date: 20080207 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20080618 |