US6092064A - On-line mining of quantitative association rules - Google Patents

On-line mining of quantitative association rules Download PDF

Info

Publication number
US6092064A
US6092064A US08/964,064 US96406497A US6092064A US 6092064 A US6092064 A US 6092064A US 96406497 A US96406497 A US 96406497A US 6092064 A US6092064 A US 6092064A
Authority
US
United States
Prior art keywords
node
tree
confidence
index
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US08/964,064
Other languages
English (en)
Inventor
Charu Chandra Aggarwal
Philip Shi-lung Yu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Assigned to IBM CORPORATION reassignment IBM CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AGGARWAL, CHARU CHANDRA, YU, PHILIP SHI-LUNG
Priority to US08/964,064 priority Critical patent/US6092064A/en
Priority to TW087112467A priority patent/TW505868B/zh
Priority to PCT/GB1998/002928 priority patent/WO1999023577A1/en
Priority to ES98945396T priority patent/ES2184322T3/es
Priority to JP2000519369A priority patent/JP3575602B2/ja
Priority to CNB988108658A priority patent/CN1138222C/zh
Priority to AU92726/98A priority patent/AU750629B2/en
Priority to KR10-2000-7004749A priority patent/KR100382296B1/ko
Priority to HU0100161A priority patent/HUP0100161A3/hu
Priority to DE69809964T priority patent/DE69809964T2/de
Priority to EP98945396A priority patent/EP1034489B1/en
Priority to PL98340380A priority patent/PL340380A1/xx
Priority to CA002304646A priority patent/CA2304646C/en
Priority to CZ20001630A priority patent/CZ294171B6/cs
Publication of US6092064A publication Critical patent/US6092064A/en
Application granted granted Critical
Priority to HK01104434A priority patent/HK1033987A1/xx
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2246Trees, e.g. B+trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/954Relational
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/956Hierarchical
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/967Peer-to-peer
    • Y10S707/968Partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99932Access augmentation or optimizing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access

Definitions

  • the present invention relates generally to online searching for data dependencies in large databases and more particularly to an online method of data mining of data items to find quantitative association rules, where the data items comprise various kinds of quantitative and categorical attributes.
  • Data mining also known as knowledge discovery in databases
  • knowledge discovery in databases has been recognized as a new area for database research.
  • the volume of data stored in electronic format has increased dramatically over the past two decades.
  • the increase in use of electronic data gathering devices such as point-of-sale or remote sensing devices has contributed to this explosion of available data.
  • Data storage is becoming easier and more attractive to the business community as the availability of large amounts of computing power and data storage resources are being made available at increasingly reduced costs.
  • Data mining technologies are characterized by intensive computations on large volumes of data.
  • Large databases are definable as consisting of a million records or more.
  • end users will test association rules such as; "75% of customers who buy Cola also buy corn chips", where 75% refers to the rule's confidence factor.
  • the support of the rule is the percentage of transactions that contain both Cola and corn chips.
  • the present invention is directed to a method for efficiently performing online mining of quantitative association rules.
  • An association rule can be generally defined as a conditional statement that suggests that there exists some correlation between its two component parts, antecedent and consequent.
  • both the antecedent and consequent are composed from some user specified combination of quantitative and categorical attributes.
  • the user would provide three additional inputs representing the confidence and support level of interest to the user and a value referred to as interest level. These inputs provide an indication of the strength of the rule proposed by the user (the user query). In other words the strength of the suggested correlation between antecedent and consequent defined by the user query.
  • a method for preprocessing the raw data by utilizing the antecedent attributes to partition the data so as to create a mutidimensional indexing structure,followed by an online rule generation step.
  • pre-processing the data By effectively pre-processing the data into an indexing structure it is placed in a form suitable to answer repeated online queries with practically instantaneous response times.
  • the indexing structure obviates the need to make multiple passes over the database.
  • the indexing structure creates significant performance advantages over previous techniques.
  • the indexing structure (pre-processed data) is stored in such a way that online processing may be done by applying a graph theoretic search algorithm whose complexity is proportional to the size of the output. This results in an online algorithm which is practically instantaneous in terms of response time, minimizing excessive amounts of I/O or computation.
  • FIG. 1 is an overall description of the computer network in which this invention operates.
  • FIG. 2 is an overall description of the method performed by the invention. It consists of two stages described by FIGS. 2(a) and 2(b).
  • FIG. 2(a) is a description of the preprocessing stage.
  • FIG. 2(b) is a description of the on-line stage of the algorithm.
  • FIG. 3 is a detailed description of how the index tree is constructed using the antecedent set. It can be considered an expansion of step 75 of FIG. 2(a).
  • FIG. 4 is a detailed description of how the unmerged rule tree is generated from the index tree. It can be considered an expansion of step 100 of FIG. 2(b).
  • FIG. 5 is a description of how the merged rule tree is built from the unmerged rule tree.
  • FIG. 6 is a description of how the quantitative association rules are generated from the merged rule tree at some user specified interest level r.
  • the present invention is directed to a method for online data mining of quantitative association rules.
  • Traditional database queries consisting of simple questions such as "what were the sales of orange juice in January 1995 for the Long Island area?".
  • Data mining attempts to source out discernible patterns and trends in the data and infers rules from these patterns. With these rules the user is then able to support, review and examine decisions in some related business or scientific area.
  • Typical business decisions associated with the operation concern what to put on sale, how to design coupons, and how to place merchandise on shelves in order to maximize profit, etc. Analysis of past transaction data is a commonly used approach in order to improve the quality of such decisions.
  • An example of an association rule is: 30% of transactions that contain beer also contain diapers; 2% of all transactions contain both of these items". Here 30% is called the confidence of the rule, and 2% the support of the rule.
  • association rule Another example of such an association rule is the statement that 90% of customer transactions that purchase bread and butter also purchase milk.
  • the antecedent of this rule, X consists of bread and butter and the consequent, Y, consists of milk alone.
  • Ninety percent is the confidence factor of the rule. It may be desirable, for instance to find all rules that have "bagels" in the antecedent which may help determine what products (the consequent) may be impacted if the store discontinues selling bagels.
  • the problem of mining association rules is to find all rules that have support and confidence greater than the user-specified minimum support (minsupport s) and minimum confidence (minconfidence c).
  • the rule may be interpreted as:
  • the support and confidence of the rule collectively define the strength of the rule.
  • a non-inclusive yet representative list of the kinds of online queries that such a system can support include;
  • the present method particularizes the method of discovering general association rules to finding quantitative rules from a large database consisting of a set of raw transactions, D, defined by various quantitative and categorical attributes.
  • a typical quantitative/categorical database for a general marketing survey would consist of a series of records where each record reflects some combination of consumer characteristics and preferences;
  • a quantitative association rule is a condition of the form
  • X1, X2, . . . Xk correspond to quantitative antecedent attributes
  • Y1, Y2, . . . Yr, and C correspond to categorical antecedent attributes.
  • [11 . . . u1], [12 . . . u2], . . . [1 k . . . uk] correspond to the ranges for the various quantitative attributes.
  • Z1 and Z2 correspond to a multiple consequent condition.
  • the present method requires that a user supply three inputs, a proposed rule, otherwise referred to as the user query, in the form of an antecedent/consequent pair.
  • a proposed rule otherwise referred to as the user query
  • Minconfidence 50%
  • FIG. 1 is an overall description of the architecture of the present method. There are assumed to be multiple clients 40 which can access the preprocessed data over the network 35.
  • the preprocessed data resides at the server 5. There may be a cache 25 at the server end, along with the preprocessed data 20.
  • the preprocessing as well as the online processing takes place in the CPU 10.
  • a disk 15 is present in the event that the data is stored on disk.
  • the present method comprises two stages, a pre-processing stage followed by an online processing stage.
  • FIG. 2(a) shows an overall description of the preprocessing step as well as the online processing (rule generation steps) for the algorithm.
  • the pre-processing stage involves the construction of a binary index tree structure, see step 75 of FIG. 2 and the associated detailed description of FIG. 3(a).
  • the use of an index tree structure is a well known spatial data structure in the art which is used as a means to index on multidimensional data. Related work in prior art may be found in Guttman, A., A dynamic Index Structure for Spatial Searching. Proceedings of the ACM SIGMOD Conference. In the present method a variation on this index tree structure is employed in order to perform the on-line queries.
  • Antecedent attributes are utilized to partition the data so as to create a multidimensional indexing structure.
  • the indexing structure is a two-level structure where the higher level nodes are associated with at most two successor nodes and lower level nodes may have more than two successor nodes.
  • the construction of the indexing structure is crucial to performing effective online data mining. The key advantage resides in minimizing the amount of disk I/O required to respond to user queries.
  • FIG. 3(b) A graphical analogue of the indexing structure, stored in computer memory, is shown shown in FIG. 3(b) in the form of an index tree.
  • An index tree is a well known spatial data structure which is used in order to index on multi-dimensional data.
  • a separate index structure will be created in computer memory for each dimension, defined by a particular quantitative attribute, specified by the user in the online query.
  • FIG. 3(b) is a specific example of an index tree structure which represents the antecedent condition, "Age" and its associated consequent condition, "FirstTimeBuyer". To further clarify the concept of an index tree, FIG. 3(b) could have represented the "Age" dimension in the example below;
  • the root node of the index tree structure defines the user specified quantitative attribute, Age[0-100].
  • Each of the successive nodes of the tree also represent the quantitative attribute, Age, with increasingly narrower range limits from the top to the bottom of the tree heirarchy.
  • the binary successors to the root node for age[0-100] are Age[0-45] and Age[45-100].
  • the present method stores two pieces of data at each node of the index tree representing the confidence and support levels of interest. For example, with reference to FIG. 3(b), at the root node, two pieces of data are stored consisting of;
  • FIG. 3(a) is the detailed flowchart of the preprocessing stage of the algorithm, illustrated in FIG. 2 as element 100.
  • the process steps of this stage involve generating the binary index tree structure and storing the support and confidence levels for the consequent attribute at each node of the structure, followed by utilizing a compression algorithm on the lower levels of the structure to ensure that the index tree fits into the available memory.
  • Step 300 is the point of entry into the preprocessing stage.
  • Step 310 represents the software to implement the process step of using a binarization algorithm to generate a binary index tree.
  • the binarization step has been discussed in the prior art in Aggarwal C. C., Wolf J., Yu P. S., and Epelman M. A.
  • the S-Tree An efficient index tree for multidimensional index trees.
  • Step 315 the way in which the entries of an index node are organized is unique in that both the support level and the confidence level for each value of the consequent attribute are stored at each node in the structure.
  • Step 320 represents the software to implement the process step of utilizing a compression algorithm to compress the lower level index nodes into a single node.
  • FIG. 4(a) is the detailed flowchart of the primary search algorithm which is used in order to generate the unmerged rule tree from the index tree, illustrated in FIG. 2(b) as element 100.
  • the Querybox is merely a descriptive term to denote the lefthand or antecedent portion of the user query.
  • Example C describes what is required of an online user as input in the present method;
  • An online user would, in addition be required to input a user query (proposed rule) in the form of an an(antecedent/consequent) pair, items 3&4.
  • Item three the Querybox
  • Item four the consequent attribute, can consist of one or more categorical attributes.
  • This user specified query consists of an antecedent condition, querybox, with two dimensions, Age and Lefthandedness, and a single categorical consequent condition, asmoker.
  • This user specified query consists of an antecedent condition, querybox, with two dimensions, Height and Income and a multiple consequent condition.
  • the user specified query consists of a single antecedent condition, querybox, with a single dimension, Age, and a single consequent condition.
  • Example C describes in general terms what a user supplies as input to the method.
  • Example D below provides a representative example. Using the user query in example 2 above, a typical input/output result could look as follows:
  • the output can conceivably generate no rules, one rule, or multiple rules.
  • a single rule was generated in the example above.
  • the generated rule is said to satisfy the user query, (antecedent/consequent pair), at the user specified confidence and support level, 0.5 and 0.4 respectively.
  • Step 400 is the point of entry into the primary search algorithm.
  • Step 410 represents the software to implement the process step of setting a pointer, Currentnode to point to the root node of the index tree. Pointer CurrentNode will always point to the particular node of the index tree which the algorithm is presently searching.
  • Step 420 defines LIST as a set of nodes which are considered to be eligible nodes to be scanned by the search algorithm. LIST is initialized to contain only the root node in step 420.
  • Step 430 represents the software to implement the process step of adding all the child nodes of the node pointed to by Currentnode to LIST which intersect with Querybox Q, and have support at least equal to the user supplied input value, minsupport, s.
  • a child node is said to intersect with Querybox Q, when all of the antecedent conditions associated with the child node are wholly contained within the antecedent condition defined by the Querybox.
  • Step 450 follows steps 440 and 445 and represents the software to implement the process step of deleting the node presently pointed to by Currentnode from LIST and setting the pointer Currentnode to the next node contained in LIST.
  • Step 460 determines whether LIST is empty and terminates the algorithm when the condition is met, see Step 470. Otherwise, the algorithm returns to step 430 and repeats the steps for the node currently pointed to by the pointer CurrentNode.
  • an unmerged rule tree is output which consists of all nodes in the input index tree which satisfy the user specified minimum support, minsupport s.
  • FIG. 5(a) is the detailed flowchart which describes the process of constructing the merged rule tree from the unmerged rule tree.
  • the algorithm described by the flowchart compresses the unmerged rule tree to obtain a hierarchical representation of the rules.
  • the unmerged rule tree is traversed in depth first search order where at each node a determination is made as to whether that node is meaningful.
  • a meaningful node is defined to be a node which has a rule associated with it.
  • a rule may or may not have been associated with a node when the unmerged rule tree was created.
  • FIG. 4(b) the unmerged rule tree, where meaningful nodes correspond to nodes 1, 2, and 4. All meaningful nodes are preserved in the merged rule tree. If a node is determined not to be meaningful then the algorithm either eliminates that node, or merges multiple child nodes into a single node when certain conditions are met.
  • Step 500 represents the point of entry into the algorithm.
  • Step 510 represents the software to implement the process step of insuring that the unmerged rule tree is traversed in depth first search order.
  • Step 515 represents the step of proceeding to the next node in the unmerged rule tree in the depth first traversal.
  • Step 520 represents a decision step which determines whether the current rule node is a meaningful node. A branch is made to step 530 when the current node is determined to be meaningful. Otherwise the algorithm branches to step 540 thereby classifying the node as nonmeaningful.
  • Step 540 is a decision step which determines whether the nonmeaningful node has a child node. If the nonmeaningful node does have a child node a branch is taken to step 550.
  • Step 550 represents the software to implement the process step of deleting the current nonmeaningful node. Otherwise, if it is determined in step 540 that the current node does not have a child node, a branch will be taken to step 560.
  • Step 560 is a decision step for the purpose of determining whether the current nonmeaningful node has one or more than one child nodes. If the current node has only a single child node then a branch is taken to step 570.
  • Step 570 represents the software to implement the process step of deleting the current node and directly connecting the parent and child nodes of the deleted nonmeaningful node together in the index tree. Otherwise, in the case where the current node is found to have multiple child nodes a branch is taken to step 580.
  • Step 580 is a decision step which determines whether the minimum bounding rectangle of the two child nodes are more than that of the nonmeaningful parent node.
  • the minimum bounding rectangle is defined by the upper and lower bounds (the range) of the quantitative attribute for each child node.
  • the ranges of the child nodes are combined and found to be broader than the range of the parent node, a merger occurs. For example, if the child nodes were defined as;
  • Step 590 represents the software to perform the process step of adjusting the minimum bounding rectangle of the parent to be the minimum bounding rectangle of the two child nodes.
  • a branch to decision step 600 determines whether there are any more nodes to traverse in the tree.
  • a branch to termination step 610 occurs if there are no more nodes to traverse, otherwise process steps 490-515 are repeated for the remaining index nodes.
  • FIG. 6 is the detailed flowchart which describes the process of using the merged rule tree as input to define the rules at the user specified interest level r.
  • the merged rule tree is traversed in depth first order.
  • Step 616 is the point of entry into the flowchart.
  • a user would specify an input value for r, representing the interest level.
  • Step 618 represents the software to select the next node in the merged rule tree in depth first order.
  • Step 620 is a decision step which represents the software which looks at all ancestral nodes of the current node of interest to determine whether any of them has a confidence value at least equal to 1/r of the current node.
  • a branch to Step 630 will be taken when condition is true.
  • Step 630 represents the software to prune the rule associated with the current node.
  • Step 640 is a decision step which determines whether there are any remaining nodes to be evaluated in the merged rule tree. The process steps will be repeated if there are additional nodes to be evaluated, otherwise the process terminates at this point.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
US08/964,064 1997-11-04 1997-11-04 On-line mining of quantitative association rules Expired - Lifetime US6092064A (en)

Priority Applications (15)

Application Number Priority Date Filing Date Title
US08/964,064 US6092064A (en) 1997-11-04 1997-11-04 On-line mining of quantitative association rules
TW087112467A TW505868B (en) 1997-11-04 1998-07-29 On-line mining of quantitative association rules
HU0100161A HUP0100161A3 (en) 1997-11-04 1998-09-29 Online database mining
PL98340380A PL340380A1 (en) 1997-11-04 1998-09-29 Method of acquiring data directly from data base
JP2000519369A JP3575602B2 (ja) 1997-11-04 1998-09-29 ć‚Ŗćƒ³ćƒ©ć‚¤ćƒ³ćƒ»ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹ćƒ»ćƒžć‚¤ćƒ‹ćƒ³ć‚°
CNB988108658A CN1138222C (zh) 1997-11-04 1998-09-29 åœØēŗæę•°ę®åŗ“ęŒ–ęŽ˜ēš„ę–¹ę³•å’Œč®¾å¤‡
AU92726/98A AU750629B2 (en) 1997-11-04 1998-09-29 Online database mining
KR10-2000-7004749A KR100382296B1 (ko) 1997-11-04 1998-09-29 ģ˜Øė¼ģø ė°ģ“ķ„°ė² ģ“ģŠ¤ ė§ˆģ“ė‹
PCT/GB1998/002928 WO1999023577A1 (en) 1997-11-04 1998-09-29 Online database mining
DE69809964T DE69809964T2 (de) 1997-11-04 1998-09-29 Online-datenbank ausbeutung
EP98945396A EP1034489B1 (en) 1997-11-04 1998-09-29 Online database mining
ES98945396T ES2184322T3 (es) 1997-11-04 1998-09-29 Extraccion de datos en linea.
CA002304646A CA2304646C (en) 1997-11-04 1998-09-29 Online database mining
CZ20001630A CZ294171B6 (cs) 1997-11-04 1998-09-29 ZpÅÆsob pÅ™Ć­mĆ©ho vyhledĆ”vĆ”nĆ­ v rozsĆ”hlĆ© databĆ”zi
HK01104434A HK1033987A1 (en) 1997-11-04 2001-06-27 Method and apparatus for online database mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/964,064 US6092064A (en) 1997-11-04 1997-11-04 On-line mining of quantitative association rules

Publications (1)

Publication Number Publication Date
US6092064A true US6092064A (en) 2000-07-18

Family

ID=25508083

Family Applications (1)

Application Number Title Priority Date Filing Date
US08/964,064 Expired - Lifetime US6092064A (en) 1997-11-04 1997-11-04 On-line mining of quantitative association rules

Country Status (15)

Country Link
US (1) US6092064A (xx)
EP (1) EP1034489B1 (xx)
JP (1) JP3575602B2 (xx)
KR (1) KR100382296B1 (xx)
CN (1) CN1138222C (xx)
AU (1) AU750629B2 (xx)
CA (1) CA2304646C (xx)
CZ (1) CZ294171B6 (xx)
DE (1) DE69809964T2 (xx)
ES (1) ES2184322T3 (xx)
HK (1) HK1033987A1 (xx)
HU (1) HUP0100161A3 (xx)
PL (1) PL340380A1 (xx)
TW (1) TW505868B (xx)
WO (1) WO1999023577A1 (xx)

Cited By (48)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
US6185549B1 (en) * 1998-04-29 2001-02-06 Lucent Technologies Inc. Method for mining association rules in data
US6278998B1 (en) * 1999-02-16 2001-08-21 Lucent Technologies, Inc. Data mining using cyclic association rules
US6311179B1 (en) * 1998-10-30 2001-10-30 International Business Machines Corporation System and method of generating associations
US6311173B1 (en) * 1998-01-05 2001-10-30 Wizsoft Ltd. Pattern recognition using generalized association rules
US6321225B1 (en) * 1999-04-23 2001-11-20 Microsoft Corporation Abstracting cooked variables from raw variables
US6385608B1 (en) * 1997-11-11 2002-05-07 Mitsubishi Denki Kabushiki Kaisha Method and apparatus for discovering association rules
US6405200B1 (en) 1999-04-23 2002-06-11 Microsoft Corporation Generating a model for raw variables from a model for cooked variables
US6470351B1 (en) 1994-12-15 2002-10-22 Ufil Unified Data Technologies, Ltd. Binary-oriented set sequencing
US6529895B2 (en) 1999-04-23 2003-03-04 Microsoft Corporation Determining a distribution of a numeric variable
US6542878B1 (en) 1999-04-23 2003-04-01 Microsoft Corporation Determining whether a variable is numeric or non-numeric
US20030200199A1 (en) * 2002-04-19 2003-10-23 Dow Jones Reuters Business Interactive, Llc Apparatus and method for generating data useful in indexing and searching
US20030212699A1 (en) * 2002-05-08 2003-11-13 International Business Machines Corporation Data store for knowledge-based data mining system
US20030212667A1 (en) * 2002-05-10 2003-11-13 International Business Machines Corporation Systems, methods, and computer program products to browse database query information
US20030212669A1 (en) * 2002-05-07 2003-11-13 Aatish Dedhia System and method for context based searching of electronic catalog database, aided with graphical feedback to the user
US20030212675A1 (en) * 2002-05-08 2003-11-13 International Business Machines Corporation Knowledge-based data mining system
US20030212676A1 (en) * 2002-05-10 2003-11-13 International Business Machines Corporation Systems, methods and computer program products to determine useful relationships and dimensions of a database
US20030229605A1 (en) * 2002-04-19 2003-12-11 Herrera Peter J. System and method for building a rulebase
US6711577B1 (en) 2000-10-09 2004-03-23 Battelle Memorial Institute Data mining and visualization techniques
US6714940B2 (en) 2001-11-15 2004-03-30 International Business Machines Corporation Systems, methods, and computer program products to rank and explain dimensions associated with exceptions in multidimensional data
US20040122646A1 (en) * 2002-12-18 2004-06-24 International Business Machines Corporation System and method for automatically building an OLAP model in a relational database
US6757678B2 (en) 2001-04-12 2004-06-29 International Business Machines Corporation Generalized method and system of merging and pruning of data trees
US20040139061A1 (en) * 2003-01-13 2004-07-15 International Business Machines Corporation Method, system, and program for specifying multidimensional calculations for a relational OLAP engine
US20040260679A1 (en) * 2003-06-19 2004-12-23 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US20040260680A1 (en) * 2003-06-19 2004-12-23 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US6865582B2 (en) * 2000-01-03 2005-03-08 Bechtel Bwxt Idaho, Llc Systems and methods for knowledge discovery in spatial data
US20050175245A1 (en) * 2004-02-10 2005-08-11 Microsoft Corporation Voting based scheme for electronic document node reuse
US20050198041A1 (en) * 2000-12-07 2005-09-08 Lewandowski Robert P. Method and apparatus for processing electronic records for physical transactions
US20050278290A1 (en) * 2004-06-14 2005-12-15 International Business Machines Corporation Systems, methods, and computer program products that automatically discover metadata objects and generate multidimensional models
US20050283488A1 (en) * 2004-06-22 2005-12-22 International Business Machines Corporation Model based optimization with focus regions
US20050283494A1 (en) * 2004-06-22 2005-12-22 International Business Machines Corporation Visualizing and manipulating multidimensional OLAP models graphically
US7007020B1 (en) * 2000-03-10 2006-02-28 Hewlett-Packard Development Company, L.P. Distributed OLAP-based association rule generation method and system
US7010526B2 (en) 2002-05-08 2006-03-07 International Business Machines Corporation Knowledge-based data mining system
US20060178960A1 (en) * 1999-04-09 2006-08-10 Berkeley *Ieor Process for determining object level profitability
US20060271528A1 (en) * 2003-09-10 2006-11-30 Exeros, Inc. Method and system for facilitating data retrieval from a plurality of data sources
US20070250476A1 (en) * 2006-04-21 2007-10-25 Lockheed Martin Corporation Approximate nearest neighbor search in metric space
US20090024551A1 (en) * 2007-07-17 2009-01-22 International Business Machines Corporation Managing validation models and rules to apply to data sets
US7539677B1 (en) 2000-10-09 2009-05-26 Battelle Memorial Institute Sequential pattern data mining and visualization
US20090150391A1 (en) * 2007-12-05 2009-06-11 Fujitsu Limited Rare pattern extracting device and rare pattern extracting method
US20090327208A1 (en) * 2008-06-30 2009-12-31 International Business Machines Corporation Discovering transformations applied to a source table to generate a target table
US20100030719A1 (en) * 2008-07-10 2010-02-04 Covey Todd M Methods and apparatus related to bioinformatics data analysis
US20100070505A1 (en) * 2008-09-18 2010-03-18 International Business Machines Corporation Classification of Data in a Hierarchical Data Structure
US7895191B2 (en) 2003-04-09 2011-02-22 International Business Machines Corporation Improving performance of database queries
US8924343B2 (en) 2005-03-23 2014-12-30 International Business Machines Coporation Method and system for using confidence factors in forming a system
US8930303B2 (en) 2012-03-30 2015-01-06 International Business Machines Corporation Discovering pivot type relationships between database objects
US20150026555A1 (en) * 2013-07-16 2015-01-22 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US20150026217A1 (en) * 2013-07-16 2015-01-22 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US20160179903A1 (en) * 2014-12-23 2016-06-23 Ran Bittmann Enhancing frequent itemset mining
US20190191004A1 (en) * 2017-05-23 2019-06-20 Hitachi ,Ltd. System and method to reduce network traffic and load of host servers

Families Citing this family (18)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
KR100344530B1 (ko) * 1999-12-20 2002-07-24 ķ•œźµ­ź³¼ķ•™źø°ģˆ ģ› ģ‹œź³„ģ—“ ė°ģ“ķ„°ė² ģ“ģŠ¤ģ—ģ„œ ģœˆė„ģš° źµ¬ģ„±ģ˜ ģ“ģ›ģ„±ģ„ ģ‚¬ģš©ķ•œ ģ„œėøŒģ‹œķ€€ģŠ¤ ė§¤ģ¹­ė°©ė²•
KR20020018777A (ko) * 2000-09-04 2002-03-09 ė°•ėŒ€ķ¬ ķŒØķ„“ ė¶„ė„˜ė„¼ ģœ„ķ•œ ģ ģ¦ģ  ź°±ģ‹ ģ˜ ė°ģ“ķ„° ė§ˆģ“ė‹ ė°©ė²•
CN1310172C (zh) * 2001-07-26 2007-04-11 国际商äøšęœŗå™Ø公åø ē”Ÿęˆå€™č”„同义čƍēš„ę•°ę®å¤„ē†ę–¹ę³•å’Œē³»ē»Ÿ
KR20030032096A (ko) * 2001-10-10 2003-04-26 ģ“ģ°½ķ™˜ ė°ģ“ķ„° ė§ˆģ“ė‹ ė°©ė²• ė° ģ»“ķ“Øķ„°ė”œ ķŒė…ź°€ėŠ„ķ•œ źø°ė” ė§¤ģ²“
KR100500329B1 (ko) * 2001-10-18 2005-07-11 ģ£¼ģ‹ķšŒģ‚¬ ķ•øė””ģ†Œķ”„ķŠø ģ›Œķ¬ķ”Œė”œģš° ė§ˆģ“ė‹ ģ‹œģŠ¤ķ…œ ė° ė°©ė²•
KR100497212B1 (ko) * 2002-03-02 2005-06-23 (ģ£¼)ė¹„ģ—˜ģ‹œģŠ¤ķ…œģŠ¤ ė°ģ“ķ„° ė§ˆģ“ė‹ģ—ģ„œģ˜ ģ•™ģƒėø” źø°ė²•ģ— ģ ģš©ė˜ėŠ” ģ—°ź“€ģ„± ź·œģ¹™ģƒģ„± ģž„ģ¹˜ ė° ź·ø ė°©ė²•
KR100812378B1 (ko) * 2005-11-28 2008-03-11 ģ“ģ›ģ„ ģ§€ģ†ģ ģœ¼ė”œ ė°œģƒė˜ėŠ” ķŠøėžœģž­ģ…˜ ė°ģ“ķ„° ģ§‘ķ•©ģø ė°ģ“ķ„° ģŠ¤ķŠøė¦¼ ķ™˜ź²½ģ—ģ„œ ė¹ˆė°œķ•­ėŖ©ģ§‘ķ•© ķƒģƒ‰ģ„ ģœ„ķ•œ ģ¶•ģ•½ķ˜• ģ „ģœ„ ķŠøė¦¬ė„¼ ģ“ģš©ķ•œ ė¹ˆė°œķ•­ėŖ©ģ§‘ķ•© ķƒģƒ‰ ė°©ė²•
KR100799665B1 (ko) * 2007-04-10 2008-01-30 ģ‚¼ģœ”ėŒ€ķ•™źµģ‚°ķ•™ķ˜‘ė „ė‹Ø ė…øģø ė³µģ§€ ģ„œė¹„ģŠ¤ė„¼ ģœ„ķ•œ ģš•źµ¬ ģ‚¬ģ • ģ˜ˆģø” ė°©ė²• ė° ģƒźø°ė°©ė²•ģ„ ģˆ˜ķ–‰ķ•˜źø° ģœ„ķ•œ ģ‹œģŠ¤ķ…œ
US8185531B2 (en) * 2008-07-24 2012-05-22 Nahava Inc. Method and apparatus for partitioning high-dimension vectors for use in a massive index tree
US20110035444A1 (en) * 2009-08-06 2011-02-10 Timedright Inc. Relationship security in online social and professional networks and communities
CN101996102B (zh) * 2009-08-31 2013-07-17 äø­å›½ē§»åŠØ通äæ”集团公åø ę•°ę®å…³č”č§„åˆ™ęŒ–ęŽ˜å®žēŽ°ę–¹ę³•äøŽē³»ē»Ÿ
CN102117302B (zh) * 2009-12-31 2013-01-23 南äŗ¬ē†å·„大学 ä¼ ę„Ÿå™Øę•°ę®ęµå¤ę‚ęŸ„čÆ¢ē»“ęžœēš„ę•°ę®čµ·ęŗč·ŸčøŖę–¹ę³•
JP6020031B2 (ja) 2012-10-19 2016-11-02 åƌ士通ę Ŗ式会ē¤¾ ęŠ½å‡ŗćƒ—ćƒ­ć‚°ćƒ©ćƒ ć€ęŠ½å‡ŗč£…ē½®åŠć³ęŠ½å‡ŗę–¹ę³•
JP6003561B2 (ja) 2012-11-15 2016-10-05 åƌ士通ę Ŗ式会ē¤¾ ęŠ½å‡ŗćƒ—ćƒ­ć‚°ćƒ©ćƒ ć€ęŠ½å‡ŗč£…ē½®åŠć³ęŠ½å‡ŗę–¹ę³•
JP5962471B2 (ja) 2012-11-30 2016-08-03 åƌ士通ę Ŗ式会ē¤¾ ęŠ½å‡ŗćƒ—ćƒ­ć‚°ćƒ©ćƒ ć€ęŠ½å‡ŗč£…ē½®åŠć³ęŠ½å‡ŗę–¹ę³•
US10671607B2 (en) * 2016-09-23 2020-06-02 Futurewei Technologies, Inc. Pipeline dependent tree query optimizer and scheduler
CN107703383A (zh) * 2017-09-21 2018-02-16 国ē½‘äøŠęµ·åø‚ē”µåŠ›å…¬åø äø€ē§äæ”ęÆ采集ē³»ē»Ÿę•…éšœčÆŠę–­ēŸ„čƆåŗ“ēš„å»ŗē«‹ę–¹ę³•
CN112183823B (zh) * 2020-09-08 2023-12-05 国ē½‘ę±Ÿč‹ēœē”µåŠ›ęœ‰é™å…¬åøč„é”€ęœåŠ”äø­åæƒ äø€ē§åŸŗäŗŽč§„åˆ™ę ‘ēš„ē”µčƒ½č®”é‡č£…ē½®é€‰åž‹ę–¹ę³•å’Œē³»ē»Ÿ

Citations (6)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
US5615341A (en) * 1995-05-08 1997-03-25 International Business Machines Corporation System and method for mining generalized association rules in databases
US5664171A (en) * 1994-04-14 1997-09-02 International Business Machines Corporation System and method for query optimization using quantile values of a large unordered data set
EP0810807A2 (en) * 1996-05-31 1997-12-03 Nortel Networks Corporation Method and apparatus for analysing network data
US5724573A (en) * 1995-12-22 1998-03-03 International Business Machines Corporation Method and system for mining quantitative association rules in large relational tables
US5794209A (en) * 1995-03-31 1998-08-11 International Business Machines Corporation System and method for quickly mining association rules in databases
US5819266A (en) * 1995-03-03 1998-10-06 International Business Machines Corporation System and method for mining sequential patterns in a large database

Family Cites Families (5)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
US5737550A (en) * 1995-03-28 1998-04-07 Advanced Micro Devices, Inc. Cache memory to processor bus interface and method thereof
JP2963033B2 (ja) * 1995-09-29 1999-10-12 ę Ŗ式会ē¤¾é‡Žę‘ē·åˆē ”ē©¶ę‰€ ć‚µćƒ³ćƒ—ćƒ«åˆ†é”žę”Æę“č£…ē½®
JPH09114669A (ja) * 1995-10-16 1997-05-02 Hitachi Ltd ćƒ«ćƒ¼ćƒ«ē”Ÿęˆę–¹ę³•
JPH09251467A (ja) * 1996-03-15 1997-09-22 Mitsubishi Electric Corp ćƒ‡ćƒ¼ć‚æćƒžć‚¤ćƒ‹ćƒ³ć‚°ć‚·ć‚¹ćƒ†ćƒ ćŠć‚ˆć³ćƒ‡ćƒ¼ć‚æćƒžć‚¤ćƒ‹ćƒ³ć‚°ę–¹ę³•
JP3952518B2 (ja) * 1996-03-29 2007-08-01 ę Ŗ式会ē¤¾ę—„ē«‹č£½ä½œę‰€ å¤šę¬”å…ƒćƒ‡ćƒ¼ć‚æ処ē†ę–¹ę³•

Patent Citations (6)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
US5664171A (en) * 1994-04-14 1997-09-02 International Business Machines Corporation System and method for query optimization using quantile values of a large unordered data set
US5819266A (en) * 1995-03-03 1998-10-06 International Business Machines Corporation System and method for mining sequential patterns in a large database
US5794209A (en) * 1995-03-31 1998-08-11 International Business Machines Corporation System and method for quickly mining association rules in databases
US5615341A (en) * 1995-05-08 1997-03-25 International Business Machines Corporation System and method for mining generalized association rules in databases
US5724573A (en) * 1995-12-22 1998-03-03 International Business Machines Corporation Method and system for mining quantitative association rules in large relational tables
EP0810807A2 (en) * 1996-05-31 1997-12-03 Nortel Networks Corporation Method and apparatus for analysing network data

Non-Patent Citations (36)

* Cited by examiner, ā€  Cited by third party
Title
Agrawal, Rakesh et al. "Database Mining: A Performance Perspective", IEEE Transaction on knowledge and Data Engineering, vol. 5, No. 6, Dec. 1993.
Agrawal, Rakesh et al. Database Mining: A Performance Perspective , IEEE Transaction on knowledge and Data Engineering, vol. 5, No. 6, Dec. 1993. *
Antonin Guttman, "R-Trees: A Dynamic Index Structure For Spatial Searching", ACM, 1984, pp. 47-57.
Antonin Guttman, R Trees: A Dynamic Index Structure For Spatial Searching , ACM, 1984, pp. 47 57. *
Bergamo, Lisa "Silicon Graphics introduces MineSet", PR Newswire Apr. 16, 1996, Dateline: San Francisco, CA Apr. 1996.
Bergamo, Lisa Silicon Graphics introduces MineSet , PR Newswire Apr. 16, 1996, Dateline: San Francisco, CA Apr. 1996. *
Charu Aggarwal, et al., "The S-Tree: An Efficient Index for Multidimensional Objects", Proceddings of the International Symposium on Spatial Databases, Jul. 1997, pp. 350-370.
Charu Aggarwal, et al., The S Tree: An Efficient Index for Multidimensional Objects , Proceddings of the International Symposium on Spatial Databases, Jul. 1997, pp. 350 370. *
Charu C. Aggarwal, et al., "Online Generation of Association Rules", Research Report, Computer Science/Mathematics, RC20899 (92609), Jun. 24, 1997, pp. 1-23.
Charu C. Aggarwal, et al., Online Generation of Association Rules , Research Report, Computer Science/Mathematics, RC20899 (92609), Jun. 24, 1997, pp. 1 23. *
Cheung, David et al., "Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique", 1996 IEEE Feb. 1996.
Cheung, David et al., Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique , 1996 IEEE Feb. 1996. *
Edelstein, Herb "Mining for gold", Informationweek n627, pp. 53-70, ISSN:8750-6874, Apr. 21, 1997.
Edelstein, Herb Mining for gold , Informationweek n627, pp. 53 70, ISSN:8750 6874, Apr. 21, 1997. *
Fayyad, Usama et al., "The KDD Process for Extracting Useful Knowledge from Volumes of Data", Http://www.research.microsoft.com/research/datamine/CACM-DM-refs/.ACM Feb. 1996.
Fayyad, Usama et al., The KDD Process for Extracting Useful Knowledge from Volumes of Data , Http://www.research.microsoft.com/research/datamine/CACM DM refs/.ACM Feb. 1996. *
Hedberg, Sara "Parallelism Speeds Data Mining", IEEE Parallel & Distributed Technology, vol. 3, Issue 4, Dec. 1995.
Hedberg, Sara Parallelism Speeds Data Mining , IEEE Parallel & Distributed Technology, vol. 3, Issue 4, Dec. 1995. *
Houtsma, Maurice et al., "Set-Oriented Mining for Association Rules in Relational Databases", Feb. 1995 IEEE pp. 25-33 Feb.
Houtsma, Maurice et al., Set Oriented Mining for Association Rules in Relational Databases , Feb. 1995 IEEE pp. 25 33 Feb. *
Jong Soo Park, et al., "An Effective Hash-based Algorithm for Mining Association Rules", ACM, 1995, pp. 175-186.
Jong Soo Park, et al., An Effective Hash based Algorithm for Mining Association Rules , ACM, 1995, pp. 175 186. *
Kaufman, K., et al., "Mining for Knowledge in Databases:Goals and General Description of the Inlen System", Knowledge Discovery in Databases, pp. 449-462(1991).
Kaufman, K., et al., Mining for Knowledge in Databases:Goals and General Description of the Inlen System , Knowledge Discovery in Databases, pp. 449 462(1991). *
Moxon, Bruce "Defining data Mining", DBMS Aug. 1996 v9 n9 ps11 (3).
Moxon, Bruce Defining data Mining , DBMS Aug. 1996 v9 n9 ps11 (3). *
Rakesh Agrawal, et al., "Fast Algorithms for Mining Association Rules", Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994, pp. 487-499.
Rakesh Agrawal, et al., "Mining Association Rules Between Sets of Items in Large Databases", ACM, 1993, pp. 207-216.
Rakesh Agrawal, et al., Fast Algorithms for Mining Association Rules , Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994, pp. 487 499. *
Rakesh Agrawal, et al., Mining Association Rules Between Sets of Items in Large Databases , ACM, 1993, pp. 207 216. *
Ramakrishnan Srikant, et al., "Mining Generalized Association Rules", Proceedings of the 21st VLDB Conference, Zurich, Switzerland, 1995, pp. 407-419.
Ramakrishnan Srikant, et al., Mining Generalized Association Rules , Proceedings of the 21st VLDB Conference, Zurich, Switzerland, 1995, pp. 407 419. *
Ramakrishnan Srivant, et al., "Mining Quantitative Association Rules in Large Relational Tables", ACM, 1996, pp. 1-12.
Ramakrishnan Srivant, et al., Mining Quantitative Association Rules in Large Relational Tables , ACM, 1996, pp. 1 12. *
Zhang, K., "On Rough Sets and Interference Analysis", Cambridge University Computer Laboratory, pp. 256-265, (1997).
Zhang, K., On Rough Sets and Interference Analysis , Cambridge University Computer Laboratory, pp. 256 265, (1997). *

Cited By (87)

* Cited by examiner, ā€  Cited by third party
Publication number Priority date Publication date Assignee Title
US20060064433A1 (en) * 1994-12-15 2006-03-23 Ufil Unified Data Technolgies, Ltd. Method and apparatus for binary-oriented set sequencing
US6721757B2 (en) 1994-12-15 2004-04-13 Ufil Unified Data Technologies, Ltd. Method and apparatus for binary-oriented set sequencing
US6636865B2 (en) 1994-12-15 2003-10-21 Ufil Unified Data Technologies, Ltd. Method and apparatus for binary-oriented set sequencing
US7076501B2 (en) 1994-12-15 2006-07-11 Ufil Unified Data Technologies, Ltd. Method and apparatus for binary-oriented set sequencing
US6470351B1 (en) 1994-12-15 2002-10-22 Ufil Unified Data Technologies, Ltd. Binary-oriented set sequencing
US6691129B2 (en) 1994-12-15 2004-02-10 Ufil Unified Data Technologies, Ltd. Method and apparatus for binary-oriented set sequencing
US20040139055A1 (en) * 1994-12-15 2004-07-15 Ufil Unified Data Technologies, Ltd. Method and apparatus for binary-oriented set sequencing
US6385608B1 (en) * 1997-11-11 2002-05-07 Mitsubishi Denki Kabushiki Kaisha Method and apparatus for discovering association rules
US6311173B1 (en) * 1998-01-05 2001-10-30 Wizsoft Ltd. Pattern recognition using generalized association rules
US6185549B1 (en) * 1998-04-29 2001-02-06 Lucent Technologies Inc. Method for mining association rules in data
US6311179B1 (en) * 1998-10-30 2001-10-30 International Business Machines Corporation System and method of generating associations
US6278998B1 (en) * 1999-02-16 2001-08-21 Lucent Technologies, Inc. Data mining using cyclic association rules
US7596521B2 (en) 1999-04-09 2009-09-29 Berkeley *Ieor Process for determining object level profitability
US20060178960A1 (en) * 1999-04-09 2006-08-10 Berkeley *Ieor Process for determining object level profitability
US20060190367A1 (en) * 1999-04-09 2006-08-24 Berkeley *Ieor Process for determining object level profitability
US6321225B1 (en) * 1999-04-23 2001-11-20 Microsoft Corporation Abstracting cooked variables from raw variables
US6542878B1 (en) 1999-04-23 2003-04-01 Microsoft Corporation Determining whether a variable is numeric or non-numeric
US6529895B2 (en) 1999-04-23 2003-03-04 Microsoft Corporation Determining a distribution of a numeric variable
US6405200B1 (en) 1999-04-23 2002-06-11 Microsoft Corporation Generating a model for raw variables from a model for cooked variables
US6865582B2 (en) * 2000-01-03 2005-03-08 Bechtel Bwxt Idaho, Llc Systems and methods for knowledge discovery in spatial data
US7007020B1 (en) * 2000-03-10 2006-02-28 Hewlett-Packard Development Company, L.P. Distributed OLAP-based association rule generation method and system
US6711577B1 (en) 2000-10-09 2004-03-23 Battelle Memorial Institute Data mining and visualization techniques
US7539677B1 (en) 2000-10-09 2009-05-26 Battelle Memorial Institute Sequential pattern data mining and visualization
US20050198041A1 (en) * 2000-12-07 2005-09-08 Lewandowski Robert P. Method and apparatus for processing electronic records for physical transactions
US6757678B2 (en) 2001-04-12 2004-06-29 International Business Machines Corporation Generalized method and system of merging and pruning of data trees
US6714940B2 (en) 2001-11-15 2004-03-30 International Business Machines Corporation Systems, methods, and computer program products to rank and explain dimensions associated with exceptions in multidimensional data
US20040015463A1 (en) * 2002-04-19 2004-01-22 Herrera Peter J. System and method for providing inferencing services
US20030200199A1 (en) * 2002-04-19 2003-10-23 Dow Jones Reuters Business Interactive, Llc Apparatus and method for generating data useful in indexing and searching
US7356522B2 (en) 2002-04-19 2008-04-08 Computer Associates Think, Inc. System and method for building a rulebase
US7191163B2 (en) 2002-04-19 2007-03-13 Computer Associates Think, Inc. System and method for providing inferencing services
US20030229605A1 (en) * 2002-04-19 2003-12-11 Herrera Peter J. System and method for building a rulebase
US7152056B2 (en) 2002-04-19 2006-12-19 Dow Jones Reuters Business Interactive, Llc Apparatus and method for generating data useful in indexing and searching
US6920459B2 (en) * 2002-05-07 2005-07-19 Zycus Infotech Pvt Ltd. System and method for context based searching of electronic catalog database, aided with graphical feedback to the user
US20030212669A1 (en) * 2002-05-07 2003-11-13 Aatish Dedhia System and method for context based searching of electronic catalog database, aided with graphical feedback to the user
US7010526B2 (en) 2002-05-08 2006-03-07 International Business Machines Corporation Knowledge-based data mining system
US8214391B2 (en) 2002-05-08 2012-07-03 International Business Machines Corporation Knowledge-based data mining system
US20030212699A1 (en) * 2002-05-08 2003-11-13 International Business Machines Corporation Data store for knowledge-based data mining system
US6993534B2 (en) * 2002-05-08 2006-01-31 International Business Machines Corporation Data store for knowledge-based data mining system
US20030212675A1 (en) * 2002-05-08 2003-11-13 International Business Machines Corporation Knowledge-based data mining system
US20030212676A1 (en) * 2002-05-10 2003-11-13 International Business Machines Corporation Systems, methods and computer program products to determine useful relationships and dimensions of a database
US7447687B2 (en) 2002-05-10 2008-11-04 International Business Machines Corporation Methods to browse database query information
US7873664B2 (en) 2002-05-10 2011-01-18 International Business Machines Corporation Systems and computer program products to browse database query information
US20030212667A1 (en) * 2002-05-10 2003-11-13 International Business Machines Corporation Systems, methods, and computer program products to browse database query information
US6947929B2 (en) 2002-05-10 2005-09-20 International Business Machines Corporation Systems, methods and computer program products to determine useful relationships and dimensions of a database
US20040122646A1 (en) * 2002-12-18 2004-06-24 International Business Machines Corporation System and method for automatically building an OLAP model in a relational database
US7716167B2 (en) 2002-12-18 2010-05-11 International Business Machines Corporation System and method for automatically building an OLAP model in a relational database
US7953694B2 (en) 2003-01-13 2011-05-31 International Business Machines Corporation Method, system, and program for specifying multidimensional calculations for a relational OLAP engine
US20040139061A1 (en) * 2003-01-13 2004-07-15 International Business Machines Corporation Method, system, and program for specifying multidimensional calculations for a relational OLAP engine
US7895191B2 (en) 2003-04-09 2011-02-22 International Business Machines Corporation Improving performance of database queries
US20040260680A1 (en) * 2003-06-19 2004-12-23 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US20040260679A1 (en) * 2003-06-19 2004-12-23 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US20070271247A1 (en) * 2003-06-19 2007-11-22 Best Steven F Personalized Indexing And Searching For Information In A Distributed Data Processing System
US7289983B2 (en) 2003-06-19 2007-10-30 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US7865494B2 (en) 2003-06-19 2011-01-04 International Business Machines Corporation Personalized indexing and searching for information in a distributed data processing system
US20090094274A1 (en) * 2003-09-10 2009-04-09 Exeros, Inc. Method and apparatus for semantic discovery and mapping between data sources
US20060271528A1 (en) * 2003-09-10 2006-11-30 Exeros, Inc. Method and system for facilitating data retrieval from a plurality of data sources
US9336253B2 (en) 2003-09-10 2016-05-10 International Business Machines Corporation Semantic discovery and mapping between data sources
US8082243B2 (en) 2003-09-10 2011-12-20 International Business Machines Corporation Semantic discovery and mapping between data sources
US8442999B2 (en) 2003-09-10 2013-05-14 International Business Machines Corporation Semantic discovery and mapping between data sources
US8874613B2 (en) 2003-09-10 2014-10-28 International Business Machines Corporation Semantic discovery and mapping between data sources
US7680828B2 (en) * 2003-09-10 2010-03-16 International Business Machines Corporation Method and system for facilitating data retrieval from a plurality of data sources
US20050175245A1 (en) * 2004-02-10 2005-08-11 Microsoft Corporation Voting based scheme for electronic document node reuse
US7958132B2 (en) * 2004-02-10 2011-06-07 Microsoft Corporation Voting based scheme for electronic document node reuse
US7707143B2 (en) 2004-06-14 2010-04-27 International Business Machines Corporation Systems, methods, and computer program products that automatically discover metadata objects and generate multidimensional models
US20050278290A1 (en) * 2004-06-14 2005-12-15 International Business Machines Corporation Systems, methods, and computer program products that automatically discover metadata objects and generate multidimensional models
US7480663B2 (en) 2004-06-22 2009-01-20 International Business Machines Corporation Model based optimization with focus regions
US20050283494A1 (en) * 2004-06-22 2005-12-22 International Business Machines Corporation Visualizing and manipulating multidimensional OLAP models graphically
US20050283488A1 (en) * 2004-06-22 2005-12-22 International Business Machines Corporation Model based optimization with focus regions
US8924343B2 (en) 2005-03-23 2014-12-30 International Business Machines Coporation Method and system for using confidence factors in forming a system
US20070250476A1 (en) * 2006-04-21 2007-10-25 Lockheed Martin Corporation Approximate nearest neighbor search in metric space
US20090024551A1 (en) * 2007-07-17 2009-01-22 International Business Machines Corporation Managing validation models and rules to apply to data sets
US8401987B2 (en) 2007-07-17 2013-03-19 International Business Machines Corporation Managing validation models and rules to apply to data sets
US20090150391A1 (en) * 2007-12-05 2009-06-11 Fujitsu Limited Rare pattern extracting device and rare pattern extracting method
US8086626B2 (en) * 2007-12-05 2011-12-27 Fujitsu Limited Rare pattern extracting device and rare pattern extracting method
US20090327208A1 (en) * 2008-06-30 2009-12-31 International Business Machines Corporation Discovering transformations applied to a source table to generate a target table
US9720971B2 (en) 2008-06-30 2017-08-01 International Business Machines Corporation Discovering transformations applied to a source table to generate a target table
US20100030719A1 (en) * 2008-07-10 2010-02-04 Covey Todd M Methods and apparatus related to bioinformatics data analysis
US8290955B2 (en) * 2008-09-18 2012-10-16 International Business Machines Corporation Classification of data in a hierarchical data structure
US20100070505A1 (en) * 2008-09-18 2010-03-18 International Business Machines Corporation Classification of Data in a Hierarchical Data Structure
US8930303B2 (en) 2012-03-30 2015-01-06 International Business Machines Corporation Discovering pivot type relationships between database objects
US20150026555A1 (en) * 2013-07-16 2015-01-22 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US9720959B2 (en) * 2013-07-16 2017-08-01 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US20150026217A1 (en) * 2013-07-16 2015-01-22 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US9727662B2 (en) * 2013-07-16 2017-08-08 Fujitsu Limited Data output method, computer-readable recording medium storing data output program and data output system
US20160179903A1 (en) * 2014-12-23 2016-06-23 Ran Bittmann Enhancing frequent itemset mining
US9672495B2 (en) * 2014-12-23 2017-06-06 Sap Se Enhancing frequent itemset mining
US20190191004A1 (en) * 2017-05-23 2019-06-20 Hitachi ,Ltd. System and method to reduce network traffic and load of host servers

Also Published As

Publication number Publication date
TW505868B (en) 2002-10-11
JP3575602B2 (ja) 2004-10-13
DE69809964T2 (de) 2003-08-28
KR100382296B1 (ko) 2003-05-09
CN1138222C (zh) 2004-02-11
EP1034489B1 (en) 2002-12-04
HK1033987A1 (en) 2001-10-05
AU9272698A (en) 1999-05-24
HUP0100161A2 (hu) 2001-05-28
JP2001522095A (ja) 2001-11-13
PL340380A1 (en) 2001-01-29
CA2304646C (en) 2003-10-28
AU750629B2 (en) 2002-07-25
DE69809964D1 (de) 2003-01-16
CZ294171B6 (cs) 2004-10-13
KR20010031687A (ko) 2001-04-16
EP1034489A1 (en) 2000-09-13
CA2304646A1 (en) 1999-05-14
CN1278345A (zh) 2000-12-27
HUP0100161A3 (en) 2004-03-01
WO1999023577A1 (en) 1999-05-14
ES2184322T3 (es) 2003-04-01
CZ20001630A3 (en) 2001-05-16

Similar Documents

Publication Publication Date Title
US6092064A (en) On-line mining of quantitative association rules
US5920855A (en) On-line mining of association rules
US6094645A (en) Finding collective baskets and inference rules for internet or intranet mining for large data bases
US6643646B2 (en) Analysis of massive data accumulations using patient rule induction method and on-line analytical processing
Sumathi et al. Introduction to data mining and its applications
US5724573A (en) Method and system for mining quantitative association rules in large relational tables
Hossain et al. Market basket analysis using apriori and FP growth algorithm
US7818286B2 (en) Computer-implemented dimension engine
Lisnawati et al. Data Mining with Associated Methods to Predict Consumer Purchasing Patterns.
Hilderman et al. Mining association rules from market basket data using share measures and characterized itemsets
Sinthuja et al. Comparative Analysis of Association Rule Mining Algorithms in Mining Frequent Patterns.
Hong et al. Linguistic object-oriented web-usage mining
Sumathi et al. Data warehousing, data mining, and OLAP
Liu et al. Development of E-Commerce Recommendation System Based on FP-Growth Algorithm
Mena Machineā€learning the business: Using data mining for competitive intelligence
Banek et al. Distributed architecture for association rule mining
Rashidi et al. Factors affecting the Increase and Decrease student
Gupta Exploring Corporate Assets: Mining for the Corporate-Data Warehousing Gems
Jorge et al. A Methodology for exploring association models
Win et al. Implementation of Pharmacy Sale System Using Frequent Pattern Growth Algorithm
ZHI-HANG A Comparative Study of Association Rules Mining Algorithms
Goulbourne Knowledge Discovery in Databases
Chandar et al. Market Basket Analysis in the Library Circulated data using Apriori Algorithm
Dudgikar A layered optimizer for mining association rules over relational database management systems
Thilagavathi et al. High utility text mining using zealous algorithm

Legal Events

Date Code Title Description
AS Assignment

Owner name: IBM CORPORATION, NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:AGGARWAL, CHARU CHANDRA;YU, PHILIP SHI-LUNG;REEL/FRAME:008873/0246

Effective date: 19971104

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

REMI Maintenance fee reminder mailed
FPAY Fee payment

Year of fee payment: 12

SULP Surcharge for late payment

Year of fee payment: 11