CN105260442A - Bit operation and inverted index based association rule mining algorithm - Google Patents

Bit operation and inverted index based association rule mining algorithm Download PDF

Info

Publication number
CN105260442A
CN105260442A CN201510644226.XA CN201510644226A CN105260442A CN 105260442 A CN105260442 A CN 105260442A CN 201510644226 A CN201510644226 A CN 201510644226A CN 105260442 A CN105260442 A CN 105260442A
Authority
CN
China
Prior art keywords
algorithm
item
inverted index
database
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510644226.XA
Other languages
Chinese (zh)
Other versions
CN105260442B (en
Inventor
黄玉蕾
郭运凯
陈群英
林青
李艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Peihua University
Original Assignee
Xian Peihua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Peihua University filed Critical Xian Peihua University
Priority to CN201510644226.XA priority Critical patent/CN105260442B/en
Publication of CN105260442A publication Critical patent/CN105260442A/en
Application granted granted Critical
Publication of CN105260442B publication Critical patent/CN105260442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention discloses a bit operation and inverted index based association rule mining algorithm, and provides a bit operation and inverted index based frequent item set mining algorithm Apriori-BR by analysis on a classical Apriori algorithm. According to the Apriori algorithm, a database needs to be repeatedly scanned to calculate a support degree of a candidate item set, and different items also need to be repeatedly compared to determine whether the items exist during generation of the candidate item set, so that efficiency of the algorithm is influenced. According to the improved Apriori-BR algorithm, the database only needs to be scanned for twice in the mining process; and the improved Apriori-BR algorithm is used for obtaining a 1-item frequent item set and establishing inverted indexes of frequent items. The bit operation is utilized to accelerate detection of a subset so as to promote a connection speed. The support degree of the candidate item set can be obtained only by carrying out the bit operation on a column vector of each item of the candidate item set without scanning the database again, and in the calculation process, a non-frequent item set can be discovered as soon as possible to end the remaining operation, thereby better improving algorithm efficiency.

Description

A kind of association rules mining algorithm based on bit arithmetic and inverted index
Technical field
The present invention relates to a kind of database algorithms, particularly relate to a kind of association rules mining algorithm based on bit arithmetic and inverted index.
Background technology
Apriori algorithm is one of the most influential algorithm in association rules mining algorithm.For finding out the correlation rule meeting the given min confidence of user and minimum support in data centralization.After Apriori algorithm proposes, a large amount of association rule algorithms is suggested, and the large class of these point countings two, the first kind take Apriori algorithm as the relevant innovatory algorithm of representative, and this kind of algorithm realization is simple, but needs Multiple-Scan database.Equations of The Second Kind algorithm is the FP-Growth and innovatory algorithm thereof that propose based on Han [2], and this kind of algorithm does not produce Candidate itemsets, and adopts FP-Tree compressed database, and advantage is that efficiency of algorithm is high, and shortcoming is that algorithm realization is more complicated.For the shortcoming of class Apriori algorithm, scholar proposes multiple optimized algorithm measure: to number decoder and beta pruning, bottom-up matrix with delete affairs, reduce and be connected number of times, delete affairs.These algorithms optimize traditional Apriori algorithm in performance, and meanwhile, its algorithm complexity also improves thereupon.
Apriori algorithm adopts Bottom-up approach to complete the excavation of rule, namely utilizes k-item collection to produce (k+1)-item collection.For raising the efficiency, Apriori algorithm make use of a critical nature: in frequent item set, any subset is frequent item set, thus significantly reduces the search volume of algorithm.Its detailed step sees reference document [1].
Traditional Apriori algorithm weak point is as follows:
(1) produce time Candidate itemsets, need repeatedly to compare to judge whether condition of contact meets.
(2) subset judging a candidate whether time need repeatedly to compare.
(3) once must could obtain the support of Candidate itemsets by full dose scan database, each candidate needs full dose scan database once.
Summary of the invention
Object of the present invention is just to provide to solve the problem a kind of association rules mining algorithm based on bit arithmetic and inverted index.
The present invention is achieved through the following technical solutions above-mentioned purpose:
The present invention includes bit arithmetic, set up inverted index and be connected fast, described bit arithmetic comprises the following steps:
(1) column vector of each Ij in transaction database D is defined as R j=(t 1j, t 2j..., t nj), wherein n is the affairs sum of transaction database D, if the transaction packet of Tid=m is containing item I j, then t mj=1, no t mj=0, to transaction database D, the column vector of item c is Rc=(111011) t, after introducing vector storage, then bit arithmetic can be adopted to carry out the computing of support number and subset detection;
(2) support number of computational item Ij and Ik, is equivalent to compute vector R jand R kthe number of " 1 " is comprised after step-by-step "AND";
(3) vectorial R is judged jwhether R ksubset, be equivalent to and judge R j==(R jaMP.AMp.Amp R k), wherein " & " is step-by-step AND operation;
Described inverted index of setting up is: first carry out the scanning of full storehouse to database, and comparison is one by one carried out to every affairs, in the hope of its support number, thus increase the complexity of algorithm greatly, if the affairs number of database D is N, average transaction length is m, and when a calculating Candidate itemsets Ck, the complexity of algorithm is O (k*m*N), introduce inverted index to each project for this reason, obtain the Tid of its correspondence, and divide into groups by the transaction length at Tid place, set up inverted index;
Described quick connection is: to k-item Frequent Item Sets, 2 parts are divided into store, by its structure called after first and second, wherein first stores front k-1 item, second stores kth item, and front k-1 item is stored in a vector, when connecting, if the first of two item collection is equal, and the position of second power is unequal, then the candidate that directly generation one is new, and the position less part of power is stored in second, candidate new is in this instance fdbe, first be 10110, second is 1 power; Adopt the representation based on bit string, when generating candidate, previous section being compared, only needing once-through operation to complete, without the need to carrying out independent searching to each item, comparing, thus improve the efficiency of join algorithm.
Beneficial effect of the present invention is:
The present invention is a kind of association rules mining algorithm based on bit arithmetic and inverted index, compared with prior art, and the present invention
Accompanying drawing explanation
Fig. 1 is the inverted index figure that transaction database of the present invention is corresponding;
Fig. 2 is comparison diagram working time of the lower three kinds of algorithms of accidents data set of the present invention;
Fig. 3 is comparison diagram working time of the lower three kinds of algorithms of mushroom data set of the present invention;
Fig. 4 is comparison diagram working time of the lower three kinds of algorithms of T70I4D100K data set of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the invention will be further described:
For the improvement thought of algorithm is described, be described for a transaction database D here.Minsup=50%, then minimum support number=3.
Table 1 transaction database D
Then 1 Frequent Item Sets is L1={f:5, c:5, d:4, b:3, e:3}
The present invention includes bit arithmetic, set up inverted index and be connected fast, in order to promote the efficiency that subset detects, introducing bit arithmetic.Described bit arithmetic comprises the following steps:
(1) column vector of each Ij in transaction database D is defined as R j=(t 1j, t 2j..., t nj), wherein n is the affairs sum of transaction database D, if the transaction packet of Tid=m is containing item I j, then t mj=1, no t mj=0, to transaction database D, the column vector of item c is Rc=(111011) t, after introducing vector storage, then bit arithmetic can be adopted to carry out the computing of support number and subset detection;
(2) support number of computational item Ij and Ik, is equivalent to compute vector R jand R kthe number of " 1 " is comprised after step-by-step "AND";
(3) vectorial R is judged jwhether R ksubset, be equivalent to and judge R j==(R jaMP.AMp.Amp R k), wherein " & " is step-by-step AND operation;
Described inverted index of setting up is: first carry out the scanning of full storehouse to database, and comparison is one by one carried out to every affairs, in the hope of its support number, thus increase the complexity of algorithm greatly, if the affairs number of database D is N, average transaction length is m, and when a calculating Candidate itemsets Ck, the complexity of algorithm is O (k*m*N), introduce inverted index to each project for this reason, obtain the Tid of its correspondence, and divide into groups by the transaction length at Tid place, set up inverted index; Inverted index corresponding to the D of Transaction Information as shown in Figure 1.
The inverted index built at this is made up of 3 parts.Part 1 is the position power of each frequent item, for marking position during this frequent item 2 system bit representation below.Part 2 is frequent item head node, and its content comprises frequent item and support number thereof.3rd part is inverted index node, is made up of the Tid list of item length and affairs.Here, carry out point bucket to Tid by transaction length to preserve.
Quick connection: Apriori algorithm, when performing Connection Step and generating (k+1)-item Candidate itemsets, needs from L kin find two item collection l 1and l 2, note symbol l i [j]represent l ijth item, if l 1and l 2front k-1 item all identical, and meet l 1 [k]<l 2 [k].Then l 1and l 2attachable, connect l 1and l 2the result items produced is collection is l 1 [1]l 1 [2]... l 1 [k]l 2 [k].Can see from this process, in order to generate a k+1-item candidate, optimal situation also needs to compare k time, the worst situation is a k-item frequent item item, compare k time with each collection in Lk, finally all do not have that the match is successful, make the efficiency of algorithm very low like this.
Therefore, to k-item Frequent Item Sets, 2 parts are divided into store, by its structure called after first and second, wherein first stores front k-1 item, and second stores kth item, and front k-1 item is stored in a vector, such as to frequent item set A=(fdb), then first is (10100), and second is 2 (position power); Frequent item set B=(fde), then first is (10100), and second is 1 (position power); When connecting, if the first of two item collection is equal, and the position of second power is unequal, the candidate that then directly generation one is new, and the position less part of power is stored in second, candidate new is in this instance fdbe, first is 10110, second is 1 power; Adopt the representation based on bit string, when generating candidate, previous section being compared, only needing once-through operation to complete, without the need to carrying out independent searching to each item, comparing, thus improve the efficiency of join algorithm.
Relevant nature:
Character 1: if having an affairs T in transaction database D, its length is less than (k+1), then length is greater than to the Candidate itemsets of k, affairs T is without the need to scanning.
Character 2: one Item Sets support number in a database, the number of " 1 " after equaling the step-by-step "AND" of the column vector of all items of this Item Sets.
Do as one likes matter 2 is known, the support number of calculated candidate project, by full dose Multiple-Scan database, is converted into the step-by-step AND operation of vector.That is, its support number equals the column vector of first item, the number of " 1 " after successively carrying out step-by-step AND operation with other column vector.
Optimisation strategy:
Do as one likes matter 1 and character 2, propose following optimisation strategy:
Strategy 1: when column vector generates, when excavating k-item Frequent Item Sets, only needs to be not less than the inverted index list of k from the transaction length of each project to generate.
Strategy 2: when calculating the support number of k-item candidate, if the number of " 1 " is less than k after the column vector of certain two item carries out step-by-step AND operation, then the support number of this candidate item is less than k, is not Frequent Item Sets, can directly delete.
Strategy 3: adopt the computing of bit arithmetic accelerated evolution.
Arthmetic statement:
The treatment scheme of Apriori-BR algorithm is: full dose scan database once, obtains 1-item Frequent Set, scan database again, generates inverted index.Then perform following steps: from k=1, iteration produces the set C that length is the candidate of k+1 k+1.Concrete steps are as follows: the first step, and Lk is from connecting.From Lk, options integrates and connects the candidate of generation length as k+1, and it is all identical that the item collection participating in connecting must meet (k-1)-prefix.Second step is beta pruning, and Candidate itemsets is being added C k+1time, if its k length be the subset of k not all in Lk, then delete this Candidate itemsets.When computational length is the support of the candidate of k+1, strategy 2 is adopted to carry out beta pruning.After beta pruning, all item of the k+1 frequently collection excavated are added in Lk+1.This process is until C k+1terminate when equaling empty set.
Algorithm Apriori-BR inputs: transaction database D, and minimum support number threshold value minsup exports: Frequent Item Sets L
Algorithm example:
To the database D of Section 2, if minimum support threshold value is 50%, then minimum support number minsup=3, its mining process is as follows.
First scan 2 secondary data storehouses, generate inverted index as shown in Figure 1, and obtain 1-item Frequent Item Sets L1={f:5, c:5, d:4, b:3, e:3}.
Then start to connect, generate 2-item Candidate itemsets C2.When generating C2, for project f and c.F.first=(00000), f.second=16, c.fitst=(00000), c.second=8, meets f.first=c.first, then connect generation 2 Candidate itemsets { fc}, { fc}.first=(10000), { fc}.second=8.In like manner generate other all Candidate itemsets C2={fc, fd, fb, fe, cd, cb, ce, db, de, be}.When obtaining support, so that { fc} is with { fb} is described.To { inverted index of fc}, project f is (4,1,3,5,2) corresponding column vector is R1=(111110) t, the inverted index of project c is (1,3,5,2), and corresponding column vector is R2=(111010) t, R1 & R2=(111010) t, (R1 & R2) .count=4>3 is frequent item; To { inverted index of fb}, project b is (4,6,2), and corresponding column vector is R3=(010101) t, then R1 & R3=(010100) t, (R1 & R3) .count=2<3, is not Frequent Item Sets, then deletes.Excavate successively, obtain 2-item Frequent Item Sets L2={fc:4, fd:4cd:4}; In like manner excavate 3 Frequent Item Sets L3={fcd:4}, algorithm terminates.
Experimental result and analysis:
In order to the performance of test optimization algorithm, test under the environment of CPUi5-4200u2.5GHz, 2048MBDDRRAM, WindowsXPProfessionalSP2.Achieve Apriori with VC++6.0, Apriori-BR, Fp-growth algorithm, experimental data divides two kinds, True Data (accidents, mushroom) and generated data (T70I20D100K).Wherein True Data takes from the database of FIMI (http://fimi.ua.ac.be/data/).Generated data is generated by the maker in IBMAlmaden laboratory.
Accidents data set comprises the traffic hazard record in 1991 to 2000 Flanders areas.The size of this data set is 34678KB, totally 340184 affairs, has the item that 572 different, and average every bar transaction packet is containing 45 items.Mushroom is 557KB, and project adds up to 119, records several 8124, and average every bar transaction packet contains 23 different items.T70I20D100K represents that project adds up to 70, and average transaction length is 20, and record number is 100000.
Test result wherein on accidents data set as shown in Figure 2.Test result on mushroom database as shown in Figure 3.Test result on T70I20D100K database as shown in Figure 4.
Table 2 test database describes
As can be seen from experimental result, in different tests, working time of algorithm with minimum support diminish and elongated gradually.Can find out in fig. 2, the amplitude of variation of the working time of tradition Apriori algorithm is more much larger than two other algorithm, trace it to its cause, when support is less, Apriori generates a large amount of Candidate itemsets, and each candidate needs full dose scan database to obtain support, the plenty of time is consumed the connection at candidate, in the acquisition of subset test and support.FP-Growth algorithm in an experiment performance is more excellent, and it adopts the size of FP-Tree significantly compressed database, and without the need to generating Candidate itemsets, makes it on Dense Databases, play good performance.Apriori-BR algorithm in this paper obtains good performance, mainly be only to need scanning 2 secondary data storehouse, set up inverted index, and utilize bit arithmetic to accelerate the detection of subset, in the support number process obtaining Candidate itemsets, can obtain without the need to scan database, and stop remaining comparison process immediately after detecting Infrequent item-set, reduce algorithm complex, obtain good performance.
Conclusion
The present invention, by analysis to apriori traditional, proposes a kind of Frequent Itemsets Mining Algorithm Apriori-BR based on bit arithmetic and inverted index.Whether Apriori algorithm needs repeatedly scan database so that the support of calculated candidate item collection, and also need to exist different contrast projects repeatedly when generating Candidate itemsets, have impact on the efficiency of algorithm.Innovatory algorithm Apriori-BR algorithm only needs scanning 2 secondary data storehouse in mining process, for obtaining 1-item Frequent Item Sets, and sets up the inverted index of frequent episode.Utilize bit arithmetic to accelerate the detection of subset, promote connection speed.When obtaining the support of Candidate itemsets, without the need to scan database again, only need carry out bit arithmetic to each column vector of Candidate itemsets can obtain, and can find Infrequent item-set early in computation process, stop remaining computing, thus better boosting algorithm efficiency.
More than show and describe ultimate principle of the present invention and principal character and advantage of the present invention.The technician of the industry should understand; the present invention is not restricted to the described embodiments; what describe in above-described embodiment and instructions just illustrates principle of the present invention; without departing from the spirit and scope of the present invention; the present invention also has various changes and modifications, and these changes and improvements all fall in the claimed scope of the invention.Application claims protection domain is defined by appending claims and equivalent thereof.

Claims (1)

1., based on an association rules mining algorithm for bit arithmetic and inverted index, it is characterized in that: comprise bit arithmetic, set up inverted index and be connected fast, described bit arithmetic comprises the following steps:
(1) column vector of each Ij in transaction database D is defined as R j=(t 1j, t 2j..., t nj), wherein n is the affairs sum of transaction database D, if the transaction packet of Tid=m is containing item I j, then t mj=1, no t mj=0, to transaction database D, the column vector of item c is Rc=(111011) t, after introducing vector storage, then bit arithmetic can be adopted to carry out the computing of support number and subset detection;
(2) support number of computational item Ij and Ik, is equivalent to compute vector R jand R kthe number of " 1 " is comprised after step-by-step "AND";
(3) vectorial R is judged jwhether R ksubset, be equivalent to and judge R j==(R jaMP.AMp.Amp R k), wherein " & " is step-by-step AND operation;
Described inverted index of setting up is: first carry out the scanning of full storehouse to database, and comparison is one by one carried out to every affairs, in the hope of its support number, thus increase the complexity of algorithm greatly, if the affairs number of database D is N, average transaction length is m, and when a calculating Candidate itemsets Ck, the complexity of algorithm is O (k*m*N), introduce inverted index to each project for this reason, obtain the Tid of its correspondence, and divide into groups by the transaction length at Tid place, set up inverted index;
Described quick connection is: to k-item Frequent Item Sets, 2 parts are divided into store, by its structure called after first and second, wherein first stores front k-1 item, second stores kth item, and front k-1 item is stored in a vector, when connecting, if the first of two item collection is equal, and the position of second power is unequal, then the candidate that directly generation one is new, and the position less part of power is stored in second, candidate new is in this instance fdbe, first be 10110, second is 1 power; Adopt the representation based on bit string, when generating candidate, previous section being compared, only needing once-through operation to complete, without the need to carrying out independent searching to each item, comparing, thus improve the efficiency of join algorithm.
CN201510644226.XA 2015-10-08 2015-10-08 A kind of association rules mining algorithm based on bit arithmetic and inverted index Active CN105260442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510644226.XA CN105260442B (en) 2015-10-08 2015-10-08 A kind of association rules mining algorithm based on bit arithmetic and inverted index

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510644226.XA CN105260442B (en) 2015-10-08 2015-10-08 A kind of association rules mining algorithm based on bit arithmetic and inverted index

Publications (2)

Publication Number Publication Date
CN105260442A true CN105260442A (en) 2016-01-20
CN105260442B CN105260442B (en) 2018-11-02

Family

ID=55100133

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510644226.XA Active CN105260442B (en) 2015-10-08 2015-10-08 A kind of association rules mining algorithm based on bit arithmetic and inverted index

Country Status (1)

Country Link
CN (1) CN105260442B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106022886A (en) * 2016-05-25 2016-10-12 黑龙江大学 Method of generating association rules based on bit combination
CN110489411A (en) * 2019-07-11 2019-11-22 齐鲁工业大学 A kind of association rule mining method based on virtual value storage and operation mode
WO2020147335A1 (en) * 2019-01-16 2020-07-23 苏宁云计算有限公司 Method and system for clustering member data on electronic commerce platform

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104715073A (en) * 2015-04-03 2015-06-17 江苏物联网研究发展中心 Association rule mining system based on improved Apriori algorithm

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104715073A (en) * 2015-04-03 2015-06-17 江苏物联网研究发展中心 Association rule mining system based on improved Apriori algorithm

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CH.BHAVANI等: "Improving Efficiency of Apriori Algorithm", 《INTERNATIONAL JOURNAL OF COMPUTER TRENDS AND TECHNOLOGY (IJCTT)》 *
JAISHREE SINGH 等: "Improving Efficiency of Apriori Algorithm Using Transaction Reduction", 《INTERNATIONAL JOURNAL OF SCIENTIFIC AND RESEARCH PUBLICATIONS》 *
傅向华 等: "基于倒排索引位运算的深度优先频繁项集挖掘", 《小型微型计算机系统》 *
刘敏娴 等: "基于频繁矩阵的Apriori算法改进", 《计算机工程与设计》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106022886A (en) * 2016-05-25 2016-10-12 黑龙江大学 Method of generating association rules based on bit combination
WO2020147335A1 (en) * 2019-01-16 2020-07-23 苏宁云计算有限公司 Method and system for clustering member data on electronic commerce platform
CN110489411A (en) * 2019-07-11 2019-11-22 齐鲁工业大学 A kind of association rule mining method based on virtual value storage and operation mode
CN110489411B (en) * 2019-07-11 2023-08-22 齐鲁工业大学 Association rule mining method based on effective value storage and operation mode

Also Published As

Publication number Publication date
CN105260442B (en) 2018-11-02

Similar Documents

Publication Publication Date Title
Papapetrou et al. Discovering frequent arrangements of temporal intervals
WO2021088385A1 (en) Online log analysis method, system, and electronic terminal device thereof
CN105306475B (en) A kind of network inbreak detection method based on Classification of Association Rules
Tatti et al. Mining closed strict episodes
CN108446540A (en) Program code based on source code multi-tag figure neural network plagiarizes type detection method and system
Liu Study on application of apriori algorithm in data mining
CN107291877A (en) A kind of Mining Frequent Itemsets based on Apriori algorithm
CN105260442A (en) Bit operation and inverted index based association rule mining algorithm
Van Den Brink et al. Quality assessment for embedded SQL
Argyros et al. Efficient subsequence matching in time series databases under time and amplitude transformations
CN104376261B (en) A kind of method of the automatic detection malicious process under evidence obtaining scene
Woo et al. EstMax: Tracing maximal frequent item sets instantly over online transactional data streams
CN110489448A (en) The method for digging of big data correlation rule based on Hadoop
Flouris et al. Issues in complex event processing systems
Gopalan et al. Treeitl-mine: Mining frequent itemsets using pattern growth, tid intersection, and prefix tree
Luo et al. Research on mining positive and negative association rules
CN108228607B (en) Maximum frequent item set mining method based on connectivity
Cule et al. Mining interesting itemsets in graph datasets
Chang et al. A weight-order-based lattice algorithm for mining maximal weighted frequent patterns over a data stream sliding window
Chawla et al. Implementation of association rule mining using reverse apriori algorithmic approach
Shen et al. A recycle technique of association rule for missing value completion
Song et al. A sequential pattern mining algorithm for misuse intrusion detection
Rajeswari et al. Mining Association Rules Using Hash Table
Tumasonis et al. The Probabilistic Algorithm for Mining Frequent Sequences.
Kolli Mining for Frequent Community Structures using Approximate Graph Matching

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant