CN106296401A - A kind of Strong association rule method for digging understood for stock market's operation logic - Google Patents

A kind of Strong association rule method for digging understood for stock market's operation logic Download PDF

Info

Publication number
CN106296401A
CN106296401A CN201611038982.9A CN201611038982A CN106296401A CN 106296401 A CN106296401 A CN 106296401A CN 201611038982 A CN201611038982 A CN 201611038982A CN 106296401 A CN106296401 A CN 106296401A
Authority
CN
China
Prior art keywords
stock
association rule
rule
strong association
item
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611038982.9A
Other languages
Chinese (zh)
Inventor
吴梅红
洪志令
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201611038982.9A priority Critical patent/CN106296401A/en
Publication of CN106296401A publication Critical patent/CN106296401A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange

Abstract

The invention discloses a kind of Strong association rule method for digging understood for stock market's operation logic.The main thought of method is: after the amount of increase and amount of decrease numerical quantization of all stocks, and these values regard the set of correlation rule middle term as, and the quantized value of the recent adjacent short time sequence of every stock then regards the transaction of affairs as;Then with the thought of association rule algorithm, all stock certificate datas are added up and rule discovery;Finally obtain and meet support threshold and the of a relatively high Strong association rule of confidence level and analyzed, thus the operation logic of stock market is understood and makes explanations.The inventive method is with the difference of traditional association rule and method: the item of traditional association rule is unordered set, and what we's rule requirement was ordered into.It is actual that the inventive method combines stock, can relatively profound understanding stock market operation logic, provide decision support for Stock-operation.

Description

A kind of Strong association rule method for digging understood for stock market's operation logic
Technical field
The present invention relates to stock certificate data digging technology field, especially relate to a kind of for stock market's operation logic understand strong Association rule mining method.
Background technology
Stock market is always numerous scholars and the object of study of investor, the prediction of stock price tendency since setting up It it is the problem of investment and security theoretical circles common concern.Due to various by domestic and international political economy environment and enterprise self etc. The impact of factor, among stock price is always at constantly fluctuating, the influence mode of various factors is the most extremely complex, so stock Price ups and downs are unordered, have the biggest random walk.
Stock price is conducted in-depth research by a large amount of scholars, and its prediction principle is: by history and current stock Data are analyzed, and are predicted the price trend that stock is following, thus provide reference frame for stock invester.Conventional stock valency Lattice Forecasting Methodology has: time series analysis method, neural net prediction method, regression analysis, time series smoothing techniques, trend are bent Line model method, Random time sequence Forecasting Methodology, Markov Pre-measurement and discriminant analysis predicted method etc..Share price internal system is tied The complexity of structure, the polytropy of external factor determine the arduousness of Stock Market Forecasting, existing analyzing and predicting method application effect Unsatisfactory.
Although having various factors to result in the bad predictability of stock, but we still can be by certain methods to stock Operation logic have individual basic insight.Stock market's operation logic described in the inventive method does not implies that the operating mechanism of stock market, And the logic combined before and after referring to advance versus decline width, the most common chasing after is risen to kill and is fallen, i.e. rise sharply today, and tomorrow and then rises sharply Probability is the biggest;Otherwise today slumps, the probability slumped tomorrow is the biggest.
Summary of the invention
The invention discloses a kind of Strong association rule method for digging understood for stock market's operation logic.The main think of of method Think be: after the amount of increase and amount of decrease numerical quantization of all stocks, these values regard the set of correlation rule middle term as, and every stock The quantized value of recent adjacent short time sequence then regards the transaction of affairs as;Then with the thought pair of association rule algorithm All stock certificate datas carry out adding up and rule discovery;Finally obtain and meet support threshold and of a relatively high the closing by force of confidence level Connection rule is also analyzed, thus understands the operation logic of stock market and make explanations.
The inventive method is with the difference of traditional association rule and method: the item of traditional association rule is unordered collection Close, and what we's rule requirement was ordered into.The inventive method combines the reality of stock, can run relatively profound understanding stock market Logic, provides decision support for Stock-operation.
Assume that stock list is S, S=[S1, S2,…,Si,…,Sm], m is the quantity of stock in stock pond, in China The quantity of city's stock or the quantity of listed stock of the U.S..The inventive method learns common rule for all of stock.This The step of bright method is as follows:
(1) the ups and downs amplitude of all stocks is quantified;
(2) adjacency to all stocks carries out orderly frequent item set statistics;
(3) stock market's Strong association rule is extracted based on support and confidence level sequence;
(4) stock market's operation logic understood and make explanations.
Wherein, the ups and downs amplitude to all stocks of step (1) quantifies, particularly as follows: for every stock, obtain certain Amount of increase and amount of decrease data (such as on January 1st, 2005) since individual time point, then quantify ups and downs amplitude, i.e. to ups and downs amplitude Carry out the operation rounded up, be transformed to the integer value that [-10,10] are interval;Every stock in last stock pond is all transformed to Comprise the integer-valued array of advance versus decline.
Wherein, the adjacency to all stocks of step (2) carries out orderly frequent item set statistics, particularly as follows: with all stocks The amount of increase and amount of decrease integer value of ticket is object of statistics, simultaneously on the basis of requiring to obtain adjacent quantization value in order, carries out k-item collection Frequency statistics, k here takes 1 to 5.When carrying out k-item collection statistics, first one k dimension group of definition, the most one-dimensional is a length of 21, then travel through the amount of increase and amount of decrease integer value array of all stock, take k adjacent value every time, the common appearance that this k is worth Number of times carries out cumulative statistics, and puts in the groove that k dimension group is corresponding.
Wherein, stock market's Strong association rule is extracted in sorting based on support and confidence level of step (3), particularly as follows: based on often Group k item collection and the statistics of k-1 item collection, calculate the support under often organizing k item collection and confidence level, and k takes 2 to 5 here.Under k item collection Support be the ratio that the secondary numerical value in the k each groove of dimension group accounts for all number of times sums;The molecule of confidence level is the system of k-item collection Metering number, denominator is that k-item collection removes tail Xiang Houqi remainder in the k-1 item corresponding statistics number of concentration.To more than minimum support Frequent item set sort from big to small according to confidence value, the rule corresponding to front t confidence value is Strong association rule.
For often organizing k item collection, the Strong association rule of acquisition is divided into former piece and consequent, after former piece refers to that tail item removed by k item collection Residue other, consequent then refers to tail item, i.e. last.
Wherein, stock market's operation logic being understood and makes explanations, particularly as follows: obtain according to previous step of step (4) The Strong association rule of number of packages before the various differences taken, learns these rules, selects significant rule, and when stock to be predicted is current When recent transaction data matches the operation logic of stock market, i.e. before and after advance versus decline width, combine the upper rule extracted of arranging in pairs or groups Time, prompting user pays close attention to, and informs the follow-up possible combination of history run logic and the probability occurred thereof.
Accompanying drawing explanation
Fig. 1 is the flow chart of the Strong association rule method for digging that the present invention understands for stock market's operation logic.
Fig. 2 is sub-fraction Strong association rule based on the inventive method output.Here rule is to carry out based on stock The support obtained after data mining and the higher rule of confidence level.Owing to Strong association rule is more, the most only list little A part.Rule can be used for helping to understand the operation logic of stock market.
Detailed description of the invention
Below in conjunction with the accompanying drawings and example, the present invention is described in detail.
Correlation rule is the implications of shape such as X → Y, X and Y is called former piece and the consequent of correlation rule.Wherein, association , there is support and confidence level in rule XY.Correlation rule is defined as: assume I={I1, I2 ... .Im} is the set of item.Given one Individual transaction data base D, the most each affairs t are the nonvoid subsets of I, i.e. each transaction is corresponding with a unique TID. Correlation rule support (support) in D is the percentage ratio that in D, affairs comprise X, Y simultaneously, i.e. probability;Confidence level (confidence) it is in the case of affairs have comprised X in D, comprises the percentage ratio of Y, i.e. conditional probability.If item collection meets Minimum support threshold value, then this Xiang Jiwei frequent item set;If meeting minimum support threshold value and minimal confidence threshold, then recognize It is interesting for correlation rule.These threshold values are manually set according to excavating needs.
After the amount of increase and amount of decrease numerical quantization of all stocks, these values regard the set of correlation rule middle term as;And every stock The quantized value of ticket recent short time sequence is then considered as the transaction of affairs.It is actual that the inventive method combines stock, with All stock certificate datas are added up and rule discovery by association rule algorithm thought, obtain and meet support threshold and confidence Spend of a relatively high Strong association rule and analyzed, thus strengthening the understanding to stock market's operation logic.
The inventive method is with the difference of traditional association rule and method: the item of traditional association rule is unordered collection Close, and what we's rule requirement was ordered into.
Assume that stock list is S, S=[S1, S2,…,Si,…,Sm], m is the quantity of stock in stock pond, in China The quantity of city's stock or the quantity of listed stock of the U.S..Method learns common rule for all stocks.Stock market associates by force rule Mining process then is specific as follows.
One, the ups and downs amplitude of stock is quantified.
For every stock, amount of increase and amount of decrease data (such as on January 1st, 2005) since obtaining certain time point, then to rising Drop range value quantifies, the operation i.e. rounded up ups and downs amplitude, is transformed to the integer value that [-10,10] are interval;Finally Every stock in stock pond is all transformed to comprise the integer-valued array of advance versus decline.
Two, stock frequent item set statistics.
2.1 carry out 1-item collection statistics.Owing to stock has amount of increase and amount of decrease to limit, the integer value after quantization has 21 kinds of situations, i.e. [- 10,10] interval integer, therefore arranges an array having 21 grooves, travels through the amount of increase and amount of decrease integer value array of all stock, according to The integer-valued occurrence number of amount of increase and amount of decrease carries out cumulative statistics, and puts in the groove of correspondence.
2.2 carry out 2-item collection statistics.The two-dimensional array of one 21X21 is set, travels through the amount of increase and amount of decrease integer value of all stock Array, takes adjacent two value every time, the common number of times occurred of the two value is carried out cumulative statistics, and puts into two-dimensional array In corresponding groove.
2.3 carry out 3-item collection statistics.The three-dimensional array of one 21X21X21 is set, travels through the amount of increase and amount of decrease integer of all stock Value array, takes adjacent three value every time, the common number of times occurred of these three value is carried out cumulative statistics, and puts into three dimensions In the groove that group is corresponding.
2.4 carry out 4-item collection statistics.The four-dimensional array of one 21X21X21X21 is set, travels through advance versus decline width integer value Array, takes adjacent four value every time, the common number of times occurred of these four values is carried out cumulative statistics, and puts into four-dimensional array In corresponding groove.
2.5 carry out 5-item collection statistics.The five dimension groups of one 21X21X21X21X21 are set, travel through advance versus decline width integer Value array, takes adjacent five value every time, the common number of times occurred of these five values is carried out cumulative statistics, and puts into five dimensions In the groove that group is corresponding.
Owing to stock certificate data is limited, when typically proceeding to 5-item collection statistics, the frequency of occurrences of co-occurrence tuple is the least , the higher-dimension array that statistics obtains is the most sparse, carries out necessity of higher item collection statistics the most again.
Three, stock market's Strong association rule is extracted.
Due to the particularity of stock application, be only concerned inside frequent item set last, i.e. next amount of increase and amount of decrease goes out the day of trade Existing probability, is therefore different from traditional association rules mining algorithm, and the rear number of packages of the rule extracted here is all fixed as 1.
3.1 obtain 2-frequent item set and Strong association rule.2-item collection is calculated support, i.e. calculates two-dimensional array each Secondary numerical value in groove accounts for the ratio of all number of times sums.Set a support threshold, filter out support less than support threshold Groove, residual term collection is frequent item set.For each 2-item frequent item set, the residual term removing last is concentrated at 1-item Obtain corresponding statistics number, the statistics number of 2-item collection divided by corresponding 1-item collection statistics number confidence level.To obtaining The all confidence values obtained sort from big to small, and the rule corresponding to front t confidence value is Strong association rule.
Strong association rule the following table is some the 1-former pieces obtained.
3.2 obtain 3-frequent item set and Strong association rule.3-item collection is calculated support, i.e. calculates three-dimensional array each Secondary numerical value in groove accounts for the ratio of all number of times sums.Set a support threshold, filter out support less than support threshold Groove, residual term collection is frequent item set.For each 3-item frequent item set, the residual term removing last is concentrated at 2-item Obtain corresponding statistics number, the statistics number of 3-item collection divided by corresponding 2-item collection statistics number confidence level.To obtaining The all confidence values obtained sort from big to small, and the rule corresponding to front t confidence value is Strong association rule.
Strong association rule the following table is some the 2-former pieces obtained.
3.3 obtain 4-frequent item set and Strong association rule.4-item collection is calculated support, i.e. calculates four-dimensional array each Secondary numerical value in groove accounts for the ratio of all number of times sums.Set a support threshold, filter out support less than support threshold Groove, residual term collection is frequent item set.For each 4-item frequent item set, the residual term removing last is concentrated at 3-item Obtain corresponding statistics number, the statistics number of 4-item collection divided by corresponding 3-item collection statistics number confidence level.To obtaining The all confidence values obtained sort from big to small, and the rule corresponding to front t confidence value is Strong association rule.
Strong association rule the following table is some the 3-former pieces obtained.
3.4 obtain 5-frequent item set and Strong association rule.5-item collection is calculated support, i.e. calculates five dimension groups each Secondary numerical value in groove accounts for the ratio of all number of times sums.Set a support threshold, filter out support less than support threshold Groove, residual term collection is frequent item set.For each 5-item frequent item set, the residual term removing last is concentrated at 4-item Obtain corresponding statistics number, the statistics number of 5-item collection divided by corresponding 4-item collection statistics number confidence level.To obtaining The all confidence values obtained sort from big to small, and the rule corresponding to front t confidence value is Strong association rule.
Strong association rule the following table is some the 4-former pieces obtained.
Four, stock market's operation logic understands and explains.
According to the Strong association rule of number of packages before the various differences that previous step obtains, learn these rules, select significantly Rule.Knowable to the Strong association rule of number of packages before any of the above difference, generally stock market is to become surely, i.e. next friendship The amount of increase and amount of decrease integer value of Yi is all close to 0.When stock limit-up (amount of increase 10%), there is the biggest probability meeting next day of trade Continue limit-up;When stock limit down (drop range-10%), next day of trade there is the biggest probability and may proceed to limit down;This also illustrates Chasing after in stock market is risen and is killed that to fall be reasonable.Owing to Strong association rule is more, illustrate the most one by one.
Operation logic based on stock, i.e. stock Strong association rule, when the most recent transaction data of stock to be predicted When matching the operation logic of stock market, i.e. before and after advance versus decline width combination can arrange in pairs or groups go up extraction regular time, at this moment point out use Family is paid close attention to, and informs the follow-up possible combination of history run logic and the probability occurred thereof.
In sum, the invention discloses a kind of Strong association rule method for digging understood for stock market's operation logic.Side All stock certificate datas are added up and rule discovery by method with the thought of association rule algorithm.It is different from traditional association rule and method Item be unordered set, process require that what item was ordered into, and the consequent of rule only have an item.Method combines stock Reality, can relatively profound understanding stock market operation logic, for Stock-operation provide decision support.
The inventive method is similarly applied to security class and has the data of time series feature, such as fund, futures etc..Cause This, although disclosing the specific embodiments and the drawings of the present invention for the purpose of illustration, its object is to help to understand that the present invention's is interior Hold and implement according to this, but it will be appreciated by those skilled in the art that: without departing from the present invention and the essence of appended claim In god and scope, various replacements, to change and modifications be all impossible.Therefore, the present invention should not be limited to most preferred embodiment and Accompanying drawing disclosure of that.Presently disclosed embodiment should be understood illustrative in all respects rather than it be claimed The restriction of scope.

Claims (4)

1. the Strong association rule method for digging understood for stock market's operation logic, it is characterised in that described method includes as follows Step:
(1) the ups and downs amplitude of all stocks is quantified;
(2) adjacency to all stocks carries out orderly frequent item set statistics;
(3) stock market's Strong association rule is extracted based on support and confidence level sequence;
(4) stock market's operation logic understood and make explanations.
The Strong association rule method for digging understood for stock market's operation logic the most according to claim 1, it is characterised in that When all stocks are associated rule frequent episode statistics, the item of statistics is the quantized value of amount of increase and amount of decrease, and is by item Carry out what Ordered Statistic adjacent for k-completed.
The Strong association rule method for digging understood for stock market's operation logic the most according to claim 1, it is characterised in that During calculating correlation rule often organizes support and the confidence level of k item collection, it is only necessary to utilize k item collection and the statistics of k-1 item collection Result carries out simple division arithmetic and can be quickly completed.
The Strong association rule method for digging understood for stock market's operation logic the most according to claim 1, it is characterised in that In the extraction of Strong association rule, define that the consequent of rule has been only capable of an item, the most i.e. met next day of trade of stock Prediction need, the acquisition process of Strong association rule can be made again to be greatly simplified.
CN201611038982.9A 2016-11-24 2016-11-24 A kind of Strong association rule method for digging understood for stock market's operation logic Pending CN106296401A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611038982.9A CN106296401A (en) 2016-11-24 2016-11-24 A kind of Strong association rule method for digging understood for stock market's operation logic

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611038982.9A CN106296401A (en) 2016-11-24 2016-11-24 A kind of Strong association rule method for digging understood for stock market's operation logic

Publications (1)

Publication Number Publication Date
CN106296401A true CN106296401A (en) 2017-01-04

Family

ID=57720970

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611038982.9A Pending CN106296401A (en) 2016-11-24 2016-11-24 A kind of Strong association rule method for digging understood for stock market's operation logic

Country Status (1)

Country Link
CN (1) CN106296401A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108090220A (en) * 2017-12-29 2018-05-29 科大讯飞股份有限公司 Point of interest search sort method and system
CN109508386A (en) * 2018-11-07 2019-03-22 福建工程学院 A kind of relevance metric method of stock information press center word and related stock
CN109697619A (en) * 2017-10-20 2019-04-30 中移(苏州)软件技术有限公司 A kind of data analysing method and device
CN111046090A (en) * 2020-03-10 2020-04-21 深圳开源互联网安全技术有限公司 Vehicle data mining method and system based on vehicle-mounted self-organizing network

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109697619A (en) * 2017-10-20 2019-04-30 中移(苏州)软件技术有限公司 A kind of data analysing method and device
CN108090220A (en) * 2017-12-29 2018-05-29 科大讯飞股份有限公司 Point of interest search sort method and system
CN108090220B (en) * 2017-12-29 2021-05-04 科大讯飞股份有限公司 Method and system for searching and sequencing points of interest
CN109508386A (en) * 2018-11-07 2019-03-22 福建工程学院 A kind of relevance metric method of stock information press center word and related stock
CN111046090A (en) * 2020-03-10 2020-04-21 深圳开源互联网安全技术有限公司 Vehicle data mining method and system based on vehicle-mounted self-organizing network

Similar Documents

Publication Publication Date Title
Paithankar et al. Open pit mine production schedule optimization using a hybrid of maximum-flow and genetic algorithms
CN106296401A (en) A kind of Strong association rule method for digging understood for stock market's operation logic
CN107291895B (en) Quick hierarchical document query method
Francés et al. The cryptocurrency market: A network analysis
Guo Oil price forecast using deep learning and ARIMA
CN110826618A (en) Personal credit risk assessment method based on random forest
CN113052225A (en) Alarm convergence method and device based on clustering algorithm and time sequence association rule
Amayuelas et al. Neural methods for logical reasoning over knowledge graphs
CN111967971A (en) Bank client data processing method and device
CN108038211A (en) A kind of unsupervised relation data method for detecting abnormality based on context
CN106780018A (en) A kind of stock certificate data mining model reappeared based on history and system
Liu Incomplete big data imputation mining algorithm based on BP neural network
Li et al. Chart GCN: Learning chart information with a graph convolutional network for stock movement prediction
Gupta et al. A classification method to classify high dimensional data
Malik et al. A comprehensive approach towards data preprocessing techniques & association rules
Garciarena et al. Evolving imputation strategies for missing data in classification problems with TPOT
CN105678430A (en) Improved user recommendation method based on neighbor project slope one algorithm
Wang Predicting the rise and fall of Shanghai composite index based on artificial intelligence
CN113378842A (en) Recommendation method based on segmented image feature extraction
Yang RETRACTED ARTICLE: Financial Information Extraction Using the Improved Hidden Markov Model and Deep Learning
Iyer et al. Comparison of logistic regression and artificial neural network based bankruptcy prediction models
CN111984842A (en) Bank client data processing method and device
CN109766332A (en) Data processing empty value method, apparatus and terminal device
Khatun et al. An efficient genetic algorithm for discovering diverse-frequent patterns
Moodi et al. Evaluation of feature selection performance for identification of best effective technical indicators on stock market price prediction

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170104