CN111798061A - Financial data prediction system based on block chain and cloud computing - Google Patents

Financial data prediction system based on block chain and cloud computing Download PDF

Info

Publication number
CN111798061A
CN111798061A CN202010652720.1A CN202010652720A CN111798061A CN 111798061 A CN111798061 A CN 111798061A CN 202010652720 A CN202010652720 A CN 202010652720A CN 111798061 A CN111798061 A CN 111798061A
Authority
CN
China
Prior art keywords
data
financial
financial data
subclass
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010652720.1A
Other languages
Chinese (zh)
Other versions
CN111798061B (en
Inventor
刘星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen rongyida Technology Development Co.,Ltd.
Original Assignee
Yangpu Minoan Electronic Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yangpu Minoan Electronic Technology Co ltd filed Critical Yangpu Minoan Electronic Technology Co ltd
Priority to CN202010652720.1A priority Critical patent/CN111798061B/en
Publication of CN111798061A publication Critical patent/CN111798061A/en
Application granted granted Critical
Publication of CN111798061B publication Critical patent/CN111798061B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Technology Law (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Game Theory and Decision Science (AREA)
  • Evolutionary Biology (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

The financial data prediction system comprises a first data acquisition module, a second data acquisition module, a financial data preprocessing module, a block chain storage module, a financial data prediction module and a visualization module, wherein the first data acquisition module is used for acquiring historical financial time sequence data, the second data acquisition module is used for acquiring real-time financial time sequence data, the financial data preprocessing module is used for processing the financial data, and the block chain storage module is used for storing the processed historical financial time sequence data. The method determines the abnormal degree of the financial data to be detected by measuring the distribution relation of the financial data to be detected and the neighborhood financial data thereof on the whole by using the distribution detection coefficient, and determines the abnormal degree of the financial data to be detected by measuring the local relation between the financial data to be detected and the subclass in the neighborhood detection set by using the neighborhood detection coefficient, thereby effectively improving the detection precision of the noise data.

Description

Financial data prediction system based on block chain and cloud computing
Technical Field
The invention relates to the field of financial data processing, in particular to a financial data prediction system based on a block chain and cloud computing.
Background
In recent years, with the rapid development of economy, the trend of change of the financial market is more and more complex, understanding the mode of financial activities and predicting the development and change of the financial activities are important points of academic circles and industry research, the future trend of financial data is predicted, the development and change of the financial market at a macroscopic level are facilitated to be known, and a basis is provided for investors and companies to make trading decisions and plans at a microscopic level.
Disclosure of Invention
In view of the above problems, the present invention aims to provide a financial data prediction system based on a block chain and cloud computing.
The invention points improved based on the two major algorithms of the invention do not have singleness and belong to the same technical framework, so that the applicant applies for related contents of 2 technologies on the same day, the invention and the applicant's application on the same day, namely a financial data prediction system based on a block chain and big data share partial contents, the application mainly requests to protect the algorithm improvement and the whole of a preprocessing module, and the other application mainly requests to protect the algorithm improvement of a neural network of the prediction module.
The purpose of the invention is realized by the following technical scheme:
a financial data prediction system based on a block chain and cloud computing comprises a first data acquisition module, a second data acquisition module, a financial data preprocessing module, a block chain storage module, a financial data prediction module and a visualization module, wherein the first data acquisition module is used for collecting historical financial time sequence data and inputting the collected historical financial time sequence data to the financial data preprocessing module for processing, the processed historical financial time sequence data is transmitted to the block chain storage module for storage, the second data acquisition module is used for collecting real-time financial time sequence data and inputting the collected real-time financial time sequence data to the financial data preprocessing module for processing, the processed real-time financial time sequence data is input to the financial data prediction module, and the financial data prediction module predicts the future trend of the current financial data according to the input real-time sequence financial data, the visualization module is used for displaying the prediction result of the financial data prediction module.
Preferably, the financial data preprocessing module is configured to normalize the input financial time series data and remove noise data in the normalized financial time series data, where R denotes a normalized financial time series data set, and R ═ { R(s) ·, s ═ 1, 2,..,. m }, where R(s) denotes the s-th financial data in the financial time series data set R, and m denotes a total number of financial data in the financial time series data set R; let c(s) denote a neighborhood data set corresponding to financial data R(s), given a positive integer n, and 0 < n < m, n financial data closest to the financial data R(s) in the financial time series data set R is selected and added to the set c(s), and c(s) { R (e, s), e ═ 1, 2.., n }, where R (e, s) denotes the e-th financial data in the set c(s); screening and classifying the financial data in the set C(s), Cu(s) represents the u-th sub-category obtained by screening and classifying the financial data in the set C(s), and the financial data in the set C(s) is selected and added into the sub-category C in the following wayuIn(s), specifically:
given a distance threshold D, assuming that r (p, s) represents the p-th financial data in the set C(s) without any subclass added, and r (q, s) represents the q-th financial data in the set C(s) without any subclass added, then
Figure BDA0002575587370000021
Figure BDA0002575587370000022
Randomly selecting one financial data which is not added into any subclass from the set C(s) to be added into the subclass Cu(s) continuing to screen and classify the financial data in the set C(s) without any subclass, wherein r (v, s) represents the v-th financial data in the set C(s) without any subclass, and r (v, s) represents the v-th financial data in the set C(s) without any subclassu(d, s) represents subclass Cu(s) when the financial data r (v, s) satisfies:
Figure BDA0002575587370000023
then financial data r (v, s) is added to subclass Cu(s) and continuing to screen and sort financial data in the set C(s) that is not added with any subclass, when no new financial data is added into subclass CuIf the data is in(s), the data is stopped from being selected from the set C(s) and added to the subclass Cu(s) wherein η (r (v, s), r)u(d, s)) represents a value function, when | r |uWhen (D, s) -r (v, s) | is less than or equal to D, then eta (r (v, s), ru(d, s)) -0, when | r |u(d,s)-r(v,s) | > D, then η (r (v, s), ru(d,s))=1;
Let mu(s) represents subclass CuFinancial data in(s) when muWhen(s) is greater than or equal to 2, subclass C is retainedu(s) when muWhen(s) is 1, subclass C will beuThe financial data in(s) is deleted in the set C(s), and the method is adopted again to select the financial data in the set C(s) to be added into the subclass Cu(s) stopping screening and classifying the financial data in the set C(s) until all the financial data in the set C(s) are added into the subclasses;
let C'(s) denote the set of financial data in the set C(s) that has not been deleted after being filtered and classified, Cθ(s) represents the theta sub-class resulting from screening and classifying the financial data in the set C(s), and Cθ(s) is a subclass closest to the financial data r(s) among the subclasses obtained by screening and classifying the financial data in the set C(s), and
Figure BDA0002575587370000024
Figure BDA0002575587370000025
where m(s) represents the number of sub-categories into which the financial data in set C(s) is divided, ru(g, s) denotes subclass CuThe g-th financial data in(s); the threshold H (θ) is determined for a given neighborhood, and
Figure BDA0002575587370000026
Figure BDA0002575587370000027
wherein r isθ(ψ, s) represents the subclass CθThe ψ th financial data in(s); let C "(s) denote the neighborhood detection set of the financial data r(s), let r ' (a, s) denote the a-th financial data in the set C '(s), and when the financial data r ' (a, s) satisfies: when | < r(s) | ≦ H (θ), adding financial data r' (a, s) to the set C "(s), where C"(s) { r "(a, s), a ═ 1, 2,. and m"(s) }, where r "(a, s) represents the a-th financial data in the set C"(s), m "(s) represents the total number of financial data in the set C"(s), defining an abnormal value y(s) corresponding to the financial data r(s), and the expression of y(s) is:
y(s)=y1(s)*y2(s)
Figure BDA0002575587370000031
Figure BDA0002575587370000032
in the formula, y1(s) represents the distribution detection coefficient of the financial data r(s) in the set C'(s), y2(s) represents the neighborhood detection coefficients of financial data r(s) in set C'(s), ρ(s) represents the distribution coefficients of financial data r(s) in set C "(s), and
Figure BDA0002575587370000033
Figure BDA0002575587370000034
wherein, R (max) represents the maximum financial data value in the financial time sequence data set R, R (min) represents the minimum financial data value in the financial time sequence data set R, f1(ρ(s), H (ρ)) represents a first judgment function corresponding to the financial data r(s), where f (ρ(s), H (ρ)) > H (ρ) is 1, and when ρ(s) ≦ H (ρ), f (ρ(s), H (ρ)) is 0, and C "(a, s) represents the financial data r" (a, s)A neighborhood detection set, ρ (a, s) represents the distribution coefficient of the financial data r "(a, s) in the set C" (a, s),
Figure BDA0002575587370000035
where r '(b, a, s) represents the b-th financial data in the set C' (a, s), m '(a, s) represents the total number of financial data in the set C' (a, s), and f1(ρ (a, s), H (ρ)) represents a first judgment function corresponding to the financial data r ″ (a, s), and f (a, s) > H (ρ) is the first judgment function1(ρ (a, s), H (ρ)) -1, and when ρ (a, s) ≦ H (ρ), then f (p)) is set to 11(ρ (a, s), H (ρ)) -0, where H (ρ) is a given distribution detection threshold,
Figure BDA0002575587370000036
Figure BDA0002575587370000037
represents a second judgment function when
Figure BDA0002575587370000038
When it is, then
Figure BDA0002575587370000039
When in use
Figure BDA00025755873700000310
When it is, then
Figure BDA00025755873700000311
Represents subclass CθThe Lth financial data in(s), D (C)θ(s)) represents subclass Cθ(s) a corresponding class detection threshold, and
Figure BDA00025755873700000312
wherein r isθ(K, s) denotes subclass Cθ(s) the kth financial data;
when the abnormal value y(s) corresponding to the financial data R(s) is greater than 1, the financial data R(s) is judged to be noise data, the financial data R(s) is deleted in the financial time sequence data set R, when the abnormal value y(s) corresponding to the financial data R(s) is less than or equal to 1, the financial data R(s) is judged to be normal data, and the financial data R(s) is reserved in the financial time sequence data set R.
Preferably, the financial data prediction module predicts the future trend of the financial time series data by using a BP neural network, the financial data prediction module calls historical financial time series data stored in the block chain storage module to train the financial data future trend prediction on the BP neural network, the processed real-time financial time series data is used as the output value of the trained BP neural network, and the output value of the trained BP neural network is the predicted future trend of the financial data.
Preferably, in the training process of the BP neural network, the initial weight and the threshold of the BP neural network are optimized by adopting a particle swarm algorithm.
Preferably, the particles in the set of particles are updated using the following formula:
Vi(t+1)=ωi(t)Vi(t)+c1rand()(Pi(t)-Xi(t))+c2rand()(G(t)-Xi(t))
Xi(t+1)=Xi(t)+Vi(t+1)
in the formula, Xi(t +1) and Vi(t +1) denotes the position and step size of the particle i at the (t +1) th iteration, X, respectivelyi(t) and Vi(t) denotes the position and step size of the particle i at the t-th iteration, c1And c2Respectively, a learning factor, rand () a random number between (0, 1) randomly generated, Pi(t) represents the historical optimal solution, ω, for particle i at the t-th iterationi(t) represents an inertial weight factor for particle i at the tth iteration, g (t) represents a global reference solution for the particle population at the tth iteration, g (t) is determined in the following manner:
let P (t) denote the historical optimal solution set of particles in the particle swarm at the t-th iteration, and P (t) { P }i(t), i ═ 1, 2.., N }, where N represents the number of particles in the population, the historical optimal solutions in set p (t) are screened, and when the same historical optimal solution exists in set p (t), the historical optimal solution is screenedWhen only one of the same historical optimal solutions is reserved, deleting the other historical optimal solutions in the same historical optimal solutions, representing the filtered set P (t) as P ' (t), and setting P ' (t) ═ P (j, t), j ═ 1, 2,. and N ' (t) }, wherein P (j, t) represents the jth historical optimal solution in the set P ' (t), N ' (t) represents the number of historical optimal solutions in the set P ' (t), and the neighborhood detection distance corresponding to the historical optimal solution in the set P ' (t) is defined as d (t), and then the expression of d (t) is as follows:
Figure BDA0002575587370000041
in the formula (d)0Representing a given initial neighborhood detection distance, TmaxRepresenting a given maximum number of iterations;
detecting the historical optimal solution in the set P' (t), wherein O (j, t) is used for representing a neighborhood detection range corresponding to the historical optimal solution P (j, t), and the O (j, t) is a circular area which takes the historical optimal solution P (j, t) as a center and d (t) as a radius, and defines a global reference value corresponding to the historical optimal solution P (j, t) as
Figure BDA0002575587370000042
Then
Figure BDA0002575587370000043
The expression of (a) is:
Figure BDA0002575587370000044
in the formula (f)3(h (g (j, t))) represents a third value function corresponding to the historical optimal solution P (j, t), h (P (j, t)) represents a fitness function value corresponding to the historical optimal solution P (j, t),
Figure BDA0002575587370000051
represents the mean value of fitness function values corresponding to the historical optimal solution in the set P' (t)
Figure BDA0002575587370000052
When f is greater3(h (P (j, t)))) 1, when
Figure BDA0002575587370000053
When f is greater3(h (P (j, t))) + ∞, h (min, t) and h (max, t) respectively represent the minimum and maximum fitness function values corresponding to the particles in the particle swarm at the time of the tth iteration, (O (j, t)) represents the historical optimal number of solutions in the set P' (t) existing in the neighborhood detection range O (j, t), and α (t) and β (t) are weight coefficients, and
Figure BDA0002575587370000054
and selecting the historical optimal solution with the minimum global reference value in the set P' (t) as the global reference solution G (t).
Preferably, the inertial weight factor ω of the particle i at the t-th iterationiThe expression of (t) is:
Figure BDA0002575587370000055
where ω (start) represents the initial inertia weight factor value, and ω (start) is 0.9, and ω (end) represents the evolution of the particle population to the maximum number of iterations TmaxThe value of the inertial weight factor of time, and ω (end) is 0.4, h (X)i(t)) represents the position X of the particle i at the t-th iterationi(t) the corresponding fitness function value, and h (G (t)) represents the fitness function value corresponding to the global reference solution G (t) of the particle swarm in the t iteration.
The beneficial effects created by the invention are as follows: the financial data prediction system based on the block chain and cloud computing is provided, and the future trend of financial data is predicted by adopting a BP neural network, so that the development and change of a financial market can be known in time; the method comprises the steps of setting a financial data preprocessing module for removing noise data in financial time series data, selecting a certain amount of financial data closest to the financial data to be detected to construct a neighborhood data set corresponding to the financial data to be detected, screening and classifying the financial data in the neighborhood data set, removing isolated data in the neighborhood detection set, obtaining subclasses with similar data, selecting the subclasses closest to the financial data to be detected to construct a neighborhood detection set of the financial data to be detected, effectively avoiding influence on accuracy of noise detection on the financial data to be detected due to the noise data in the neighborhood detection set, ensuring similarity between the financial data to be detected and the neighborhood financial data in the neighborhood detection set, defining abnormal values corresponding to the financial data to be detected, and determining that the financial data to be detected is compared with the neighborhood financial data to be detected through distribution detection coefficients and neighborhood detection coefficients Fusing the abnormal degree of the data, wherein the distribution detection coefficient measures the distribution situation between the financial data and the neighborhood financial data thereof through the distribution coefficient, when the distribution coefficient of the financial data is smaller than a given distribution detection threshold, the distribution detection coefficient indicates that the financial data is closer to the distribution distance of the neighborhood financial data thereof, therefore, the financial data to be detected is directly judged to be normal data, when the distribution coefficient of the financial data to be detected is larger than the given distribution detection threshold, the distribution distance between the financial data to be detected and the neighborhood financial data thereof is farther, considering the situation that the density distribution of the financial data is larger, whether the financial data to be detected is in an area with larger density distribution is determined by calculating the distribution coefficient of the neighborhood financial data of the financial data to be detected, when the distribution coefficient of the neighborhood financial data with more density distribution is larger than the given distribution detection threshold, the financial data to be detected is indicated to be in an area with large density distribution, namely the financial data to be detected is judged to be normal data, when the financial data to be detected has fewer distribution coefficients of neighborhood financial data smaller than a given distribution detection threshold value, the abnormal condition of the financial data to be detected is further determined through the neighborhood detection coefficients, the neighborhood detection coefficients are further judged by measuring the abnormal condition of the sub-class to be detected closest to the financial data to be detected, when the minimum distance between the financial data to be detected and the fusion data in the sub-class closest to the financial data to be detected is larger than the class detection threshold value corresponding to the sub-class, the financial data to be detected is indicated to be outside the sub-class closest to the financial data to be detected, namely the financial data to be detected is judged to be noise data; in summary, the financial data preprocessing module determines the abnormal degree of the financial data to be detected by measuring the distribution relation between the financial data to be detected and the neighborhood financial data thereof on the whole by using the distribution detection coefficients, and the adopted distribution detection coefficients can effectively detect the abnormal of the financial data in different density areas; determining the abnormal degree of the financial data to be detected by measuring the local relation between the data to be detected and the subclasses in the neighborhood detection set by utilizing the neighborhood detection coefficient, thereby effectively improving the detection precision of the noise data; the initial weight and the threshold of the BP neural network are optimized by adopting a particle swarm algorithm, the prediction precision of the BP neural network can be effectively improved, in a traditional updating mode of a particle swarm, the global optimal solution directly influences the updating of the position of the next generation of particles, namely, the selection of the global optimal solution has important influence on the optimization result of the particle swarm, compared with a mode that particles in the traditional particle swarm directly learn to the global optimal solution in the updating process, the preferred embodiment introduces a global reference solution to replace the traditional global optimal solution, the global reference solution is selected from historical optimal solutions of the particles in the particle swarm during the current iteration, the global reference value corresponding to the historical optimal solution is defined, the global reference value of the historical optimal solution is calculated in the neighborhood detection range corresponding to the historical optimal solution, and the detection distance corresponding to the historical optimal solution is determined by the iteration number of the particle swarm and the number of different historical optimal solutions in the particle swarm, with the increase of the iteration times, the neighborhood detection distance is reduced, so that the local search and the global search of the particle swarm algorithm can be effectively balanced, and in addition, when the number of different historical optimal solutions in the particle swarm is small, the neighborhood detection distance of the historical optimal solution is increased, so that the solution space can be more comprehensively covered by the detection of the historical optimal solution in the particle swarm, and the global search capability of the particle swarm is increased; when calculating the global reference value of the historical optimal solution, comprehensively considering the fitness function value level of the historical optimal solution and the number of the historical optimal solutions in the neighborhood detection range of the historical optimal solution, selecting the historical optimal solution with the minimum global reference value in the set as the global reference solution, namely selecting the historical optimal solution with smaller fitness function value in the particle swarm and less other historical optimal solutions in the neighborhood detection range as the global reference solution, selecting the historical optimal solution with smaller fitness function value as the global reference solution, namely ensuring that the particles in the particle swarm advance towards the target solution, selecting the historical optimal solution with less other historical optimal solutions in the neighborhood detection range as the global reference solution, namely increasing the diversity of the particle swarm solution and avoiding the defect that the particle swarm algorithm is trapped in local optimization, when more other historical optimal solutions are included in the neighborhood detection range of the historical optimal solution, if the historical optimal solution is taken as a local reference solution, the particle swarm is easy to fall into local optimal, in addition, considering the condition that the historical optimal solution of more particles is around the global optimal solution at the later iteration stage of the particle swarm algorithm, the preferred embodiment introduces a weight coefficient into the global reference value of the historical optimal solution, the weight coefficient enables the global reference solution to be selected to pay more attention to the number of other historical optimal solutions contained in the neighborhood detection range of the historical optimal solution in the early iteration stage of the particle swarm, namely, the diversity of the particle swarm solution is more noticed, the particle swarm is prevented from falling into the local optimum, the global reference solution is selected to more notice the fitness function value level of the historical optimum solution in the later iteration stage of the particle swarm according to the weight coefficient, the particle swarm optimization is accelerated to move forward towards the direction of a target solution, so that the convergence speed of the particle swarm optimization is improved; the inertia weight factors of the particles in the particle swarm are set to be adjusted along with the difference value of the fitness function value between the particles and the global reference solution in a self-adaptive mode, when the difference value between the fitness function value of the particles and the fitness function value of the global reference solution is large, the value of the inertia weight factors of the particles is large at the moment, namely the particles advance towards the global reference solution by adopting a large step length, so that the particles pay more attention to global optimization, when the difference value between the fitness function value of the particles and the fitness function value of the global reference solution is small, the value of the inertia weight factors of the particles is small at the moment, namely the particles advance towards the global reference solution by adopting a small step length, so that the particles pay more attention to local optimization, namely the inertia weight factors adopted by the preferred embodiment can effectively balance global optimization and local optimization of the particles.
Drawings
The invention is further described with the aid of the accompanying drawings, in which, however, the embodiments do not constitute any limitation to the invention, and for a person skilled in the art, without inventive effort, further drawings may be derived from the following figures.
FIG. 1 is a schematic diagram of the present invention.
Detailed Description
The invention is further described with reference to the following examples.
Referring to fig. 1, the financial data prediction system based on a block chain and cloud computing in this embodiment includes a first data acquisition module, a second data acquisition module, a financial data preprocessing module, a block chain storage module, a financial data prediction module, and a visualization module, where the first data acquisition module is configured to collect historical financial timing data and input the collected historical financial timing data to the financial data preprocessing module for processing, the processed historical financial timing data is transmitted to the block chain storage module for storage, the second data acquisition module is configured to collect real-time financial timing data and input the collected real-time financial timing data to the financial data preprocessing module for processing, the processed real-time financial timing data is input to the financial data prediction module, and the financial data prediction module predicts future behavior of current financial data according to the input real-time financial timing data, the visualization module is used for displaying the prediction result of the financial data prediction module.
The preferred embodiment provides a financial data prediction system based on a block chain and cloud computing, and the future trend of financial data is predicted by adopting a BP neural network, so that the development and change of a financial market can be known in time.
Preferably, the financial data preprocessing module is configured to perform normalization on the input financial time series data, remove noise data in the normalized financial time series data, and set R to represent the normalized fundFusing the time series data set, and R ═ R(s), s ═ 1, 2,. and m }, wherein R(s) represents the s-th financial data in the financial time series data set R, and m represents the total number of financial data in the financial time series data set R; let c(s) denote a neighborhood data set corresponding to financial data R(s), given a positive integer n, and 0 < n < m, n financial data closest to the financial data R(s) in the financial time series data set R is selected and added to the set c(s), and c(s) { R (e, s), e ═ 1, 2.., n }, where R (e, s) denotes the e-th financial data in the set c(s); screening and classifying the financial data in the set C(s), Cu(s) represents the u-th sub-category obtained by screening and classifying the financial data in the set C(s), and the financial data in the set C(s) is selected and added into the sub-category C in the following wayuIn(s), specifically:
given a distance threshold D, assuming that r (p, s) represents the p-th financial data in the set C(s) without any subclass added, and r (q, s) represents the q-th financial data in the set C(s) without any subclass added, then
Figure BDA0002575587370000081
Figure BDA0002575587370000082
Randomly selecting one financial data which is not added into any subclass from the set C(s) to be added into the subclass Cu(s) continuing to screen and classify the financial data in the set C(s) without any subclass, wherein r (v, s) represents the v-th financial data in the set C(s) without any subclass, and r (v, s) represents the v-th financial data in the set C(s) without any subclassu(d, s) represents subclass Cu(s) when the financial data r (v, s) satisfies:
Figure BDA0002575587370000083
then financial data r (v, s) is added to subclass Cu(s) and continuing to screen and sort financial data in the set C(s) that is not added with any subclass, when no new financial data is added into subclass Cu(s) stopping the selection of melting data from the set C(s)To subclass Cu(s) wherein η (r (v, s), r)u(d, s)) represents a value function, when | r |uWhen (D, s) -r (v, s) | is less than or equal to D, then eta (r (v, s), ru(d, s)) -0, when | r |uWhen (D, s) -r (v, s) | > D, η (r (v, s), ru(d,s))=1;
Let mu(s) represents subclass CuFinancial data in(s) when muWhen(s) is greater than or equal to 2, subclass C is retainedu(s) when muWhen(s) is 1, subclass C will beuThe financial data in(s) is deleted in the set C(s), and the method is adopted again to select the financial data in the set C(s) to be added into the subclass Cu(s) stopping screening and classifying the financial data in the set C(s) until all the financial data in the set C(s) are added into the subclasses;
let C'(s) denote the set of financial data in the set C(s) that has not been deleted after being filtered and classified, Cθ(s) represents the theta sub-class resulting from screening and classifying the financial data in the set C(s), and Cθ(s) is a subclass closest to the financial data r(s) among the subclasses obtained by screening and classifying the financial data in the set C(s), and
Figure BDA0002575587370000084
Figure BDA0002575587370000085
where m(s) represents the number of sub-categories into which the financial data in set C(s) is divided, ru(g, s) denotes subclass CuThe g-th financial data in(s); the threshold H (θ) is determined for a given neighborhood, and
Figure BDA0002575587370000086
Figure BDA0002575587370000091
wherein r isθ(ψ, s) represents the subclass CθThe ψ th financial data in(s); let C "(s) denote the neighborhood detection set of financial data r(s), let r '(A, s) denote the A-th financial data in set C'(s), when finance is detectedThe data r' (a, s) satisfy: when | < r(s) | ≦ H (θ), adding financial data r' (a, s) to the set C "(s), where C"(s) { r "(a, s), a ═ 1, 2,. and m"(s) }, where r "(a, s) represents the a-th financial data in the set C"(s), m "(s) represents the total number of financial data in the set C"(s), defining an abnormal value y(s) corresponding to the financial data r(s), and the expression of y(s) is:
y(s)=y1(s)*y2(s)
Figure BDA0002575587370000092
Figure BDA0002575587370000093
in the formula, y1(s) represents the distribution detection coefficient of the financial data r(s) in the set C'(s), y2(s) represents the neighborhood detection coefficients of financial data r(s) in set C'(s), ρ(s) represents the distribution coefficients of financial data r(s) in set C "(s), and
Figure BDA0002575587370000094
Figure BDA0002575587370000095
wherein, R (max) represents the maximum financial data value in the financial time sequence data set R, R (min) represents the minimum financial data value in the financial time sequence data set R, f1(ρ(s), H (ρ)) represents a first judgment function corresponding to the financial data r(s), where f (ρ(s), H (ρ)) > H (ρ) is 1, and when ρ(s) ≦ H (ρ), f (ρ(s), H (ρ)) is 0, C "(a, s) is assumed to represent a neighborhood detection set of the financial data r" (a, s), ρ (a, s) represents a distribution coefficient of the financial data r "(a, s) in the set C" (a, s),
Figure BDA0002575587370000096
where r '(b, a, s) represents the b-th financial data in the set C' (a, s), m '(a, s) represents the total number of financial data in the set C' (a, s), and f1(ρ (a, s), H (ρ)) represents a first judgment function corresponding to the financial data r ″ (a, s), and f (a, s) > H (ρ) is the first judgment function1(ρ (a, s), H (ρ)) -1, and when ρ (a, s) ≦ H (ρ), then f (p)) is set to 11(ρ (a, s), H (ρ)) -0, where H (ρ) is a given distribution detection threshold,
Figure BDA0002575587370000097
Figure BDA0002575587370000098
represents a second judgment function when
Figure BDA0002575587370000099
When it is, then
Figure BDA00025755873700000910
When in use
Figure BDA00025755873700000911
When it is, then
Figure BDA00025755873700000912
Represents subclass CθThe Lth financial data in(s), D (C)θ(s)) represents subclass Cθ(s) a corresponding class detection threshold, and
Figure BDA00025755873700000913
wherein r isθ(K, s) denotes subclass Cθ(s) the kth financial data;
when the abnormal value y(s) corresponding to the financial data R(s) is greater than 1, the financial data R(s) is judged to be noise data, the financial data R(s) is deleted in the financial time sequence data set R, when the abnormal value y(s) corresponding to the financial data R(s) is less than or equal to 1, the financial data R(s) is judged to be normal data, and the financial data R(s) is reserved in the financial time sequence data set R.
The preferred embodiment is used for removing noise data in financial time series data, selecting a certain amount of financial data closest to the financial data to be detected to construct a neighborhood data set corresponding to the financial data to be detected, screening and classifying the financial data in the neighborhood data set, removing isolated data in the neighborhood detection set, obtaining subclasses with similar data, selecting the subclasses closest to the financial data to be detected to construct a neighborhood detection set of the financial data to be detected, effectively avoiding the influence on the accuracy of noise detection on the financial data to be detected due to the noise data in the neighborhood detection set, ensuring the similarity between the financial data to be detected and the neighborhood financial data in the neighborhood detection set, defining abnormal values corresponding to the financial data to be detected, and determining the abnormal values of the financial data to be detected compared with the neighborhood financial data to be detected through distribution detection coefficients and neighborhood detection coefficients The distribution detection coefficient measures the distribution situation between the financial data and the neighborhood financial data through the distribution coefficient, when the distribution coefficient of the financial data is smaller than a given distribution detection threshold, the distribution detection coefficient indicates that the financial data is closer to the distribution distance of the neighborhood financial data, therefore, the financial data to be detected is directly judged to be normal data, when the distribution coefficient of the financial data to be detected is larger than the given distribution detection threshold, the distribution distance between the financial data to be detected and the neighborhood financial data is farther, considering the situation that the density distribution of the financial data is larger, whether the financial data to be detected is in an area with larger density distribution is determined by calculating the distribution coefficient of the neighborhood financial data of the financial data to be detected, when the distribution coefficient of the neighborhood financial data with more existence of the financial data to be detected is larger than the given distribution detection threshold, the financial data to be detected is indicated to be in an area with large density distribution, namely the financial data to be detected is judged to be normal data, when the financial data to be detected has fewer distribution coefficients of neighborhood financial data smaller than a given distribution detection threshold value, the abnormal condition of the financial data to be detected is further determined through the neighborhood detection coefficients, the neighborhood detection coefficients are further judged by measuring the abnormal condition of the sub-class to be detected closest to the financial data to be detected, when the minimum distance between the financial data to be detected and the fusion data in the sub-class closest to the financial data to be detected is larger than the class detection threshold value corresponding to the sub-class, the financial data to be detected is indicated to be outside the sub-class closest to the financial data to be detected, namely the financial data to be detected is judged to be noise data; in summary, the preferred embodiment determines the abnormal degree of the financial data to be detected by measuring the distribution relation between the financial data to be detected and the neighborhood financial data thereof, and the adopted distribution detection coefficients can effectively detect the abnormal of the financial data in different density areas; the abnormal degree of the financial data to be detected is determined by measuring the local relation between the data to be detected and the subclasses in the neighborhood detection set by utilizing the neighborhood detection coefficients, so that the detection precision of the noise data is effectively improved.
Preferably, the financial data prediction module predicts the future trend of the financial time series data by using a BP neural network, the financial data prediction module calls historical financial time series data stored in the block chain storage module to train the BP neural network for future trend prediction of the financial data, the processed real-time financial time series data is used as an input value of the trained BP neural network, and an output value of the trained BP neural network is the predicted future trend of the financial data.
Preferably, in the training process of the BP neural network, the particle swarm algorithm is adopted to optimize the initial weight and the threshold of the BP neural network, and a fitness function of the particle swarm algorithm is defined as:
Figure BDA0002575587370000111
wherein M is the number of training samples, YpFor the output value of the p-th sample,
Figure BDA0002575587370000112
the smaller the fitness function value of the particles in the particle swarm is, the better the optimization result of the particles is.
Preferably, the particles in the set of particles are updated using the following formula:
Vi(t+1)=ωi(t)Vi(t)+c1rand()(Pi(t)-Xi(t))+c2rand()(G(t)-Xi(t))
Xi(t+1)=Xi(t)+Vi(t+1)
in the formula, Xi(t +1) and Vi(t +1) denotes the position and step size of the particle i at the (t +1) th iteration, X, respectivelyi(t) and Vi(t) denotes the position and step size of the particle i at the t-th iteration, c1And c2Respectively, a learning factor, rand () a random number between (0, 1) randomly generated, Pi(t) represents the historical optimal solution, ω, for particle i at the t-th iterationi(t) represents an inertial weight factor for particle i at the tth iteration, g (t) represents a global reference solution for the particle population at the tth iteration, g (t) is determined in the following manner:
let P (t) denote the historical optimal solution set of particles in the particle swarm at the t-th iteration, and P (t) { P }i(t), i ═ 1, 2., N }, where N denotes the number of particles in the particle swarm, the historical optimal solutions in the set P (t) are screened, when the same historical optimal solutions exist in the set P (t), only one of the same historical optimal solutions is retained, the other historical optimal solutions in the same historical optimal solutions are deleted, the screened set P (t) is denoted as P ' (t), P ' (t) { P (j, t), j ═ 1, 2., N ' (t) }, where P (j, t) denotes the jth historical optimal solution in the set P ' (t), N ' (t) denotes the number of historical optimal solutions in the set P ' (t), and the neighborhood detection distance corresponding to the historical optimal solution in the set P ' (t) is defined as d (t), then d (t) is expressed as:
Figure BDA0002575587370000113
in the formula (d)0Representing a given initial neighborhood detection distance, TmaxRepresenting a given maximum number of iterations;
detecting the historical optimal solution in the set P' (t), wherein O (j, t) represents a neighborhood detection range corresponding to the historical optimal solution P (j, t), and the O (j, t) is a circle with the historical optimal solution P (j, t) as a center and d (t) as a radiusThe region defines the global reference value corresponding to the historical optimal solution P (j, t) as
Figure BDA0002575587370000114
Then
Figure BDA0002575587370000115
The expression of (a) is:
Figure BDA0002575587370000116
in the formula (f)3(h (g (j, t))) represents a third value function corresponding to the historical optimal solution P (j, t), h (P (j, t)) represents a fitness function value corresponding to the historical optimal solution P (j, t),
Figure BDA0002575587370000124
represents the mean value of fitness function values corresponding to the historical optimal solution in the set P' (t)
Figure BDA0002575587370000121
When f is greater3(h (P (j, t)))) 1, when
Figure BDA0002575587370000122
When f is greater3(h (P (j, t))) + ∞, h (min, t) and h (max, t) respectively represent the minimum and maximum fitness function values corresponding to the particles in the particle swarm at the time of the tth iteration, (O (j, t)) represents the historical optimal number of solutions in the set P' (t) existing in the neighborhood detection range O (j, t), and α (t) and β (t) are weight coefficients, and
Figure BDA0002575587370000123
and selecting the historical optimal solution with the minimum global reference value in the set P' (t) as the global reference solution G (t).
The preferred embodiment is used for determining a global reference solution in the particle swarm updating process, in the traditional updating mode of the particle swarm, the global optimal solution directly influences the updating of the next generation particle position, namely the selection of the global optimal solution has an important influence on the optimization result of the particle swarm, compared with the mode that the particles in the traditional particle swarm directly learn the global optimal solution in the updating process, the preferred embodiment introduces the global reference solution to replace the traditional global optimal solution, the global reference solution is selected from the historical optimal solutions of the particles in the particle swarm during the current iteration, the global reference value corresponding to the historical optimal solution is defined, the global reference value of the historical optimal solution is calculated in the neighborhood detection range corresponding to the historical optimal solution, and the neighborhood detection distance corresponding to the historical optimal solution is determined by the iteration number of the particle swarm and the number of different historical optimal solutions in the particle swarm, with the increase of the iteration times, the neighborhood detection distance is reduced, so that the local search and the global search of the particle swarm algorithm can be effectively balanced, and in addition, when the number of different historical optimal solutions in the particle swarm is small, the neighborhood detection distance of the historical optimal solution is increased, so that the solution space can be more comprehensively covered by the detection of the historical optimal solution in the particle swarm, and the global search capability of the particle swarm is increased; when calculating the global reference value of the historical optimal solution, comprehensively considering the fitness function value level of the historical optimal solution and the number of the historical optimal solutions in the neighborhood detection range of the historical optimal solution, selecting the historical optimal solution with the minimum global reference value in the set as the global reference solution, namely selecting the historical optimal solution with smaller fitness function value in the particle swarm and less other historical optimal solutions in the neighborhood detection range as the global reference solution, selecting the historical optimal solution with smaller fitness function value as the global reference solution, namely ensuring that the particles in the particle swarm advance towards the target solution, selecting the historical optimal solution with less other historical optimal solutions in the neighborhood detection range as the global reference solution, namely increasing the diversity of the particle swarm solution and avoiding the defect that the particle swarm algorithm is trapped in local optimization, when more other historical optimal solutions are included in the neighborhood detection range of the historical optimal solution, if the historical optimal solution is taken as a local reference solution, the particle swarm is easy to fall into local optimal, in addition, considering the condition that the historical optimal solution of more particles is around the global optimal solution at the later iteration stage of the particle swarm algorithm, the preferred embodiment introduces a weight coefficient into the global reference value of the historical optimal solution, the weight coefficient enables the global reference solution to be selected to pay more attention to the number of other historical optimal solutions contained in the neighborhood detection range of the historical optimal solution in the early iteration stage of the particle swarm, namely, the diversity of the particle swarm solution is more noticed, the particle swarm is prevented from falling into the local optimum, the global reference solution is selected to more notice the fitness function value level of the historical optimum solution in the later iteration stage of the particle swarm according to the weight coefficient, namely, the particle swarm algorithm is accelerated to move forward towards the direction of the target solution, and the convergence speed of the particle swarm algorithm is improved.
Preferably, the inertial weight factor ω of the particle i at the t-th iterationiThe expression of (t) is:
Figure BDA0002575587370000131
where ω (start) represents the initial inertia weight factor value, and ω (start) is 0.9, and ω (end) represents the evolution of the particle population to the maximum number of iterations TmaxThe value of the inertial weight factor of time, and ω (end) is 0.4, h (X)i(t)) represents the position X of the particle i at the t-th iterationi(t) the corresponding fitness function value, and h (G (t)) represents the fitness function value corresponding to the global reference solution G (t) of the particle swarm in the t iteration.
The preferred embodiment is for determining inertial weight factors for particles in a population of particles, the inertial weight factors being adaptively adjusted with a difference in fitness function values between the particles and a global reference solution, when the difference between the fitness function value of the particle and the fitness function value of the global reference solution is large, the value of the inertial weight factor of the particle is large at this time, i.e. the particles advance towards the global reference solution with larger steps, making the particles more focused on global optimization, when the difference between the fitness function value of the particle and the fitness function value of the global reference solution is small, the value of the inertial weight factor of the particle is small at this time, i.e. the particles advance towards the global reference solution with smaller steps, making the particles more focused on local optimization, that is, the inertial weight factor adopted in the preferred embodiment can effectively balance global optimization and local optimization of particles.
Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the protection scope of the present invention, although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions can be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention.

Claims (4)

1. A financial data prediction system based on a block chain and cloud computing is characterized by comprising a first data acquisition module, a second data acquisition module, a financial data preprocessing module, a block chain storage module, a financial data prediction module and a visualization module, wherein the first data acquisition module is used for acquiring historical financial time sequence data and inputting the acquired historical financial time sequence data into the financial data preprocessing module for processing, the processed historical financial time sequence data is transmitted to the block chain storage module for storage, the second data acquisition module is used for acquiring real-time financial time sequence data and inputting the acquired real-time financial time sequence data into the financial data preprocessing module for processing, the processed real-time financial time sequence data is input into the financial data prediction module, and the financial data prediction module predicts the future trend of the current financial data according to the input real-time financial time sequence data, the visualization module is used for displaying the prediction result of the financial data prediction module.
2. The system according to claim 1, wherein the financial data preprocessing module is configured to normalize the input financial time series data and remove noise data from the normalized financial time series data, where R denotes a normalized financial time series data set, and R ═ R(s), s ═ 1, 2.. and m }, where R(s) denotes the s-th financial data in the financial time series data set R, and m denotes the total number of financial data in the financial time series data set R; let C(s) denote the neighborhood corresponding to the financial data r(s)A domain data set, wherein n is given as a positive integer n, and 0 < n < m, n financial data closest to financial data R(s) in a financial time sequence data set R are selected to be added into a set c(s), and c(s) { R (e, s), e ═ 1, 2,. said., n }, wherein R (e, s) represents the e-th financial data in the set c(s); screening and classifying the financial data in the set C(s), Cu(s) represents the u-th sub-category obtained by screening and classifying the financial data in the set C(s), and the financial data in the set C(s) is selected and added into the sub-category C in the following wayuIn(s), specifically:
given a distance threshold D, assuming that r (p, s) represents the p-th financial data in the set C(s) without any subclass added, and r (q, s) represents the q-th financial data in the set C(s) without any subclass added, then
Figure FDA0002575587360000011
Figure FDA0002575587360000012
Randomly selecting one financial data which is not added into any subclass from the set C(s) to be added into the subclass Cu(s) continuing to screen and classify the financial data in the set C(s) without any subclass, wherein r (v, s) represents the v-th financial data in the set C(s) without any subclass, and r (v, s) represents the v-th financial data in the set C(s) without any subclassu(d, s) represents subclass Cu(s) when the financial data r (v, s) satisfies:
Figure FDA0002575587360000013
then financial data r (v, s) is added to subclass Cu(s) and continuing to screen and sort financial data in the set C(s) that is not added with any subclass, when no new financial data is added into subclass CuIf the data is in(s), the data is stopped from being selected from the set C(s) and added to the subclass Cu(s) wherein η (r (v, s), r)u(d, s)) represents a value function, when | r |uWhen (D, s) -r (v, s) | is less than or equal to D, then eta (r (v, s), ru(d, s)) -0, when | r |uWhen (D, s) -r (v, s) | > D, η (r (v, s), ru(d,s))=1;
Let mu(s) represents subclass CuFinancial data in(s) when muWhen(s) is greater than or equal to 2, subclass C is retainedu(s) when muWhen(s) is 1, subclass C will beuThe financial data in(s) is deleted in the set C(s), and the method is adopted again to select the financial data in the set C(s) to be added into the subclass Cu(s) stopping screening and classifying the financial data in the set C(s) until all the financial data in the set C(s) are added into the subclasses;
let C'(s) denote the set of financial data in the set C(s) that has not been deleted after being filtered and classified, Cθ(s) represents the theta sub-class resulting from screening and classifying the financial data in the set C(s), and Cθ(s) is a subclass closest to the financial data r(s) among the subclasses obtained by screening and classifying the financial data in the set C(s), and
Figure FDA0002575587360000021
Figure FDA0002575587360000022
where m(s) represents the number of sub-categories into which the financial data in set C(s) is divided, ru(g, s) denotes subclass CuThe g-th financial data in(s); the threshold H (θ) is determined for a given neighborhood, and
Figure FDA0002575587360000023
Figure FDA0002575587360000024
wherein r isθ(ψ, s) represents the subclass CθThe ψ th financial data in(s); let C "(s) denote the neighborhood detection set of the constructed financial data r(s), let r ' (a, s) denote the a-th financial data in the set C '(s), and when the financial data r ' (a, s) satisfy: if | < r(s) | ≦ H (θ), the financial data r' (a, s) is added to the set C ″(s), where C ″(s) = { r ″ (a, s) ≦ H (θ)) 1, 2., m "(s) }, where r" (a, s) represents the a-th financial data in the set C "(s), m"(s) represents the total number of financial data in the set C "(s), the abnormal value corresponding to the financial data r(s) is defined as y(s), and the expression of y(s) is:
y(s)=y1(s)*y2(s)
Figure FDA0002575587360000025
Figure FDA0002575587360000026
in the formula, y1(s) represents the distribution detection coefficient of the financial data r(s) in the set C'(s), y2(s) represents the neighborhood detection coefficients of financial data r(s) in set C'(s), ρ(s) represents the distribution coefficients of financial data r(s) in set C "(s), and
Figure FDA00025755873600000210
Figure FDA0002575587360000028
wherein, R (max) represents the maximum financial data value in the financial time sequence data set R, R (min) represents the minimum financial data value in the financial time sequence data set R, f1(ρ(s), H (ρ)) represents a first judgment function corresponding to the financial data r(s), where f (ρ(s), H (ρ)) > H (ρ) is 1, and when ρ(s) ≦ H (ρ), f (ρ(s), H (ρ)) is 0, C "(a, s) is assumed to represent a neighborhood detection set of the financial data r" (a, s), ρ (a, s) represents a distribution coefficient of the financial data r "(a, s) in the set C" (a, s),
Figure FDA0002575587360000029
where r '(b, a, s) represents the b-th financial data in the set C' (a, s), m '(a, s) represents the total number of financial data in the set C' (a, s), and f1(ρ (a, s), H (ρ)) represents a first judgment corresponding to the financial data r ″ (a, s)Function, when ρ (a, s) > H (ρ), then f1(ρ (a, s), H (ρ)) -1, and when ρ (a, s) ≦ H (ρ), then f (p)) is set to 11(ρ (a, s), H (ρ)) -0, where H (ρ) is a given distribution detection threshold,
Figure FDA0002575587360000031
Figure FDA0002575587360000032
Figure FDA0002575587360000033
represents a second judgment function when
Figure FDA0002575587360000034
When it is, then
Figure FDA0002575587360000035
When in use
Figure FDA0002575587360000036
When it is, then
Figure FDA0002575587360000037
rθ(L, s) denotes subclass CθThe Lth financial data in(s), D (C)θ(s)) represents subclass Cθ(s) a corresponding class detection threshold, and
Figure FDA0002575587360000038
wherein r isθ(K, s) denotes subclass Cθ(s) the kth financial data;
when the abnormal value y(s) corresponding to the financial data R(s) is greater than 1, the financial data R(s) is judged to be noise data, the financial data R(s) is deleted in the financial time sequence data set R, when the abnormal value y(s) corresponding to the financial data R(s) is less than or equal to 1, the financial data R(s) is judged to be normal data, and the financial data R(s) is reserved in the financial time sequence data set R.
3. The financial data prediction system based on the block chain and cloud computing as claimed in claim 2, wherein the financial data prediction module predicts the future trend of the financial time series data by using a BP neural network, the financial data prediction module calls historical financial time series data stored in the block chain storage module to train the BP neural network for future trend prediction of the financial data, the processed real-time financial time series data is used as an input value of the trained BP neural network, and an output value of the trained BP neural network is the predicted future trend of the financial data.
4. The system of claim 3, wherein in the training process of the BP neural network, the initial weight and the threshold of the BP neural network are optimized by a particle swarm algorithm.
CN202010652720.1A 2020-07-08 2020-07-08 Financial data prediction system based on block chain and cloud computing Active CN111798061B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010652720.1A CN111798061B (en) 2020-07-08 2020-07-08 Financial data prediction system based on block chain and cloud computing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010652720.1A CN111798061B (en) 2020-07-08 2020-07-08 Financial data prediction system based on block chain and cloud computing

Publications (2)

Publication Number Publication Date
CN111798061A true CN111798061A (en) 2020-10-20
CN111798061B CN111798061B (en) 2020-12-29

Family

ID=72810607

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010652720.1A Active CN111798061B (en) 2020-07-08 2020-07-08 Financial data prediction system based on block chain and cloud computing

Country Status (1)

Country Link
CN (1) CN111798061B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111930844A (en) * 2020-08-11 2020-11-13 罗忠明 Financial prediction system based on block chain and artificial intelligence
CN113065710A (en) * 2021-04-09 2021-07-02 深圳市小金象科技有限公司 Financial prediction system based on artificial intelligence and block chain
CN117474686A (en) * 2023-12-11 2024-01-30 万链指数(青岛)信息科技有限公司 Financial data prediction system based on blockchain and big data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109242207A (en) * 2018-10-10 2019-01-18 中山大学 A kind of Financial Time Series prediction technique based on deeply study
CN110288123A (en) * 2019-05-17 2019-09-27 福建江夏学院 A kind of Financial Information prediction technique and device based on deep learning
US20200007322A1 (en) * 2018-06-27 2020-01-02 International Business Machines Corporation Automated task management on a blockchain based on predictive and analytical analysis
CN111192144A (en) * 2020-01-03 2020-05-22 湖南工商大学 Financial data prediction method, device, equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200007322A1 (en) * 2018-06-27 2020-01-02 International Business Machines Corporation Automated task management on a blockchain based on predictive and analytical analysis
CN109242207A (en) * 2018-10-10 2019-01-18 中山大学 A kind of Financial Time Series prediction technique based on deeply study
CN110288123A (en) * 2019-05-17 2019-09-27 福建江夏学院 A kind of Financial Information prediction technique and device based on deep learning
CN111192144A (en) * 2020-01-03 2020-05-22 湖南工商大学 Financial data prediction method, device, equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LAHMIRI: "A variational mode decompoisition approach for analysis and forecasting of economic and financial time series", 《EXPERT SYSTEMS WITH APPLICATIONS》 *
周博: ""基于多维泰勒网及其扩展的金融数据建模与预测研究"", 《中国博士学位论文全文数据库-基础科学辑》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111930844A (en) * 2020-08-11 2020-11-13 罗忠明 Financial prediction system based on block chain and artificial intelligence
CN113065710A (en) * 2021-04-09 2021-07-02 深圳市小金象科技有限公司 Financial prediction system based on artificial intelligence and block chain
CN117474686A (en) * 2023-12-11 2024-01-30 万链指数(青岛)信息科技有限公司 Financial data prediction system based on blockchain and big data
CN117474686B (en) * 2023-12-11 2024-03-29 万链指数(青岛)信息科技有限公司 Financial data prediction system based on blockchain and big data

Also Published As

Publication number Publication date
CN111798061B (en) 2020-12-29

Similar Documents

Publication Publication Date Title
CN111798061B (en) Financial data prediction system based on block chain and cloud computing
CN110070141B (en) Network intrusion detection method
Yu et al. Convolutional neural networks with refined loss functions for the real-time crash risk analysis
Bamakan et al. A new intrusion detection approach using PSO based multiple criteria linear programming
CN106599935B (en) Three decision unbalanced data oversampler methods based on Spark big data platform
Shen et al. A cost-sensitive logistic regression credit scoring model based on multi-objective optimization approach
CN108304316B (en) Software defect prediction method based on collaborative migration
CN111798062B (en) Financial data prediction system based on block chain and big data
CN111898685B (en) Target detection method based on long tail distribution data set
CN116612120B (en) Two-stage road defect detection method for data unbalance
CN109840413A (en) A kind of detection method for phishing site and device
CN111160959A (en) User click conversion estimation method and device
Hanafy et al. Using machine learning models to compare various resampling methods in predicting insurance fraud
Wang et al. Preventing discriminatory decision-making in evolving data streams
US20230298167A1 (en) Automated disease detection system
CN112801231B (en) Decision model training method and device for business object classification
Yahaya et al. An enhanced bank customers churn prediction model using a hybrid genetic algorithm and k-means filter and artificial neural network
JP2007066114A (en) Sorting method and apparatus using sorter
Narasimhan et al. Altered particle swarm optimization based attribute selection strategy with improved fuzzy Artificial Neural Network classifier for coronary artery heart disease risk prediction
Mokheleli et al. Machine learning approach for credit score predictions
Chowdhury et al. Bankruptcy prediction for imbalanced dataset using oversampling and ensemble machine learning methods
CN114066173A (en) Capital flow behavior analysis method and storage medium
Himani et al. A comparative study on machine learning based prediction of citations of articles
Almas et al. Enhancing the performance of decision tree: A research study of dealing with unbalanced data
CN112396507A (en) Shadow division-based integrated SVM personal credit evaluation method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20201209

Address after: 518000 building 904-54, building 2, Meidu community, Meilin street, Futian District, Shenzhen City, Guangdong Province

Applicant after: Shenzhen rongyida Technology Development Co.,Ltd.

Address before: No.8, Yantian Road, xinyingwan District, Yangpu Economic Development Zone, Danzhou City, Hainan Province

Applicant before: Yangpu Minoan Electronic Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant