CN107943986A - A kind of big data analysis digging system - Google Patents

A kind of big data analysis digging system Download PDF

Info

Publication number
CN107943986A
CN107943986A CN201711244293.8A CN201711244293A CN107943986A CN 107943986 A CN107943986 A CN 107943986A CN 201711244293 A CN201711244293 A CN 201711244293A CN 107943986 A CN107943986 A CN 107943986A
Authority
CN
China
Prior art keywords
data
module
data analysis
mining
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711244293.8A
Other languages
Chinese (zh)
Other versions
CN107943986B (en
Inventor
洪少华
吴琦
肖潇
龚纯斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ruishi Chikaku (shenzhen) Algorithm Technology Co Ltd
Original Assignee
Ruishi Chikaku (shenzhen) Algorithm Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ruishi Chikaku (shenzhen) Algorithm Technology Co Ltd filed Critical Ruishi Chikaku (shenzhen) Algorithm Technology Co Ltd
Priority to CN201711244293.8A priority Critical patent/CN107943986B/en
Publication of CN107943986A publication Critical patent/CN107943986A/en
Application granted granted Critical
Publication of CN107943986B publication Critical patent/CN107943986B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases

Abstract

The present invention discloses a kind of big data analysis digging system, including several data analysis modules, data analysis module manager, several data-mining modules, data-mining module manager and frame execution module, each data analysis module carries out respective simple function realization according to interface, while obtains data from raw data base and analyzed and be stored in respective module database;Multiple data analysis modules are managed collectively by data analysis module manager;Each data-mining module carries out respective simple function realization according to interface;Multiple data-mining modules are managed collectively by data-mining module manager;Frame execution module is parsed according to incoming structured document, and that extracts from data analysis module manager and data mining module management corresponding function realizes module, execution module function, thus obtain it is final needed for data.Such a system can save human resources, improve demand response efficiency, and excavating demand for different pieces of information provides facility.

Description

A kind of big data analysis digging system
Technical field
The invention belongs to big data analysis field, more particularly to a kind of module based on structured documents such as XML and JSON It polymerize big data analysis digging system.
Background technology
Under the historical background that information technology continues to develop, mass data all is being produced daily, big data analysis is in each row Industry is widely used in field, or even gradually starts to penetrate into our daily life.The data analysis of high efficient and flexible It is one of problem of big data system regions most study to excavate framework, its main effect is exactly the sea that will be stored in database It is required big data report that amount data carry out effectively analyzing and neatly merging statistics as desired, corresponding to carry out with this Meaningful judgement.
With the arrival of Internet era, network data is continuously increased, and analytic statistics is carried out to various network datas Demand is also following, and the analysis in industry to data at present also rests on the progress special exploitation of particular analysis module according to demand Stage on, flexible partition can not be carried out to data analysis mining demand in general form and integrated.Therefore, by various data Analysis demand is split as multiple single realizing, and module and the system being polymerized with structured document are a feasible solution party Case.
The content of the invention
The purpose of the present invention, is to provide a kind of big data analysis digging system, it can save human resources, improves demand Response efficiency, excavates demand for different pieces of information and provides facility.
In order to achieve the above objectives, solution of the invention is:
A kind of big data analysis digging system, including several data analysis modules, data analysis module manager, several numbers According to excavation module, data-mining module manager and frame execution module, wherein, data analysis module defines unified module and connects Mouthful, each data analysis module carries out respective simple function realization according to interface, while each data analysis module is from initial data Data are obtained in storehouse to be analyzed and be stored in respective module database;Multiple data analysis modules are by data analysis module pipe Reason device is managed collectively;Data-mining module defines unified module interface, and each data-mining module carries out each according to interface From simple function realize;Multiple data-mining modules are managed collectively by data-mining module manager;Frame performs mould Block is responsible for being parsed according to incoming structured document, and according to file structure and parameter from data analysis module manager and The module of realizing of corresponding function is extracted in data-mining module manager, structure forms aggregation module, according to pre-defined system One analysis module interface excavates module interface execution module function with unified, so as to obtain final required data.
Above-mentioned each data-mining module is also built-in with initial data bank interface, for extracting original number from raw data base According to.
Above-mentioned each data-mining module is also built-in with data analysis module manager, for being pre-processed from each analysis module Data are extracted in database, and respective statistics is generated according to the data mining algorithm of realization.
Said structure document is XML or JSON document.
After using the above scheme, the present invention by the disassembling of data analysis requirements, the modular implementation of simple function and Based on the final module polymerization of structured document, realize flexible data digging system, system framework code etc. can not changed Under the premise of, by increasing analysis module and excavating module and the incoming file structure of adjustment, adapting to different data minings automatically needs Ask, it is not necessary to excavate the corresponding function of demand overlapping development further according to different pieces of information, save human resources, improve demand response effect Rate, excavates demand for different pieces of information and provides facility.The present invention is suitable for various data mining scenes.
Brief description of the drawings
Fig. 1 is the integrated stand composition of the present invention.
Embodiment
Below with reference to attached drawing, technical scheme is described in detail.
As shown in Figure 1, the present invention provides a kind of big data analysis digging system, including data analysis module, data mining Module and frame execution module, wherein, data analysis module defines unified module interface, respectively realizes that module is carried out according to interface Respective simple function is realized, while each module obtains data from raw data base and analyzed and be stored in respective number of modules According in storehouse, multiple data analysis modules are managed collectively by data analysis module manager.Data-mining module definition is unified Module interface, respectively realize that module carries out respective simple function realization according to interface, while built-in initial data bank interface, For extracting initial data, and built-in data analysis module manager from raw data base, for pre- from each analysis module Data are extracted in processing database, and respective statistics, multiple data minings are generated according to the data mining algorithm of realization Module is managed collectively by data-mining module manager.Frame execution module (namely aggregation module actuator in Fig. 1) It is mainly responsible for and is parsed according to structured documents such as incoming XML or JSON, and is divided according to file structure and parameter from data That corresponding function is extracted in analysis module management and data mining module management realizes that module is built, and will finally build Aggregation module be passed in aggregation module actuator, aggregation module actuator according to pre-defined united analysis module interface with It is unified to excavate module interface execution module function, so as to obtain final required data.
To sum up, the present invention obtains corresponding function module by the module title identified in parameter first;Secondly, according to mark Corresponding module parameter carry out data mining;Finally, the data of excavation are integrated according to file structure, so as to reach generation Meet the purpose of the data of demand.
Above example is merely illustrative of the invention's technical idea, it is impossible to protection scope of the present invention is limited with this, it is every According to technological thought proposed by the present invention, any change done on the basis of technical solution, each falls within the scope of the present invention Within.

Claims (4)

  1. A kind of 1. big data analysis digging system, it is characterised in that:Including several data analysis modules, data analysis module management Device, several data-mining modules, data-mining module manager and frame execution module, wherein, data analysis module definition system One module interface, each data analysis module carry out respective simple function realization, while each data analysis module according to interface Data are obtained from raw data base to be analyzed and be stored in respective module database;Multiple data analysis modules are by data Analysis module manager is managed collectively;Data-mining module defines unified module interface, each data-mining module according to Interface carries out respective simple function realization;Multiple data-mining modules are managed collectively by data-mining module manager; Frame execution module is responsible for being parsed according to incoming structured document, and according to file structure and parameter from data analysis mould The module of realizing of corresponding function is extracted in block manager and data mining module management, structure forms aggregation module, according to pre- The united analysis module interface first defined excavates module interface execution module function with unified, so as to obtain final required data.
  2. A kind of 2. big data analysis digging system as claimed in claim 1, it is characterised in that:Each data-mining module is also Initial data bank interface is built-in with, for extracting initial data from raw data base.
  3. A kind of 3. big data analysis digging system as claimed in claim 1, it is characterised in that:Each data-mining module is also Data analysis module manager is built-in with, for extracting data from each analysis module preprocessed data storehouse, and according to realization Data mining algorithm generate respective statistics.
  4. A kind of 4. big data analysis digging system as claimed in claim 1, it is characterised in that:The structured document is XML Or JSON document.
CN201711244293.8A 2017-11-30 2017-11-30 Big data analysis mining system Active CN107943986B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711244293.8A CN107943986B (en) 2017-11-30 2017-11-30 Big data analysis mining system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711244293.8A CN107943986B (en) 2017-11-30 2017-11-30 Big data analysis mining system

Publications (2)

Publication Number Publication Date
CN107943986A true CN107943986A (en) 2018-04-20
CN107943986B CN107943986B (en) 2022-05-17

Family

ID=61947156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711244293.8A Active CN107943986B (en) 2017-11-30 2017-11-30 Big data analysis mining system

Country Status (1)

Country Link
CN (1) CN107943986B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5692107A (en) * 1994-03-15 1997-11-25 Lockheed Missiles & Space Company, Inc. Method for generating predictive models in a computer system
US20020169735A1 (en) * 2001-03-07 2002-11-14 David Kil Automatic mapping from data to preprocessing algorithms
CN1399228A (en) * 2002-08-29 2003-02-26 北京北大方正技术研究院有限公司 Text excavating method of semi-structural document set
US6865573B1 (en) * 2001-07-27 2005-03-08 Oracle International Corporation Data mining application programming interface
CN103577605A (en) * 2013-11-20 2014-02-12 贵州电网公司电力调度控制中心 Data warehouse based on data fusion and data mining and application method of data warehouse
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN107038167A (en) * 2016-02-03 2017-08-11 普华诚信信息技术有限公司 Big data excavating analysis system and its analysis method based on model evaluation
CN107357873A (en) * 2017-07-04 2017-11-17 深圳齐心集团股份有限公司 A kind of big data storage management system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5692107A (en) * 1994-03-15 1997-11-25 Lockheed Missiles & Space Company, Inc. Method for generating predictive models in a computer system
US20020169735A1 (en) * 2001-03-07 2002-11-14 David Kil Automatic mapping from data to preprocessing algorithms
US6865573B1 (en) * 2001-07-27 2005-03-08 Oracle International Corporation Data mining application programming interface
CN1399228A (en) * 2002-08-29 2003-02-26 北京北大方正技术研究院有限公司 Text excavating method of semi-structural document set
CN103577605A (en) * 2013-11-20 2014-02-12 贵州电网公司电力调度控制中心 Data warehouse based on data fusion and data mining and application method of data warehouse
CN107038167A (en) * 2016-02-03 2017-08-11 普华诚信信息技术有限公司 Big data excavating analysis system and its analysis method based on model evaluation
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN107357873A (en) * 2017-07-04 2017-11-17 深圳齐心集团股份有限公司 A kind of big data storage management system

Also Published As

Publication number Publication date
CN107943986B (en) 2022-05-17

Similar Documents

Publication Publication Date Title
CN102222092B (en) Massive high-dimension data clustering method for MapReduce platform
CN106709012A (en) Method and device for analyzing big data
CN102222105B (en) Method for generating real-time statistical report
CN104915378B (en) A kind of statistics task quick-speed generation system and method suitable for big data
CN107577771A (en) A kind of big data digging system
CN110750650A (en) Construction method and device of enterprise knowledge graph
CN104598565B (en) A kind of K mean value large-scale data clustering methods based on stochastic gradient descent algorithm
CN103440566A (en) Method and device for generating order picking collection lists and method for optimizing order picking route
US11520825B2 (en) Method and system for converting one type of data schema to another type of data schema
CN106959948A (en) The system and its preprocess method pre-processed for distributed nature to big data
CN103853826B (en) A kind of distributed performance data processing method
CN105574032A (en) Rule matching operation method and device
CN107967347A (en) Batch data processing method, server, system and storage medium
CN106789347A (en) A kind of method that alarm association and network fault diagnosis are realized based on alarm data
CN105760511B (en) A kind of big data adaptive topology processing method based on storm
CN105138650A (en) Hadoop data cleaning method and system based on outlier mining
CN103425692B (en) Data export method and device
CN108829884A (en) data mapping method and device
CN104143122A (en) Intelligent service approval scheme
CN1831855A (en) Real-time analysing and control system and method for large underground hole group construction at network environment
CN109829660A (en) Data processing system and its design method based on electric power enterprise grade data model
CN106657099A (en) Spark data analysis service release system
CN107590225A (en) A kind of Visualized management system based on distributed data digging algorithm
CN105391777A (en) Algorithm escrow PaaS platform for decoupling logic code and performance code
CN104166701A (en) Machine learning method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant