CN104933112A - Distributed Internet transaction information storage and processing method - Google Patents

Distributed Internet transaction information storage and processing method Download PDF

Info

Publication number
CN104933112A
CN104933112A CN201510302559.4A CN201510302559A CN104933112A CN 104933112 A CN104933112 A CN 104933112A CN 201510302559 A CN201510302559 A CN 201510302559A CN 104933112 A CN104933112 A CN 104933112A
Authority
CN
China
Prior art keywords
data
distributed
warehouse
transaction information
fairground
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510302559.4A
Other languages
Chinese (zh)
Other versions
CN104933112B (en
Inventor
陈海江
吕浩
邵奇可
颜世航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Li Shi Science And Technology Co Ltd
Original Assignee
Zhejiang Li Shi Science And Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Li Shi Science And Technology Co Ltd filed Critical Zhejiang Li Shi Science And Technology Co Ltd
Priority to CN201510302559.4A priority Critical patent/CN104933112B/en
Publication of CN104933112A publication Critical patent/CN104933112A/en
Application granted granted Critical
Publication of CN104933112B publication Critical patent/CN104933112B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses

Abstract

The invention provides a distributed Internet transaction information storage and processing method. The method comprises the following steps that: an operation data storage function module extracts structured transaction information data from an external data layer; a distributed data warehouse and a non-relational database read non-structured network log data from a distributed file system respectively; an Internet transaction information data warehouse extracts, transforms and loads the transaction information data and the network log data from the operation data storage function module and the distributed data warehouse, integrates the transaction information data and the network log data into a data tuple, and stores the data tuple with a star model; a data market extracts, transforms and loads market-specified data from the Internet transaction information data warehouse and/or the distributed data warehouse; and a business intelligence system acquires data required by analysis from the data market and/or the non-relational database according to data analysis requirements. The method is specific to the application scene of Internet transaction data analysis and processing.

Description

Distributed interconnection Transaction Information storage processing method
Technical field
The present invention relates to a kind of distributed interconnection Transaction Information storage processing method.
Background technology
The model of Data Analysis Services and result only apply to could produce real value in some decision-making application, and to know by carrying out aid decision making based on the back-up system of the fact etc. that some serial Theories and methods are business intelligence (Bussiness Intelligence are called for short BI).Along with information-based high development, business intelligence is more and more taken seriously, especially associating on-line analysis OLAP(is namely based on the online express-analysis of database, large data), be the main methods of business intelligence especially in a way, the core data warehouse architecture design in the BI solution of traditional industries as shown in Figure 1.
But internet business information data is not suitable for adopting BI solution, main cause have following some:
1. the data source of internet electronic business transaction is varied, comprises the data and network log data etc. of different electric business's platforms, various relevant database, the generation of social software; So single ETL instrument cannot handle all data well.
2. the data volume of e-commerce transaction is huge, and traditional centralized relevant database cannot meet the requirement processing large-scale data like this.
3. internet business monitoring is higher to the requirement of real-time of Data Analysis Services, and most monitoring and warning needs to complete the value that process could embody data at short notice, and traditional off-line ETL processing mode cannot satisfy the demands.
4. the number of users of internet business is huge, and traditional BI represents tool design mainly for the user on the middle and senior level of enterprise, is transplanted to e-commerce industry and all there is larger difference from interactive efficiency and Consumer's Experience.The business intelligence system of current traditional mode, can not well be applicable to the extensive of internet business information, heterogeneous data source and the demand to data analysis real-time.
Summary of the invention
The present invention is directed to the application scenarios of internet business Data Analysis Services, in conjunction with on the feature basis of internet electronic business analysis and early warning, for defect of the prior art, the object of this invention is to provide distributed interconnection Transaction Information storage processing method.
According to a kind of distributed interconnection Transaction Information storage processing method provided by the invention, comprising:
Operation data storage function module is from the trading information data of external data layer drawing-out structure;
Distributed Data Warehouse and non-relational database read non-structured network log data respectively from distributed file system;
Internet business information data warehouse extracts respectively after conversion loads described trading information data and network log data and carries out being integrated into data tuple from described operation data storage function module and described Distributed Data Warehouse, and stores this data tuple with Star Model;
Data Mart root from described internet business information data warehouse, and/or extracts conversion loading fairground specific data in described Distributed Data Warehouse;
Business intelligence system needs from described Data Mart according to data analysis, and/or obtains analysis desired data in described non-relational database.
As a kind of prioritization scheme, described Data Mart comprises exchanges, sells fairground and customer service fairground;
Described exchanges, sale fairground are extracted conversion according to service needed from described internet business information data warehouse and are loaded described fairground specific data;
Described customer service fairground is extracted conversion according to service needed and is loaded described fairground specific data from described Distributed Data Warehouse.
As a kind of prioritization scheme, between described customer service fairground and described business intelligence system, also carry out data transmission through MemCache caching system;
Described business intelligence system first checks data needed for asked analysis whether in MemCache caching system when obtaining analyze desired data to described customer service fairground, if have, then directly to obtain from MemCache caching system, if do not exist, then obtain from described customer service fairground analyze desired data and MemCache caching system buffer memory a.
As a kind of prioritization scheme, described business intelligent system is used for data mining, enterprise diagnosis, customer analysis, data file analysis and on-line analysis.
As a kind of prioritization scheme, described operation data storage function module comprises three-decker:
Mapping layer: the field mappings of former for the data of external data layer table in the local data base of operation data storage function module, complete the association of data from operation layer to analysis layer;
Data prediction layer: pre-service is carried out to described trading information data, this pre-service comprises integration, screening and increases contingency table;
Data cleansing layer: data cleansing operation is carried out for defective in quality trading information data.
As a kind of prioritization scheme, the fact table that described Star Model is positioned at star centre comprise some data tuple time address, number of transaction corresponding to domain addresses, store address, product address and this data tuple and dealing money;
The dimension table of described Star Model comprises shop dimension table, time dimension table, product dimension table and region dimension table;
Described shop dimension table comprises electric business's platform information, landing slab block message and shop management information;
Described time dimension table comprises temporal information;
Described product dimension table comprises name of product, product description, product price and product quality;
Described region dimension table comprises geographical location information, and this geographical location information comprises country, province, city.
As a kind of prioritization scheme, described operation data storage function module uses full dose load mode, is specially:
S101, empties the object table of described operation data storage function module,
S102, inserts this object table by the full dose trading information data of external data layer.
As a kind of prioritization scheme, described operation data storage function module uses step increment method mode, is specially:
S201, empties the temporary table of described operation data storage function module,
S202, inserts this temporary table by the increment trading information data of external data layer,
S203, the data that deletion object table and this temporary table repeat,
Data in temporary table are inserted after in object table and are returned step S1 until data all extract end by S204.
As a kind of prioritization scheme, the dimension table in described internet business information data warehouse is step increment method mode, and fact table is full dose load mode;
The dimension table step increment method process in internet business information data warehouse is specially:
S301, calculates sequence to incremental data according to the line number of dimension table historical data in temporary table,
S302, inserts temporary table by the incremental data with described sequence,
S303, empties the dimension table in internet business information data warehouse,
S304, associates the tables of data of described operation data storage function module with described temporary table, the described incremental data with described sequence is inserted the dimension table in internet business information data warehouse;
The step increment method process of fact table is specially:
After emptying temporary table, judge whether that index takes from different fact tables in addition;
If also have index to take from different fact tables, then repeat the process of the different business event of taking out in fact table within the scope of timestamp, terminate until all business event are all removed Posterior circle;
If do not have index to take from different fact tables, then take out the different business event in fact table within the scope of timestamp, the data of the first temporary table are aggregated in the second temporary table according to dimension field, delete the data repeated with this second temporary table in target fact table, the data of described second temporary table are inserted target fact table.
Compared with prior art, the present invention has following beneficial effect:
The present invention is directed to internet electronic business Transaction Information feature, design application oriented, integrated, that there is temporal characteristics, stable data acquisition, for transaction data process, analysis and monitoring decision-making provide support.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, below the accompanying drawing used required in describing embodiment is briefly described, obviously, accompanying drawing in the following describes is only some embodiments of the present invention, for those skilled in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.In accompanying drawing:
Fig. 1 is the core data warehouse Organization Chart in the BI solution of traditional industries;
Fig. 2 is a kind of distributed interconnection Transaction Information storage architecture schematic diagram in embodiment;
Fig. 3 is the principle framework of data warehouse;
Fig. 4 is a kind of distributed interconnection Transaction Information storage processing method schematic diagram in embodiment;
Fig. 5 is data warehouse Star Model schematic diagram;
Fig. 6 is that operation data storage function module (ODS) full dose loads process flow diagram;
Fig. 7 is operation data storage function module (ODS) step increment method process flow diagram;
Fig. 8 is the dimension table step increment method process flow diagram in internet business information data warehouse (DW);
Fig. 9 is fact table (DM) the step increment method process flow diagram of DW;
Figure 10 is operation data storage function module ODS, relation schematic diagram between internet business information data warehouse DW, fact table DM.
Embodiment
Hereafter in the mode of specific embodiment, the present invention is described in detail by reference to the accompanying drawings.Following examples will contribute to those skilled in the art and understand the present invention further, but not limit the present invention in any form.It should be pointed out that the embodiment that can also use other, or the amendment on 26S Proteasome Structure and Function is carried out to the embodiment enumerated herein, and can not depart from the scope and spirit of the present invention.
The present invention is directed to the application scenarios of internet business Data Analysis Services, in conjunction with on the feature basis of internet electronic business analysis and early warning, mainly comprise leading portion Reports module, ETL module, data warehouse module, database management module, data dispatch module and Web service module composition.The present invention is directed to internet electronic business Transaction Information feature, design application oriented, integrated, that have temporal characteristics, stable data acquisition, for transaction data process, analysis and monitoring decision-making provide support, overall architecture as shown in Figure 2.
In the embodiment of a kind of distributed interconnection Transaction Information storage processing method provided by the invention, as shown in Figure 2 and Figure 4, comprising:
Operation data storage function module is from the trading information data of external data layer drawing-out structure;
Distributed Data Warehouse and non-relational database read non-structured network log data respectively from distributed file system;
Internet business information data warehouse extracts respectively after conversion loads described trading information data and network log data and carries out being integrated into data tuple from described operation data storage function module and described Distributed Data Warehouse, and stores this data tuple with Star Model;
Data Mart root from described internet business information data warehouse, and/or extracts conversion loading fairground specific data in described Distributed Data Warehouse;
Business intelligence system needs from described Data Mart according to data analysis, and/or obtains analysis desired data in described non-relational database.
Described extraction conversion is loaded as ETL, is the abbreviation of English Extract-Transform-Load, is used for describing data are passed through extraction (extract) from source terminal, change (transform), loaded (load) process to destination.
Along with the development of ecommerce, need to carry out on-line analysis to the trading activity of user in real time, such as show all history access and the inquiry record of certain shop on net, simultaneously real-time tracing shows this shop on electric business's platform just in information such as the accessed pages, adopts the relevant database of traditional support off-line analysis and complex query to be difficult to satisfied such demand.Simultaneously, the data mining process of the semi-structured large data processing combination complexity that the increasing web log file of e-commerce industry, user behavior are such, therefore the present invention adopts the large data platform of power stone science and technology or Hadoop to realize mass data volume handling implement.The large data processing core module of power stone science and technology mainly comprises cloud database, cloud storage, search engine and data analysis, structuring, semi-structured, unstructured data can be processed, support standard interface, provide the data of one-stop robotization to dispose, move, back up, recovery, the function such as disaster tolerance.The principle framework of data warehouse as shown in Figure 3.
The data of data warehouse, by the data-interface of standard, are derived from internet electronic business transaction platform and open to applications.Data warehouse is divided into three-decker according to data flow: data Layer, Information Level and analysis layer, as shown in Figure 4.
Data Layer
By the standard data interface of propelling movement type, use the external data obtaining electric business's platform with the mode of the consistent model of electric business's platform database or middle table, then carry out data pick-up data grabber in other words by ODS, the form of extraction comprises XML and TXT etc.
Information Level
Operation data storage function module (ODS:Operation Data Storage) is increased in the middle of data Layer and internet business information data depot layer.Object is as a buffer pool, by the data integration of multiple data source a to extra buffer for data warehouse, effectively alleviate the pressure of data source and ETL.
Wherein, ODS comprises three-decker:
Mapping layer: the field mappings of former for the data of external data layer table in the local data base of operation data storage function module, complete the association of data from operation layer to analysis layer is also manage mapping layer by system to concentrate for external data.
Data prediction layer: pre-service is carried out to described trading information data, this pre-service comprises integration, screening and increases contingency table, and object is the work simplifying and promote ETL;
Data cleansing: data cleansing operation is carried out for defective in quality trading information data.
What ODS stored is all the internet business information data captured from all kinds of electric business's platform.
Analysis layer
By BI system and Hadoop instrument, the process such as data mining, enterprise diagnosis, customer analysis, Data support and on-line analysis are carried out to the trading information data of all kinds of electric business's platform and non-structured web log file.Described business intelligent system is used for data mining, enterprise diagnosis, customer analysis, data file analysis and on-line analysis.
The easy information data warehouse in internet for stores processor structuring trading information data adopts the synthesis of relational database, memory database and distributed data base, adopts applicable traditional BI service to carry out analyzing and processing for relational database; For the non-relational database (Nosql) based on the storage of a large amount of real time datas and the Hadoop distributed file system HDFS of real-time query analysis employing support HBase.
In embodiment as shown in Figure 4, described Data Mart comprises exchanges, sells fairground and customer service fairground;
Described exchanges, sale fairground are extracted conversion according to service needed from described internet business information data warehouse and are loaded described fairground specific data;
Described customer service fairground is extracted conversion according to service needed and is loaded described fairground specific data from described Distributed Data Warehouse.
Described exchanges are used for the transaction related information in stores processor buyer and shop, as exchange hour, number of transaction etc.
Described sale fairground is used for the sale related data in stores processor shop, as shop visit capacity, sales situation etc.
The calling information that described customer service fairground is used for stores processor buyer is mutual with the both sides in transaction.
Distributed Data Warehouse in the present embodiment is power stone cloud database, the distributed relation database all-in-one of high-performance, High Availabitity is provided, OLAP, OLTP and Combination application can be supported, support high-performance (distributed), High Availabitity, support that thermophoresis, Hot Spare, heat are recovered, support stsndard SQL, support main flow development language, support based on x86, Godson, soar, the chip server such as PowerPC, low to hardware requirement.
Also can adopt HBase, i.e. Hadoop Database, be one high reliablyproperty, high-performance, towards row, telescopic distributed memory system.
In Fig. 4, Distributed Data Warehouse and non-relational database read non-structured network log data respectively from distributed file system.When customer service needs to transfer transaction record and transaction data corresponding with it, directly extracted by ETL mode from HBASE, and if third-party business intelligence system needs to carry out statistical study to internet business data, without the need to detailed Transaction Information, then the direct network log data obtaining all kinds of electric business's platform from described NOSQL.Which thereby enhance travelling speed, make system storage process more efficient.
As a kind of embodiment, between described customer service fairground and described business intelligence system, also carry out data transmission through MemCache caching system;
Described business intelligence system first checks data needed for asked analysis whether in MemCache caching system when obtaining analyze desired data to described customer service fairground, if have, then directly to obtain from MemCache caching system, if do not exist, then obtain from described customer service fairground analyze desired data and MemCache caching system buffer memory a.
Memcache is a high performance distributed memory object caching system, and by safeguarding the huge hash table of a unification in internal memory, it can be used for storing the data of various form, comprises the result etc. of image, video, file and database retrieval.Be exactly briefly by data call in internal memory, then read from internal memory, thus greatly improve reading speed.
The metadata store pattern in internet business information data warehouse adopts the Star Model being applicable to dimension and being separated with the fact, as shown in Figure 5.Data are through pre-service, and the dimensional information about the fact has detached out and has been based upon in corresponding dimension table from the fact.Treatment scheme is: ODS layer is drawn into about the relevant data of operation flow from operation layer; Data Layer design (mainly comprising the common dimension such as time dimension table, region dimension table) is carried out according to business function at depot layer (DW layer); Fact table (DM layer) is the fact of historical data, does not have and repeats operational, and correspond to business association relation table, prestige table, visit capacity etc., fact table is the core of star structure, the trunk content of record main body.
In the embodiment as shown in fig .5, the fact table that described Star Model is positioned at star centre comprise some data tuple time address, number of transaction corresponding to domain addresses, store address, product address and this data tuple and dealing money;
The dimension table of described Star Model comprises shop dimension table, time dimension table, product dimension table and region dimension table;
Described shop dimension table comprises electric business's platform information, landing slab block message and shop management information;
Described time dimension table comprises temporal information;
Described product dimension table comprises name of product, product description, product price and product quality;
Described region dimension table comprises geographical location information, and this geographical location information comprises country, province, city.
As a kind of embodiment, as shown in Figure 6, described operation data storage function module uses full dose load mode, is specially:
S101, empties the object table of described operation data storage function module,
S102, inserts this object table by the full dose trading information data of external data layer.
As a kind of embodiment, as shown in Figure 7, described operation data storage function module uses step increment method mode, is specially:
S201, empties the temporary table of described operation data storage function module,
S202, inserts this temporary table by the increment trading information data of external data layer,
S203, the data that deletion object table and this temporary table repeat,
Data in temporary table are inserted after in object table and are returned step S1 until data all extract end by S204.
As a kind of embodiment, as shown in Figure 8, the dimension table in described internet business information data warehouse is step increment method mode, and fact table is full dose load mode;
The dimension table step increment method process in internet business information data warehouse is specially:
S301, calculates sequence to incremental data according to the line number of dimension table historical data in temporary table,
S302, inserts temporary table by the incremental data with described sequence,
S303, empties the dimension table in internet business information data warehouse,
S304, associates the tables of data of described operation data storage function module with described temporary table, the described incremental data with described sequence is inserted the dimension table in internet business information data warehouse.
As a kind of embodiment, as shown in Figure 9, the step increment method process of fact table is specially:
After emptying temporary table, judge whether that index takes from different fact tables in addition;
If also have index to take from different fact tables, then repeat the process of the different business event of taking out in fact table within the scope of timestamp, terminate until all business event are all removed Posterior circle;
If do not have index to take from different fact tables, then take out the different business event in fact table within the scope of timestamp, the data of the first temporary table are aggregated in the second temporary table according to dimension field, delete the data repeated with this second temporary table in target fact table, the data of described second temporary table are inserted target fact table.
One is all also comprised by the step of the situation of insertion writing system daily record after all kinds of full dose loading of the present embodiment intelligence or step increment method complete.
Adopt Multi-dimension on-line analytical process (OLAP:Online Analysis Processing) to come according to different business demand, from different demand angle (as visual angles such as sale, customer service, finance, time, region, industries), alternate analysis is carried out to the related data from other data structure.By presenting the analysis of data and the multidimensional of leading portion system, realize internet business information display and real-time trend analysis early warning.First dimension and indication information is extracted; Secondly because dimension between each main body is all separate when defining, in order to consistance and the incidence relation of data, will set up the relation information of each analysis personnel dimension, realization body is expanded and is associated; After establishing data model, utilize ETL that the data in data warehouse are carried out corresponding statistical summaries according to customer demand and obtain multidimensional analysis data, finally form form.
The present invention devises rational system architecture and the design of the ETL adapted, data warehouse and data dimension, makes it be applicable to the stores processor of internet business information.
Consider and the data of the electric business's platform of difference are processed and stored, so the demand of application layer is different; Be not only that unitem can be used to the changes in demand of user simultaneously, so need to take into account the demand of search efficiency and data dynamics and good extensibility simultaneously, adopt star-like Multidimensional Data Model as data warehouse model.
Through pretreated data, the dimensional information about fact table extracted from the fact is based upon in corresponding dimension table.Therefore, processing layer functional module only needs to carry out inquiry to fact table just can obtain Transaction Information, substantially increases the efficiency of access.Operation data storage function module ODS, relation schematic diagram between internet business information data warehouse DW, fact table DM are as shown in Figure 10.
ODS layer correspondence is drawn into the data about internet business main body and behavior from data Layer; DW layer is data warehouse layer, main store to tie up correlation time with internet business show, region ties up and the public information such as to show and tie up and show; DM corresponding fact table, the namely core of hub-and-spoke configuration, the information such as the incidence relation of record transaction.
ETL update mechanism, warehouse upgrades at first, and fairground is in renewal; Dimension first upgrades, the true mechanism upgraded again.
The foregoing is only preferred embodiment of the present invention, those skilled in the art know, without departing from the spirit and scope of the present invention, can carry out various change or equivalent replacement to these characteristic sum embodiments.In addition, under the teachings of the present invention, can modify to adapt to concrete situation and material to these characteristic sum embodiments and can not the spirit and scope of the present invention be departed from.Therefore, the present invention is not by the restriction of specific embodiment disclosed herein, and the embodiment in the right of all the application of falling into all belongs to protection scope of the present invention.

Claims (9)

1. a distributed interconnection Transaction Information storage processing method, is characterized in that, comprising:
Operation data storage function module is from the trading information data of external data layer drawing-out structure;
Distributed Data Warehouse and non-relational database read non-structured network log data respectively from distributed file system;
Internet business information data warehouse extracts respectively after conversion loads described trading information data and network log data and carries out being integrated into data tuple from described operation data storage function module and described Distributed Data Warehouse, and stores this data tuple with Star Model;
Data Mart from described internet business information data warehouse, and/or extracts conversion loading fairground specific data in described Distributed Data Warehouse;
Business intelligence system needs from described Data Mart according to data analysis, and/or obtains analysis desired data in described non-relational database.
2. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, is characterized in that, described Data Mart comprises exchanges, sells fairground and customer service fairground;
Described exchanges, sale fairground are extracted conversion according to service needed from described internet business information data warehouse and are loaded described fairground specific data;
Described customer service fairground is extracted conversion according to service needed and is loaded described fairground specific data from described Distributed Data Warehouse.
3. a kind of distributed interconnection Transaction Information storage processing method according to claim 2, is characterized in that, also carry out data transmission through MemCache caching system between described customer service fairground and described business intelligence system;
Described business intelligence system first checks data needed for asked analysis whether in MemCache caching system when obtaining analyze desired data to described customer service fairground, if have, then directly to obtain from MemCache caching system, if do not exist, then obtain from described customer service fairground analyze desired data and MemCache caching system buffer memory a.
4. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, is characterized in that, described business intelligent system is used for data mining, enterprise diagnosis, customer analysis, data file analysis and on-line analysis.
5. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, it is characterized in that, described operation data storage function module comprises three-decker:
Mapping layer: the field mappings of former for the data of external data layer table in the local data base of operation data storage function module, complete the association of data from operation layer to analysis layer;
Data prediction layer: pre-service is carried out to described trading information data, this pre-service comprises integration, screening and increases contingency table;
Data cleansing layer: data cleansing operation is carried out for defective in quality trading information data.
6. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, it is characterized in that, the fact table that described Star Model is positioned at star centre comprise some data tuple time address, number of transaction corresponding to domain addresses, store address, product address and this data tuple and dealing money;
The dimension table of described Star Model comprises shop dimension table, time dimension table, product dimension table and region dimension table;
Described shop dimension table comprises electric business's platform information, landing slab block message and shop management information;
Described time dimension table comprises temporal information;
Described product dimension table comprises name of product, product description, product price and product quality;
Described region dimension table comprises geographical location information, and this geographical location information comprises country, province, city.
7. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, is characterized in that, described operation data storage function module uses full dose load mode, is specially:
S101, empties the object table of described operation data storage function module,
S102, inserts this object table by the full dose trading information data of external data layer.
8. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, is characterized in that, described operation data storage function module uses step increment method mode, is specially:
S201, empties the temporary table of described operation data storage function module,
S202, inserts this temporary table by the increment trading information data of external data layer,
S203, the data that deletion object table and this temporary table repeat,
Data in temporary table are inserted after in object table and are returned step S1 until data all extract end by S204.
9. a kind of distributed interconnection Transaction Information storage processing method according to claim 1, is characterized in that, the dimension table in described internet business information data warehouse is step increment method mode, and fact table is full dose load mode;
The dimension table step increment method process in internet business information data warehouse is specially:
S301, calculates sequence to incremental data according to the line number of dimension table historical data in temporary table,
S302, inserts temporary table by the incremental data with described sequence,
S303, empties the dimension table in internet business information data warehouse,
S304, associates the tables of data of described operation data storage function module with described temporary table, the described incremental data with described sequence is inserted the dimension table in internet business information data warehouse;
The step increment method process of fact table is specially:
After emptying temporary table, judge whether that index takes from different fact tables in addition;
If also have index to take from different fact tables, then repeat the process of the different business event of taking out in fact table within the scope of timestamp, terminate until all business event are all removed Posterior circle;
If do not have index to take from different fact tables, then take out the different business event in fact table within the scope of timestamp, the data of the first temporary table are aggregated in the second temporary table according to dimension field, delete the data repeated with this second temporary table in target fact table, the data of described second temporary table are inserted target fact table.
CN201510302559.4A 2015-06-04 2015-06-04 Distributed interconnection Transaction Information storage processing method Active CN104933112B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510302559.4A CN104933112B (en) 2015-06-04 2015-06-04 Distributed interconnection Transaction Information storage processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510302559.4A CN104933112B (en) 2015-06-04 2015-06-04 Distributed interconnection Transaction Information storage processing method

Publications (2)

Publication Number Publication Date
CN104933112A true CN104933112A (en) 2015-09-23
CN104933112B CN104933112B (en) 2018-12-21

Family

ID=54120280

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510302559.4A Active CN104933112B (en) 2015-06-04 2015-06-04 Distributed interconnection Transaction Information storage processing method

Country Status (1)

Country Link
CN (1) CN104933112B (en)

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105320757A (en) * 2015-10-19 2016-02-10 杭州华量软件有限公司 Business intelligent analysis method for quickly processing data
CN105590259A (en) * 2015-11-04 2016-05-18 中国银联股份有限公司 Device and method for diagnosis of transaction system
CN105589940A (en) * 2015-12-16 2016-05-18 南京联成科技发展有限公司 Safety management operation and maintenance service platform based on unstructured real-time database
CN105653696A (en) * 2015-12-29 2016-06-08 台山核电合营有限公司 Data processing method and system for nuclear power plant databases
CN105787660A (en) * 2016-02-24 2016-07-20 国家电网公司 Information management system for photovoltaic power distribution network
JP2016170778A (en) * 2015-03-10 2016-09-23 技研商事インターナショナル株式会社 Market area analysis system
CN106227862A (en) * 2016-07-29 2016-12-14 浪潮软件集团有限公司 E-commerce data integration method based on distribution
CN106934023A (en) * 2017-03-13 2017-07-07 山东浪潮云服务信息科技有限公司 A kind of data managing method and device
CN107832392A (en) * 2017-10-31 2018-03-23 链家网(北京)科技有限公司 A kind of metadata management system
CN107944866A (en) * 2017-10-17 2018-04-20 厦门市美亚柏科信息股份有限公司 Transaction record rearrangement and computer-readable recording medium
CN107958046A (en) * 2017-11-24 2018-04-24 小花互联网金融服务(深圳)有限公司 Internet finance big data warehouse analysis mining method
CN108280084A (en) * 2017-01-06 2018-07-13 上海前隆信息科技有限公司 A kind of construction method of data warehouse, system and server
CN108595685A (en) * 2018-05-04 2018-09-28 北京顶象技术有限公司 A kind of data processing method and device
CN108733758A (en) * 2018-04-11 2018-11-02 北京三快在线科技有限公司 Hotel's static data method for pushing, device, electronic equipment and readable storage medium storing program for executing
CN109189861A (en) * 2018-06-29 2019-01-11 深圳市彬讯科技有限公司 Data stream statistics method, server and storage medium based on index
CN109325648A (en) * 2018-06-29 2019-02-12 深圳市彬讯科技有限公司 Multi-dimensional data stream statistics method, server and storage medium based on index
CN109656910A (en) * 2018-12-06 2019-04-19 哈尔滨工业大学 Expansible Large Scale Biology medicine sample management and Visualization Platform
CN111581254A (en) * 2020-05-03 2020-08-25 上海维信荟智金融科技有限公司 ETL method and system based on internet financial data
CN112256523A (en) * 2020-09-23 2021-01-22 贝壳技术有限公司 Service data processing method and device
CN112380218A (en) * 2020-11-18 2021-02-19 浪潮天元通信信息系统有限公司 ETL-based automatic triggering method for summarizing data tables of data warehouse layers
CN112395345A (en) * 2020-12-04 2021-02-23 江苏苏宁云计算有限公司 HBase full data import method and device, computer equipment and storage medium
CN112416630A (en) * 2020-12-10 2021-02-26 湖南新云网科技有限公司 Data flow architecture and data processing method
CN112650738A (en) * 2020-12-31 2021-04-13 广西中科曙光云计算有限公司 Construction method of open database
CN112947844A (en) * 2019-12-11 2021-06-11 北京金山云网络技术有限公司 Data storage method and device, electronic equipment and medium
CN113362018A (en) * 2021-05-25 2021-09-07 北京明略软件系统有限公司 Conference time processing method and system
CN113515362A (en) * 2021-07-12 2021-10-19 广州云从洪荒智能科技有限公司 Data processing method, data processing device, computer equipment and storage medium
CN113742320A (en) * 2021-11-05 2021-12-03 亿景智联(北京)科技有限公司 Management method and device of OLAP data warehouse
WO2022133981A1 (en) * 2020-12-25 2022-06-30 京东方科技集团股份有限公司 Data processing method, platform, computer-readable storage medium, and electronic device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102043841A (en) * 2010-12-10 2011-05-04 上海市城市建设设计研究院 Multi-source information supplying method based on Web technology and integrated service system thereof
CN102867282A (en) * 2012-09-13 2013-01-09 福建富士通信息软件有限公司 Implementation method for mobile Internet-based customer service quality analysis system
CN103678665A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 Heterogeneous large data integration method and system based on data warehouses
CN104298779A (en) * 2014-11-04 2015-01-21 中国银行股份有限公司 Processing method and system for massive data processing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102043841A (en) * 2010-12-10 2011-05-04 上海市城市建设设计研究院 Multi-source information supplying method based on Web technology and integrated service system thereof
CN102867282A (en) * 2012-09-13 2013-01-09 福建富士通信息软件有限公司 Implementation method for mobile Internet-based customer service quality analysis system
CN103678665A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 Heterogeneous large data integration method and system based on data warehouses
CN104298779A (en) * 2014-11-04 2015-01-21 中国银行股份有限公司 Processing method and system for massive data processing

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016170778A (en) * 2015-03-10 2016-09-23 技研商事インターナショナル株式会社 Market area analysis system
CN105320757A (en) * 2015-10-19 2016-02-10 杭州华量软件有限公司 Business intelligent analysis method for quickly processing data
CN105590259A (en) * 2015-11-04 2016-05-18 中国银联股份有限公司 Device and method for diagnosis of transaction system
CN105589940A (en) * 2015-12-16 2016-05-18 南京联成科技发展有限公司 Safety management operation and maintenance service platform based on unstructured real-time database
CN105653696A (en) * 2015-12-29 2016-06-08 台山核电合营有限公司 Data processing method and system for nuclear power plant databases
CN105787660A (en) * 2016-02-24 2016-07-20 国家电网公司 Information management system for photovoltaic power distribution network
CN106227862A (en) * 2016-07-29 2016-12-14 浪潮软件集团有限公司 E-commerce data integration method based on distribution
CN108280084A (en) * 2017-01-06 2018-07-13 上海前隆信息科技有限公司 A kind of construction method of data warehouse, system and server
CN106934023A (en) * 2017-03-13 2017-07-07 山东浪潮云服务信息科技有限公司 A kind of data managing method and device
CN107944866B (en) * 2017-10-17 2021-08-31 厦门市美亚柏科信息股份有限公司 Transaction record duplication elimination method and computer-readable storage medium
CN107944866A (en) * 2017-10-17 2018-04-20 厦门市美亚柏科信息股份有限公司 Transaction record rearrangement and computer-readable recording medium
CN107832392A (en) * 2017-10-31 2018-03-23 链家网(北京)科技有限公司 A kind of metadata management system
CN107958046A (en) * 2017-11-24 2018-04-24 小花互联网金融服务(深圳)有限公司 Internet finance big data warehouse analysis mining method
CN108733758A (en) * 2018-04-11 2018-11-02 北京三快在线科技有限公司 Hotel's static data method for pushing, device, electronic equipment and readable storage medium storing program for executing
CN108733758B (en) * 2018-04-11 2022-04-05 北京三快在线科技有限公司 Hotel static data pushing method and device, electronic equipment and readable storage medium
CN108595685A (en) * 2018-05-04 2018-09-28 北京顶象技术有限公司 A kind of data processing method and device
CN109189861A (en) * 2018-06-29 2019-01-11 深圳市彬讯科技有限公司 Data stream statistics method, server and storage medium based on index
CN109325648A (en) * 2018-06-29 2019-02-12 深圳市彬讯科技有限公司 Multi-dimensional data stream statistics method, server and storage medium based on index
CN109656910A (en) * 2018-12-06 2019-04-19 哈尔滨工业大学 Expansible Large Scale Biology medicine sample management and Visualization Platform
CN109656910B (en) * 2018-12-06 2021-04-13 哈尔滨工业大学 Extensible large-scale biomedical sample management and visualization platform
CN112947844A (en) * 2019-12-11 2021-06-11 北京金山云网络技术有限公司 Data storage method and device, electronic equipment and medium
CN111581254A (en) * 2020-05-03 2020-08-25 上海维信荟智金融科技有限公司 ETL method and system based on internet financial data
CN112256523A (en) * 2020-09-23 2021-01-22 贝壳技术有限公司 Service data processing method and device
CN112380218A (en) * 2020-11-18 2021-02-19 浪潮天元通信信息系统有限公司 ETL-based automatic triggering method for summarizing data tables of data warehouse layers
CN112380218B (en) * 2020-11-18 2023-03-28 浪潮通信信息系统有限公司 ETL-based automatic triggering method for summarizing data tables of data warehouse layers
CN112395345A (en) * 2020-12-04 2021-02-23 江苏苏宁云计算有限公司 HBase full data import method and device, computer equipment and storage medium
CN112416630A (en) * 2020-12-10 2021-02-26 湖南新云网科技有限公司 Data flow architecture and data processing method
WO2022133981A1 (en) * 2020-12-25 2022-06-30 京东方科技集团股份有限公司 Data processing method, platform, computer-readable storage medium, and electronic device
CN112650738B (en) * 2020-12-31 2021-09-21 广西中科曙光云计算有限公司 Construction method of open database
CN112650738A (en) * 2020-12-31 2021-04-13 广西中科曙光云计算有限公司 Construction method of open database
CN113362018A (en) * 2021-05-25 2021-09-07 北京明略软件系统有限公司 Conference time processing method and system
CN113515362A (en) * 2021-07-12 2021-10-19 广州云从洪荒智能科技有限公司 Data processing method, data processing device, computer equipment and storage medium
CN113515362B (en) * 2021-07-12 2023-10-20 广州云从洪荒智能科技有限公司 Data processing method, device, computer equipment and storage medium
CN113742320A (en) * 2021-11-05 2021-12-03 亿景智联(北京)科技有限公司 Management method and device of OLAP data warehouse
CN113742320B (en) * 2021-11-05 2022-03-01 亿景智联(北京)科技有限公司 Management method and device of OLAP data warehouse

Also Published As

Publication number Publication date
CN104933112B (en) 2018-12-21

Similar Documents

Publication Publication Date Title
CN104933112A (en) Distributed Internet transaction information storage and processing method
CN103365929B (en) The management method of a kind of data base connection and system
CN111459985B (en) Identification information processing method and device
CN104750681B (en) A kind of processing method and processing device of mass data
Bao et al. Managing massive trajectories on the cloud
US9798813B2 (en) Extensible person container
CN112767058A (en) AIOT DaaS digital twin cloud platform
CN103268336A (en) Fast data and big data combined data processing method and system
CN103678665A (en) Heterogeneous large data integration method and system based on data warehouses
CN103440288A (en) Big data storage method and device
CN104239377A (en) Platform-crossing data retrieval method and device
CN110990474A (en) Regional industry image analysis method and device
CN110674152B (en) Data synchronization method and device, storage medium and electronic equipment
US20190050435A1 (en) Object data association index system and methods for the construction and applications thereof
CN112148718A (en) Big data support management system for city-level data middling station
CN107832392A (en) A kind of metadata management system
CN110837520A (en) Data processing method, platform and system
CN105630934A (en) Data statistic method and system
CN111737364B (en) Safe multi-party data fusion and federal sharing method, device, equipment and medium
CN107832463A (en) A kind of finance data service platform
CN113127741B (en) Cache method for reading and writing data of mass users and posts in part-time post recommendation system
CN102945270B (en) Parallel distribution type network public opinion data management method and system
CN109446167A (en) A kind of storage of daily record data, extracting method and device
Singh et al. Easy designing steps of a local data warehouse for possible analytical data processing
CN111708895A (en) Method and device for constructing knowledge graph system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Distributed Internet transaction information storage and processing method

Effective date of registration: 20200306

Granted publication date: 20181221

Pledgee: Huaxia Bank Co., Ltd. Hangzhou Yuhang sub branch

Pledgor: Zhejiang Li Shi Science and Technology Co., Ltd.

Registration number: Y2020330000080

PE01 Entry into force of the registration of the contract for pledge of patent right