CN105740462A - Method for supporting data migration between different environments - Google Patents

Method for supporting data migration between different environments Download PDF

Info

Publication number
CN105740462A
CN105740462A CN201610116440.2A CN201610116440A CN105740462A CN 105740462 A CN105740462 A CN 105740462A CN 201610116440 A CN201610116440 A CN 201610116440A CN 105740462 A CN105740462 A CN 105740462A
Authority
CN
China
Prior art keywords
data
etl
database
migration method
java
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610116440.2A
Other languages
Chinese (zh)
Inventor
程永新
胡永
刘赞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI XINJU NETWORK INFORMATION TECHNOLOGY Co Ltd
Original Assignee
SHANGHAI XINJU NETWORK INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI XINJU NETWORK INFORMATION TECHNOLOGY Co Ltd filed Critical SHANGHAI XINJU NETWORK INFORMATION TECHNOLOGY Co Ltd
Priority to CN201610116440.2A priority Critical patent/CN105740462A/en
Publication of CN105740462A publication Critical patent/CN105740462A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/214Database migration support

Abstract

The invention discloses a method for supporting data migration between different environments. The method includes the following steps of firstly, data extraction and storage, wherein to-be-migrated data of tables under a data source user is extracted in an ETL mode piece by piece according to the Java database connection technology and is temporarily stored in a Java stack memory; secondly, data operation and analysis, wherein the data temporarily stored in the memory is read according to the Java IO stream technology, according to the type of a database into which the data is to be written and the number of field columns of the tables, the data is converted into line records matched with the standard of the database, and the data is packaged into an available data object meeting ETL; thirdly, database writing-in of the data, wherein the standardized data is distributed to ETL multithread tasks according to the ETL database writing-in mechanism, continuous and rapid writing-in operation is conducted, and the data is inserted table by table till the line records are completely written to the tail. By means of the method, the data migration work can be conducted more efficiently and stably, faults can be reduced, and operation is easy.

Description

A kind of data migration method supported between varying environment
Technical field
The present invention relates to a kind of data migration method, particularly relate to a kind of data migration method supported between varying environment.
Background technology
Current internet has been enter into the epoch of data, in the face of growing mass data, it is necessary to be managed, test, analyze.Such as: data derive test library from storage facility located at processing plant, import certain a part of data from disparate databases, data analysis is done to new data base, inevitably need to be timed, quantitatively, have for Data Migration, how solving that the data efficient between varying environment migrates is emphasis.
Data are stored in different data bases, and such as: ORACLE, DB2, MYSQL etc., each data base it is also contemplated that concrete version.A such as scene, it is necessary to extract a part of data in these several data sources, import to new MYSQL storehouse and be analyzed, then complete this process, very time-consuming, effort, substantially increase the job costs of enterprise.
The Data Migration of current data base, major part is performed manually by, carry out data combing, verification by the dba personnel of specialty, then utilize export (swf) command to be operated, then import at target source, this can also use at same type of database manipulation, if but related to inter-library, the data importing a table of Cross-environment, or the other data volume of millions, whole process is extremely complex by what become, and difficulty.
Additionally, the mode of operation of this order, there is no daily record, also without the result data that interface is relevant, after having processed, it is not easy management and represents, also cannot verifying whether to meet result requirement, even occur that data are confusing, mistake such as repeats at the situation, the effective migration mechanism of neither one manages, and quality, efficiency all cannot be protected.
Therefore, the Data Transference Technology of existing database has a disadvantage in that 1) transport efficiency is low, every secondary entrance different pieces of information lab environment, performs loaded down with trivial details order;2) level requirement for treatment people is higher;3) do not keep track of, it is impossible to statistics.
Summary of the invention
The technical problem to be solved is to provide a kind of data migration method supported between varying environment, can allow the work of Data Migration, more smoothly run efficiently, reduce the generation of fault, and it is simple to operate, the problem being substantially reduced difficulty of starting with, ordinary person can be on duty through simple training.
The present invention solves that above-mentioned technical problem employed technical scheme comprise that a kind of data migration method supported between varying environment of offer, comprise the steps: a) data pick-up storage: utilize Java to connect the technology of data base, the data of data source user's following table to be migrated are extracted by ETL mode one by one, and is temporarily stored to the stack memory of Java;B) data operation analysis: utilize the IO Flow Technique of Java to read the data of interim internal memory, field number of columns according to the type of database to put in storage and table, convert data to mate the row record of this normalizing database, and be packaged into meet ETL can data object;C) data loading: utilize data base's writing mechanism of ETL, the data that specification is good are assigned to the task of ETL multithreading, carries out quickly write operation continuously, inserts data by table, until row record is completely written to end.
The above-mentioned data migration method between support varying environment, wherein, described data base is ORACLE, DB2 or MYSQL, and described step a) adopts multiple table data of multithreading operation support to extract simultaneously.
The above-mentioned data migration method between support varying environment, wherein, in described step c), ETL multithreading adopts queue to make to migrate data and buffers into storehouse, and it is 2~30,000 row/second that average data inserts speed.
The above-mentioned data migration method between support varying environment, wherein, described data migration method also includes arranging all-the-way tracking record, the data volume that statistics migrates, and adopts visualization display interface to carry out complete monitoring according to log recording.
The above-mentioned data migration method between support varying environment, wherein, described data migration method also includes arranging execution migration strategy with the form of task scheduling, and data source, target source and user right are configured management.
The present invention contrasts prior art following beneficial effect: the data migration method between support varying environment provided by the invention, stored by data pick-up, data operation analysis and data loading can allow the work of Data Migration, more smoothly run efficiently, reduce the generation of fault, and it is simple to operate, the problem being substantially reduced difficulty of starting with, ordinary person can be on duty through simple training.
Accompanying drawing explanation
Fig. 1 is the data mover system configuration diagram that the present invention supports between varying environment;
Fig. 2 is the Data Migration schematic flow sheet that the present invention supports between varying environment.
Detailed description of the invention
Below in conjunction with drawings and Examples, the invention will be further described.
Fig. 1 is the data mover system configuration diagram that the present invention supports between varying environment;Fig. 2 is the Data Migration schematic flow sheet that the present invention supports between varying environment.
Refer to Fig. 1 and Fig. 2, the data migration method between support varying environment provided by the invention, comprise the steps:
A) data pick-up storage: utilize Java to connect the technology of data base, the data of data source user's following table to be migrated are extracted by ETL (ExtractTransformLoad) mode one by one, and is temporarily stored to the stack memory of Java;
B) data operation analysis: utilize the IO Flow Technique of Java to read the data of interim internal memory, field number of columns according to the type of database to put in storage and table, convert data to mate the row record of this normalizing database, and be packaged into meet ETL can data object;
C) data loading: utilize data base's writing mechanism of ETL, the data that specification is good are assigned to the task of ETL multithreading, carries out quickly write operation continuously, inserts data by table, until row record is completely written to end.
The present invention data by specifying for data base, carry out quantitative extraction storage, operational analysis, include queue in, adopt buffer memory warehouse-in to insert;And with the format management migration strategy of task, the monitoring of whole process is done by the mode of journal displaying, efficiently solve the data migration problems between varying environment, attendant only needs to pass through interface operation, start to put in order and can complete work, the mode at complicated background commands interface is simplified operation, and the problem that start with difficulty is greatly reduced, ordinary person can be on duty through simple training.Core link is divided into following 3 layers:
1) data pick-up accumulation layer:
Mainly realize flow process: utilizing Java to connect the technology of data base, the data of certain data source user's following table are extracted by ETL mode one by one, and is temporarily stored to the stack memory of Java, this step completes data pick-up action.(this layer can support the different editions of the data bases such as ORACLE, DB2, MYSQL, supports that multiple table data extract simultaneously, supports that multithreading runs.The big data-handling capacity such as bill/inventory is on average in 0.4-1GB left and right per minute, and concrete processing speed is looked CPU, IO speed and network network speed and determined).
2) data operation analysis layer:
Mainly realize flow process: utilize the IO Flow Technique of Java to read the data of interim internal memory, field number of columns according to data base's (ORACLE, DB2, MYSQL) type to put in storage and table, convert data to mate the row record of this normalizing database, and be packaged into meet ETL can data object, this step completes data operation analysis action.
3) data efficient warehouse-in:
Mainly realizing flow process: utilize data base's writing mechanism of ETL, the data that specification is good are assigned to the task of ETL multithreading, carry out quickly write operation continuously, insert data by table, until row record is completely written to end, this step completes data efficient warehouse-in.(speed, on average up to 2~3w row/per second, supports disparate databases type, for instance ORACLE, DB2, MYSQL etc.)
In sum, particularly advantage of the invention is as follows: 1) greatly simplify specialty command operation difficulty, reduces attendant and requires threshold;2) the pattern striking order line, it has been advanced to visualized operation pattern;3) whole process has track record, it is possible to the data volume that statistics migrates;4) promote the transport efficiency of data, reduce cost of labor, reduce the fault because tired maloperation brings, indirectly decrease the loss of enterprise.
Although the present invention discloses as above with preferred embodiment; so it is not limited to the present invention, any those skilled in the art, without departing from the spirit and scope of the present invention; when doing a little amendment and perfect, therefore protection scope of the present invention is when with being as the criterion that claims define.

Claims (5)

1. the data migration method supported between varying environment, it is characterised in that comprise the steps:
A) data pick-up storage: utilize Java to connect the technology of data base, the data of data source user's following table to be migrated are extracted by ETL mode one by one, and is temporarily stored to the stack memory of Java;
B) data operation analysis: utilize the IO Flow Technique of Java to read the data of interim internal memory, field number of columns according to the type of database to put in storage and table, convert data to mate the row record of this normalizing database, and be packaged into meet ETL can data object;
C) data loading: utilize data base's writing mechanism of ETL, the data that specification is good are assigned to the task of ETL multithreading, carries out quickly write operation continuously, inserts data by table, until row record is completely written to end.
2. the data migration method supported between varying environment as claimed in claim 1, it is characterised in that described data base is ORACLE, DB2 or MYSQL, and described step a) adopts multiple table data of multithreading operation support to extract simultaneously.
3. the data migration method supported between varying environment as claimed in claim 1, it is characterised in that in described step c), ETL multithreading adopts queue to make to migrate data and buffers into storehouse, and it is 2~30,000 row/second that average data inserts speed.
4. the data migration method supported between varying environment as claimed in claim 1, it is characterized in that, described data migration method also includes arranging all-the-way tracking record, the data volume that statistics migrates, and adopts visualization display interface to carry out complete monitoring according to log recording.
5. the data migration method supported between varying environment as claimed in claim 1, it is characterised in that described data migration method also includes arranging execution migration strategy with the form of task scheduling, and data source, target source and user right are configured management.
CN201610116440.2A 2016-03-02 2016-03-02 Method for supporting data migration between different environments Pending CN105740462A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610116440.2A CN105740462A (en) 2016-03-02 2016-03-02 Method for supporting data migration between different environments

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610116440.2A CN105740462A (en) 2016-03-02 2016-03-02 Method for supporting data migration between different environments

Publications (1)

Publication Number Publication Date
CN105740462A true CN105740462A (en) 2016-07-06

Family

ID=56248951

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610116440.2A Pending CN105740462A (en) 2016-03-02 2016-03-02 Method for supporting data migration between different environments

Country Status (1)

Country Link
CN (1) CN105740462A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106547892A (en) * 2016-11-01 2017-03-29 山东浪潮云服务信息科技有限公司 A kind of data resource management platform gathered based on internet data
CN106777345A (en) * 2017-01-16 2017-05-31 山东浪潮商用系统有限公司 A kind of data pick-up loading method based on mass data migration
CN107943889A (en) * 2017-11-16 2018-04-20 郑州云海信息技术有限公司 A kind of migration scheme of MySQL to KDB databases
CN108647270A (en) * 2018-04-28 2018-10-12 尚谷科技(天津)有限公司 A method of the Data Migration based on fault-tolerant time daily record
CN111367890A (en) * 2020-03-13 2020-07-03 深圳壹账通智能科技有限公司 Data migration method and device, computer equipment and readable storage medium
CN112597221A (en) * 2020-12-17 2021-04-02 四川新网银行股份有限公司 Test environment data extraction optimization execution method based on cross section data

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101739436A (en) * 2009-09-28 2010-06-16 孙彬 XML-based flexible data migration method
CN102156699A (en) * 2010-02-11 2011-08-17 陈巍 Data migration method based on JDOM revolving technology
US20120265726A1 (en) * 2011-04-18 2012-10-18 Infosys Limited Automated data warehouse migration
CN102831211A (en) * 2012-08-14 2012-12-19 中山大学 Data sheet migration method based on sheet relation analysis
CN103440273A (en) * 2013-08-06 2013-12-11 北京航空航天大学 Data cross-platform migration method and device
CN104915341A (en) * 2014-03-10 2015-09-16 中国科学院沈阳自动化研究所 Visual multi-database ETL integration method and system
CN105095425A (en) * 2015-07-17 2015-11-25 北京京东尚科信息技术有限公司 Cross-database transfer method and device for databases

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101739436A (en) * 2009-09-28 2010-06-16 孙彬 XML-based flexible data migration method
CN102156699A (en) * 2010-02-11 2011-08-17 陈巍 Data migration method based on JDOM revolving technology
US20120265726A1 (en) * 2011-04-18 2012-10-18 Infosys Limited Automated data warehouse migration
CN102831211A (en) * 2012-08-14 2012-12-19 中山大学 Data sheet migration method based on sheet relation analysis
CN103440273A (en) * 2013-08-06 2013-12-11 北京航空航天大学 Data cross-platform migration method and device
CN104915341A (en) * 2014-03-10 2015-09-16 中国科学院沈阳自动化研究所 Visual multi-database ETL integration method and system
CN105095425A (en) * 2015-07-17 2015-11-25 北京京东尚科信息技术有限公司 Cross-database transfer method and device for databases

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JEYSON的博客: "数据迁移工具kettle简单上手", 《HTTPS://BLOG.CSDN.NET/JS_SKY/ARTICLE/DETAILS/47833793?LOCATIONNUM=11》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106547892A (en) * 2016-11-01 2017-03-29 山东浪潮云服务信息科技有限公司 A kind of data resource management platform gathered based on internet data
CN106777345A (en) * 2017-01-16 2017-05-31 山东浪潮商用系统有限公司 A kind of data pick-up loading method based on mass data migration
CN106777345B (en) * 2017-01-16 2020-07-28 浪潮软件科技有限公司 Data extraction loading method based on mass data migration
CN107943889A (en) * 2017-11-16 2018-04-20 郑州云海信息技术有限公司 A kind of migration scheme of MySQL to KDB databases
CN108647270A (en) * 2018-04-28 2018-10-12 尚谷科技(天津)有限公司 A method of the Data Migration based on fault-tolerant time daily record
CN111367890A (en) * 2020-03-13 2020-07-03 深圳壹账通智能科技有限公司 Data migration method and device, computer equipment and readable storage medium
CN112597221A (en) * 2020-12-17 2021-04-02 四川新网银行股份有限公司 Test environment data extraction optimization execution method based on cross section data
CN112597221B (en) * 2020-12-17 2023-04-11 四川新网银行股份有限公司 Test environment data extraction optimization execution method based on cross section data

Similar Documents

Publication Publication Date Title
CN105740462A (en) Method for supporting data migration between different environments
US11714554B2 (en) Method for aggregation optimization of time series data
CN110297866A (en) Method of data synchronization and data synchronization unit based on log analysis
US20140101167A1 (en) Creation of Inverted Index System, and Data Processing Method and Apparatus
CN108694195B (en) Management method and system of distributed data warehouse
CN103294724B (en) A kind of management method and system of database structure
CN104239443B (en) A kind of storage method of serialized data operation log
CN102542071A (en) Distributed data processing system and method
CN104484131B (en) The data processing equipment of multiple disks server and corresponding processing method
CN109522312A (en) A kind of data processing method, device, server and storage medium
CN106126601A (en) A kind of social security distributed preprocess method of big data and system
US10311111B2 (en) Search method and apparatus and storage medium
CN103827852B (en) Assemble WEB page on search engine results page
CN102651008A (en) Method and equipment for organizing data records in relational data base
US11144424B2 (en) System and method for recording the beginning and ending of job level activity in a mainframe computing environment
CN103955530A (en) Data reconstruction and optimization method of on-line repeating data deletion system
CN104199963A (en) Method and device for HBase data backup and recovery
CN105630934A (en) Data statistic method and system
US20120317113A1 (en) Computing device, storage medium, and method for processing bill of material of electronic products
CN107665219A (en) A kind of blog management method and device
CA2950686C (en) System and method for dynamic collection of system management data in a mainframe computing environment
CN108959456A (en) A kind of method of automatic recovery completion data
CN111159117B (en) Low-overhead file operation log acquisition method
JP2017156931A (en) Data storage device, data storage device control program, and data storage device control method
CN109254961A (en) A kind of distribution multi engine data quality management system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Qingpu 201707 waiqingsong road Shanghai City, No. 588 Lane 7548 Building 1 R zone 1 room 113

Applicant after: Shanghai new torch network information technology Limited by Share Ltd

Address before: Qingpu 201707 waiqingsong road Shanghai City, No. 588 Lane 7548 Building 1 R zone 1 room 113

Applicant before: SHANGHAI XINJU NETWORK INFORMATION TECHNOLOGY CO., LTD.

CB02 Change of applicant information
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160706

WD01 Invention patent application deemed withdrawn after publication