CN103810256B - Method based on partitioning technique quick distribution data in big data network optimization platform - Google Patents
Method based on partitioning technique quick distribution data in big data network optimization platform Download PDFInfo
- Publication number
- CN103810256B CN103810256B CN201410034358.6A CN201410034358A CN103810256B CN 103810256 B CN103810256 B CN 103810256B CN 201410034358 A CN201410034358 A CN 201410034358A CN 103810256 B CN103810256 B CN 103810256B
- Authority
- CN
- China
- Prior art keywords
- data
- subregion
- transfer
- specially
- network optimization
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/258—Data format conversion from or to a database
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/214—Database migration support
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of method for quickly distributing data in big data network optimization platform based on partitioning technique, this method includes:The step of A, transfer data partition are calculated;The step of B, generation transfer object subregion;The step of C, progress subregion filtering;The step of D, long-range export data;The step of E, parallel SQLLDR data are imported.Using the inventive method, solving data query time length present in existing network optimization data transfer process, data write-in slowly, and needs the deficiency of human intervention.
Description
Technical field
The present invention relates to the 3G communication technologys and database technology, more particularly to a kind of partitioning technique that is based on is in big data network optimization
The method of quick distribution data in platform.
Background technology
With developing rapidly for mobile communication business, the quantity of mobile communication network optimization is also more and more.In order to solve
The problem of data are preserved, performance optimizes is, it is necessary to which using a set of database acquisition platform data storage, many data servers are carried out
The solution framework of data query.
At present, there is data query time length, data write-in in network optimization data transfer process slowly, and needs human intervention
Deficiency.How to distribute substantial amounts of data in multiple database servers, will not generate larger inquiry pressure to accepting and believing platform again
Power, is also a current urgent problem to be solved.
The content of the invention
In view of this, it is a primary object of the present invention to provide it is a kind of based on partitioning technique in big data network optimization platform it is fast
The method of speed distribution data, is improved to processes such as data importing, data processing, data operation, data filings, existing to solve
There are data query time length present in network optimization data transfer process, data write-in slow, and need the deficiency of human intervention.
To reach above-mentioned purpose, the technical proposal of the invention is realized in this way:
Based on the method for partitioning technique quick distribution data in big data network optimization platform, this method includes:
The step of A, transfer data partition are calculated;
The step of B, generation transfer object subregion;
The step of C, progress subregion filtering;
The step of D, long-range export data;
The step of E, parallel SQLLDR data are imported.
Wherein, the step of data partition is calculated is shifted described in step A, refers to the time range according to transfer data, passes through
Database manipulation, finding out needs to shift the subregion where data.
The step of transfer object subregion being generated described in step B, be specially:Subregion described in step A is converted into Linux
The treatable objects of Shell.
Step B further comprises the computational methods for realizing data conversion subregion, is specially:
B1, distribute data when, extracted according to the time value of data;Carry out data creation when, by data according to
Time carries out strict segmentation, and the data storage of different time is in different object subregions;
B2, in change data, input need distribute data period, the time of change data as needed, in number
According to being matched in subregion according to time range, until obtaining the zone object to be changed;
B3, by zone object with completed distribution data be compared, find out the data there is presently no distribution, transfer to
Subsequent distribution process is handled.
The step of subregion filtering being carried out described in step C, be specially:The data existed according to target database, to needing
The zone object to be changed is filtered, and inquires the object for being actually needed conversion.
Described in step D the step of long-range export data, it is specially:Using data export instrument OCIULDR by where subregion
Data export.
The step of parallel SQLLDR data are imported described in step E, be specially:It will be led using data base tool described in step D
The data gone out import target database.
It is provided by the present invention based on partitioning technique in big data network optimization platform quick distribution data method, with
Lower advantage:
The present invention is for problem present in network optimization data transfer process, according to the distribution situation of gathered data, with reference to fast
Fast database export, importing work, can quickly realize the transfer of data, and whole process is automated, than general SQL statement
The speed of raising about 90%.
Brief description of the drawings
Fig. 1 is process schematic of the present invention based on partitioning technique quick distribution data in big data network optimization platform;
Fig. 2 is a specific implementation process of transfer data partition calculating in Fig. 1;
Fig. 3 is a specific embodiment of progress subregion filtering in Fig. 1;
Fig. 4 is that a specific embodiment of target database is imported data to using the OCIULDR management tools in Fig. 1;
Fig. 5 is the process embodiments in Fig. 1 using SQLLDR progress data importings.
Embodiment
Below in conjunction with the accompanying drawings and embodiments of the invention to the present invention the quick distribution data in big data network optimization platform
Method is described in further detail.
The present invention relates to subregion, the quick method for distributing data in multiple database, root are used in big data network optimization platform
According to data distribution, per class data in transmission, according to ORACLE management minimum unit:Subregion is shifted, and is reduced to data
Pressure.Wherein in data deriving step, using the OCIULDR instruments of specialty, the mode speed that can be extracted than SQL query is carried
High 10 times or so.
Fig. 1 is process schematic of the present invention based on partitioning technique quick distribution data in big data network optimization platform.Such as
Shown in Fig. 1, the process mainly comprises the following steps:
Step 11:The step of transfer data partition is calculated.The transfer data partition is calculated, and is referred to according to transfer data
Time range, by database manipulation, finding out needs to shift the subregion where data.
The specific implementation process that transfer data partition is calculated is illustrated in figure 2, the process includes:
Step 111:Specify data transfer time range be:20130101~20130201.
Step 112:From partitions of database Object table, subregion is extracted.
Step 113:From partitions of database Object table, inquiry subregion date range is:20130101~20130201.
Step 114:Check whether the subregion date is 20130101~20130201, if it is, step 115 is performed, it is no
Then, step 116 is performed.
Step 115:Obtain changing subregion.
Step 116:Return to step 112.
Step 12:The step of generating transfer object subregion.Specially:According to step 11, the subregion inquired about is converted to
The treatable objects of Linux Shell, i.e., by Linux language, the conversion division result that step 11 is obtained prints to one
In file.
Here, in the step of generation transfer object subregion, the computational methods of data partition is employed, number is realized with this
Calculated according to the subregion of conversion, its main process comprises the following steps:
Step 121:For the management of big data, typically all it is managed by partitions of database technology.Database
Partitioning technique have many kinds.Because the present invention is when distributing data, extracted according to the time value of data.Therefore,
The present invention is needed in data creation, data is carried out to strict segmentation according to the time, the data storage of different time is in difference
Object subregion in.
Step 122:In change data, input needs to distribute the period of data, algorithm change data as needed
Time, matched in data partition according to time range, until obtaining the zone object to be changed.
Step 123:Zone object is compared with distribution data have been completed, the number there is presently no distribution is found out
According to transferring to subsequent distribution process to be handled.
Step 13:The step of carrying out subregion filtering.Specially:The data existed according to target database, to needing
The zone object of conversion is filtered, and inquires the object for being actually needed conversion.It is illustrated in figure 3 carry out subregion filtering one
Specific embodiment, its process includes:
Step 131:Circulating filtration is carried out to the transfer object subregion described in step 12.
Step 132:Judge that the subregion of each in database whether there is, if it is not, then return to step 131, if so, then performing step
Rapid 133.
Step 133:Described subregion is transferred to guiding flow.
Step 14:The step of carrying out long-range export data.Specially:Will using the data export instrument OCIULDR of specialty
Data export where subregion.
Here, described OCIULDR is a generalized database management instrument.The present invention directly uses the instrument, specifies
The data file of subregion, and the destination object imported are imported, target database is imported data to.
Be illustrated in figure 4 the present invention using the OCIULDR management tools import data to one of target database it is specific
Embodiment, its process comprises the following steps:
Step 141:OCIULDR is specified to need derived subregion.
Step 142:Specify OCIULDR data input files.
Step 143:Specify the information of OCIULDR connection source databases.
Step 144:Start OCIULDR and perform data extraction.
Step 145:OCIULDR carries out data write-in, completes data export.
Step 15:The step of parallel SQLLDR data are imported.That is, derived data in step 14 are passed through into database work
The data are imported target database, are finally completed the export work of data by tool.
Here, the SQLLDR is also a management tool of database, is realized in the present invention using SQLLDR by text
Data are changed, and import database.
The process embodiments that the present invention carries out data importing using SQLLDR are illustrated in figure 5, the process includes:
Step 151:SQLLDR is specified to need the file imported.
Step 152:SQLLDR is specified to carry out the form of data parsing.
Step 153:Specify the information of SQLLDR connection source databases.
Step 154:Perform SQLLDR and carry out data parsing.
Step 155:Data write-in is carried out using SQLLDR, data export is completed.
Step 156:Script is called, subregion write-in target database will be imported.The data that the step is obtained, are used for and step
The transfer object subregion of 12 generations is compared.
Existing most database technology application scenarios, extract data, it is impossible to according to number using the method based on SQL statement
Data extraction is carried out according to business and distribution situation., can be according to business, the distribution feelings of data and the present invention is then based on partitioning technique
Condition is that subregion carries out data extraction for the administrative unit of database.The present invention is by the way that data base administration, Linux Shell are compiled
Journey and OCIULDR instruments are unified using in the processing of the big data business of mobile network, respective advantage can be played, so that high
Efficient complete the migration work of data.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.
Claims (1)
1. the method based on partitioning technique quick distribution data in big data network optimization platform, it is characterised in that this method is main
Including:
The step of A, transfer data partition are calculated;
The step of B, generation transfer object subregion;
The step of C, progress subregion filtering;
The step of D, long-range export data;
The step of E, parallel SQLLDR data are imported;
The step of data partition is calculated is shifted described in step A, refers to the time range according to transfer data, is grasped by database
Make, finding out needs to shift the subregion where data;
The step of transfer object subregion being generated described in step B, be specially:Subregion described in step A is converted into LinuxShell
Treatable object;
Step B further comprises the computational methods for realizing data conversion subregion, is specially:
B1, distribute data when, extracted according to the time value of data;When carrying out data creation, by data according to the time
Strict segmentation is carried out, the data storage of different time is in different object subregions;
B2, in change data, input needs to distribute the period of data, the time of change data as needed, in data point
Matched in area according to time range, until obtaining the zone object to be changed;
B3, zone object is compared with distribution data have been completed, finds out the data there is presently no distribution, transfer to follow-up
Distribution process is handled;
The step of subregion filtering being carried out described in step C, be specially:The data existed according to target database, to needing to turn
The zone object changed is filtered, and inquires the object for being actually needed conversion;
Described in step D the step of long-range export data, it is specially:Instrument OCIULDR is exported by the number where subregion using data
According to export;
The step of parallel SQLLDR data are imported described in step E, be specially:Will be derived described in step D using data base tool
Data import target database.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410034358.6A CN103810256B (en) | 2014-01-24 | 2014-01-24 | Method based on partitioning technique quick distribution data in big data network optimization platform |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410034358.6A CN103810256B (en) | 2014-01-24 | 2014-01-24 | Method based on partitioning technique quick distribution data in big data network optimization platform |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103810256A CN103810256A (en) | 2014-05-21 |
CN103810256B true CN103810256B (en) | 2017-09-26 |
Family
ID=50707026
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410034358.6A Active CN103810256B (en) | 2014-01-24 | 2014-01-24 | Method based on partitioning technique quick distribution data in big data network optimization platform |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103810256B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104866568A (en) * | 2015-05-22 | 2015-08-26 | 国云科技股份有限公司 | Method for quickly importing big data files into relation-based database |
CN106776598B (en) * | 2015-11-19 | 2019-12-13 | 中国移动通信集团公司 | Information processing method and device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101086732A (en) * | 2006-06-11 | 2007-12-12 | 上海全成通信技术有限公司 | A high magnitude of data management method |
CN101098495A (en) * | 2007-06-14 | 2008-01-02 | 中兴通讯股份有限公司 | System and method for improving intelligent business on-line statistical task performance |
CN101795211A (en) * | 2010-01-13 | 2010-08-04 | 北京中创信测科技股份有限公司 | Data storage method and system |
CN102495906A (en) * | 2011-12-23 | 2012-06-13 | 天津神舟通用数据技术有限公司 | Incremental data migration method capable of realizing breakpoint transmission |
CN102567428A (en) * | 2010-12-30 | 2012-07-11 | 中国移动通信集团浙江有限公司 | Method and device for managing life cycle of online data |
-
2014
- 2014-01-24 CN CN201410034358.6A patent/CN103810256B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101086732A (en) * | 2006-06-11 | 2007-12-12 | 上海全成通信技术有限公司 | A high magnitude of data management method |
CN101098495A (en) * | 2007-06-14 | 2008-01-02 | 中兴通讯股份有限公司 | System and method for improving intelligent business on-line statistical task performance |
CN101795211A (en) * | 2010-01-13 | 2010-08-04 | 北京中创信测科技股份有限公司 | Data storage method and system |
CN102567428A (en) * | 2010-12-30 | 2012-07-11 | 中国移动通信集团浙江有限公司 | Method and device for managing life cycle of online data |
CN102495906A (en) * | 2011-12-23 | 2012-06-13 | 天津神舟通用数据技术有限公司 | Incremental data migration method capable of realizing breakpoint transmission |
Also Published As
Publication number | Publication date |
---|---|
CN103810256A (en) | 2014-05-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11392604B2 (en) | Designating fields in machine data using templates | |
CN105389402B (en) | A kind of ETL method and apparatus towards big data | |
CN107622103B (en) | Managing data queries | |
CN104268428B (en) | A kind of visual configuration method calculated for index | |
CN107016019B (en) | Database index creation method and device | |
CN108369584B (en) | Information processing system, descriptor creation method, and descriptor creation program | |
CN105683940A (en) | Processing a data flow graph of a hybrid flow | |
CN106168963B (en) | Real-time streaming data processing method and device and server | |
CN104778236A (en) | ETL (Extract-Transform-Load) realization method and system based on metadata | |
CN106156070A (en) | A kind of querying method, Piece file mergence method and relevant apparatus | |
CN103955577A (en) | Computer automatic design method for mechanical equipment | |
CN107506383A (en) | A kind of audit data processing method and computer equipment | |
CN107291770A (en) | The querying method and device of mass data in a kind of distributed system | |
CN104092659A (en) | General protocol data analysis method | |
CN104268275A (en) | Method for carrying out business abstraction and path finding analysis on data | |
CN104077369A (en) | Multi-dimension data matching device and method | |
CN110851511A (en) | Data synchronization method and device | |
CN104794130B (en) | Relation query method and device between a kind of table | |
CN103810256B (en) | Method based on partitioning technique quick distribution data in big data network optimization platform | |
CN106648839A (en) | Method and device for processing data | |
CN103942280A (en) | Automatic code generating method based on data structure | |
CN104504221A (en) | Evaluation data processing method and system | |
CN103677852A (en) | Design method of extensible class natural language formula editor | |
CN108874395A (en) | Hard Compilation Method and device during a kind of modularization stream process | |
CN102708157A (en) | Apparatus and method for determining stage using technology lifecycle |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20171229 Address after: 430023 Hubei province Wuhan city East Lake New Technology Development Zone Guandong Industrial Park beacon light communication system equipment and device production workshop 1 4 Patentee after: Wuhan Hong Xin technological service Co., Ltd Address before: 430023 Hubei Wuhan city East Lake Development Zone Guandong Industrial Park beacon light communication system equipment and device production workshop 4 layers Patentee before: Wuhan Hongyi Information Co., Ltd. |
|
TR01 | Transfer of patent right |