CN113254508A

CN113254508A - Data warehouse for natural gas pipeline pigging operation and data mining method

Info

Publication number: CN113254508A
Application number: CN202110694989.0A
Authority: CN
Inventors: 贾文龙; 邓乾星; 吴瑕; 李俊逸; 雍雪梅; 江林峪; 孙璐璐
Original assignee: Southwest Petroleum University
Current assignee: Southwest Petroleum University
Priority date: 2021-06-22
Filing date: 2021-06-22
Publication date: 2021-08-13
Anticipated expiration: 2041-06-22
Also published as: CN113254508B

Abstract

The invention discloses a data warehouse for natural gas pipeline pigging operation and a data mining method. The method comprises the following steps: firstly, determining a data source of natural gas pipeline pigging operation; secondly, completing data integration by utilizing an ETL tool; then establishing a natural gas pipeline pigging operation data warehouse according to the Epicentre data model; then, extracting the data from the warehouse and loading the data into a data mart; and finally, carrying out data mining of the natural gas pipeline pigging operation based on a defined algorithm. The method fully considers the value and the discreteness of the actual pigging operation data, realizes the consistency and the integrality of the pigging operation data, provides a management basis for evaluating the safety of the pigging operation of the natural gas pipeline, and provides technical support for developing data mining and intelligent pigging of the pigging operation of the natural gas pipeline.

Description

Data warehouse for natural gas pipeline pigging operation and data mining method

Technical Field

The invention belongs to the field of safety guarantee of natural gas conveying pipelines, and particularly relates to a data warehouse for natural gas pipeline pigging operation and a data mining method.

Background

Natural gas is a clean high-quality energy and chemical raw material, and pipeline transportation is the main transportation mode of the natural gas. According to the middle-long-term pipe network development planning released by the national development and improvement commission and the national energy agency, the total mileage of natural gas pipelines in China is estimated to reach 16.3 ten thousand kilometers in 2025. In the operation process of production and daily production of the natural gas pipeline, dirt, impurities, accumulated liquid and the like left in the process of construction or transportation need to be removed through pipe cleaning operation, so that the purposes of providing pipeline conveying efficiency, prolonging the service life of the pipeline and guaranteeing the safe operation of the pipeline are achieved.

At present, the natural gas pipeline pigging operation is mainly developed by taking a station as a unit and taking a pipe section as an interval, and the generated pigging operation data is visual embodiment of a pigging scheme and is an important basis for reflecting the running condition of the natural gas pipeline. However, the natural gas pipeline pigging operation data has a large amount of information and little useful information, and is stored in a relatively dispersed form in the work logs of different stations or databases of different systems, so that inconsistency and incompleteness of the data are easy to occur. The data warehouse is an efficient data storage system, and can store, extract and maintain data from different places, heterogeneous data sources or other databases in the data warehouse after processing to form a series of theme-oriented, integrated, stable and different time-series sets, so that a comprehensive analysis-oriented and decision-making-oriented application environment is provided for users. Data mining is the mining of useful information from large amounts of data as decision support. At present, the existing natural gas pipeline pigging operation data have the characteristics of discreteness, immediacy, authenticity, visibility, relevance, value, specialty and the like, and the problems of great difficulty in developing data mining and intelligent research exist. Therefore, the existing data storage mode for the natural gas pipeline pigging operation cannot specially classify, efficiently store and effectively utilize data, cannot provide technical support for data mining, and cannot probe the pigging rule of a certain natural gas pipeline. Therefore, a method is needed to establish a data warehouse for gas pipeline pigging operation, and analyze and calculate historical pigging operation data of a certain gas pipeline, so as to solve inconsistency and incompleteness of the gas pipeline pigging operation data, meet the requirement of mining useful information from the historical pigging operation data, and provide technical support for guiding future intelligent pigging construction and application.

To sum up, the existing natural gas pipeline pigging operation data has the characteristics of discreteness, immediacy, authenticity, visibility, relevance, value, specialty and the like, and has the problems of inconsistency, incompleteness and the like, no mature and efficient pigging operation data warehouse is available for specially classifying and efficiently storing the historical pigging operation data, and the historical pigging operation data cannot be conveniently maintained and effectively utilized. The method can provide a data warehouse for the natural gas pipeline pigging operation and a data mining method, so that a data warehouse system for the natural gas pipeline pigging operation is constructed, historical pigging operation data is analyzed and processed, and data mining is carried out, so that the problems of inconsistency and incompleteness of pigging operation data of the natural gas pipeline are solved, the uniformity and the completeness of the pigging operation data are realized, technical support is provided for carrying out information mining and effective utilization of the pigging operation data of the natural gas pipeline, a decision basis is provided for formulating a pigging scheme, and reference is provided for evaluating the safety of a pigging process of the natural gas pipeline.

Disclosure of Invention

The invention provides a data warehouse for natural gas pipeline pigging operation and a data mining method, aiming at solving various defects and shortcomings in the prior art, and adopting the following technical scheme:

the data warehouse and the data mining method for the natural gas pipeline pigging operation are provided, and the method comprises the following steps:

step one, determining a data source of the natural gas pipeline pigging operation. The data source is the most original data stored in each business department, and is information record and calculation result generated by natural gas pipeline pigging operation, including work logs of each business department, databases of other software or systems, pigging operation record, design data of natural gas pipeline, and the like;

and step two, integrating data of the natural gas pipeline pigging operation. The data integration is to extract the data of the pigging operation from a data source by utilizing an ETL tool, load the data into a data warehouse according to a defined mapping relation, check the legality of the data in the loading process, and record a loading result into a working log of the data integration for query;

and step three, establishing a data warehouse for natural gas pipeline pigging operation. And establishing a data warehouse for the natural gas pipeline pigging operation based on the common data model and the software development standard. Adopting an Epicentre data model of an oil exploration and development integrated platform of an international POSC organization to construct a data warehouse of natural gas pipeline pigging operation;

and step four, extracting data of the natural gas pipeline pigging operation. The data extraction is to automatically extract data information in a data warehouse of natural gas pipeline pigging operation according to a defined data warehouse model and load the data information into data marts with different application themes according to a defined mapping relation;

and fifthly, performing data mart of the natural gas pipeline pigging operation. Data marts are collections of data built on the basis of the subject of the research or application in question, and are direct vehicles for data mining. The method comprises the steps of pressure data mart, flow data mart, pipe cleaner operation speed data mart, pigging time data mart, terrain change data mart and the like of natural gas pipeline pigging operation;

and sixthly, data mining of the natural gas pipeline pigging operation. Data mining is to extract relevant data from a data mart for analysis according to different research or application topics, and mine the knowledge hidden behind the data based on appropriate algorithms.

Step one, determining a data source of the natural gas pipeline pigging operation. The data source is the most original data stored in each business department, and is information records and calculation results generated by the gas pipeline pigging operation. The method comprises the following concrete steps:

(1) the data source is determined as a data source, and the data source of the natural gas pipeline pigging operation is mainly a business department or software related to the pigging operation and comprises a ball serving station, a ball receiving station, a valve chamber or station yard along the natural gas pipeline, an SCADA system, a dispatching center and the like;

(2) and determining metadata, wherein the metadata is data describing the whole natural gas pipeline pigging operation process and comprises work logs of all service departments, databases of other software or systems, pigging operation records, design data of the natural gas pipeline and the like.

And step two, integrating data of the natural gas pipeline pigging operation. The data integration is to extract the data of the pigging operation from a data source by utilizing an ETL tool, load the data into a data warehouse according to a defined mapping relation, check the legality of the data in the loading process, and record the loading result into a working log of the data integration for query. The method comprises the following concrete steps:

(1) the ETL tool is used for extracting data, so that the original distributed pigging operation data in the application system is not influenced, the originally existing heterogeneous data source still operates independently, data service is provided for the respective application system, and the original application architecture of an enterprise is not damaged.

(2) A mapping relationship is defined. Defining a mapping relation according to the serve time of the pigging operation and the region where the pipeline or the serve station is located by using the line name of the natural gas pipe as a folder; wherein, the name of metadata is defined by the area where the pipeline or the service station is located, and the arrangement mode of the metadata is defined by the service time of the pigging operation;

(3) and selecting the metadata according to a definition relation, checking the legality of the metadata such as format, definition and the like, loading the legal metadata into a data warehouse, and isolating the illegal metadata.

And step three, establishing a data warehouse for natural gas pipeline pigging operation. And adopting an Epicentre data model of an oil exploration and development integrated platform of the international POSC organization to construct a data warehouse of the gas pipeline pigging operation. The method comprises the following concrete steps:

(1) establishing a natural gas pipeline basic parameter data warehouse:

extracting the natural gas pipeline basic parameters from a data source, and loading the natural gas pipeline basic parameters into a natural gas pipeline basic parameter data warehouse in a data integration mode, so as to establish a natural gas pipeline basic parameter data warehouse;

the natural gas pipeline basic parameter data warehouse comprises pipeline names, pipeline lengths, wall thicknesses, inner diameters, outer diameters, roughness, pipeline along-line elevation change conditions, design pressure, design gas transmission quantity, crossing structures, environment temperatures, mileage and elevations of pipeline along-line station yards or valve chambers and the like.

(2) Establishing a pigging operation running state parameter data warehouse:

extracting the operation state parameters of the pigging operation from a data source, and loading the operation state parameters into a pigging operation state parameter data warehouse in a data integration mode so as to establish a pigging operation state parameter data warehouse;

the warehouse for the operational state parameter data of the pipe cleaning operation comprises the pressure and the gas transmission amount of a ball sending station, the pressure and the gas transmission amount of a ball receiving station, the change situation of the pressure and the flow along the pipeline along the time, the gas inlet and distribution situation along the pipeline, the operation speed of a pipe cleaner, the time when the pipe cleaner reaches each station or valve chamber, the pipe cleaning time and the like.

(3) Establishing a pig parameter data warehouse:

extracting the pipe cleaner parameters from the data source, and loading the pipe cleaner parameters into a pipe cleaner parameter data warehouse in a data integration mode, so as to establish a pipe cleaner parameter data warehouse;

the pig parameter data repository includes: pig type, material, length, diameter, interference, mass, modulus of elasticity, coefficient of sliding friction, and the like.

And step four, extracting data of the natural gas pipeline pigging operation. The data extraction is to automatically extract data information in a data warehouse of natural gas pipeline pigging operation according to a defined data warehouse model and load the data information into data marts with different application themes according to a defined mapping relation. The method comprises the following concrete steps:

(1) defining an automatic extraction data warehouse model:

according to the application theme of the data mart, different automatic extraction data warehouse models are defined, so that metadata in a data warehouse can be efficiently extracted and maintained;

the automatic extraction data warehouse model comprises a pipe cleaner running speed extraction model, a pipe cleaning time extraction model, a pipeline line pressure extraction model, a pipeline line flow extraction model, a pipeline line terrain change extraction model and the like.

(2) Extraction model for running speed of pipe cleaner

The data extracted from the natural gas pipeline basic parameter data warehouse by the pipeline cleaner running speed extraction model comprises the pipeline name, the pipeline length, the wall thickness, the inner diameter, the outer diameter, the roughness, the elevation change condition along the pipeline, the environment temperature, the mileage and the elevation of a station yard or a valve chamber along the pipeline and the like;

the data extracted by the pipeline cleaner operation speed extraction model from the pipeline cleaning operation state parameter data warehouse comprises the change situation of pressure and flow along the pipeline along the time, the gas inlet and distribution situation along the pipeline, the pipeline cleaner operation speed, the time when the pipeline cleaner reaches each station or valve chamber, the pipeline cleaning time and the like;

the data extracted by the pig run speed extraction model from the pig parameter data repository includes: pig type, material, length, diameter, interference, mass, modulus of elasticity, coefficient of sliding friction, and the like.

(3) Pigging time extraction model

The data extracted from the data warehouse by the pigging time extraction model comprises the following data: pipeline basic parameter data such as pipeline name, pipeline length, elevation change condition of the pipeline along the pipeline, a crossing structure, mileage and elevation of a station yard or a valve chamber along the pipeline and the like, and pipeline cleaning operation state parameter data such as pipeline cleaning operation speed, time of the pipeline cleaning to reach each station yard or valve chamber, pipeline cleaning time and the like.

(4) Pipeline pressure extraction model

The data extracted from the data warehouse by the pipeline pressure extraction model comprises the following data: the pressure of the ball serving station, the pressure of the ball receiving station, the change situation of the pressure along the pipeline along with time, the pipeline name, the pipeline length and the like.

(5) Pipeline flow extraction model

The data extracted from the data warehouse by the pipeline flow extraction model comprises the following data: the flow of the ball serving station, the flow of the ball receiving station, the change situation of the flow along the pipeline along with time, the air inlet and distribution situation along the pipeline, the pipeline name, the pipeline length and other data.

(6) Pipeline terrain change extraction model

The data extracted from the data warehouse by the pipeline terrain change extraction model comprises the following data: pipeline name, pipeline length, elevation change condition of the pipeline along the pipeline, crossing structure, mileage and elevation of the pipeline along a station yard or a valve chamber, and the like.

And step five, establishing a data mart of the natural gas pipeline pigging operation. The method comprises a pressure data market, a flow data market, a tube cleaner running speed data market, a tube cleaning time data market, a terrain data market and the like of the natural gas pipeline cleaning operation.

The method comprises the following concrete steps:

(1) pig operating speed data mart

And according to the pipeline pig running speed extraction model, relevant data related to the pipeline pig running speed is extracted from the data warehouse and loaded into a pipeline pig running speed data mart.

(2) Inventory time data mart

And according to the pigging time extraction model, extracting relevant data related to the pigging time from the data warehouse and loading the relevant data into the pigging time data mart.

(3) Pressure data mart

And according to the pipeline pressure extraction model, extracting relevant books related to the pressure from a data warehouse and loading the books into a pressure data mart.

(4) Traffic data mart

Extracting die according to flow along pipeline

Type, relevant data related to the traffic is extracted from the data warehouse and loaded into the traffic data mart.

(5) Terrain data marts

And according to the terrain change extraction model along the pipeline, relevant data related to the terrain are extracted from the data warehouse and loaded into the terrain data mart.

And sixthly, data mining of the natural gas pipeline pigging operation. According to different research or application subjects, respectively five application subjects or research objects including pig running speed, pigging time, flow, pressure and terrain are extracted from the data mart for analysis, and the knowledge hidden behind the data is mined based on a proper algorithm. The method comprises the following concrete steps:

(1) determining application topics or research objects for data mining

For the natural gas pipeline cleaning operation, an application theme or a research object for developing data mining comprises 5 aspects, namely the running speed of a pipe cleaner, the pipe cleaning time, the flow change of the pipeline along the pipeline, the pressure change of the pipeline along the pipeline and the terrain change of the pipeline along the pipeline.

(2) Basis of data mining

Based on different application themes, the data mining is carried out according to data marts, including 5 data marts such as pipe cleaner running speed, pipe cleaning time, pressure, flow and terrain. And according to the user requirements, carrying out corresponding data mining work according to the corresponding data marts, thereby extracting useful information as a decision basis.

(3) Data mining algorithm

When data mining of natural gas pipeline pigging operation is carried out, a statistical method and a induction method are introduced to analyze the existing information. A mathematical statistical model is first established, and then the relevant knowledge is extracted by the model.

(4) Algorithm of pig running speed

Aiming at a folder with a pipeline name as a file name, algorithms of historical data of the running speed of the existing pipeline cleaner can be divided into two types, wherein one type is used for analyzing a specific one-time cleaning operation, and the other type is used for analyzing all cleaning operations of the pipeline;

when a specific pipe cleaning operation is analyzed, a mathematical calculation method is adopted to respectively calculate the maximum pipe cleaner running speed, the minimum pipe cleaner running speed and the average pipe cleaner running speed, so that the maximum pipe cleaner running speed, the minimum pipe cleaner running speed and the average pipe cleaner running speed are used as reference basis for judging whether the pipe cleaner running speed of the specific pipe cleaning operation is reasonable or not;

in the formula (I), the compound is shown in the specification,

-average pig run speed, km/h;

l-the pig travel distance, km;

t is pigging time, h;

when all the pigging operations of a certain natural gas pipeline are analyzed, a statistical method and a regression method are adopted to calculate the distribution interval of the running speed of the pipe cleaner, and the proportion of each distribution interval of the running speed of the pipe cleaner is obtained, so that the overall situation of the pipeline pigging operation is judged.

(5) Time-to-pigging algorithm

Aiming at a folder with a pipeline name as a file name, analyzing historical data of the existing pipeline cleaner time by an algorithm of the historical data, wherein the algorithm is mainly aiming at all pipeline cleaning operations of the pipeline;

counting all the pigging time of the pipeline by adopting a statistical method, and carrying out comparative analysis to obtain the maximum value, the minimum value and the average value of historical data of all the pigging time, the distribution interval of the pigging time and the proportion of the distribution interval to the pigging time, thereby obtaining a statistical result of judging the pigging operation time;

(6) flow and pressure algorithm

The change conditions of the flow and the pressure along the pipeline can be respectively obtained from a flow data mart and a pressure data mart according to flow and pressure algorithms, and the flow and the pressure algorithms are mainly used for obtaining useful information such as maximum flow, minimum flow, average flow, maximum pressure drop, minimum pressure drop, average pressure drop, maximum pressure drop rate and the like through analysis.

(7) Terrain algorithm

The change condition of the terrain along the pipeline can be obtained from the terrain data mart by a terrain algorithm, and the maximum elevation and the minimum elevation are mainly obtained through statistics, and data such as the maximum elevation difference and the minimum elevation difference are calculated.

The invention provides a data warehouse for natural gas pipeline cleaning operation and a data mining method, which are novel methods for developing related researches or applications of pipe cleaner operation speed, pipe cleaning time, pipeline pressure along a pipeline, flow and terrain change conditions through data integration, data extraction and data fitting on the basis of pipeline basic parameters, pipe cleaning operation state parameters and pipe cleaner parameters of a natural gas long-distance pipeline and in combination with related theories of the data warehouse and data mining. The method is easy to realize the consistency and the completeness of the data of the pigging operation, can be convenient for maintaining and managing the pigging operation data and providing technical service for developing data mining, and solves the problem that no mature technology and a calculation method are available at present for carrying out unified maintenance management and efficient utilization on the pigging operation data. The method has clear working flow, convenient maintenance and management and wide application prospect, can extract useful data information from massive pigging operation data, provides technical support for developing data mining and intelligent pigging, and provides decision basis for evaluating the safety of pigging operation of the natural gas pipeline.

Drawings

FIG. 1 is a flow chart of a data warehouse and data mining method for gas pipeline pigging;

FIG. 2 is a data warehouse model diagram of a natural gas pipeline pigging operation;

FIG. 3 is a data mart model diagram of a natural gas pipeline pigging operation.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following examples. It should be understood that the specific examples described herein are intended to be illustrative only and are not intended to be limiting.

The invention provides a data warehouse for natural gas pipeline pigging operation and a data mining method, wherein the method mainly comprises the following calculation steps:

step one, determining a data source of the natural gas pipeline pigging operation. The data source is the most original data stored in each business department, and is information record and calculation result generated by the natural gas pipeline pigging operation, including work logs of each business department, databases of other software or systems, pigging operation record, design data of the natural gas pipeline, and the like.

And step two, integrating data of the natural gas pipeline pigging operation. The data integration is to extract the data of the pigging operation from a data source by utilizing an ETL tool, load the data into a data warehouse according to a defined mapping relation, check the legality of the data in the loading process, and record the loading result into a working log of the data integration for query.

And step three, establishing a data warehouse for natural gas pipeline pigging operation. And establishing a data warehouse for the natural gas pipeline pigging operation based on the common data model and the software development standard. And adopting an Epicentre data model of an oil exploration and development integrated platform of the international POSC organization to construct a data warehouse of the gas pipeline pigging operation.

And step four, extracting data of the natural gas pipeline pigging operation. The data extraction is to automatically extract data information in a data warehouse of natural gas pipeline pigging operation according to a defined data warehouse model and load the data information into data marts with different application themes according to a defined mapping relation.

And fifthly, performing data mart of the natural gas pipeline pigging operation. Data marts are collections of data built on the basis of the subject of the research or application in question, and are direct vehicles for data mining. The method comprises a pressure data market, a flow data market, a tube cleaner running speed data market, a tube cleaning time data market, a terrain change data market and the like of natural gas pipeline cleaning operation.

In step one, data sources are determined, including work logs of each business department, databases of other software or systems, pigging operation records, design data of natural gas pipelines, and the like. The method comprises data information such as pipeline name, diameter, wall thickness, length, elevation along the pipeline, mileage and elevation of a station yard or a valve chamber, pressure, gas transmission amount, pigging time, pig parameters and the like.

In the second step, the data of the pigging operation needs to be extracted from the data source by using an ETL tool according to a defined mapping relation, the legality of the data is checked, the legal data is loaded into a data warehouse, the inconsistency and the incompleteness of the pigging operation data in different data sources are solved, and the uniformity of the pigging operation data is realized.

In the third step, a data warehouse of the gas pipeline pigging operation needs to be constructed based on an Epicentre data model of an oil exploration and development integrated platform of the international POSC organization, and the data warehouse is respectively 3 data warehouses of gas pipeline basic parameters, pigging operation state parameters, pipe cleaner parameters and the like:

(1) natural gas pipeline basic parameter data warehouse

The metadata in the natural gas pipeline basic parameter data warehouse is natural gas pipeline basic parameters which are extracted from a data source and accord with a mapping relation, and comprises pipeline names, pipeline lengths, wall thicknesses, inner diameters, outer diameters, roughness, elevation change conditions along pipelines, design pressure, design gas transmission quantity, crossing structures, environment temperatures, mileage and elevation of a pipeline station yard or a valve chamber along the pipelines and the like;

(2) pigging operation running state parameter data warehouse

The metadata in the pigging operation running state parameter data warehouse is pigging operation running state parameters which are extracted from a data source and accord with a mapping relation, and comprises pressure and gas transmission of a ball sending station, pressure and gas transmission of a ball receiving station, pressure and flow along a pipeline along the time, gas inlet and distribution conditions along the pipeline, running speed of a pipe cleaner, time when the pipe cleaner reaches each station or valve chamber, pigging time and the like;

(3) pig parameter data warehouse

Metadata in the pig parameter data repository, which are pig parameters extracted from the data sources that conform to the mapping relationship, includes: pig type, material, length, diameter, interference, mass, modulus of elasticity, coefficient of sliding friction, and the like.

In the fourth step, an automatic extraction data warehouse model is defined, and then the data information in the data warehouse is loaded into the data marts of different application topics or research objects according to the defined mapping relationship:

(1) extraction model for running speed of pipe cleaner

The pipeline cleaner operation speed extraction model needs more data information extracted from the data warehouse, and the data in 3 data warehouses such as natural gas pipeline basic parameters, pipeline cleaner operation state parameters, pipeline cleaner parameters and the like are all related;

the data extracted from the natural gas pipeline basic parameter data warehouse comprises the pipeline name, the pipeline length, the wall thickness, the inner diameter, the outer diameter, the roughness, the elevation change condition along the pipeline, the environmental temperature, the mileage and the elevation of a station yard or a valve chamber along the pipeline and the like;

the data extracted from the pipe cleaning operation state parameter data warehouse comprises the change situation of pressure and flow along the pipeline along the time, the gas inlet and distribution situation along the pipeline, the running speed of a pipe cleaner, the time when the pipe cleaner reaches each station or valve chamber, the pipe cleaning time and the like;

data extracted from the pig parameter data warehouse includes: pig type, material, length, diameter, interference, mass, modulus of elasticity, coefficient of sliding friction, and the like;

(2) pigging time extraction model

The data extracted from the data warehouse by the pigging time extraction model mainly relates to two data warehouses of natural gas pipeline basic parameters and pigging operation running state parameters, and comprises the following steps: pipeline basic parameter data such as pipeline name, pipeline length, elevation change condition of the pipeline along the pipeline, a crossing structure, mileage and elevation of a station yard or a valve chamber along the pipeline and the like, and pipeline cleaning operation state parameter data such as pipeline cleaning operation speed, the time when the pipeline cleaner reaches each station yard or valve chamber, pipeline cleaning time and the like;

(3) pipeline pressure extraction model

The data extracted from the data warehouse by the pipeline pressure extraction model comprises the following data: the ball serving station pressure, the ball receiving station pressure, the change situation of the pressure along the pipeline along with time, the pipeline name, the pipeline length and other data;

(4) pipeline flow extraction model

The data extracted from the data warehouse by the pipeline flow extraction model comprises the following data: the data of the flow of the ball serving station, the flow of the ball receiving station, the change situation of the flow along the pipeline along with time, the air inlet and distribution situation along the pipeline, the pipeline name, the pipeline length and the like;

(5) pipeline terrain change extraction model

In the fifth step, according to different application themes, different data extraction models are adopted to establish corresponding data marts, including 5 data marts such as pressure, flow, pipe cleaner running speed, pipe cleaning time and terrain change:

(1) pig operating speed data mart

Extracting relevant data related to the running speed of the pipe cleaner from a data warehouse according to the pipe cleaner running speed extraction model, and loading the relevant data into a pipe cleaner running speed data mart for analyzing the running speed of the pipe cleaner;

(2) inventory time data mart

Extracting relevant data related to the pigging time from a data warehouse according to a pigging time extraction model, and loading the data into a pigging time data mart for analyzing the pigging time;

(3) pressure data mart

Extracting relevant books related to the pressure from a data warehouse according to the pipeline pressure extraction model, and loading the books into a pressure data mart for analyzing the change condition of the pipeline pressure along the pipeline;

(4) traffic data mart

According to the pipeline flow extraction model along the pipeline, relevant data related to the flow is extracted from a data warehouse and loaded into a flow data mart for analyzing the change condition of the pipeline flow along the pipeline;

(5) terrain data marts

And according to the terrain change extraction model along the pipeline, extracting relevant data related to the terrain from a data warehouse, and loading the relevant data into a terrain data mart for analyzing the change condition of the terrain along the pipeline.

In the sixth step, data mining of the gas pipeline pigging operation can be further performed. And respectively starting from five application themes of pipe cleaner running speed, pipe cleaning time, flow, pressure and terrain, mining the hidden knowledge behind the data based on a proper algorithm:

(1) basis of data mining

The data mining is carried out according to data marts, and the data marts comprise 5 data marts such as pipe cleaner running speed, pipe cleaning time, pressure, flow and terrain. And according to the user requirements, carrying out corresponding data mining work according to the corresponding data marts, thereby extracting useful information as a decision basis.

(2) Algorithm of pig running speed

(3) Time-to-pigging algorithm

(4) flow and pressure algorithm

(5) Terrain algorithm

The application of the principles of the present invention will now be further described with reference to specific examples.

Example (c): the length of a natural gas long-distance pipeline is 27.5km, the diameter of the pipeline is 813mm, the wall thickness is 8.8mm, the starting point height range is 282m, the end point height is 297m, the inlet pressure of the pipeline in the pipeline cleaning process is 5.15MPa, the outlet pressure is 5.08MPa, and the gas transmission capacity is 620 multiplied by 10⁴m³And d, the ball serving time is 9:30, and the ball receiving time is 12: 15. In addition, historical data of the gas pipeline pigging operation are stored in a dispatching center in an Excel table mode, and data analysis of pigging time is tried to be developed.

The first step is as follows: determining a data source, wherein the data source is historical data of the pigging operation stored in a dispatching center of the pipeline; the pipeline length L is 27.5km, the pipeline diameter d is 813mm, the wall thickness delta is 8.8mm, and the starting point height H is long₀282m, end point elevation H_i297m, ball serving time 9:30, ball receiving time, pipe inlet pressure Q₀5.15MPa, pipeline outlet pressure Q_i5.08MPa, gas transmission quantity Q at pipeline inlet_{Transfusion system}＝620×10⁴m³/d。

The second step is that: data integration, namely extracting historical data of the pigging operation from a data center by using an ETL tool, and checking the type of data storage to enable the data storage to conform to a defined mapping relation;

thirdly, a data warehouse sets the pipeline length L as 27.5km, the pipeline diameter d as 813mm, the wall thickness delta as 8.8mm and the starting point height H₀282m, end point elevation H_iStoring data of 297m and the like into a natural gas pipeline basic parameter data warehouse;

the ball serving time, the ball receiving time and the pipeline inlet pressure Q₀5.15MPa, pipeline outlet pressure Q_i5.08MPa, gas transmission quantity Q at pipeline inlet_{Transfusion system}＝620×10⁴m³The data such as the data/d and the like are stored in a data warehouse of the operation state parameters of the pigging operation;

storing the pipe cleaner parameters in the historical data of the pipeline into a pipe cleaner parameter data warehouse;

fourthly, data extraction, namely defining a pigging time extraction model according to the application theme as the pigging time, then extracting related data information from a data warehouse,the method comprises the following steps: pipeline length L equal to 27.5km and starting point height H₀282m, end point elevation H_iPipeline basic parameter data such as 297m and pigging operation running state parameter data such as serve time, receive time and pigging time;

fifthly, performing data mart, namely forming a pigging time data mart according to related data obtained by a pigging time extraction model so as to lay a foundation for data mining;

sixthly, data mining, namely analyzing the historical data of similar or similar pigging operation by taking parameters such as ball sending time, gas transmission amount, gas transmission pressure and the like as references according to pigging time data mart and adopting a statistical method aiming at the pipeline to obtain the maximum value, the minimum value and the average value of all pigging time historical data, and the distribution interval and the occupied proportion of the pigging time;

and finally, comparing the ball sending time 9:30 and the ball receiving time 12:15 of the cleaning operation with various data obtained by analysis in historical data, finding that the time consumed by the cleaning operation is similar to the average cleaning time and belongs to the working condition of normal cleaning operation, and accordingly, judging that the cleaning operation of the long-distance natural gas pipeline is normal and the natural gas pipeline runs basically normally.

And comparing the data analysis conclusion of the pigging time with the actual working condition, and finding that the analysis conclusion basically accords with the actual working condition, so that the analysis result of the method is in accordance with the actual pigging operation data and the operation rule of the natural gas pipeline cleaner, and the method can be used as a basis for making a pigging scheme and can be used for guiding the field actual production. Therefore, the present invention has its necessity and utility.

The above description is only a preferred embodiment of the present invention, and should not be taken as limiting the invention, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A data warehouse for gas pipeline pigging operation and a data mining method are mainly used for extracting pigging operation data from other storage positions and loading the pigging operation data into the data warehouse, so that the integrality and consistency of the pigging operation data are realized, technical support is provided for developing data mining, and decision basis is provided for evaluating the safety of the gas pipeline pigging operation. The method mainly comprises the following steps:

2. The data warehouse and the data mining method for gas pipeline pigging operation according to claim 1, characterized in that the ETL tool is used to extract the gas pipeline pigging operation data from the data source and load the data into the data warehouse, including the work logs of each business department, the database of other software or systems, the pigging operation records, the design data of the gas pipeline, etc.

3. The data warehouse and the data mining method for the gas pipeline pigging operation according to claim 1, wherein the data warehouse for the gas pipeline pigging operation is constructed based on an Epicentre data model and comprises a gas pipeline basic parameter data warehouse, a pigging operation state parameter data warehouse and a pipe cleaner parameter data warehouse.

4. The data warehouse and the data mining method for gas pipeline pigging according to claim 1, characterized in that based on different application themes and defining a mapping relation, pigging history data in the data warehouse are extracted and loaded into data marts, including 5 data marts of pressure, flow, terrain, pigging time, pig running speed, etc.