CN112328667B

CN112328667B - Shale gas field ground engineering digital handover method based on data blood margin

Info

Publication number: CN112328667B
Application number: CN202010693603.XA
Authority: CN
Inventors: 王鸿捷; 何益萍; 梁谷; 张锦涛; 肖坤; 徐心童; 胡耀义; 杨政; 魏士尧; 刘雅琪
Original assignee: Sichuan Changning Natural Gas Development Co ltd; China Petroleum Engineering and Construction Corp
Current assignee: Sichuan Changning Natural Gas Development Co ltd; China Petroleum Engineering and Construction Corp
Priority date: 2020-07-17
Filing date: 2020-07-17
Publication date: 2023-09-08
Anticipated expiration: 2040-07-17
Also published as: CN112328667A

Abstract

The invention provides a data blood edge technology-based shale gas field ground engineering digital handover method, which comprises a whole set of technical scheme, data blood edge relation configuration, interface packaging method, visual data blood edge diagram construction and the like in the processes of data acquisition, storage, service and achievement restoring, so that the whole data link is completely processed and managed, the whole life cycle of the data is opened, and a data closed loop is formed, thereby continuously improving the quality and service capability of the data in the whole data link.

Description

Shale gas field ground engineering digital handover method based on data blood margin

Technical Field

The invention belongs to the field of data handover, and particularly relates to a shale gas field ground engineering digital handover method based on data blood edges.

Background

The traditional engineering construction digital handover integrates the data related to the project artificially, carries out phase handover according to project milestones, and the handover content comprises various documents, drawings and data/models distributed in various professional software generated in the design/purchase/construction process. The data provider and the data receiver need to check the integrity, consistency, correctness and timeliness of the handover data together in the handover process, and the handover workload is huge.

When the data problem occurs, the data source is required to be searched from the mass documents through manual comparison, the data problem is found, the efficiency is low, errors are easy to occur, and a large amount of manpower and material resources are wasted. If the data cannot be effectively traced, errors are gradually accumulated, and normal production and operation of the engineering object are finally affected.

In many project practices, tracing back data relationships often resides at the system or table level, managing relationships by way of data flow, and defining and viewing relationships at the field level is not ideal.

The shale gas field ground engineering construction process involves a lot of professions, generates a large amount of data with different formats in the design, construction and debugging processes, and is distributed on different professional software. In the period of centralized construction and data transfer, engineering data transfer is complex and has large workload, and standardized and normalized engineering data acquisition, processing and storage are required, so that orderly transfer work is ensured, and production preparation efficiency is improved. A unified platform is needed to collect data required by each specialty, unified standard management is carried out on the bottom data, data specifications and management specifications are formulated, unified data outlets are provided for each upper application system in the future, the data retrieval time cost is reduced, and data sharing intercommunication among the specialty is realized.

Disclosure of Invention

Aiming at the problems that the data transfer is complex and the engineering quantity is large in the conventional shale gas field ground engineering construction, the invention provides a shale gas field ground engineering digital transfer method based on data blood margin, and the data blood margin technology is used for making blood margin relations to realize unified management of data through metadata setting blood margin relations.

The invention has the following specific implementation contents:

the invention discloses a data blood margin-based shale gas field ground engineering digital handover method, which comprises the steps of defining a data blood margin relation by using a reaction front-end technology and a Spring Cloud micro-service technology before data acquisition, storing the defined data blood margin relation into a database according to a Json protocol for persistence, applying the defined relation to a data acquisition layer for data analysis and storing the data analysis into a data warehouse; and then collecting data, registering and registering metadata of the collected data, taking the metadata of the collected data as input data of a data blood relationship, and caching the collected data.

In order to better implement the invention, further, the collected data comprises structured data, model data and unstructured data;

the metadata of the model data and the unstructured data comprise source information and structure information of the collected data; defining and configuring the blood-edge relation of the data flow direction through source information and structure information on a configuration interface for configuring the blood-edge relation of the data, and caching the received acquired data according to an original structure;

the structural data definition data blood-edge relation is realized in two ways, and the specific method is as follows:

the method comprises the following steps: manufacturing a data blood relationship definition Excel template, and then carrying out batch data import on a configuration interface for configuring the data blood relationship through uploading the data blood relationship definition Excel template;

the second method is as follows: defining and configuring the blood-edge relation of the data flow direction through a source system, a source owner, a source table, a source field, a target system, a target owner, a target table and a target field in a configuration interface for configuring the blood-edge relation of the data; the source system, the source owner, the source table, the source field, the target system, the target owner, the target table and the target field are metadata of the structured data.

In order to better realize the invention, further, for the caching of the acquired data, different modes of caching are carried out according to different data classifications:

the structured data comprises structured data in a relational database, content data identified through tables and fields is collected by means of an open-source Kafka message queue, the data is packaged into Json messages and pushed to a Kafka platform, and then the messages are analyzed and stored through a data processing program; the specific operation of the storage is as follows: storing the structured data into a MySQL database, and storing the association relation of the structured data into a Neo4j graph database;

the model data comprise 3D model file data, an AVEVA three-dimensional model, an SP3D three-dimensional model, a Revit software building three-dimensional model, a line GIS three-dimensional model and an AutoCAD Civil 3D total image three-dimensional model; importing model data through a Kafka system, and pulling out actual model data by task scheduling according to a Uniform Resource Locator (URL) corresponding to the model in the metadata for storage; storing the model data in a FastDFS distributed file system;

the unstructured data comprise engineering drawings, SP PID intelligent PID data drawings and data, office files and PDF files; for unstructured data, pushing metadata of the unstructured data to a platform through a Kafka message queue, and extracting actual file data for storage according to a file URL in the metadata by task scheduling; the unstructured file is stored into the FastDFS distributed file system.

In order to better realize the invention, further, for the unstructured data, in the process of analyzing and extracting the message, engineering object identification information associated with the unstructured data is extracted, the engineering object identification information is structured data information, and the association between the engineering object identification information and the corresponding unstructured data is established in a structured database, so that the data link between the structured data and the unstructured data is realized.

In order to better realize the invention, further, the collected data is converted according to the international standard, unordered scattered data is formed into standard and general engineering data, and the converted data is stored in a standard storage area to form a steady-state data structure; storing the structured data into a MySQL database, and storing the association relation of the structured data into a Neo4Jj graph database; model data and unstructured files are stored into the FastDFS distributed file system.

The standard and general engineering data are normalized through a data model, a service entity list, a service entity data structure list and a service entity document list which are established in advance; the data model, the business entity list, the business entity data structure list and the business entity document list are required to meet the following requirements:

the service structure is carded in a hierarchical mode of the tree directory, and the service entities are classified;

defining and configuring logical categories and classifications of asset services;

and defining and designing a data structure table for the physical storage layer.

In order to better realize the invention, further, for the data stored in the standard storage area, the data is not directly pushed to the upper layer application to provide data service, the data is required to be subjected to service conversion processing according to service requirements in a service delivery mode, and the data requirements are stored into the upper layer application of the theme storage area for use after service conversion; the personalized application is required to be personalized and customized and then stored into a customized storage area for upper-layer application; the personalized application comprises interface personalization, protocol personalization and query personalization.

In order to better realize the invention, further, when the service delivery is carried out, the construction of a data service interface is realized by registering and opening an interface in a service catalog through a standard RESTful API based on an API gateway in a highly configurable and lightly developed mode; realizing the data service under different scenes by combining and arranging the constructed data service interfaces; and takes the transmission object of the data service as one input of the data blood relationship.

In order to better realize the invention, further, after the upper layer application uses the data provided by the theme storage area or the customized storage area, the output result data is restored to the result restoring area, and then the data stored in the result restoring area is standardized in a data rule base mode, so that the restored result data accords with the standard definition of the international standard ISO 159226 and the enterprise data storage standard and is stored in the standard storage area; the achievement restoring area is based on a MongoDB database;

the method comprises the steps of recording data conversion in each step in the processes of data acquisition, storage, service delivery application, personalized customization application and achievement restoring by adopting a message queue and data persistence technology, and storing the records in a MySQL database and a Neo4j graph database.

In order to better realize the invention, further, the structured data blood-lineage diagram is generated by the Echarts technology for visual display; the structural data blood-edge map uses metadata as a distinguishing basis of field-level data blood-edge relations and uses a data table as a distinguishing basis of table-level data blood-edge relations;

carrying out data blood relationship display on the collected, stored, service delivery application, personalized customization application and result restoring data by using the structured data blood relationship graph; and the blood-edge relation contained in the structured data blood-edge graph is checked sequentially according to the sequence of the source system, the source owner, the source field, the source table, the target field, the target owner and the target system.

In order to better realize the invention, the unstructured data is further subjected to labeling, sources and phases of the unstructured data are refined, and data labels are set according to the format of the deliveries and keywords, so that the sources of the data can be quickly searched and queried.

Compared with the prior art, the invention has the following advantages:

(1) The data blood-source relation of the field level is checked through the visual data blood-source map, so that quick and simple tracing of engineering data is realized;

(2) Unified data acquisition, unified standard management is carried out on the bottom data, data specifications and management specifications are formulated, unified data outlets are provided for various upper application systems in the future, the data retrieval time cost is reduced, and data sharing and intercommunication among professions are realized.

Drawings

FIG. 1 is a complete flow chart of the data processing of the present invention;

FIG. 2 is a diagram illustrating an exemplary configuration interface for downloading a data blood relationship definition Excel template in the actual operation of the present invention;

FIG. 3 is a diagram illustrating an exemplary configuration interface for defining metadata for a data blood relationship in an actual operation of the present invention;

fig. 4 is a diagram showing an example of data blood-edge relationship viewing in the actual operation of the present invention.

Detailed Description

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only some embodiments of the present invention, but not all embodiments, and therefore should not be considered as limiting the scope of protection. All other embodiments, which are obtained by a worker of ordinary skill in the art without creative efforts, are within the protection scope of the present invention based on the embodiments of the present invention.

Example 1:

the invention discloses a shale gas field ground engineering digital handover method based on a blood-margin technology, as shown in fig. 1, firstly, data are required to be collected, the collected data comprise structured data, unstructured data, an industrial drawing, a model and the like, then the collected data are cached and stored according to a rule of storing the data and a rule of shale gas field business, and in the storage, the data of different classifications are stored into different databases according to different classifications of the data, such as the structured data, the unstructured data and the like; for the collected data after the storage is distinguished, conversion processing is carried out according to international standards, the external unordered scattered data is formed into standardized and generalized engineering data, and the converted data enters a standard storage area to form a steady-state data structure; storing the structured data into a MySQL database, and storing the association relation of the structured data into a Neo4j graph database; model data and unstructured files are stored into the FastDFS distributed file system. Then, the general, standard and steady data are stored in a theme storage area or a custom storage area for upper application according to the requirements; after the provided data are used by the upper layer application, the result data are output, the output result data are traced back through blood relationship, data management such as quick search analysis is realized, and the result data after big data analysis are restored or history isolation is carried out.

Working principle: the invention solves the problems of large workload, low efficiency, easy error and difficult data tracing when the traditional data delivery is carried out by artificial integration, introduces a data blood source technology, realizes the communication with databases in a construction period and an operation period, receives data from the cloud and infrastructure such as design, purchase, construction, maintenance and the like, carries out asset storage and management, and flexibly responds to the data requirements of upper intelligent application.

Example 2:

in order to better realize the invention, on the basis of the embodiment 1, further, in the process of collecting data, the receiving adapter for receiving the data registers and registers metadata of the collected data, and takes the metadata of the collected data as input data of data blood edges; the metadata comprises source information and structure information of the acquired data; and caching the received acquisition data according to an original structure.

For the caching of the collected data, caching in different modes is carried out according to classification, wherein the specific classification comprises: structured data, model data, unstructured data, real-time data;

processing the real-time data, converting the real-time data into structured data, and storing the structured data in a MySQL database;

Working principle: in the aspect of collection, aiming at heterogeneous data collection, the method unifies a collector, encapsulates technical components and scheduling operation and provides the technical components and scheduling operation for users; in the aspect of storage, aiming at the situation that each layer of functions of data storage are shared respectively and the database types are more, the method integrates the data storage and unified call in an object-oriented manner and provides an integrated storage and call interface.

Other portions of this embodiment are the same as those of embodiment 1 described above, and thus will not be described again.

Example 3:

in order to better implement the present invention on the basis of any one of the above embodiments 1-2, further, in the parsing and extracting process of the message, the engineering object identification information associated with the unstructured data is also extracted, the engineering object identification information is structured data information, and an association is established between the engineering object identification information and the corresponding unstructured data in the structured database, so as to implement data connection between the structured data and the unstructured data.

Meanwhile, converting the collected data according to international standards, forming unordered scattered data into standard and general engineering data, storing the converted data into a standard storage area, forming a steady-state data structure, storing the structured data into a MySQL database, and storing the association relation of the structured data into a Neo4j graph database; model data and unstructured files are stored into the FastDFS distributed file system.

(1) The service structure is carded in a hierarchical mode of the tree directory, and the service entities are classified;

(2) Defining and configuring logical categories and classifications of asset services;

(3) And defining and designing a data structure table for the physical storage layer.

Other portions of this embodiment are the same as any of embodiments 1-2 described above, and thus will not be described again.

Example 4:

on the basis of any one of the above embodiments 1 to 3, in order to better implement the present invention, further, for the data stored in the standard storage area, the data is not directly pushed to the upper layer application to provide data service, and the data needs are subjected to service conversion processing according to service requirements in a service delivery mode, and then are stored into the upper layer application of the subject storage area for use after service conversion; the personalized application is required to be personalized and customized and then stored into a customized storage area for upper-layer application; the personalized application comprises interface personalization, protocol personalization and query personalization; the data of the standard storage area, the theme storage area and the custom storage area are structured data and are also stored in the MySQL database.

When the service delivery is carried out, the construction of a data service interface is realized by registering and opening an interface in a service catalog through a standard RESTful API based on an API gateway in a highly configurable and lightly developed mode; realizing the data service under different scenes by combining and arranging the constructed data service interfaces; and takes the transmission object of the data service as one input of the data blood relationship.

Working principle: in terms of service output, the data interface is packaged and modified into micro service by using a micro service technology framework, and the micro service is exposed to an upper layer application to be called in a mode of an API gateway. At the service delivery level, rather than custom developing for each application to provide data services, the data service interface is built through a highly configurable, lightweight development approach. The technical level is based on the manner of an API gateway, and the interfaces are registered and opened in a service directory through a standard RESTful API. All standardized, generalized interfaces in the service directory can be combined and orchestrated to meet the data services in different scenarios. The transmission object of the data service is constructed and integrated based on the logic entity in the data model and is also used as one of the important inputs of the data blood relationship

Other portions of this embodiment are the same as any of embodiments 1 to 3 described above, and thus will not be described again.

Example 5:

in order to better implement the present invention on the basis of any one of the above embodiments 1 to 4, as shown in fig. 2, 3 and 4, further, after the upper layer application uses the data provided by the theme storage area or the customized storage area, the output result data is restored to the result restoring area, and then the data stored in the result restoring area is standardized in a data rule base manner, so that the restored result data accords with the specification definition of international standard ISO 15926 and enterprise data storage standard, and then is stored in the standard storage area; the achievement restoring area is based on a MongoDB database;

Defining a data blood-edge relationship by using a reaction front-end technology and a Spring Cloud micro-service technology, storing the defined data blood-edge relationship into a database according to a Json protocol for persistence, and carrying out data analysis by applying the defined relationship to a data acquisition layer and storing the data analysis into a data warehouse;

the definition of the data blood-edge relationship is realized in two ways, and the specific method is as follows:

the method comprises the following steps: preparing a data blood edge relationship definition Excel template, and then carrying out batch data import on a configuration interface for configuring the data blood edge relationship through uploading the data blood edge relationship definition Excel template, wherein an interface schematic diagram in actual operation is shown in fig. 2;

the second method is as follows: defining and configuring the blood-edge relation of the data flow direction through a source system, a source owner, a source table, a source field, a target system, a target owner, a target table and a target field in a configuration interface for configuring the blood-edge relation of the data; the source system, the source owner, the source table, the source field, the target system, the target owner, the target table and the target field are metadata of the structured data; in actual operation, the interface is schematically shown in fig. 3.

Generating a structured data blood-lineage diagram through Echarts technology for visual display; the structural data blood-edge map uses metadata as a distinguishing basis of field-level data blood-edge relations and uses a data table as a distinguishing basis of table-level data blood-edge relations;

carrying out data blood relationship display on the collected, stored, service delivery application, personalized customization application and result restoring data by using the structured data blood relationship graph; the blood-edge relation contained in the structured data blood-edge graph is checked sequentially according to the sequence of a source system, a source owner, a source field, a source table, a target field, a target owner and a target system; specific practical operation examples are shown in fig. 4, and in fig. 4, due to color limitation, a red color ball in an original image is shown as black in fig. 4, and a mouse is hovered above a circular graph node in practical operation, so that the blood relationship between corresponding fields and a data table can be independently checked; clicking the drop-down box of the selected target field can check the blood-edge relation corresponding to the selected target field.

The small red color balls in fig. 4 represent the data fields of the engineering entities, the large red color balls represent the engineering entity data tables corresponding to the fields, and the arrows represent the data flow directions from the source field to the source table, the target table and the target field in sequence; the data blood-edge map provides a scaling function, and the blood-edge relation is enlarged or reduced to be checked according to the whole perception or local focusing positioning requirement.

And (3) carrying out labeling setting on unstructured data, refining sources and stages of the unstructured data, and setting data labels according to formats and keywords of deliveries so as to facilitate quick retrieval and query of the sources of the data.

Working principle: defining the relation of the data flow direction between external data and internal data and between the internal data in a visual mode, storing the blood-edge relation defined by the front end into a database for persistence according to a Json protocol by combining a real front end technology with a Spring Cloud micro-service technology, analyzing the data by applying the defined relation to a data acquisition layer and storing the data in a data warehouse, and displaying the mapping of the data flow direction through a visual data blood-edge diagram. In the process of data transfer by using the platform, if data tracing is needed, the field-level data blood-edge relationship can be checked through the visual data blood-edge map, so that a simple method is provided for engineering construction data tracing. After the upper layer application uses the data provided by the platform, restoring the result data of the output comprehensive study to a result restoring area of the platform, and carrying out standardized processing on the result data according to a data rule base mode to ensure that the result data accords with the standard definition of the ISO 15926 and the enterprise data storage standard and finally enters a standard storage area; the information processing link based on the data blood-edge technology records the change of the data, so that whether the processing relationship between the data is reasonable or not can be analyzed, and the influence of the change of the upstream data on the downstream can be analyzed; tracking the source of upstream problems as downstream data changes, etc. The table level and field level data blood relationship analysis clearly locates the association relationship of mapping, calculation, circulation and the like among different engineering entities, has high analysis precision and wider application prospect.

Other portions of this embodiment are the same as any of embodiments 1 to 4 described above, and thus will not be described again.

The foregoing description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and any simple modification, equivalent variation, etc. of the above embodiment according to the technical matter of the present invention fall within the scope of the present invention.

Claims

1. The data blood margin-based shale gas field ground engineering digital handover method is characterized in that before data acquisition, a reaction front-end technology is combined with a Spring Cloud micro-service technology to define a data blood margin relation, the defined data blood margin relation is stored into a database according to a Json protocol for persistence, and a defined relation is applied to a data acquisition layer for data analysis and storage in a data warehouse; then, data acquisition is carried out, metadata of acquired data are registered and registered, the metadata of the acquired data are used as input data of a data blood-edge relationship, and the acquired data are cached;

the collected data comprises structured data, model data and unstructured data;

the second method is as follows: defining and configuring the blood-edge relation of the data flow direction through a source system, a source owner, a source table, a source field, a target system, a target owner, a target table and a target field in a configuration interface for configuring the blood-edge relation of the data; the source system, the source owner, the source table, the source field, the target system, the target owner, the target table and the target field are metadata of the structured data;

converting the collected data according to international standard, forming unordered scattered data into standard and general engineering data, and storing the converted data into a standard storage area to form a steady-state data structure; storing the structured data into a MySQL database, and storing the association relation of the structured data into a Neo4j graph database; storing the model data and unstructured files in a FastDFS distributed file system;

(3) Defining and designing a data structure table for a physical storage layer;

2. The data blood-based shale gas field surface engineering digital handover method as claimed in claim 1, wherein for the data acquisition caching, caching in different modes is performed according to data classification:

3. The method for digitally handing over the shale gas field ground engineering based on the data blood margin according to claim 2, wherein for the unstructured data, in the process of analyzing and extracting the message, engineering object identification information associated with the unstructured data is extracted, the engineering object identification information is structured data information, and an association is established between the engineering object identification information and the corresponding unstructured data in a structured database, so that data connection between the structured data and the unstructured data is realized.

4. The method for digitally handing over the shale gas field ground engineering based on the data blood margin according to claim 1, wherein the data stored in the standard storage area is not directly pushed to an upper layer application to provide data service, the data is required to be subjected to service conversion processing according to service requirements in a service delivery mode, and the data requirements are stored into the upper layer application of the theme storage area for use after service conversion; the personalized application is required to be personalized and customized and then stored into a customized storage area for upper-layer application; the personalized application comprises interface personalization, protocol personalization and query personalization.

5. The data blood-based shale gas field ground engineering digital handover method according to claim 4, wherein the construction of the data service interface is realized by registering and opening an interface in a service catalog through a standard RESTful API based on an API gateway in a highly configurable and light development mode when the service delivery is carried out; realizing the data service under different scenes by combining and arranging the constructed data service interfaces; and takes the transmission object of the data service as one input of the data blood relationship.

6. The method for digitally handing over shale gas field ground engineering based on data blood margin according to claim 5, wherein after the data provided by the subject storage area or the customized storage area is used by the upper layer application, the output result data is restored to the result restoring area, and then the data stored in the result restoring area is standardized in a data rule base mode, so that the restored result data accords with the specification definition of international standard ISO 1596 and enterprise data storage standard, and then is stored in the standard storage area; the achievement restoring area is based on a MongoDB database;

7. The data blood-margin-based shale gas field ground engineering digital handover method according to claim 1, wherein the unstructured data is subjected to labeling, sources and the stages of the unstructured data are refined, and data labels are set according to the formats and keywords of deliverables so as to facilitate rapid searching and querying of the sources of the data.