CN117829106A - Automatic extraction method and system for drilling engineering report information - Google Patents

Automatic extraction method and system for drilling engineering report information Download PDF

Info

Publication number
CN117829106A
CN117829106A CN202410009784.8A CN202410009784A CN117829106A CN 117829106 A CN117829106 A CN 117829106A CN 202410009784 A CN202410009784 A CN 202410009784A CN 117829106 A CN117829106 A CN 117829106A
Authority
CN
China
Prior art keywords
data
drilling engineering
target
information
engineering report
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410009784.8A
Other languages
Chinese (zh)
Inventor
林伯韬
张家豪
韩雪银
金衍
曹倩雯
樊永东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Petroleum Beijing
Original Assignee
China University of Petroleum Beijing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Petroleum Beijing filed Critical China University of Petroleum Beijing
Priority to CN202410009784.8A priority Critical patent/CN117829106A/en
Publication of CN117829106A publication Critical patent/CN117829106A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a method and a system for automatically extracting report information of drilling engineering, wherein the method comprises the following steps: acquiring a drilling engineering report document; positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords; and carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information. According to the invention, the extraction and data structuring of the target key data are realized through the automatic identification of the drilling engineering report document, so that the dependence of the drilling engineering report information extraction on manual work and industry experience is reduced.

Description

Automatic extraction method and system for drilling engineering report information
Technical Field
The invention relates to the technical field of oil and gas artificial intelligence, in particular to a method and a system for automatically extracting report information of drilling engineering.
Background
The practice and management of drilling engineering is increasingly dependent on the assisted lifting of computers and data analysis. Drilling designs and constructions require the management and processing of a large number of data and events, including geological awareness and geophysical data, drilling parameters and drilling tool assembly, wellhead equipment data, and drill bit usage and complexity records, etc. The data are recorded in the electronic and paper drilling engineering reports in the form of texts or tables, so that the data are various and cannot be used cooperatively, and further the waste of massive drilling data resources is caused, and the information value and the potential knowledge are further mined.
Because the drilling engineering report is stored in various modes (Word, PDF or picture) and contains structured, semi-structured and unstructured information, the noun concept has a certain professional barrier, and personnel with industry background and practical experience are required to search, process and explain the information base during specific operation or construction, so that a great amount of manpower and material resources are consumed, and the problems of low data quality, low processing efficiency, unreliable results and the like exist in the whole link. However, the drilling engineering report has a certain writing specification, but in practice, various differences exist due to the writing habit of the editor, and the like, so the method of writing the rule is not feasible.
Disclosure of Invention
In order to solve at least one technical problem, the invention provides a method and a system for automatically extracting drilling engineering report information, which are used for solving the problem that the current drilling engineering report information extraction is too high in dependence on manual and industry experience.
In one aspect, a method for automatically extracting report information of drilling engineering is provided, which comprises the following steps:
acquiring a drilling engineering report document;
positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
and carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information.
Preferably, after obtaining the relationship triplet information, the method further includes:
carrying out structuring treatment on the relation triplet information to obtain ternary structured data;
and carrying out abnormal data correction on the ternary structured data based on geological stratum data.
Preferably, the acquiring the drilling engineering report document includes:
acquiring a drilling engineering report;
if the drilling engineering report is in word or PDF format, converting the text format of the drilling engineering report to obtain a drilling engineering report document;
and if the drilling engineering report is in the image format, performing OCR text recognition on the drilling engineering report to obtain a drilling engineering report document.
Preferably, the positioning and cutting the drilling engineering report document according to the target keyword to obtain target data including the target keyword includes:
writing a regular expression according to the target keyword to obtain a keyword segmentation regular expression;
positioning and cutting the drilling engineering report document according to the keyword segmentation regular expression to obtain the target data;
wherein the target data includes target form data and target text block data.
Preferably, the identifying and extracting the ternary relation based on the target data to obtain relation ternary group information includes:
and if the target data is target table data, performing table shape mapping and content analysis on the target data to obtain relation triplet information.
Preferably, the identifying and extracting of the ternary relationship based on the target data to obtain relationship triplet information further includes:
and if the target data is the target text block data, carrying out named entity identification and ternary relation extraction on the target data to obtain relation ternary group information.
Preferably, after the abnormal data correction is performed on the ternary structured data based on geological stratum data, the method further includes:
and transmitting the corrected ternary structured data into an oilfield data lake based on an oilfield transmission protocol.
In a second aspect, an automatic drilling engineering report information extraction system is provided, including:
the data acquisition unit is used for acquiring a drilling engineering report document;
the first data processing unit is used for carrying out positioning cutting on the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
and the second data processing unit is used for carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information.
In a third aspect, an electronic device is provided, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus; a memory for storing a computer program; and the processor is used for realizing the automatic extraction method of the drilling engineering report information when executing the program stored in the memory.
In a fourth aspect, a computer readable storage medium is provided, in which a computer program is stored, the computer program comprising program instructions which, when executed by a processor of an electronic device, cause the processor to perform the method of automatically extracting drilling engineering report information.
According to the method, the device and the system, the drilling engineering report document is automatically identified and positioned, the data containing the target keywords are extracted, the ternary relationship is further identified, extraction and data structuring of the target key data are achieved, and therefore dependence of drilling engineering report information extraction on manual and industry experience is reduced; the abnormal data is corrected based on the geological stratum data, so that the problems of accuracy and reliability of the geological data are solved, and the data quality is improved; based on the report writing specification of the drilling industry, regular expressions are written aiming at target keywords, so that the report documents can be accurately positioned and cut, required target data can be efficiently obtained, and the accuracy and efficiency of data extraction are improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
In order to more clearly describe the embodiments of the present invention or the technical solutions in the background art, the following description will describe the drawings that are required to be used in the embodiments of the present invention or the background art.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a schematic flow chart of a method for automatically extracting report information of drilling engineering according to an embodiment of the present invention;
FIG. 2 is a diagram showing a ternary relation extraction process performed by the BiLSTM-Attention-CRF model provided by the embodiment of the invention;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The terms first, second and the like in the description and in the claims and in the above-described figures are used for distinguishing between different objects and not necessarily for describing a sequential or chronological order. Furthermore, the terms "comprise" and "have," as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those listed steps or elements but may include other steps or elements not listed or inherent to such process, method, article, or apparatus.
The term "and/or" is herein merely an association relationship describing an associated object, meaning that there may be three relationships, e.g., a and/or B, may represent: a exists alone, A and B exist together, and B exists alone. In addition, the term "at least one" herein means any one of a plurality or any combination of at least two of a plurality, for example, including at least one of A, B, C, and may mean including any one or more elements selected from the group consisting of A, B and C.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the invention. The appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those of skill in the art will explicitly and implicitly appreciate that the embodiments described herein may be combined with other embodiments.
Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better illustration of the invention. It will be understood by those skilled in the art that the present invention may be practiced without some of these specific details. In some instances, well known methods, procedures, components, and circuits have not been described in detail so as not to obscure the present invention.
At present, because the drilling engineering report is various in storage modes and contains structured, semi-structured and unstructured information, the noun concept has a certain professional barrier, and personnel with industry background and practical experience are required to further search, process and explain during specific operation or construction of an information base, a great amount of manpower and material resources are consumed, and the problems of low data quality, low processing efficiency, unreliable results and the like exist in the whole link. Accordingly, the present invention is directed to a method for automatically extracting report information of drilling engineering, the method comprising: acquiring a drilling engineering report document; positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords; and carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information. According to the invention, the extraction and data structuring of the target key data are realized through the automatic identification of the drilling engineering report document, so that the dependence of the drilling engineering report information extraction on manual work and industry experience is reduced.
Example 1
An automatic extraction method of drilling engineering report information is provided, referring to fig. 1, including:
s100, acquiring a drilling engineering report document;
s200, positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
s300, performing ternary relation recognition extraction based on the target data to obtain relation triplet information.
In this embodiment, referring to fig. 2, a drilling engineering report document is located and cut by using target keywords such as "wellbore trajectory", "wellbore structure", "drilling tool assembly", "formation temperature gradient", "formation pressure gradient" and "physical lithology", so as to obtain a plurality of target data including target keywords, and the target data including target keywords is input into a pre-trained BiLSTM-Attention-CRF model to perform ternary relation extraction, so as to obtain relation triplet information, where the relation triplet information structure is "subject-relation-object", such as "subject yellow-flow group", relation "development", and "object" thick-layer mudstone ", and a relation triplet information" yellow-flow group development thick-layer mudstone "is formed.
In one possible embodiment, the sentence or paragraph containing the target keyword may be automatically output by locating and cutting the document using natural language processing techniques and machine learning algorithms. The natural language processing technology comprises text word segmentation, part-of-speech tagging, syntactic analysis and the like, and language information in the document can be converted into a form which can be processed by a computer. On this basis, the labeled data can be trained by using a machine learning algorithm to construct a classification model to automatically identify and extract sentences or paragraphs containing target keywords. The method can efficiently extract the information related to the target keywords from the drilling engineering report document, help the user to acquire the effective information more quickly, and reduce the data processing cost and workload.
Preferably, after obtaining the relationship triplet information, the method further comprises:
carrying out structuring treatment on the relation triplet information to obtain ternary structured data;
and carrying out abnormal data correction on the ternary structured data based on the geological stratum data.
In this embodiment, mapping the extracted relationship triplet information into elements containing corresponding attributes in XML format is a method for converting unstructured data into structured data, so that subsequent data analysis and processing can be more conveniently performed. In this process, appropriate attribute definition needs to be performed for different data types and reasonable categories of geological formation data, and elements and corresponding attributes are mapped to achieve effective representation of the data.
In order to ensure the quality and accuracy of the data, the cleaning of the data and the correction process of abnormal data are guided by the well drilling engineering design formula and the reasonable category of geological formation data. The relation between the data can be predicted according to the drilling engineering design formula, and the missing value and the abnormal value can be interpolated or replaced so as to achieve higher data accuracy. By means of abnormal data correction and structured data preprocessing, drilling engineering data are easy to compare, generalize and analyze, and powerful support is provided for subsequent information extraction.
Preferably, obtaining the drilling engineering report document comprises:
acquiring a drilling engineering report;
if the drilling engineering report is in word or PDF format, converting the text format of the drilling engineering report to obtain a drilling engineering report document;
and if the drilling engineering report is in the image format, performing OCR text recognition on the drilling engineering report to obtain a drilling engineering report document.
In this embodiment, electronic drilling engineering reports in formats such as Word, PDF, JPG, etc. are obtained, and in the process of processing the drilling engineering report, in order to analyze and process files in different formats, the format of the file needs to be determined first: if the judgment result is Word document format, analyzing the document by utilizing docx module of Python, extracting the information such as characters, paragraphs and tables in the document, and obtaining drilling engineering report document; if the judgment result is in the PDF document format, carrying out document analysis by adopting a Fitz module of Python, and extracting the contents such as characters, images, tables and the like in the document to obtain a drilling engineering report document; if the judging result is a picture file such as JPG or PNG, characters in the picture are extracted by using OCR (e.g. Tesseact, google Cloud Vision) technology, and then table structure recognition is carried out through a TableMaster model, so that table information in the picture is recognized and extracted, and the character information and the table information are fused to obtain a drilling engineering report document.
Preferably, positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords, including:
writing a regular expression according to the target keywords to obtain keyword segmentation regular expressions;
positioning and cutting the drilling engineering report document according to the keyword segmentation regular expression to obtain target data;
wherein the target data includes target form data and target text block data.
In this embodiment, if the file belongs to the design report according to the file name and the top page title of the drilling engineering report document, a regular expression is written for the fields of "well name", "formation temperature and pressure prediction", "directional well data table", "well bore and casing program", "drilling tool assembly design", "each well section system and main performance", "hydraulic parameter calculation", "drilling tool friction resistance, torque calculation", etc., so as to obtain a keyword segmentation regular expression, and the drilling engineering report document is subjected to positioning cutting according to the keyword segmentation regular expression, so as to obtain corresponding target table data or target text block data.
In one possible embodiment, the file is judged to belong to a finished report according to the file name and the first page title of the drilling engineering report file, and based on the drilling engineering design format in the ocean drilling manual, regular expressions are written for fields such as 'well name', 'stratum temperature and pressure prediction', 'drilling engineering basic data', 'casing program', 'directional well data', 'stratum layering and lithology brief description', 'drilling engineering log', so as to obtain a keyword segmentation regular expression, and the drilling engineering report file is subjected to positioning cutting according to the keyword segmentation regular expression, so that corresponding target table data or target text block data are obtained.
Preferably, the identifying and extracting of the ternary relation based on the target data to obtain relation ternary group information includes:
and if the target data is the target table data, performing table shape mapping and content analysis on the target data to obtain the relation triplet information.
In the embodiment, table shape mapping and content analysis are performed by adopting a TableMaster to obtain structured data such as a casing program data table, a directional well data table and the like; inquiring keywords of 'aging of drilling operation', 'complex condition and underground accident', and extracting semi-structured form information according to a merging frame form information position matching method to obtain relation triplet information.
In one possible embodiment, the table shape mapping is performed according to the text block position, and the Bert-base-Chinese pre-training model is called to perform syntactic analysis on the table content to obtain the relation triplet information of the directional well data table, the well bore, the casing program and the like.
Preferably, the identifying and extracting of the ternary relation based on the target data to obtain the relation ternary group information further comprises:
and if the target data is the target text block data, carrying out named entity identification and ternary relation extraction on the target data to obtain relation ternary group information.
In the embodiment, a pre-trained BiLSTM-attribute-CRF model is utilized to identify and disambiguate named entities such as geographic positions, geological descriptions and the like, longitude and latitude, lithology and corresponding numerical values or descriptions are extracted, and relation triplet information is established.
In one possible embodiment, a pre-trained BiLSTM-Attention-CRF model is utilized to identify and semantically disambiguate named entities such as well sections, strata and the like, and temperature, pressure and corresponding values are extracted to establish relation triplet information.
Preferably, after performing the abnormal data correction on the ternary structured data based on the geological formation data, the method further comprises:
and transmitting the corrected ternary structured data into an oilfield data lake based on an oilfield transmission protocol.
It should be noted that the oilfield transmission protocol is a standard protocol for transmitting data in oilfield engineering. The protocol aims at realizing data exchange and sharing among different devices and systems so as to support real-time monitoring, control and optimization of oil well and drilling operation and ensure consistency, accuracy and timeliness of data. Data exchange becomes more reliable and efficient due to the same standard used between the device and the system. In addition, oilfield transport protocols provide a data security and integrity protection mechanism to prevent unauthorized access and data tampering.
In this embodiment, according to the oilfield transmission protocol WIST or wisml, the corrected ternary structured data (XML data) is transferred into the oilfield data lake, the drilling engineering data transferred into the oilfield data lake can be effectively stored and indexed, meanwhile, the data lake adopts the distributed storage technology, the storage capacity can be expanded, high performance and high availability are provided, and further, by using the wis protocol to perform data transmission, the accuracy and consistency of the data can be ensured, and errors or losses in the data transmission process can be avoided.
In one possible embodiment, a training set and a test set are generated from a sample set containing multiple format type drilling function report documents; training the BiLSTM-Attention-CRF model by using a training set to obtain a trained BiLSTM-Attention-CRF model; and testing the trained BiLSTM-Attention-CRF model by using a test set, and determining the trained BiLSTM-Attention-CRF model as a ternary relation extraction model if the test result meets the prediction accuracy requirement.
For training and testing, we split the dataset into training and testing sets at a ratio of 8:2 for use in model development and performance evaluation. This is done to ensure that the model can accurately triad extract and verify its generalization ability on unseen data.
Example 2
An automatic extraction system for report information of drilling engineering is provided, which comprises:
the data acquisition unit is used for acquiring a drilling engineering report document;
the first data processing unit is used for carrying out positioning cutting on the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
and the second data processing unit is used for carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information.
Example 3
Referring to fig. 3, fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present invention. The electronic device 600 as shown in fig. 3 includes: at least one processor 601, memory 602, at least one network interface 604, and other user interfaces 603. The various components in the electronic device 600 are coupled together by a bus system 605. It is understood that the bus system 605 is used to enable connected communications between these components. The bus system 605 includes a power bus, a control bus, and a status signal bus in addition to a data bus. But for clarity of illustration the various buses are labeled as bus system 605 in fig. 3.
The user interface 603 may include, among other things, a display, a keyboard, or a pointing device (e.g., a mouse, a trackball, a touch pad, or a touch screen, etc.).
It is to be appreciated that the memory 602 in embodiments of the invention may be either volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory. The nonvolatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable EPROM (EEPROM), or a flash Memory. The volatile memory may be random access memory (Random Access Memory, RAM) which acts as an external cache. By way of example, and not limitation, many forms of RAM are available, such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (Double Data Rate SDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), and Direct memory bus RAM (DRRAM). The memory 602 described herein is intended to comprise, without being limited to, these and any other suitable types of memory.
In some implementations, the memory 602 stores the following elements, executable units or data structures, or a subset thereof, or an extended set thereof: an operating system 6021 and application programs 6022.
The operating system 6021 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, for implementing various basic services and processing hardware-based tasks. The application 6022 includes various application programs such as a Media Player (Media Player), a Browser (Browser), and the like for realizing various application services. The program for implementing the method of the embodiment of the present invention may be included in the application 6022.
In the embodiment of the present invention, the processor 601 is configured to execute the steps of a method for automatically extracting drilling engineering report information provided by the embodiments of the present invention by calling a program or an instruction stored in the memory 602, specifically, a program or an instruction stored in the application 6022.
Example 4
There is provided a computer readable storage medium having stored therein a computer program comprising program instructions which, when executed by a processor of an electronic device, cause the processor to perform a method of automatically extracting based on drilling engineering report information.
The foregoing is only a specific embodiment of the invention to enable those skilled in the art to understand or practice the invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. The automatic extraction method of the drilling engineering report information is characterized by comprising the following steps of:
acquiring a drilling engineering report document;
positioning and cutting the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
and carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information.
2. The method for automatically extracting report information of drilling engineering according to claim 1, wherein after obtaining the relation triplet information, the method further comprises:
carrying out structuring treatment on the relation triplet information to obtain ternary structured data;
and carrying out abnormal data correction on the ternary structured data based on geological stratum data.
3. The automatic extraction method of drilling engineering report information according to claim 1, wherein the obtaining the drilling engineering report document comprises:
acquiring a drilling engineering report;
if the drilling engineering report is in word or PDF format, converting the text format of the drilling engineering report to obtain a drilling engineering report document;
and if the drilling engineering report is in the image format, performing OCR text recognition on the drilling engineering report to obtain a drilling engineering report document.
4. The automatic extraction method of drilling engineering report information according to claim 1, wherein the positioning and cutting the drilling engineering report document according to the target keyword to obtain target data including the target keyword comprises:
writing a regular expression according to the target keyword to obtain a keyword segmentation regular expression;
positioning and cutting the drilling engineering report document according to the keyword segmentation regular expression to obtain the target data;
wherein the target data includes target form data and target text block data.
5. The method for automatically extracting report information of drilling engineering according to claim 4, wherein the performing ternary relation recognition extraction based on the target data to obtain relation ternary group information comprises:
and if the target data is target table data, performing table shape mapping and content analysis on the target data to obtain relation triplet information.
6. The automatic extraction method of drilling engineering report information according to claim 4, wherein the identifying and extracting of the ternary relation based on the target data to obtain relation ternary group information further comprises:
and if the target data is the target text block data, carrying out named entity identification and ternary relation extraction on the target data to obtain relation ternary group information.
7. The method for automatically extracting report information of drilling engineering according to claim 2, wherein after the abnormal data correction is performed on the ternary structured data based on geological formation data, the method further comprises:
and transmitting the corrected ternary structured data into an oilfield data lake based on an oilfield transmission protocol.
8. An automatic extraction system for report information of drilling engineering, comprising:
the data acquisition unit is used for acquiring a drilling engineering report document;
the first data processing unit is used for carrying out positioning cutting on the drilling engineering report document according to the target keywords to obtain target data containing the target keywords;
and the second data processing unit is used for carrying out ternary relation recognition extraction based on the target data to obtain relation triplet information.
9. The electronic equipment is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory are communicated with each other through the communication bus;
a memory for storing a computer program;
a processor for implementing a method for automatically extracting report information of drilling engineering according to claims 1 to 7 when executing a program stored in a memory.
10. A computer readable storage medium, characterized in that the computer readable storage medium has stored therein a computer program comprising program instructions which, when executed by a processor of an electronic device, cause the processor to perform the automatic drilling engineering report information extraction method of any one of claims 1 to 7.
CN202410009784.8A 2024-01-02 2024-01-02 Automatic extraction method and system for drilling engineering report information Pending CN117829106A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410009784.8A CN117829106A (en) 2024-01-02 2024-01-02 Automatic extraction method and system for drilling engineering report information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410009784.8A CN117829106A (en) 2024-01-02 2024-01-02 Automatic extraction method and system for drilling engineering report information

Publications (1)

Publication Number Publication Date
CN117829106A true CN117829106A (en) 2024-04-05

Family

ID=90520751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410009784.8A Pending CN117829106A (en) 2024-01-02 2024-01-02 Automatic extraction method and system for drilling engineering report information

Country Status (1)

Country Link
CN (1) CN117829106A (en)

Similar Documents

Publication Publication Date Title
US11537662B2 (en) System and method for analysis of structured and unstructured data
US11321364B2 (en) System and method for analysis and determination of relationships from a variety of data sources
US8914419B2 (en) Extracting semantic relationships from table structures in electronic documents
CN113282955B (en) Method, system, terminal and medium for extracting privacy information in privacy policy
US11977574B2 (en) Method and apparatus for acquiring POI state information
CA3163394A1 (en) System and method for analysis and determination of relationships from a variety of data sources
KR102421904B1 (en) the method to advance the analysis of the causes of disasters
CN111831794A (en) Knowledge map-based construction method for knowledge question-answering system in comprehensive pipe gallery industry
Chen et al. An effective crowdsourced test report clustering model based on sentence embedding
CN117829106A (en) Automatic extraction method and system for drilling engineering report information
WO2020076635A1 (en) Automatic fact extraction
Irrgang et al. A case-based system to cut drilling costs
Malhotra et al. Web page segmentation towards information extraction for web semantics
CN104281693A (en) Semantic search method and semantic search system
CN115221288A (en) Semantic analysis method, semantic analysis device, electronic device, and storage medium
US20180260476A1 (en) Expert stance classification using computerized text analytics
US11341188B2 (en) Expert stance classification using computerized text analytics
CN112966169A (en) Internet emergency information capturing method
US11768804B2 (en) Deep search embedding of inferred document characteristics
Turski et al. Ccpdf: Building a high quality corpus for visually rich documents from web crawl data
Sawsaa et al. Extracting information science concepts based on jape regular expression
CN111581950A (en) Method for determining synonym and method for establishing synonym knowledge base
Mukherjee et al. Browsing fatigue in handhelds: semantic bookmarking spells relief
Barlow et al. US Geological Survey Hydrologic Toolbox—A graphical and mapping interface for analysis of hydrologic data
Sithole et al. Attributes extraction for fine-grained differentiation of the Internet of Things patterns

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination