CN115132366A - Multi-source data processing method and system based on health and medical big data standard library - Google Patents

Multi-source data processing method and system based on health and medical big data standard library Download PDF

Info

Publication number
CN115132366A
CN115132366A CN202210758887.5A CN202210758887A CN115132366A CN 115132366 A CN115132366 A CN 115132366A CN 202210758887 A CN202210758887 A CN 202210758887A CN 115132366 A CN115132366 A CN 115132366A
Authority
CN
China
Prior art keywords
metadata
data
health
medical
standard library
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210758887.5A
Other languages
Chinese (zh)
Inventor
殷晋
洪磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN202210758887.5A priority Critical patent/CN115132366A/en
Publication of CN115132366A publication Critical patent/CN115132366A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/84Mapping; Conversion

Abstract

The invention relates to a multi-source data processing method and a system based on a health and medical big data standard library, comprising the following steps: acquiring medical health data of each service platform, and establishing a medical health database; extracting metadata in the medical health database, and converting the storage format of the metadata; classifying and storing the metadata according to the application category of the metadata, and establishing a metadata standard library based on a CWM (CWM meta model); analyzing and fusing metadata, managing metadata life cycle, managing metadata change and standardizing metadata; and performing data quality management, metadata standard management and metadata knowledge graph construction on the metadata. The method can automatically perform unified conversion and data standardization processing on data sources from different service platforms, establish a metadata standard library based on a CWM (CWM-metadata) meta-model, standardize the processing on metadata objects and facilitate the query and maintenance of data; and valuable data can be acquired beneficially by fusing and managing the data.

Description

Multi-source data processing method and system based on health and medical big data standard library
Technical Field
The invention belongs to the technical field of big data processing, and particularly relates to a multisource data processing method based on a health and medical big data standard library.
Background
Big data analysis techniques are continuously developed to provide better conditions for finding knowledge from a large amount of medical health data, in particular to process a large amount of patient data in medical and medical information and identify clusters and correlations between data.
However, at present, medical institutions in different areas usually adopt a plurality of different systems to process various medical health data, a large amount of data are processed by different information systems, once the data of the information systems are mixed, data integration is difficult, data responsibility boundary is fuzzy and the like due to the fact that the data have heterogeneity, and valuable data are difficult to obtain.
Disclosure of Invention
The invention provides a multisource data processing method and system based on a health and medical treatment big data standard library for solving the technical problems.
The technical scheme for solving the technical problems is as follows:
in a first aspect, the invention provides a multisource data processing method based on a health and medical big data standard library, which comprises the following steps:
acquiring medical health data of each service platform, and establishing a medical health database;
extracting metadata in the medical health database, and converting the storage format of the metadata;
classifying and storing the metadata according to the application types of the metadata, and establishing a metadata standard library based on the CWM;
establishing a metadata function component, and analyzing and fusing metadata, managing metadata life cycle, managing metadata change and standardizing the metadata by using the metadata function component;
and performing data quality management, metadata standard management and metadata knowledge graph construction on the metadata.
In a second aspect, the present invention provides a multisource data processing system based on a health and medical big data standard library, comprising: the system comprises a metadata acquisition layer, a metadata storage layer, a metadata analysis layer and a metadata application layer;
the metadata acquisition layer comprises a database acquisition unit, a data extraction unit and a database analysis and verification unit;
the database acquisition unit is used for acquiring the medical health data of each service platform and establishing a medical health database;
the data extraction unit is used for establishing a plurality of parallel data extraction processes to extract ETL data of the medical health data to obtain metadata, and cleaning and converting the storage format of the metadata;
the database analysis and verification unit is used for acquiring the health database logs, analyzing and verifying the health database logs and sending the verified metadata to the metadata storage layer;
the metadata storage layer includes: the system comprises a standard library construction unit, a warehousing conversion unit, a classification unit and a data warehouse unit;
the standard library building unit is used for building a metadata standard library based on the CWM;
the warehouse entry conversion unit is used for processing the metadata by using the public warehouse meta-model and converting the storage format of the metadata;
a classification unit for classifying the metadata according to an application category of the metadata;
the data warehouse unit is used for describing the classified metadata in an XML form and importing the metadata into the data warehouse through a metadata access interface of the development data warehouse;
the metadata analysis layer comprises an analysis and fusion component, a metadata life cycle management component, a metadata change management component and a metadata standardization processing component, and is used for analyzing and fusing metadata in the data warehouse, managing the metadata life cycle, managing the metadata change and standardizing the metadata;
the metadata application layer comprises a metadata quality management unit, a metadata standard management unit and a metadata knowledge graph construction unit and is used for performing data quality management and metadata standard management on metadata and constructing a metadata knowledge graph.
The invention has the beneficial effects that: the method can automatically collect data sources from different service platforms, uniformly convert the heterogeneous data and standardize the data, establish a metadata standard library based on a CWM (CWM-W-M) meta-model, standardize the processing of metadata objects and facilitate the query and maintenance of the data; and valuable data can be acquired beneficially by fusing and managing the data.
On the basis of the technical scheme, the invention can be improved as follows.
Further, the method for extracting metadata in the medical health database is to establish a plurality of parallel data extraction processes to extract ETL data of the medical health data, or to analyze health database logs.
The beneficial effect of adopting the further scheme is that the data in the distributed and heterogeneous data sources can be extracted to the temporary middle layer for cleaning, conversion and integration by carrying out ETL data extraction on the medical health data and analyzing the health database logs, and the metadata can be written into the data warehouse.
Further, the classified storage of the metadata according to the application categories of the metadata comprises:
establishing a typical word list related to the application category semantics according to the metadata application category;
performing text processing on the metadata, and dividing the metadata into text feature word combinations;
and matching the text characteristic word combinations according to the typical word list, taking the matching result as a metadata set, and storing the metadata set.
The method has the advantages that the metadata are matched with the text feature word combination through the typical word list, and the metadata are classified and stored according to the application categories of the metadata, so that the metadata can be quickly retrieved and extracted.
Further, parsing and fusing the metadata includes:
establishing a basic metadata corpus, and translating all metadata based on the metadata corpus;
carrying out duplicate removal, normalization and disambiguation processing on the metadata by using a natural language processing algorithm;
establishing a business data model, and classifying the metadata according to business types;
and establishing a knowledge base, and adding the classified metadata into a corresponding directory tree of the knowledge base.
The beneficial effect of adopting the further scheme is that the metadata from different systems are translated into the uniform language, which is beneficial to uniformly classifying the metadata, and the classified metadata is added into the corresponding directory tree of the knowledge base to be beneficial to searching the metadata index.
Further, metadata lifecycle management includes:
monitoring the metadata according to the progress of each data processing node in the life cycle of the metadata, generating XML data streams for all processing flows of the metadata, and converting the XML data streams into XML files in a standard metadata format;
and analyzing the association between each metadata in the same XML file based on the XML data stream.
The beneficial effect of adopting the further scheme is that the XML data stream is utilized to monitor the metadata, which is beneficial to ensuring the consistency and correctness of the metadata.
Further, the metadata change management includes: and comparing the structural change of the metadata of each data processing flow, and changing the structural change into a metadata standard library.
The beneficial effect of adopting the further scheme is that the standardized upgrading of the metadata standard library is realized.
Further, the metadata normalization process includes: metadata structure standardization, metadata value range standardization and interface service standardization.
The further scheme has the beneficial effect that the structure, the value threshold and the interface of the metadata are subjected to standardized processing, so that the processing efficiency of the metadata is improved.
Drawings
Fig. 1 is a flowchart of a multisource data processing method based on a big health and medical data standard library provided in embodiment 1 of the present invention;
FIG. 2 is a system diagram of a multi-source data processing system based on a health and medical big data standard library.
Detailed Description
The principles and features of this invention are described below in conjunction with examples which are set forth to illustrate, but are not to be construed to limit the scope of the invention.
Example 1
The embodiment provides a multi-source data processing method based on a health and medical big data standard library, as shown in fig. 1, the method includes:
acquiring medical health data of each service platform, and establishing a medical health database;
extracting metadata in the medical health database, and converting the storage format of the metadata;
classifying and storing the metadata according to the application types of the metadata, and establishing a metadata standard library based on the CWM;
establishing a metadata function component, and analyzing and fusing metadata, managing the life cycle of the metadata, managing metadata change and standardizing the metadata by using the metadata function component;
and performing data quality management, metadata standard management and metadata knowledge graph construction on the metadata.
The invention has the beneficial effects that: the method can automatically collect data sources from different service platforms, uniformly convert the heterogeneous data and standardize the data, establish a metadata standard library based on a CWM (CWM-W-M) meta-model, standardize the processing of metadata objects and facilitate the query and maintenance of the data; and valuable data can be acquired beneficially by fusing and managing the data.
Optionally, the method for extracting metadata from the medical health database is to establish multiple parallel data extraction processes to extract ETL data from the medical health data, or to analyze logs of the health database.
In actual application, before ETL data extraction is carried out on the medical health database, unstructured data is removed firstly. By carrying out ETL data extraction on the medical health data and analyzing the health database log, the data in the distributed and heterogeneous data sources can be extracted to the temporary middle layer for cleaning, conversion and integration, the metadata is written into the data warehouse, the medical health database log records the state of the data base, the execution state of statements and the consumption condition of resources, and the metadata is written into the data warehouse by analyzing the health database log.
Optionally, the classifying and storing the metadata according to the application category of the metadata includes:
establishing a typical word list related to the application category semantics according to the metadata application category;
performing text processing on the metadata, and dividing the metadata into text feature word combinations;
and matching the text characteristic word combinations according to the typical word list, taking the matching result as a metadata set, and storing the metadata set.
In the actual application process, the metadata is matched with the text characteristic word combination through the typical word list, and the metadata is classified and stored according to the application categories of the metadata, so that the metadata is standardized and classified and stored. The application categories of the metadata include: business metadata, technical metadata, and management metadata. The service metadata is used for describing service logic in the metadata processing process; the technical metadata is used for describing the storage structure type of the metadata, attribute values (such as the address of the metadata) data acquisition ports and the like; the management metadata is used for describing management processes, personnel organization and role responsibilities and the like; and rapidly retrieving and extracting the metadata by using the metadata field name and the field attribute value. And decoupling the target incidence relation among the technical metadata, the service metadata and the management metadata in the metadata through a map knowledge map engine to construct a metadata knowledge map.
Optionally, parsing and fusing the metadata includes:
establishing a basic metadata corpus, and translating all metadata based on the metadata corpus;
carrying out duplicate removal, normalization and disambiguation processing on the metadata by using a natural language processing algorithm;
establishing a service data model, and classifying the metadata according to service types;
and establishing a knowledge base, and adding the classified metadata into a corresponding directory tree of the knowledge base.
In the actual application process, the metadata from different systems are translated into the unified language, so that the metadata can be classified uniformly, and the classified metadata is added into the corresponding directory tree of the knowledge base to facilitate the metadata index search.
Optionally, the metadata lifecycle management includes:
monitoring the metadata according to the progress of each data processing node in the life cycle of the metadata, generating an XML data stream for all processing flows of the metadata, and converting the XML data stream into an XML file in a standard metadata format;
and analyzing the association between each metadata in the same XML file based on the XML data stream.
In the actual application process, the XML data stream is used for monitoring the metadata, so that the consistency and the correctness of the metadata can be ensured.
Optionally, the metadata change management includes: and comparing the structural change of the metadata of each data processing flow, and changing the structural change into a metadata standard library.
In the actual application process, the standardized upgrading of the metadata standard library is realized.
Optionally, the metadata normalization process includes: metadata structure standardization, metadata value range standardization and interface service standardization.
In the actual application process, the structure, the value threshold and the interface of the metadata are subjected to standardized processing, so that the metadata processing efficiency is improved.
Example 2
As shown in FIG. 2, the present invention provides a multi-source data processing system based on a health and medical big data standard library, comprising: the system comprises a metadata acquisition layer, a metadata storage layer, a metadata analysis layer and a metadata application layer;
the metadata acquisition layer comprises a database acquisition unit, a data extraction unit and a database analysis and verification unit;
the database acquisition unit is used for acquiring the medical health data of each service platform and establishing a medical health database;
the data extraction unit is used for establishing a plurality of parallel data extraction processes to extract ETL data of the medical health data to obtain metadata, and cleaning and converting the storage format of the metadata;
the database analysis and verification unit is used for acquiring the health database logs, analyzing and verifying the health database logs and sending the verified metadata to the metadata storage layer;
the metadata storage layer includes: the system comprises a standard library construction unit, a warehousing conversion unit, a classification unit and a data warehouse unit;
the standard library construction unit is used for establishing a metadata standard library based on the CWM meta-model;
the warehouse entry conversion unit is used for processing the metadata by using the public warehouse meta-model and converting the storage format of the metadata;
a classification unit for classifying the metadata according to an application category of the metadata;
the data warehouse unit is used for describing the classified metadata in an XML form and importing the metadata into the data warehouse through a metadata access interface of the development data warehouse;
the metadata analysis layer comprises an analysis and fusion component, a metadata life cycle management component, a metadata change management component and a metadata standardization processing component, and is used for analyzing and fusing metadata in the data warehouse, managing the metadata life cycle, managing the metadata change and carrying out metadata standardization processing;
the metadata application layer comprises a metadata quality management unit, a metadata standard management unit and a metadata knowledge graph construction unit and is used for performing data quality management and metadata standard management on metadata and constructing a metadata knowledge graph.
Optionally, the classifying and storing the metadata according to the application category of the metadata includes:
establishing a typical word list related to the application category semantics according to the metadata application category;
performing text processing on the metadata, and dividing the metadata into text feature word combinations;
and matching the text characteristic word combinations according to the typical word list, taking the matching result as a metadata set, and storing the metadata set.
Optionally, the parsing and fusing component includes:
the translation unit is used for establishing a basic metadata corpus and translating all metadata based on the metadata corpus;
the processing unit is used for carrying out duplicate removal, normalization and disambiguation processing on the metadata by utilizing a natural language processing algorithm;
the service classification unit is used for establishing a service data model and classifying the metadata according to service types;
and the knowledge base management unit is used for establishing a knowledge base and adding the classified metadata into a corresponding directory tree of the knowledge base.
Optionally, the metadata lifecycle management unit includes:
the file conversion unit is used for monitoring the metadata according to the progress of each data processing node in the life cycle of the metadata, generating XML data streams for all processing flows of the metadata and converting the XML data streams into XML files in a standard metadata format;
and the analysis unit is used for analyzing the association among the metadata in the same XML file based on the XML data stream.
Optionally, the metadata change management component includes: and the structure comparison component is used for comparing the structural change of the metadata of each data processing flow and changing the structural change into the metadata standard library.
Optionally, the metadata standardization processing component is used for standardization processing of a metadata structure, standardization processing of a metadata value range, and standardization processing of an interface service.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (8)

1. The multi-source data processing method based on the health medical big data standard library is characterized by comprising the following steps:
acquiring medical health data of each service platform, and establishing a medical health database;
extracting metadata in the medical health database, and converting the storage format of the metadata;
classifying and storing the metadata according to the application category of the metadata, and establishing a metadata standard library based on a CWM (CWM) meta-model;
establishing a metadata functional component, and analyzing and fusing the metadata, managing the life cycle of the metadata, changing and managing the metadata and standardizing the metadata by using the metadata functional component;
and performing data quality management, metadata standard management and metadata knowledge graph construction on the metadata.
2. The multi-source data processing method based on the big health and medical data standard library according to claim 1, wherein the method for extracting the metadata in the medical health database is to establish a plurality of parallel data extraction processes to perform ETL data extraction on the medical health data or to perform analysis on the health database logs.
3. The multi-source data processing method based on the big health and medical data standard library according to claim 1, wherein the classifying and storing the metadata according to the application categories of the metadata comprises:
establishing a typical word list semantically associated with the application category according to the metadata application category;
performing text processing on the metadata, and dividing the metadata into text feature word combinations;
and matching the text characteristic word combinations according to the typical word list, taking the matching result as a metadata set, and storing the metadata set.
4. The multi-source data processing method based on the health medical big data standard library according to claim 1, wherein the parsing and fusing the metadata comprises:
establishing a basic metadata corpus, and translating all metadata based on the metadata corpus;
carrying out duplicate removal, normalization and disambiguation on the metadata by using a natural language processing algorithm;
establishing a service data model, and classifying the metadata according to service types;
and establishing a knowledge base, and adding the classified metadata into a corresponding directory tree of the knowledge base.
5. The multi-source data processing method based on the big healthy data standard library according to claim 4, wherein the metadata life cycle management comprises:
monitoring the metadata according to the progress of each data processing node in the life cycle of the metadata, generating XML data streams for all processing flows of the metadata, and converting the XML data streams into XML files in a standard metadata format;
and analyzing the association between the metadata in the same XML file based on the XML data stream.
6. The multi-source data processing method based on the health care big data standard library according to claim 1, wherein the metadata change management comprises the following steps: and comparing the structural change of the metadata of each data processing flow, and changing the structural change into the metadata standard library.
7. The multi-source data processing method based on the big healthy and medical data standard library according to claim 1, wherein the metadata standardization process comprises the following steps: metadata structure standardization, metadata value range standardization and interface service standardization.
8. Multisource data processing system based on big data standard storehouse of health and medical care, its characterized in that includes: the system comprises a metadata acquisition layer, a metadata storage layer, a metadata analysis layer and a metadata application layer;
the metadata acquisition layer comprises a database acquisition unit, a data extraction unit and a database analysis and verification unit;
the database acquisition unit is used for acquiring medical health data of each service platform and establishing a medical health database;
the data extraction unit is used for establishing a plurality of parallel data extraction processes to extract ETL data of the medical health data to obtain metadata, and cleaning and converting the storage format of the metadata;
the database analysis and verification unit is used for acquiring a health database log, analyzing and verifying the health database log and sending the verified metadata to the metadata storage layer;
the metadata storage layer includes: the system comprises a standard library construction unit, a warehousing conversion unit, a classification unit and a data warehouse unit;
the standard library construction unit is used for establishing a metadata standard library based on the CWM;
the warehousing conversion unit is used for processing the metadata by using the public warehouse meta-model and converting the storage format of the metadata;
the classification unit is used for classifying the metadata according to the application categories of the metadata;
the data warehouse unit is used for describing the classified metadata in an XML form and importing the metadata into the data warehouse through a metadata access interface of the development data warehouse;
the metadata analysis layer comprises an analysis and fusion component, a metadata life cycle management component, a metadata change management component and a metadata standardization processing component and is used for analyzing and fusing the metadata in the data warehouse, managing the metadata life cycle, managing the metadata change and standardizing the metadata;
the metadata application layer comprises a metadata quality management unit, a metadata standard management unit and a metadata knowledge graph construction unit, and is used for performing data quality management, metadata standard management and metadata knowledge graph construction on the metadata.
CN202210758887.5A 2022-06-29 2022-06-29 Multi-source data processing method and system based on health and medical big data standard library Pending CN115132366A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210758887.5A CN115132366A (en) 2022-06-29 2022-06-29 Multi-source data processing method and system based on health and medical big data standard library

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210758887.5A CN115132366A (en) 2022-06-29 2022-06-29 Multi-source data processing method and system based on health and medical big data standard library

Publications (1)

Publication Number Publication Date
CN115132366A true CN115132366A (en) 2022-09-30

Family

ID=83382071

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210758887.5A Pending CN115132366A (en) 2022-06-29 2022-06-29 Multi-source data processing method and system based on health and medical big data standard library

Country Status (1)

Country Link
CN (1) CN115132366A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117056312A (en) * 2023-08-17 2023-11-14 安徽派偌汇科技咨询有限公司 Quick development platform based on metadata model

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117056312A (en) * 2023-08-17 2023-11-14 安徽派偌汇科技咨询有限公司 Quick development platform based on metadata model

Similar Documents

Publication Publication Date Title
CN109446344B (en) Intelligent analysis report automatic generation system based on big data
CN111708773A (en) Multi-source scientific and creative resource data fusion method
CN111967761B (en) Knowledge graph-based monitoring and early warning method and device and electronic equipment
US20080235220A1 (en) Methodologies and analytics tools for identifying white space opportunities in a given industry
CN108922633A (en) A kind of disease name standard convention method and canonical system
CN114003791B (en) Depth map matching-based automatic classification method and system for medical data elements
CN112100149B (en) Automatic log analysis system
CN111192176A (en) Online data acquisition method and device supporting education informatization assessment
CN115422155A (en) Modeling method of data lake metadata model
CN115132366A (en) Multi-source data processing method and system based on health and medical big data standard library
CN111522950A (en) Rapid identification system for unstructured massive text sensitive data
CN112363996B (en) Method, system and medium for establishing physical model of power grid knowledge graph
US20050210382A1 (en) System and method for performing functional analyses making use of a plurality of inputs
CN115794798B (en) Market supervision informatization standard management and dynamic maintenance system and method
CN112200212A (en) Artificial intelligence-based enterprise material classification catalogue construction method
Adderley et al. Semantic mining and analysis of heterogeneous data for novel intelligence insights
CN114880483A (en) Metadata knowledge graph construction method, storage medium and system
KR101718599B1 (en) System for analyzing social media data and method for analyzing social media data using the same
CN105868381A (en) Knowledge base retrieval system for agricultural information service
CN112364177B (en) Method, system and medium for establishing logic model of power grid knowledge graph
Qi et al. XML-based data mining design and implementation
KR102605931B1 (en) Method for processing structured data and unstructured data on a plurality of databases and data processing platform providing the method
Goraseb et al. Using conditional random field in named entity recognition for crime location identification
Solovyev et al. Problems of database integration in the context of digitalization
Soundararajan et al. Knowledge discovery tools and techniques

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination