CN113901060A - 职工健康数据库创建的方法 - Google Patents
职工健康数据库创建的方法 Download PDFInfo
- Publication number
- CN113901060A CN113901060A CN202111372679.3A CN202111372679A CN113901060A CN 113901060 A CN113901060 A CN 113901060A CN 202111372679 A CN202111372679 A CN 202111372679A CN 113901060 A CN113901060 A CN 113901060A
- Authority
- CN
- China
- Prior art keywords
- data
- database
- entity
- health
- standardized
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 12
- 238000010276 construction Methods 0.000 claims abstract description 4
- 238000012502 risk assessment Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000007619 statistical method Methods 0.000 abstract description 2
- 238000007726 management method Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 3
- 238000003384 imaging method Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013523 data management Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000009533 lab test Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H15/00—ICT specially adapted for medical reports, e.g. generation or transmission thereof
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
- G16H70/40—ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Epidemiology (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Biomedical Technology (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Pathology (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Economics (AREA)
- Chemical & Material Sciences (AREA)
- Toxicology (AREA)
- Development Economics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Pharmacology & Pharmacy (AREA)
- Medicinal Chemistry (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Abstract
职工健康数据库创建的方法,涉及电子信息技术,尤其是一种多系统数据库的创建方法。本发明的方法首先,将数据库划分为重要数据和普通数据,其次,对数据库划分出的数据实体进行数据描述,将实体数据根据其性质归类为结构化数据以及非结构化数据;最后,将数据实体与原始数据之间尽力一对一的关系,即一张原始单据对应且只对应一个数据实体,然后将其录入系统,实现数据标准化构建。通过本发明的方法所建立起来的职工健康数据库,体检数据结构标准化处理后,得到的非结构化描述性语言文本进行了标准化处理,可实现进行数据统计分析,拓展了服务能力、开展多中心数据应用提供了重要基础。
Description
技术领域
本发明涉及电子信息技术,尤其是一种多系统数据来源及非结构化数据的数据库创建方法。
背景技术
数据库是数据管理的技术,它可以高效、有组织地存储数据,使人们能够更快、更方便地管理数据。数据库从结构上存储大量的数据信息,方便用户对数据进行有效的检索和访问。数据库还可以对数据进行排序和保存,并提供快速查询。此外,数据库中存储的数据能够保证数据有效,无损,同时可满足应用程序共享和安全的要求。利用数据库可以从一堆数据中分析有用的新信息。
职工健康管理云平台是对职工健康进行管理的数据平台,平台对对职工体检数据进行标准化处理,将体检数据进行全面的结构化处理与分析,建立体检标准数据库,实现员工健康及社保基金管理基本数字化、智能化功能的系统。
该平台需要根据医院体检大数据结构类型,建立标准化体检数据库,通过对职工健康监测、健康评估、健康干预,更好的服务于员工。现有的职工健康信息数据主要来自存储于体检系统、社保系统、医院his系统的数据,这些数据的特点是多而杂,数据类型多样,存储位置不同。这些体检数据需要将其结构标准化处理,得到的非结构化描述性语言文本,也需进行标准化处理。
发明内容
针对职工健康数据库存在结构标准化难的问题,提出一种建立职工健康数据库,即建立标准化体检数据库的方法。
职工健康数据库创建的方法,其特征在于:
首先,将数据库划分为重要数据和普通数据,其中:
重要数据包括用户数据、个人健康数据和个人体检数据;
普通数据包括健康知识库数据、专家模型数据、风险评估信息;
然后将以上的数据来源,即存储位置进行分析定义;
其次,对数据库划分出的数据实体进行数据描述,将实体数据根据其性质归类为结构化数据以及非结构化数据;
最后,将数据实体与原始数据之间尽力一对一的关系,即一张原始单据对应且只对应一个数据实体,然后将其录入系统,实现数据标准化构建。
建立体检标准数据库,数据结构标准化是指数据具有清晰的层级结构,数据元具有统一的标识符、名称、定义、数据类型、表达格式;数据内容标准化是指数据元具有明确的允许值以及有限定的、统一的值域代码。以影像学检查为例,在数据结构上,应建立“科目+三级分类”结构,每一级均有标准的代码及名称,在数据内容上,将数据元允许值分为两级,一级为超声所见情况的分类,二级为一级分类下不同部位的描述。
通过本发明的方法所建立起来的职工健康数据库,体检数据结构标准化处理后,得到的非结构化描述性语言文本进行了标准化处理,可实现进行数据统计分析,拓展了服务能力、开展多中心数据应用提供了重要基础。
标准化数据库建立后,可为职工健康管理业务提供全业务、全流程的信息化支撑,大大增强其健康管理业务服务能力和服务效率,有助于为员工提供更高附加值的健康管理服务。
具体实施方式
实施例1:职工健康数据库创建的方法,首先将数据库划分为重要数据和普通数据,其中:
重要数据包括用户数据、个人健康数据和个人体检数据;
普通数据包括健康知识库数据、专家模型数据、风险评估信息;
然后将以上的数据来源,即存储位置进行分析定义,如下表1所示:
表1:
其次,对数据库划分出的数据实体进行数据描述,将实体数据根据其性质归类为结构化数据以及非结构化数据;
构化数据是以数值或标准化分类来表达的信息,如体检人员的基本信息、实验室检验结果等。非结构化数据是以描述性语言来表达的信息,如超声检查、X线片检查等影像学检查结果。具体列于下表2;
表2
数据实体 | 数据描述 |
用户表 | 结构化 |
组织架构表 | 结构化 |
用户表 | 结构化 |
角色表 | 结构化 |
用户角色关联表 | 结构化 |
权限表 | 结构化 |
权限与角色关系表 | 结构化 |
功能表 | 结构化 |
权限功能关系表 | 结构化 |
用户对象表 | 结构化 |
菜单表 | 结构化 |
数据字典表 | 结构化 |
体检基本信息表 | 结构化 |
体检报告 | 非结构化 |
社保基金表 | 结构化 |
知识库表 | 结构化 |
药物数据表 | 结构化 |
门诊信息表 | 结构化 |
专家模型表 | 结构化 |
健康管理表 | 结构化 |
。
最后,将数据实体与原始数据之间尽力一对一的关系,即一张原始单据对应且只对应一个数据实体,然后将其录入系统,实现数据标准化构建。
Claims (1)
1.职工健康数据库创建的方法,其特征在于:
首先,将数据库划分为重要数据和普通数据,其中:
重要数据包括用户数据、个人健康数据和个人体检数据;
普通数据包括健康知识库数据、专家模型数据、风险评估信息;
然后将以上的数据来源,即存储位置进行分析定义;
其次,对数据库划分出的数据实体进行数据描述,将实体数据根据其性质归类为结构化数据以及非结构化数据;
最后,将数据实体与原始数据之间尽力一对一的关系,即一张原始单据对应且只对应一个数据实体,然后将其录入系统,实现数据标准化构建。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111372679.3A CN113901060A (zh) | 2021-11-18 | 2021-11-18 | 职工健康数据库创建的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111372679.3A CN113901060A (zh) | 2021-11-18 | 2021-11-18 | 职工健康数据库创建的方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113901060A true CN113901060A (zh) | 2022-01-07 |
Family
ID=79194793
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111372679.3A Pending CN113901060A (zh) | 2021-11-18 | 2021-11-18 | 职工健康数据库创建的方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113901060A (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107833595A (zh) * | 2017-10-12 | 2018-03-23 | 山东大学 | 医疗大数据多中心整合平台及方法 |
CN108922623A (zh) * | 2018-07-12 | 2018-11-30 | 中国铁道科学研究院集团有限公司 | 一种健康风险评估和疾病预警信息系统 |
CN110837492A (zh) * | 2019-11-15 | 2020-02-25 | 中科院计算技术研究所大数据研究院 | 一种多源数据统一sql提供数据服务的方法 |
CN111081382A (zh) * | 2019-12-18 | 2020-04-28 | 广州医科大学 | 一种职业环境与职业健康监测信息化平台系统 |
CN112199425A (zh) * | 2020-09-16 | 2021-01-08 | 北京好医生云医院管理技术有限公司 | 基于混合数据库结构的医疗大数据中心及其建设方法 |
-
2021
- 2021-11-18 CN CN202111372679.3A patent/CN113901060A/zh active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107833595A (zh) * | 2017-10-12 | 2018-03-23 | 山东大学 | 医疗大数据多中心整合平台及方法 |
CN108922623A (zh) * | 2018-07-12 | 2018-11-30 | 中国铁道科学研究院集团有限公司 | 一种健康风险评估和疾病预警信息系统 |
CN110837492A (zh) * | 2019-11-15 | 2020-02-25 | 中科院计算技术研究所大数据研究院 | 一种多源数据统一sql提供数据服务的方法 |
CN111081382A (zh) * | 2019-12-18 | 2020-04-28 | 广州医科大学 | 一种职业环境与职业健康监测信息化平台系统 |
CN112199425A (zh) * | 2020-09-16 | 2021-01-08 | 北京好医生云医院管理技术有限公司 | 基于混合数据库结构的医疗大数据中心及其建设方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112699175B (zh) | 一种数据治理系统及其方法 | |
Clements et al. | Rates of self-harm presenting to general hospitals: a comparison of data from the Multicentre Study of Self-Harm in England and Hospital Episode Statistics | |
Li et al. | Importance degree research of safety risk management processes of urban rail transit based on text mining method | |
Coulthard et al. | Better Decisions for Children with “Big Data”: Can Algorithms Promote Fairness, Transparency and Parental Engagement? | |
Khan et al. | Development of national health data warehouse for data mining. | |
CN104008107A (zh) | 运维知识库的实现方法 | |
CN115579117A (zh) | 一种医疗数据的数据资产化系统与方法 | |
Wang et al. | Macro risk: A versatile and universal strategy for measuring the overall safety of hazardous industrial installations in China | |
Carson et al. | Comparing violent and non-violent gang incidents: An exploration of gang-related police incident reports | |
Fritsche et al. | Recognition of critical situations from time series of laboratory results by case-based reasoning | |
CN113901060A (zh) | 职工健康数据库创建的方法 | |
CN111081382A (zh) | 一种职业环境与职业健康监测信息化平台系统 | |
Davis et al. | Dimensions of black suicide: A theoretical model | |
Xie | [Retracted] Human Resource Data Integration System Based on Artificial Intelligence Environment | |
RU2549515C2 (ru) | Способ выявления персональных данных открытых источников неструктурированной информации | |
Neto et al. | Disease surveillance big data platform for large scale event processing | |
CN114706625A (zh) | 构建患者信息全局查询插件的方法、装置及存储介质 | |
McKee et al. | Making routine data adequate to support clinical audit | |
Karam et al. | Integrating location and textual information for detecting affected people in a crisis | |
D’Amato et al. | Economic competition and racial/ethnic disparities in sentencing: A test of economic threat perspective | |
Zakharova et al. | Multi-level model for structuring heterogeneous biomedical data in the tasks of socially significant diseases risk evaluation | |
Ahmed et al. | Associations of remote mental healthcare with clinical outcomes: a natural language processing enriched electronic health record data study protocol | |
Li | [Retracted] Application of Intelligent Archives Management Based on Data Mining in Hospital Archives Management | |
Vallmuur | Artificial intelligence or manufactured stupidity? the need for injury informaticians in the big data era | |
Delgado et al. | Artificial Intelligence Model Based on Grey Clustering to Access Quality of Industrial Hygiene: A Case Study in Peru |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |