CN105912883A - Structural data extraction method for ICD pacemaker - Google Patents

Structural data extraction method for ICD pacemaker Download PDF

Info

Publication number
CN105912883A
CN105912883A CN201610494115.XA CN201610494115A CN105912883A CN 105912883 A CN105912883 A CN 105912883A CN 201610494115 A CN201610494115 A CN 201610494115A CN 105912883 A CN105912883 A CN 105912883A
Authority
CN
China
Prior art keywords
data
extraction method
report file
structural data
pacemaker
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610494115.XA
Other languages
Chinese (zh)
Inventor
陈样新
毛涌泉
罗超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Haoxuan Software Technology Co Ltd
Original Assignee
Guangzhou Haoxuan Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Haoxuan Software Technology Co Ltd filed Critical Guangzhou Haoxuan Software Technology Co Ltd
Priority to CN201610494115.XA priority Critical patent/CN105912883A/en
Publication of CN105912883A publication Critical patent/CN105912883A/en
Pending legal-status Critical Current

Links

Classifications

    • G06F19/324
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Electrotherapy Devices (AREA)

Abstract

The invention discloses a structural data extraction method for an ICD pacemaker. The structural data extraction method comprises the following steps: 1) exporting a report file of a PDF format from the ICD pacemaker; 2) setting a CRT parameter extraction rule base according to the report file; 3) reading file content; 4) scanning the report file, and extracting the file content; 5) calling the CRT parameter extraction rule base, and programming and extracting data meeting extraction rules; 6) writing the data into a database. By adoption of the structural data extraction method for the ICD pacemaker, the human resource can be saved.

Description

A kind of structural data extraction method for antitachycardia pacemaker
Technical field
The present invention relates to a kind of structural data extraction method for antitachycardia pacemaker.
Background technology
Congestive heart failure is the serious condition having relatively high mortality, has thousands of patient to die from heart failure every year.In more than 10 years in past, have become as treatment this rank the most significant non-drug therapy method implanting CRT equipment in the patient.
After patient implants CRT equipment, need periodically to go back to hospital and follow up a case by regular visits to, in order to the work that CRT is recent is evaluated by cardiovascular specialist doctor, and combines the diagnostic function of CRT, optimize device parameter, adjust therapeutic scheme.CRT equipment is implanted patient and is gone back to hospital when following up a case by regular visits to, the program control instrument that cardiovascular specialist physician workflow is provided by CRT producer as follows, read the patient data from CRT gift of money for a friend going on a journey device of the CRT programmable data in the patient and export as the follow-up Report of PDF, manually consult the parameters index in report, make medical judgment.The report this derived achieves, in order to patient contrasts when following up a case by regular visits to next time.
CRT producer only allows programmable data is exported as pdf document the most both at home and abroad, does not allow to export as the file formats such as excel, csv, xml.Owing to pdf document is typical unstructured data form, when domestic clinical research worker carries out the big data research of cardiovascular field, by the way of manually making a copy of, parameters index can only be extracted from above-mentioned pdf document.Owing to patient populations's parameter many, program control is complicated, manually make a copy of labor intensity big.
Summary of the invention
The technical problem to be solved in the present invention is to provide a kind of structural data extraction method for antitachycardia pacemaker that can save human resources.
For solving the problems referred to above, the present invention adopts the following technical scheme that
A kind of structural data extraction method for antitachycardia pacemaker, comprises the following steps:
1) from antitachycardia pacemaker, derive the report file of PDF;
2) CRT parameter extraction rule base is set according to report file;
3) file content is read;
4) scan report file, and extract content of text;
5) calling CRT parameter extraction rule base, programming extraction meets the data of decimation rule;
6) data base is write data into.
As preferably, every rule of described CRT parameter extraction rule base all correspond to an index in report file or parameter.
As preferably, the specific implementation of described step 3) is:
3.1) JAVA program is used to open report file;
3.2) report file is read in programming.
As preferably, the specific implementation of described step 4) is:
4.1) content of report file is progressively scanned;
4.2) by often capable contents extraction out.
As preferably, the specific implementation of described step 5) is:
5.1) JAVA routine call CRT parameter extraction rule base is used;
5.2) decimation rule of often row content is found at CRT parameter extraction rule base;
5.3) according to decimation rule by the data pick-up of full line coupling in report file out;
As preferably, the specific implementation of described step 6) is:
6.6) data extracted are collected by programming;
6.6) the data write into Databasce after collecting.
As preferably, described data base is SQL database, powerful, easy to learn, easy to use.
As preferably, described programming is JAVA programming, have simplicity, object-oriented, distributed, vigorousness, safety, platform independent with portability, multithreading, dynamic etc. feature, powerful and easy to use.
The invention have the benefit that the content of text using computer programming to read in the program control file of CRT of PDF, content of text is extracted and is saved in data base, and the data forms such as Excel and cvs can be exported to carry out statistical analysis, data pick-up efficiency and data pick-up accuracy rate are high, the manually pattern of making a copy of that thoroughly solves is difficult to a difficult problem for management and control quality, human resources can be saved, medical worker is facilitated to work, so that the work that CRT is recent is evaluated by cardiovascular specialist doctor, in order to patient contrasts when following up a case by regular visits to next time.
Accompanying drawing explanation
For the technical scheme being illustrated more clearly that in the embodiment of the present invention, in describing embodiment below, the required accompanying drawing used is briefly described, apparently, accompanying drawing in describing below is only some embodiments of the present invention, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the data pick-up flow chart of the step 5) in embodiment 2.
Detailed description of the invention
Embodiment 1
A kind of structural data extraction method for antitachycardia pacemaker, comprises the following steps:
1) from antitachycardia pacemaker, derive the report file of PDF;
2) CRT parameter extraction rule base is set according to report file;
3) file content is read;
4) scan report file, and extract content of text;
5) calling CRT parameter extraction rule base, programming extraction meets the data of decimation rule, as shown in Figure 1;
6) data base is write data into.
The content of text having the beneficial effect that in the program control file of CRT using computer programming reading PDF of the present embodiment, content of text is extracted and is saved in data base, and the data forms such as Excel and cvs can be exported to carry out statistical analysis, data pick-up efficiency and data pick-up accuracy rate are high, the manually pattern of making a copy of that thoroughly solves is difficult to a difficult problem for management and control quality, human resources can be saved, medical worker is facilitated to work, so that the work that CRT is recent is evaluated by cardiovascular specialist doctor, in order to patient contrasts when following up a case by regular visits to next time.
Embodiment 2
A kind of structural data extraction method for antitachycardia pacemaker, comprises the following steps:
1) from antitachycardia pacemaker, derive the report file of PDF;
2) CRT parameter extraction rule base is set according to report file;
3) using JAVA programming to open report file, report file is read in JAVA programming;
4) content of report file is progressively scanned, by often capable contents extraction out;
5) use JAVA programming to call CRT parameter extraction rule base, CRT parameter extraction rule base finds the decimation rule of often row content;According to decimation rule by the data pick-up of full line coupling in report file out, as shown in Figure 1;
6) JAVA programming is used the data extracted to be collected;Data write SQL database after collecting.
Every rule of described CRT parameter extraction rule base all correspond to an index in report file or parameter, after being provided with rule " body weight=$ { body weight } (kg) ", when input text be " Wang Qiang; body weight=89(kg) " time, JAVA program is then by rule extraction numeral " 89 ", and returns to result " body weight=89 ".
The content of text having the beneficial effect that in the program control file of CRT using JAVA programming reading PDF of the present embodiment, content of text is extracted and is saved in SQL database, and the data forms such as Excel and cvs can be exported to carry out statistical analysis, data pick-up efficiency and data pick-up accuracy rate are high, the manually pattern of making a copy of that thoroughly solves is difficult to a difficult problem for management and control quality, human resources can be saved, medical worker is facilitated to work, so that the work that CRT is recent is evaluated by cardiovascular specialist doctor, in order to patient contrasts when following up a case by regular visits to next time.
The above, the only detailed description of the invention of the present invention, but protection scope of the present invention is not limited thereto, any change expected without creative work or replacement, all should contain within protection scope of the present invention.

Claims (8)

1. the structural data extraction method for antitachycardia pacemaker, it is characterised in that: comprise the following steps:.
2.1) from antitachycardia pacemaker, derive the report file of PDF;
2) CRT parameter extraction rule base is set according to report file;
3) file content is read;
4) scan report file, and extract content of text;
5) calling CRT parameter extraction rule base, programming extraction meets the data of decimation rule;
6) data base is write data into.
Structural data extraction method for antitachycardia pacemaker the most according to claim 1, it is characterised in that: every rule of described CRT parameter extraction rule base all correspond to an index in report file or parameter.
Structural data extraction method for antitachycardia pacemaker the most according to claim 2, it is characterised in that: the specific implementation of described step 3) is:
3.1) report file is opened in programming;
3.2) report file is read.
Structural data extraction method for antitachycardia pacemaker the most according to claim 3, it is characterised in that: the specific implementation of described step 4) is:
4.1) content of report file is progressively scanned;
4.2) by often capable contents extraction out.
Structural data extraction method for antitachycardia pacemaker the most according to claim 4, it is characterised in that: the specific implementation of described step 5) is:
5.1) JAVA routine call CRT parameter extraction rule base is used;
5.2) decimation rule of often row content is found at CRT parameter extraction rule base;
5.3) according to decimation rule by the data pick-up of full line coupling in report file out;
Structural data extraction method for antitachycardia pacemaker according to claim 5, it is characterised in that: the specific implementation of described step 6) is:
6.6) data extracted are collected by programming;
6.6) the data write into Databasce after collecting.
Structural data extraction method for antitachycardia pacemaker the most according to claim 6, it is characterised in that: described data base is SQL database.
Structural data extraction method for antitachycardia pacemaker the most according to claim 7, it is characterised in that: described programming is JAVA programming.
CN201610494115.XA 2016-06-30 2016-06-30 Structural data extraction method for ICD pacemaker Pending CN105912883A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610494115.XA CN105912883A (en) 2016-06-30 2016-06-30 Structural data extraction method for ICD pacemaker

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610494115.XA CN105912883A (en) 2016-06-30 2016-06-30 Structural data extraction method for ICD pacemaker

Publications (1)

Publication Number Publication Date
CN105912883A true CN105912883A (en) 2016-08-31

Family

ID=56758924

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610494115.XA Pending CN105912883A (en) 2016-06-30 2016-06-30 Structural data extraction method for ICD pacemaker

Country Status (1)

Country Link
CN (1) CN105912883A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111913910A (en) * 2020-06-23 2020-11-10 复旦大学附属中山医院厦门医院 Follow-up file data extraction method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101794278A (en) * 2009-09-21 2010-08-04 广东省标准化研究院 Method and software for digitalizing full text of standard document
US7996761B2 (en) * 2004-02-02 2011-08-09 Mantaro Yajima Table format data processing method and table format data processing
CN102819552A (en) * 2012-06-26 2012-12-12 深圳市百能信息技术有限公司 Method and system for automatically examining and verifying Printed Circuit Board (PCB) project files
CN103559415A (en) * 2013-11-18 2014-02-05 深圳市开立科技有限公司 Patient report generating method and device as well as ultrasonic equipment
CN103823838A (en) * 2013-12-18 2014-05-28 江苏省电力公司常州供电公司 Method for inputting and comparing multi-format documents

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7996761B2 (en) * 2004-02-02 2011-08-09 Mantaro Yajima Table format data processing method and table format data processing
CN101794278A (en) * 2009-09-21 2010-08-04 广东省标准化研究院 Method and software for digitalizing full text of standard document
CN102819552A (en) * 2012-06-26 2012-12-12 深圳市百能信息技术有限公司 Method and system for automatically examining and verifying Printed Circuit Board (PCB) project files
CN103559415A (en) * 2013-11-18 2014-02-05 深圳市开立科技有限公司 Patient report generating method and device as well as ultrasonic equipment
CN103823838A (en) * 2013-12-18 2014-05-28 江苏省电力公司常州供电公司 Method for inputting and comparing multi-format documents

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111913910A (en) * 2020-06-23 2020-11-10 复旦大学附属中山医院厦门医院 Follow-up file data extraction method and system
CN111913910B (en) * 2020-06-23 2022-10-11 复旦大学附属中山医院厦门医院 Follow-up file data extraction method and system

Similar Documents

Publication Publication Date Title
JP2022523741A (en) ECG processing system for depiction and classification
CN103258306B (en) One kind is transplantable to custom-configure system and implementation method
CN107066814A (en) A kind of traditional Chinese medical science intelligent auxiliary diagnosis system cooperateed with based on the four methods of diagnosis
CN105184074B (en) A kind of medical data extraction and loaded in parallel method based on Multimodal medical image data model
CN109686444A (en) System and method for medical image classification
CN108389606A (en) A kind of the data quality control system and its control method of electronic medical record homepage
CN105023073A (en) Hospital intelligence assessment triage system based on artificial neural network
CN201788510U (en) Dynamic EMR collaborative mining system with particle swarm and extension rough set/concept lattice theories integrated together
CN109727680A (en) A kind of region clinical path management system based on big data technology
CN105701640A (en) Management control system based on external fertilization-embryo transplantation
CN107729450A (en) A kind of intelligent region portable medical integrated data centring system prototype based on metadata
CN109805924A (en) ECG's data compression method and cardiac arrhythmia detection system based on CNN
Fang et al. Electrocardiogram signal classification in the diagnosis of heart disease based on RBF neural network
Lei et al. Hybrid decision support to monitor atrial fibrillation for stroke prevention
CN105912883A (en) Structural data extraction method for ICD pacemaker
CN107563113A (en) Blood used in clinic application closed loop management system and method
CN111554405A (en) Intelligent data extraction and quality evaluation method for evidence-based medicine RCT
CN107863159A (en) A kind of community medical service management system and method
CN104112061A (en) Icu intensive care information system technology
CN107292094A (en) A kind of medical treatment and nursing method
TWI554969B (en) Scalable system and method for medical information collection
CN110837859A (en) Tumor fine classification system and method fusing multi-dimensional medical data
CN116130119A (en) Breast cancer postoperative rehabilitation auxiliary management system
CN109887597A (en) Data base management system based on medical big data
CN106202229A (en) A kind of structural data extraction method for cardiac pacemaker

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160831