CN115017326B - Medical coding method and device - Google Patents

Medical coding method and device Download PDF

Info

Publication number
CN115017326B
CN115017326B CN202210519541.XA CN202210519541A CN115017326B CN 115017326 B CN115017326 B CN 115017326B CN 202210519541 A CN202210519541 A CN 202210519541A CN 115017326 B CN115017326 B CN 115017326B
Authority
CN
China
Prior art keywords
name
information
term
medical dictionary
variable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210519541.XA
Other languages
Chinese (zh)
Other versions
CN115017326A (en
Inventor
邱毅
张倩倩
何林慧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Purisheng Pharmaceutical Technology Co ltd
Original Assignee
Qingdao Purisheng Pharmaceutical Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Purisheng Pharmaceutical Technology Co ltd filed Critical Qingdao Purisheng Pharmaceutical Technology Co ltd
Priority to CN202210519541.XA priority Critical patent/CN115017326B/en
Publication of CN115017326A publication Critical patent/CN115017326A/en
Application granted granted Critical
Publication of CN115017326B publication Critical patent/CN115017326B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Public Health (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Primary Health Care (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Epidemiology (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The present disclosure relates to a medical encoding method and apparatus. The method comprises the following steps: acquiring the name of a variable to be coded; comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; and under the condition that the first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name. According to the medical coding method and device, the medical dictionary and the synonym library are adopted for automatic medical coding, so that the manual participation is effectively reduced, and the medical coding efficiency is greatly improved.

Description

Medical coding method and device
Technical Field
The present disclosure relates generally to the field of medical information technology. More particularly, the present disclosure relates to medical encoding methods and apparatus.
Background
Medical coding is an important step or key in the clinical trial data management process. The data to be encoded is usually from text content freely filled in during the collection of a Case Report Form (CRF), including the content of medical history, disease diagnosis, adverse events, combined medication, etc. of the patient. Because of the differences of researchers in regions, languages, nations, cultures, knowledge backgrounds and the like, certain differences exist in expression descriptions of the same term, and the aim of coding is to provide correct and reasonable analysis information for safety and curative effect data analysis of medicines through term standardization.
The traditional medical coding is generally off-line coding, firstly, a medical coder acquires an original data set in an electronic data capture system (Electronic Data Capture System, abbreviated as an EDC system) to be delivered to a technician, then the technician uses an SAS macro or other tools to automatically code data according to an official medical dictionary (such as WHODRUG and/or MedDRA), and a result file (comprising an automatic matching file and a non-automatic matching file) is generated and returned to the medical coder; and the medical encoder carries out manual encoding on the data which cannot be automatically encoded according to the official medical dictionary, then the data is transferred to the technician again, the technician merges the data and then transfers the data to the medical encoding auditor for auditing, the auditing is finished after the data passes through the encoding work, the files need to be transferred to the medical encoder again for encoding when the auditing is wrong, and the medical encoder carries out modification, and then the technician merges the data and transfers the data to the medical encoding auditor for auditing until all result files pass the auditing. When the EDC official medical dictionary version is upgraded or the EDC dataset is updated, the encoding work again enters the loop. The above steps are complicated in process, inconsistent coding results are easy to cause, manual coding results cannot be reused, when new test medical coding is started, the terms which are coded before other tests are finished but the official medical dictionary cannot be matched automatically still need to be coded manually again by medical coding staff, so that time cost and labor cost are greatly increased, and medical coding efficiency is reduced.
Disclosure of Invention
In order to at least partially solve the technical problems mentioned in the background art, the scheme of the present disclosure provides a medical encoding method and apparatus.
According to a first aspect of the present disclosure, the present disclosure provides a medical encoding method, wherein the method comprises: acquiring the name of a variable to be coded; comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary; under the condition that a first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
Optionally, comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name includes: matching the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name completely matched with the variable name to be encoded; if the first term name only corresponds to one piece of medical dictionary information, the medical dictionary information corresponding to the first term name is used as the first coding result of the variable name to be coded; and if two or more pieces of medical dictionary information correspond to the first term name, acquiring one piece of medical dictionary information as the first coding result of the variable name to be coded according to a preset matching rule.
Optionally, the synonym store corresponds to the medical dictionary, and the synonym store includes extended term information and second term name information, wherein the extended term information corresponds to the second term name information, and the second term name information corresponds to the first term name information in the medical dictionary.
Optionally, comparing the variable name to be encoded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be encoded, and obtaining a second encoding result of the variable name to be encoded according to the second term name includes: matching the variable name to be encoded with the expansion term information of the synonym library to obtain an expansion term name completely matched with the variable name to be encoded; and obtaining a second coding result of the variable name to be coded in the medical dictionary according to a second term name corresponding to the extended term name.
Optionally, if the medical dictionary is upgraded to a new version of the medical dictionary, the method further comprises: and updating the obtained coding result according to the new medical dictionary.
Optionally, updating the obtained encoding results according to the new version of the medical dictionary comprises: searching in the new medical dictionary according to the variable names and/or the term names in the obtained coding results to obtain a third coding result corresponding to the variable names and/or the term names in the obtained coding results, and updating the obtained coding result.
Optionally, updating the obtained encoding results according to the new version of the medical dictionary comprises: comparing the variable names in the obtained coding results with third term name information of the new-version medical dictionary to obtain third term names corresponding to the variable names in the obtained coding results, taking one piece of medical dictionary information corresponding to the third term names as the third coding result, and updating the obtained coding result; and in the case that the third coding result cannot be obtained according to the variable names in the obtained coding results, obtaining a piece of medical dictionary information corresponding to the variable names in the obtained coding results in the new medical dictionary according to the term names corresponding to the variable names as the third coding result, and updating the obtained coding result.
According to a second aspect of the present disclosure, the present disclosure provides a medical encoding device, wherein the device comprises: an acquisition module configured to acquire a variable name to be encoded; the dictionary coding module is configured to compare the variable name to be coded with first term name information of a medical dictionary so as to obtain a first term name corresponding to the variable name to be coded, and obtain a first coding result of the variable name to be coded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary; the lexicon coding module is configured to compare the variable name to be coded with the expanded term information of the synonym lexicon under the condition that the first coding result of the variable name to be coded is not obtained in the medical dictionary, so as to obtain a second term name corresponding to the variable name to be coded, and obtain a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
Optionally, the dictionary coding module is configured to compare the variable name to be coded with first term name information of a medical dictionary in the following manner, so as to obtain a first term name corresponding to the variable name to be coded, and obtain a first coding result of the variable name to be coded according to the first term name: matching the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name completely matched with the variable name to be encoded; if the first term name only corresponds to one piece of medical dictionary information, the medical dictionary information corresponding to the first term name is used as the first coding result of the variable name to be coded; and if two or more pieces of medical dictionary information correspond to the first term name, acquiring one piece of medical dictionary information as the first coding result of the variable name to be coded according to a preset matching rule.
Optionally, the synonym library corresponds to the medical dictionary, and the synonym library includes extended term information and second term name information, wherein the extended term information corresponds to the second term name information, and the second term name information corresponds to the first term name information in the medical dictionary.
Optionally, the thesaurus encoding module is configured to compare the variable name to be encoded with the expanded term information of the thesaurus so as to obtain a second term name corresponding to the variable name to be encoded, and obtain a second encoding result of the variable name to be encoded according to the second term name: matching the variable name to be encoded with the expansion term information of the synonym library to obtain an expansion term name completely matched with the variable name to be encoded; and obtaining a second coding result of the variable name to be coded in the medical dictionary according to a second term name corresponding to the extended term name.
Optionally, the apparatus further comprises an updating module configured to update the obtained encoding results according to the new version of the medical dictionary in case the medical dictionary is upgraded to the new version of the medical dictionary.
Optionally, the updating module is configured to update the obtained encoding result according to the new medical dictionary by: searching in the new medical dictionary according to the variable names and/or the term names in the obtained coding results to obtain a third coding result corresponding to the variable names and/or the term names in the obtained coding results, and updating the original coding result.
Optionally, the updating module is configured to update the obtained encoding result according to the new medical dictionary by: comparing the variable names in the obtained coding results with third term name information of the new-version medical dictionary to obtain third term names corresponding to the variable names in the obtained coding results, taking one piece of medical dictionary information corresponding to the third term names as the third coding result, and updating the obtained coding result; and in the case that the third coding result cannot be obtained according to the variable names in the obtained coding results, obtaining a piece of medical dictionary information corresponding to the variable names in the obtained coding results in the new medical dictionary according to the term names corresponding to the variable names as the third coding result, and updating the obtained coding result.
According to a third aspect of the present disclosure, there is provided an electronic device, wherein the electronic device comprises a memory and a processor, the memory having stored therein a computer program, the processor, when executing the computer program, implementing the method of the first aspect of the present disclosure as described above.
According to a fourth aspect of the present disclosure, there is provided a computer readable storage medium, wherein the storage medium stores a computer program which, when executed, implements the method of the first aspect of the present disclosure described above.
According to the medical coding method and device, the medical dictionary and the synonym library are adopted for automatic medical coding, so that the manual participation is effectively reduced, and the medical coding efficiency is greatly improved. More importantly, the multiplexing rate of the coding result is effectively improved by adopting the synonym library for medical coding; in addition, the method and the device realize automatic updating of the medical coding result after the medical dictionary is updated, avoid manual repeated labor and greatly save labor cost and time cost.
Drawings
The above, as well as additional purposes, features, and advantages of exemplary embodiments of the present disclosure will become readily apparent from the following detailed description when read in conjunction with the accompanying drawings. Several embodiments of the present disclosure are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar or corresponding parts and in which:
FIG. 1 is a flow chart illustrating a medical encoding method according to one embodiment of the present disclosure;
Fig. 2 is a schematic block diagram illustrating a medical encoding apparatus according to one embodiment of the present disclosure.
Detailed Description
The following description of the technical solutions in the embodiments of the present disclosure will be made clearly and completely with reference to the accompanying drawings in the embodiments of the present disclosure, and it is apparent that the described embodiments are some embodiments of the present disclosure, but not all embodiments. Based on the embodiments in this disclosure, all other embodiments that a person skilled in the art would obtain without making any inventive effort are within the scope of protection of this disclosure.
Specific embodiments of the present disclosure are described in detail below with reference to the accompanying drawings.
The present disclosure provides a medical encoding method. Referring to fig. 1, fig. 1 is a flowchart illustrating a medical encoding method according to one embodiment of the present disclosure. As shown in fig. 1, the method includes the following steps S101-S103. Step S101: and obtaining the name of the variable to be encoded. Step S102: comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary. Step S103: under the condition that a first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
According to the medical coding method and device, the medical dictionary and the synonym library are adopted for automatic medical coding, so that the manual participation is effectively reduced, and the medical coding efficiency is greatly improved. More importantly, the multiplexing rate of the coding result is effectively improved by adopting the synonym library for medical coding.
In step S101, a variable name to be encoded may be acquired.
According to the embodiment of the disclosure, in order to achieve standardization of medical terms, the variable names to be coded are required to be converted into relatively uniform and standard medical terms through medical coding so as to provide correct and reasonable analysis information for safety and curative effect data analysis of subsequent medicines. The variable name to be encoded can be one or more of adverse reaction/adverse event, disease, drug name, etc.; the variable name to be encoded can be obtained from the original data set of the EDC system, can be automatically identified and obtained by the system, can be manually recorded, and is not particularly limited.
In step S102, the variable name to be encoded may be compared with first term name information of a medical dictionary, so as to obtain a first term name corresponding to the variable name to be encoded, and a first encoding result of the variable name to be encoded is obtained according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary.
According to the embodiment of the disclosure, the variable name to be encoded can be compared with the term name information in the medical dictionary, so that the term name matched with the variable name to be encoded can be obtained, and the medical dictionary information corresponding to the term name is used as the encoding result of the variable name to be encoded.
Preferably, comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name includes: matching the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name completely matched with the variable name to be encoded; if the first term name only corresponds to one piece of medical dictionary information, the medical dictionary information corresponding to the first term name is used as the first coding result of the variable name to be coded; and if two or more pieces of medical dictionary information correspond to the first term name, acquiring one piece of medical dictionary information as the first coding result of the variable name to be coded according to a preset matching rule. It is to be understood that, in the embodiment of the present disclosure, the term name information is a set of term names, that is, a set of all term names in the medical dictionary is referred to as "term name information".
According to an embodiment of the present disclosure, the official medical dictionary may be MedDRA or WHODrug, wherein MedDRA (Medical Dictionary for Regulatory Activities, regulatory activity medical dictionary) is ICH (international human drug registration technology coordination association) international medical phrase dictionary, and the range of inclusion includes: signs and symptoms, diseases, diagnosis, indications of treatment, names of indications, qualitative results of physical and chemical examination, surgical and medical treatments, medical history, social history, family history, and the like; WHODRUG, broadly known as WHODRUG Global, is the most comprehensive electronic dictionary for pharmaceutical products, and comprises 4 kinds of dictionary of world health organization medicine dictionary (WHO-DD), world health organization medicine dictionary enhancement edition (WHO-DDE), world health organization herbal medicine dictionary (WHO-HD) and comprehensive dictionary (Combined dictionary). The official medical dictionary is subjected to data processing to form one or more medical dictionary data tables for storage so as to facilitate medical coding. In the embodiments of the present disclosure, the "medical dictionary" refers specifically to one or more medical dictionary data tables formed by data processing of an official medical dictionary.
Specifically, taking MedDRA as an example, an official medical dictionary file is obtained, and decompressed, core files therein, such as an mdhier dictionary file (mdhier. Asc) and an ae_llt dictionary file (LLT. Asc), are obtained, information of the core files is extracted in units of rows, and the extracted information is subjected to segmentation extraction according to preset separators to obtain required key information, and one or more medical dictionary data tables are formed and stored according to a correspondence combination between the information to obtain a medical dictionary corresponding to MedDRA (hereinafter referred to as "MedDRA medical dictionary") in the embodiment of the present disclosure.
Illustrating:
for example, a line of text in mdhier. Asc is: "10000002$10021608$10021605$10027433$11-beta-hydroxylase lacks the abnormal synthesis of various kinds of sterols $congenital metabolic disorder $metabolic and nutritional disorder $metab$10010331$n$", and after division by "$" symbol, it is split into: 1- "10000002", 2- "10021608", 3- "10021605", 4- "10027433", 5- "11- β -hydroxylase deficiency", 6- "abnormal synthesis of various sterols", 7- "congenital metabolic abnormal disease", 8- "metabolic and nutritional disease", 9- "Metab", 10- ", 11-"10010331", 12-" N ", and 12-" total 12 character strings stored in the m_ae_mder table according to the corresponding rules of 1-PtCode (preferred language code), 2-hltecode (higher language code), 3-HlgtCode (higher language code), 4-SocCode (system organ classification code), 5-PtName (preferred language name), 6-HltName (higher language name), 7-HlgtName (higher language name), 8-SocName (system organ classification name).
Similarly, for example, a line of text in llt.asc is: "10000001$" gas exchange "pneumonia $10081988$$ $ N $ divided into: 1- "10000001", 2- "gas exchange" pneumonia ", 3-"10081988", 4-", 5- ", 6-", 7- ", 8-", 9- ", 10-" N ", 11-", 11 total character strings, which are stored in the m_ae_llt table according to the corresponding rules of 1-ltcode (lower language code), 2-ltname (lower language name), 3-PtCode (preferred language code).
It should be noted that the above examples only exemplify a part of the corresponding rules of the key information, wherein the low-level words (Llt, lowest level terms) are synonymous words in medicine, which describe specific events in the most detailed language, and different expressions of spoken words and events specific to a certain language can only be found in this hierarchy; the Preferred term (Pt) is used to express unique and clear medical concepts, is a basic term for international medical information exchange, and has no limitation on the number of low-order terms in the Preferred term; high level terms (Hlt) categorize the preferred terms in a variety of ways, such as anatomically, pathologically, biologically, etiologically, physiologically functional, etc., for retrieval, categorization; the high-level group language (Hlgt, high level group terms) is a broader concept above Hlt; the system organ classification (Soc, system organ class) corresponds to the system classification in other medical terms, and the preferred language may be connected to at least one Soc; if there is no character or space between the two separators "$", the extracted information is listed as "empty" or discarded. In addition to the above key information, the related character strings extracted from the official medical dictionary may be listed in the corresponding medical dictionary data table or discarded according to a preset rule, and will not be described herein. The above examples are only examples and are not limiting on the technical solutions of the present disclosure.
It may be understood that after extracting key information from different core files in the medical dictionary file of the official party, different medical dictionary data tables may be formed and stored according to the corresponding information relationship, and the key information may be integrated into one medical dictionary data table according to the corresponding information relationship in the above embodiment, which is not limited herein.
For the coding of "adverse reaction/adverse event, disease" etc., a medical dictionary corresponding to MedDRA may be employed. Specifically, the variable name to be encoded is matched with the ltname information (i.e., the first term name information) in the m_ae_llt table, so as to obtain a low-level language name (i.e., the first term name) which is completely matched with the variable name to be encoded, and if the low-level language name corresponds to only one piece of medical dictionary information, the piece of medical dictionary information is included: ptCode, lltCode corresponding to the low-level language name in the M_AE_LLT table, hltCode, hltName, hlgtCode, hlgtName, socCode, socName and the like in the corresponding M_AE_MDHIER table are used as the coding results of the variable name to be coded; if the low-level language names correspond to a plurality of pieces of medical dictionary information, preferentially acquiring an M_AE_MDHIER data set of PtName in an M_AE_MDHIER table and the name of a variable to be encoded, then finding out data of LtCode in the M_AE_LLT table and PtCode in the M_AE_MDHIER data set, and taking the finally acquired piece of medical dictionary information as an encoding result of the variable name to be encoded; if the required medical dictionary information cannot be obtained by adopting the matching rule, ptCode of the first piece of M_AE_LLT data in the data set with the consistent name of the variable to be encoded in the M_AE_LLT table and PtCode in the M_AE_MDHIER table can be selected for matching, and the obtained piece of data dictionary information is used as the encoding result of the variable name to be encoded. If the coding result can not be obtained after the processing, the matching is failed, and the subsequent further coding can be performed through the synonym library.
Taking WHODrug as an example, an official medical dictionary file is obtained, and decompressed, and core files, such as a DD dictionary file (DD. Txt), a DDA dictionary file (DDA. Txt), and an INA dictionary file (INA. Txt), are obtained, information of the core file is extracted in units of lines, and the extracted information is extracted in a splitting manner according to a preset field to obtain required key information, and one or more medical dictionary data tables are formed and stored in combination according to a correspondence between the information, so as to obtain a medical dictionary corresponding to the WHODrug (hereinafter referred to as a "WHODrug medical dictionary") in the embodiment of the disclosure.
Illustrating:
for example, a line of text in DD.txt is: "000001010024T20KEN 01041ALDOMET [ methyldopa ]", the first 11 characters "00000101002" are extracted to correspond to the drug code field in the W_DD table, the 31 st character from the beginning to the end of the characters "ALDOMET [ methyldopa ]" are extracted to correspond to the product name field in the W_DD table, and the extracted characters are stored in the W_DD table.
Similarly, for example, a line of text in dda.txt: "000017040015S02AA", extracting the first 11 characters "00001704001" corresponding to the drug code field in the W_DDA table, 13 th-17 th characters "S02AA" corresponding to the ATCCode field in the W_DDA table, 13 th-15 th characters "S02" corresponding to the ATCCode1 field in the W_DDA table, 13 th-15 th characters "S02" corresponding to the ATCCode2 field in the W_DDA table, 13 th-16 th characters "S02A" corresponding to the ATCCode3 field in the W_DDA table, 13 th-17 th characters "S02AA" corresponding to the ATCCode4 field in the W_DDA table, and storing all the extracted characters in the W_DDA table.
Likewise, a line of text in INA. Txt: the "A01AB 4 oral topical anti-infective and antibacterial drug" extracts the first 7 characters, "A01AB" corresponding to the ATCCode field in the W_INA table, the 8 th character, "4" corresponding to the LevelCode field in the W_INA table, the 9 th character to the end "oral topical corticosteroid drug" corresponding to the ATCCtext field in the W_INA table.
It should be noted that the above examples are only examples to illustrate the method for establishing the medical dictionary corresponding to the WHODrug, and are not limiting on the technical solution of the present disclosure. Wherein, when extracting information in line units for the medical dictionary file, each space in each line of text is counted as a character; ATC classification (anatomic-therapeutic-chemical classification) is a drug classification system employed by WHODrug that classifies all drugs into 14 major classes (ATCCode 1) according to their anatomic organs/systems of treatment, then reclassifies according to pharmacology/therapeutics (ATCCode 2), ATCCode3 and ATCCode4 continuing to classify drugs into different subclasses according to chemistry/pharmacology/therapeutics.
It may be understood that after extracting key information from different core files in the medical dictionary file of the official party, different medical dictionary data tables may be formed and stored according to the corresponding information relationship, and the key information may be integrated into one medical dictionary data table according to the corresponding information relationship in the above embodiment, which is not limited herein.
The medicine/drug is coded, and a corresponding medical dictionary of WHODrug can be adopted. Specifically, the variable name to be encoded is matched with the product name information (i.e. the first term name information) in the w_dd table, so as to obtain a drug name (i.e. the term name) completely matched with the variable name to be encoded, and if the drug name corresponds to only one piece of medical dictionary information, the piece of medical dictionary information is included in the following steps: drugCode, productName, preferredCode, preferredName, ATCCode1, ATCCode2, ATCCode3, ATCCode4, etc. as the encoding result of the variable name to be encoded; wherein, the PreferredCode is the code obtained by replacing the three digits of the drug code in the W_DD table with "001", and the PreferredName is the product name of the first piece of data of which the drug code in the W_DD table is matched with the PreferredCode. If the drug name corresponds to a plurality of pieces of medical dictionary information, for example, a plurality of atccodes, an appropriate ATCCode may be selected according to the purpose of administration, indication, etc., and the corresponding whole piece of medical dictionary information may be used as the encoding result of the variable name to be encoded. If the ProducName matching the variable name to be encoded is not obtained, the encoding of the medical dictionary fails and can be further encoded through a synonym library later.
In step S103, when the first encoding result of the variable name to be encoded is not obtained in the medical dictionary, the variable name to be encoded may be compared with the expanded term information of the synonym library, so as to obtain a second term name corresponding to the variable name to be encoded, and a second encoding result of the variable name to be encoded may be obtained according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
According to an embodiment of the present disclosure, in order to ensure uniformity and normalization of encoding, the synonym library corresponds to the medical dictionary, the synonym library includes extended term information and second term name information, the extended term information corresponds to the second term name information, and the second term name information corresponds to the first term name information in the medical dictionary. The second term name information of the synonym library in the embodiment of the disclosure is derived from the first term name information of the corresponding medical dictionary, so that the synonym library is strongly related to the corresponding medical dictionary, and the synonym library is convenient for subsequent medical coding use.
Specifically, the synonym store of the embodiments of the present disclosure includes at least expanded term information and second term name information, wherein the expanded term information is a representation of possible medical terms that are not included in the medical dictionary, and the second term name information corresponds to the first term name information in the medical dictionary, such that the possible medical terms that are not included in the medical dictionary may be encoded through the use of the synonym store in conjunction with the medical dictionary. The synonym library of the embodiment of the disclosure is mainly used for establishing a connection between the term expression which is not recorded in the medical dictionary and the term name of the medical dictionary, so that the medical dictionary is effectively supplemented, more medical expressions can be automatically encoded, the efficiency of medical encoding is effectively improved, and the labor cost and the time cost are reduced.
It should be noted that, the "first" and "second" in this disclosure are only used to distinguish the same or similar objects, and are not meant to be limiting in order. For example, "the second term name information corresponds to the first term name information in the medical dictionary" is intended to indicate that, by the second term name information of the synonym library, a term name matching the first term name information in the medical dictionary can be found, thereby obtaining a corresponding encoding result.
Preferably, comparing the variable name to be encoded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be encoded, and obtaining a second encoding result of the variable name to be encoded according to the second term name includes: matching the variable name to be encoded with the expansion term information of the synonym library to obtain an expansion term name completely matched with the variable name to be encoded; and obtaining a second coding result of the variable name to be coded in the medical dictionary according to a second term name corresponding to the extended term name. It is understood that in the embodiment of the present disclosure, the extended term information is a set of extended term names, that is, a set of all extended term names in the synonym library is referred to as "extended term information".
Specifically, according to the nature of the variable name to be encoded, a suitable synonym bank is selected, for example, the variable name to be encoded of the type of 'adverse reaction/adverse event, disease', the synonym bank corresponding to the medical dictionary of MedDRA ', the variable name to be encoded of the type of medicine/drug', the synonym bank corresponding to the medical dictionary of WHODRUG, the variable name to be encoded and the expanded term information of the synonym bank are selected, then a corresponding second term name is obtained according to the matched expanded term information, and corresponding medical dictionary information is obtained in the corresponding medical dictionary according to the second term name, so as to be used as the encoding result of the variable name to be encoded. The method for obtaining the corresponding medical dictionary information in the corresponding medical dictionary according to the second term name is the same as the above medical dictionary coding method, and will not be described herein again.
It can be appreciated that if the encoding result cannot be obtained through the above-mentioned medical dictionary encoding and synonym library encoding, the encoding result may be performed by a manual encoding method, and the information related to the encoding result obtained through the manual encoding may include, but is not limited to: variable names, term names, medical dictionary version IDs and the like are stored in corresponding items of the synonym library to supplement the rich synonym library, so that subsequent encoding multiplexing is facilitated.
As a preferred embodiment of the present disclosure, the medical dictionary, when built, generates a version ID (Identity document, identification code) of the corresponding medical dictionary from the version number of the official medical dictionary, and stores the version ID in the corresponding medical dictionary data table together with the extracted information.
Further preferably, the synonym store of an embodiment of the present disclosure comprises a synonym main store and at least one synonym item sub-store. When the synonym library is adopted for coding, the synonym main library and the synonym item sub-library corresponding to the item to be coded can be selected for coding simultaneously or sequentially according to actual needs, and one of the synonym main library and the synonym item sub-library can be selected for coding according to needs, and the method is not particularly limited.
As a preferred embodiment, when the official medical dictionary is upgraded, a corresponding medical dictionary data table may be built again from the official medical dictionary of the upgraded version, and the version ID of the medical dictionary is updated corresponding to the version number of the official medical dictionary as the medical dictionary of the upgraded version.
In this case, the medical encoding method of the embodiment of the present disclosure further includes: and updating the obtained coding result according to the new medical dictionary.
Preferably, updating the obtained encoding results according to the new version of the medical dictionary comprises: searching in the new medical dictionary according to the variable names and/or the term names in the obtained coding results to obtain a third coding result corresponding to the variable names and/or the term names in the obtained coding results, and updating the obtained coding result.
Specifically, comparing variable names in the obtained coding results with third term name information of the new-version medical dictionary to obtain third term names corresponding to the variable names in the obtained coding results, taking one piece of medical dictionary information corresponding to the third term names as the third coding result, and updating the obtained coding result; and in the case that the third coding result cannot be obtained according to the variable names in the obtained coding results, obtaining a piece of medical dictionary information corresponding to the variable names in the obtained coding results in the new medical dictionary according to the term names corresponding to the variable names as the third coding result, and updating the obtained coding result.
Taking a MedDRA medical dictionary as an example, comparing a variable name in a coding result with LltName information in a new version of the MedDRA medical dictionary to find a matched low-level language name, and then taking a piece of medical dictionary information corresponding to the low-level language name as an updated coding result; if the low-order language name matched with the variable name cannot be found in the new edition of MedDRA medical dictionary, a piece of corresponding medical dictionary information can be found in the new edition of MedDRA medical dictionary according to the low-order language name corresponding to the variable name in the coding result to be used as an updated coding result. The specific method for obtaining the corresponding medical dictionary information according to the low-order language name is the same as the aforementioned MedDRA medical dictionary coding method, and will not be described in detail here.
Taking a WHODRUG medical dictionary as an example, comparing a variable name in the coding result with the product name information in the new WHODRUG medical dictionary to find a matched medicine name, and then taking a piece of medical dictionary information corresponding to the medicine name as an updated coding result; if the medicine name matched with the variable name cannot be found in the new-version WHODrug medical dictionary, a piece of corresponding medical dictionary information can be found in the new-version WHODrug medical dictionary according to the medicine name corresponding to the variable name in the coding result, and the corresponding piece of medical dictionary information can be used as an updated coding result. The specific method for obtaining the corresponding medical dictionary information according to the drug name is the same as the aforementioned WHODrug medical dictionary coding method, and will not be described here again.
Further preferably, when updating the coding result obtained by using the WHODrug medical dictionary, if atccodes corresponding to the same drug name in the new and old WHODrug medical dictionary are different, a more suitable ATCCode may be selected in the new WHODrug medical dictionary according to the purpose, indication, and the like of administration, or an ATCCode in the original coding result may be retained (provided that the ATCCode in the original coding result also exists in the new WHODrug medical dictionary), and a prompt is given to facilitate manual review.
It can be appreciated that if the corresponding encoding result cannot be obtained in the new-version medical dictionary by the above method, the encoding result that cannot be updated may be manually encoded by manual operation, so as to synchronize the encoding result with the new-version medical dictionary.
The present disclosure also provides a medical encoding device. The apparatus is for performing the steps in the embodiment of the medical encoding method described above in connection with fig. 1.
Referring to fig. 2, fig. 2 is a schematic block diagram illustrating a medical encoding apparatus 100 according to one embodiment of the present disclosure. The apparatus 100 comprises an acquisition module 101, a dictionary encoding module 102 and a lexicon encoding module 103. The acquisition module 101 is configured to acquire the variable name to be encoded. The dictionary coding module 102 is configured to compare the variable name to be coded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be coded, and obtain a first coding result of the variable name to be coded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary. The thesaurus encoding module 103 is configured to compare the variable name to be encoded with the expanded term information of the synonym thesaurus to obtain a second term name corresponding to the variable name to be encoded, and obtain a second encoding result of the variable name to be encoded according to the second term name, where the first encoding result of the variable name to be encoded is not obtained in the medical dictionary; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
According to an embodiment of the present disclosure, the dictionary encoding module 102 is configured to compare the variable name to be encoded with first term name information of a medical dictionary in the following manner, so as to obtain a first term name corresponding to the variable name to be encoded, and obtain a first encoding result of the variable name to be encoded according to the first term name: matching the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name completely matched with the variable name to be encoded; if the first term name only corresponds to one piece of medical dictionary information, the medical dictionary information corresponding to the first term name is used as the first coding result of the variable name to be coded; and if two or more pieces of medical dictionary information correspond to the first term name, acquiring one piece of medical dictionary information as the first coding result of the variable name to be coded according to a preset matching rule.
According to an embodiment of the present disclosure, the synonym library corresponds to the medical dictionary, and the synonym library includes extended term information and second term name information, the extended term information and the second term name information corresponding to each other, the second term name information corresponding to the first term name information in the medical dictionary.
According to an embodiment of the present disclosure, the thesaurus encoding module 103 is configured to compare the variable name to be encoded with the expanded term information of the thesaurus to obtain a second term name corresponding to the variable name to be encoded, and obtain a second encoding result of the variable name to be encoded according to the second term name: matching the variable name to be encoded with the expansion term information of the synonym library to obtain an expansion term name completely matched with the variable name to be encoded; and obtaining a second coding result of the variable name to be coded in the medical dictionary according to a second term name corresponding to the extended term name.
According to an embodiment of the present disclosure, the apparatus further comprises an updating module configured for updating the obtained encoding results according to the new version of the medical dictionary in case the medical dictionary is updated to the new version of the medical dictionary.
According to an embodiment of the disclosure, the updating module is configured to update the obtained encoding result according to the new version of the medical dictionary by: searching in the new medical dictionary according to the variable names and/or the term names in the obtained coding results to obtain a third coding result corresponding to the variable names and/or the term names in the obtained coding results, and updating the obtained coding result.
Preferably, the updating module is configured to update the obtained encoding result according to the encoding result of the new medical dictionary in the following manner: comparing the variable names in the obtained coding results with third term name information of the new-version medical dictionary to obtain third term names corresponding to the variable names in the obtained coding results, taking one piece of medical dictionary information corresponding to the third term names as the third coding result, and updating the obtained coding result; and in the case that the third coding result cannot be obtained according to the variable names in the obtained coding results, obtaining a piece of medical dictionary information corresponding to the variable names in the obtained coding results in the new medical dictionary according to the term names corresponding to the variable names as the third coding result, and updating the obtained coding result.
It will be appreciated that the specific manner in which the various modules perform the operations in relation to the medical encoding apparatus of the embodiment described above with reference to fig. 2 has been described in detail in connection with the embodiment of the medical encoding method described in connection with fig. 1, and will not be explained in detail here.
The embodiment of the disclosure also provides an electronic device, wherein the electronic device comprises a memory and a processor, the memory stores a computer program, and when the processor executes the computer program, the following steps are realized: acquiring the name of a variable to be coded; comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary; under the condition that a first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
It will be appreciated that the steps implemented when the processor executes the computer program are substantially identical to the implementation of the steps in the above-described method, and that the detailed description has been given in relation to the embodiments of the medical encoding method and will not be explained in detail here.
In another aspect, the present disclosure provides a computer readable storage medium, wherein the storage medium stores a computer program that, when executed, performs the steps of: acquiring the name of a variable to be coded; comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary; under the condition that a first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
It will be appreciated that the steps implemented when the processor executes the computer program are substantially identical to the implementation of the steps in the above-described method, and that the detailed description has been given in relation to the embodiments of the medical encoding method and will not be explained in detail here.
The foregoing has described in detail embodiments of the present disclosure, with specific examples being employed herein to illustrate the principles and implementations of the present disclosure, the above examples being provided solely to assist in understanding the methods of the present disclosure and their core ideas; meanwhile, as one of ordinary skill in the art will have variations in the detailed description and the application scope in light of the ideas of the present disclosure, the present disclosure should not be construed as being limited to the above description.
It should be understood that the terms "first" and "second", and the like in the claims, specification and drawings of this disclosure, are used for distinguishing between different objects and not for describing a particular sequential order. The terms "comprises" and "comprising" when used in the specification and claims of this disclosure are taken to specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the present disclosure is for the purpose of describing particular embodiments only, and is not intended to be limiting of the disclosure. As used in this disclosure and in the claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should be further understood that the term "and/or" as used in the present disclosure and claims refers to any and all possible combinations of one or more of the associated listed items, and includes such combinations.
The foregoing has outlined rather closely the embodiments of the present disclosure, and detailed description of the principles and embodiments of the present disclosure have been presented herein with the application of specific examples, the description of the examples above being merely intended to facilitate an understanding of the method of the present disclosure and its core ideas. Meanwhile, those skilled in the art will recognize that modifications or variations made on the basis of the specific embodiments and application scope of the present disclosure are within the scope of the protection of the present disclosure in light of the ideas of the present disclosure. In view of the foregoing, this description should not be construed as limiting the disclosure.

Claims (9)

1. A method of medical encoding, wherein the method comprises:
acquiring the name of a variable to be coded;
comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary;
when the variable name to be coded is adverse reaction, adverse event or disease, the medical dictionary is one or more medical dictionary data tables formed by MedDRA after data processing; wherein the data processing includes: acquiring a MedDRA official medical dictionary file, decompressing, acquiring a core file MDHIER dictionary file and an AE_LLT dictionary file, extracting information of the core file in a row unit, dividing and extracting the extracted information according to a preset separator to obtain key information, storing the key information according to a preset corresponding relation, and respectively obtaining an M_AE_LLT table and an M_AE_MDHIER table; the first term name is a low-order term name, comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name includes: matching the variable name to be encoded with low-order language name information in an M_AE_LLT table to obtain a low-order language name which is completely matched with the variable name to be encoded, and if the low-order language name only corresponds to one piece of medical dictionary information, matching the piece of medical dictionary information, wherein the method comprises the following steps: ptCode, lltCode corresponding to the low-level language name in an M_AE_LLT table and HltCode, hltName, hlgtCode, hlgtName and SocCode, socName corresponding to the M_AE_MDHIER table are used as the first encoding result of the variable name to be encoded; if the low-level language name corresponds to a plurality of pieces of medical dictionary information, acquiring an M_AE_MDHIER data set in which PtName in an M_AE_MDHIER table is consistent with the name of the variable to be encoded, finding out data in which LltCode in the M_AE_LLT table is consistent with PtCode in the M_AE_MDHIER data set, and taking the finally acquired piece of medical dictionary information as the first encoding result of the name of the variable to be encoded; if the required medical dictionary information can not be obtained by adopting the matching rule, ptCode of the first piece of M_AE_LLT data in a data set with the consistent name of the variable to be encoded in the M_AE_LLT table is selected to be matched with PtCode in the M_AE_MDHIER table, and the obtained piece of data dictionary information is used as the first encoding result of the variable name to be encoded;
When the variable name to be coded is medication or medicine, the medical dictionary is one or more medical dictionary data tables formed by WHODRUG after data processing; wherein the data processing includes: obtaining a WHODRug official medical dictionary file, decompressing, obtaining a core file DD dictionary file, a DDA dictionary file and an INA dictionary file, extracting information of the core file in a row unit, dividing and extracting the extracted information according to a preset field to obtain key information, and storing the key information according to a preset corresponding relation to respectively obtain a W_DD table, a W_DDA table and a W_INA table; the first term name is a drug name, comparing the variable name to be encoded with first term name information of a medical dictionary to obtain a first term name corresponding to the variable name to be encoded, and obtaining a first encoding result of the variable name to be encoded according to the first term name includes: matching the variable name to be encoded with the drug name information in the W_DD table to obtain the drug name which is completely matched with the variable name to be encoded, and if the drug name only corresponds to one piece of medical dictionary information, matching the piece of medical dictionary information, wherein the method comprises the following steps: drugCode, productName, preferredCode, preferredName, ATCCode1, ATCCode2, ATCCode3, ATCCode4 as the first encoding result of the variable name to be encoded; wherein, the PreferredCode is the code obtained by replacing three digits of the drug code in the W_DD table with '001', and the PreferredName is the product name of the first piece of data matched with the drug code in the W_DD table; if the medicine name corresponds to a plurality of pieces of medical dictionary information, selecting a proper ATCCode according to the purpose and the indication of medicine use, and taking the corresponding whole piece of medical dictionary information as the first coding result of the variable name to be coded;
Under the condition that a first coding result of the variable name to be coded is not obtained in the medical dictionary, comparing the variable name to be coded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be coded, and obtaining a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
2. The medical encoding method of claim 1, wherein the thesaurus corresponds to the medical dictionary, the thesaurus comprising extended term information and second term name information, the extended term information corresponding to the second term name information, the second term name information corresponding to the first term name information in the medical dictionary.
3. The medical encoding method according to claim 2, wherein comparing the variable name to be encoded with the expanded term information of the synonym library to obtain a second term name corresponding to the variable name to be encoded, and obtaining a second encoding result of the variable name to be encoded according to the second term name comprises:
Matching the variable name to be encoded with the expansion term information of the synonym library to obtain an expansion term name completely matched with the variable name to be encoded;
and obtaining a second coding result of the variable name to be coded in the medical dictionary according to a second term name corresponding to the extended term name.
4. The medical encoding method of claim 1, wherein if the medical dictionary is upgraded to a new version of the medical dictionary, the method further comprises:
and updating the obtained coding result according to the new medical dictionary.
5. The medical encoding method of claim 4, wherein updating the obtained encoding results according to the new version of the medical dictionary comprises:
searching in the new medical dictionary according to the variable names and/or the term names in the obtained coding results to obtain a third coding result corresponding to the variable names and/or the term names in the obtained coding results, and updating the obtained coding result.
6. A medical encoding apparatus for implementing the medical encoding method according to any one of claims 1 to 5, wherein the apparatus comprises:
An acquisition module configured to acquire a variable name to be encoded;
the dictionary coding module is configured to compare the variable name to be coded with first term name information of a medical dictionary so as to obtain a first term name corresponding to the variable name to be coded, and obtain a first coding result of the variable name to be coded according to the first term name; the first coding result is a piece of medical dictionary information corresponding to the variable name to be coded in the medical dictionary;
when the variable name to be coded is adverse reaction, adverse event or disease, the medical dictionary is one or more medical dictionary data tables formed by MedDRA after data processing; wherein the data processing includes: acquiring a MedDRA official medical dictionary file, decompressing, acquiring a core file MDHIER dictionary file and an AE_LLT dictionary file, extracting information of the core file in a row unit, dividing and extracting the extracted information according to a preset separator to obtain key information, storing the key information according to a preset corresponding relation, and respectively obtaining an M_AE_LLT table and an M_AE_MDHIER table; the first term name is a low-order name, the dictionary coding module is configured to match the variable name to be coded with low-order name information in an m_ae_llt table, so as to obtain a low-order name that is completely matched with the variable name to be coded, and if the low-order name corresponds to only one piece of medical dictionary information, the piece of medical dictionary information is included: ptCode, lltCode corresponding to the low-level language name in an M_AE_LLT table and HltCode, hltName, hlgtCode, hlgtName and SocCode, socName corresponding to the M_AE_MDHIER table are used as the first encoding result of the variable name to be encoded; if the low-level language name corresponds to a plurality of pieces of medical dictionary information, acquiring an M_AE_MDHIER data set in which PtName in an M_AE_MDHIER table is consistent with the name of a variable to be encoded, then finding out data in which LltCode in an M_AE_LLT table is consistent with PtCode in the M_AE_MDHIER data set, and taking the finally acquired piece of medical dictionary information as the first encoding result of the name of the variable to be encoded; if the required medical dictionary information can not be obtained by adopting the matching rule, ptCode of the first piece of M_AE_LLT data in a data set with the consistent name of the variable to be encoded in the M_AE_LLT table is selected to be matched with PtCode in the M_AE_MDHIER table, and the obtained piece of data dictionary information is used as the first encoding result of the variable name to be encoded;
When the variable name to be coded is medication or medicine, the medical dictionary is one or more medical dictionary data tables formed by WHODRUG after data processing; wherein the data processing includes: obtaining a WHODRug official medical dictionary file, decompressing, obtaining a core file DD dictionary file, a DDA dictionary file and an INA dictionary file, extracting information of the core file in a row unit, dividing and extracting the extracted information according to a preset field to obtain key information, and storing the key information according to a preset corresponding relation to respectively obtain a W_DD table, a W_DDA table and a W_INA table; the first term name is a medicine name, the dictionary coding module is configured to match the variable name to be coded with the medicine name information in the w_dd table, obtain a medicine name that is completely matched with the variable name to be coded, and if the medicine name corresponds to only one piece of medical dictionary information, match the piece of medical dictionary information, including: drugCode, productName, preferredCode, preferredName, ATCCode1, ATCCode2, ATCCode3, ATCCode4 as the first encoding result of the variable name to be encoded; wherein, the PreferredCode is the code obtained by replacing three digits of the drug code in the W_DD table with '001', and the PreferredName is the product name of the first piece of data matched with the drug code in the W_DD table; if the medicine name corresponds to a plurality of pieces of medical dictionary information, selecting a proper ATCCode according to the purpose and the indication of medicine use, and taking the corresponding whole piece of medical dictionary information as the first coding result of the variable name to be coded;
The lexicon coding module is configured to compare the variable name to be coded with the expanded term information of the synonym lexicon under the condition that the first coding result of the variable name to be coded is not obtained in the medical dictionary, so as to obtain a second term name corresponding to the variable name to be coded, and obtain a second coding result of the variable name to be coded according to the second term name; the second encoding result is a piece of medical dictionary information corresponding to the second term name in the medical dictionary.
7. The medical encoding device of claim 6, wherein the device further comprises an updating module configured to update the obtained encoding results according to a new version of the medical dictionary in case the medical dictionary is upgraded to the new version of the medical dictionary.
8. An electronic device comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the method of any of claims 1 to 5 when the computer program is executed.
9. A computer readable storage medium, wherein the storage medium stores a computer program which, when executed, implements the method of any one of claims 1 to 5.
CN202210519541.XA 2022-05-12 2022-05-12 Medical coding method and device Active CN115017326B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210519541.XA CN115017326B (en) 2022-05-12 2022-05-12 Medical coding method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210519541.XA CN115017326B (en) 2022-05-12 2022-05-12 Medical coding method and device

Publications (2)

Publication Number Publication Date
CN115017326A CN115017326A (en) 2022-09-06
CN115017326B true CN115017326B (en) 2023-08-18

Family

ID=83068704

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210519541.XA Active CN115017326B (en) 2022-05-12 2022-05-12 Medical coding method and device

Country Status (1)

Country Link
CN (1) CN115017326B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115659962B (en) * 2022-12-22 2023-05-05 深圳市斯维尔科技股份有限公司 Engineering list standardization correction method and storage medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105069123A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese surgical operation information
CN105069124A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic ICD (International Classification of Diseases) coding method and system
CN105184053A (en) * 2015-08-13 2015-12-23 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese medical service project information
CN108182972A (en) * 2017-12-15 2018-06-19 上海长江科技发展有限公司 The intelligent coding method and system of Chinese medical diagnosis on disease based on participle network
CN108182207A (en) * 2017-12-15 2018-06-19 上海长江科技发展有限公司 The intelligent coding method and system of Chinese surgical procedure based on participle network
CN110705214A (en) * 2019-08-27 2020-01-17 天津开心生活科技有限公司 Automatic coding method and device
CN112131868A (en) * 2020-09-22 2020-12-25 上海亿普医药科技有限公司 Clinical trial medical coding method
CN112765136A (en) * 2021-04-07 2021-05-07 浙江太美医疗科技股份有限公司 Storage method, upgrading method and device of medical coding dictionary
CN112800213A (en) * 2021-01-26 2021-05-14 北京壹永科技有限公司 Medical text information display method and device and electronic equipment
CN113641714A (en) * 2021-08-31 2021-11-12 平安医疗健康管理股份有限公司 Medical data correction method, device, computer equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050240439A1 (en) * 2004-04-15 2005-10-27 Artificial Medical Intelligence, Inc, System and method for automatic assignment of medical codes to unformatted data
US10796390B2 (en) * 2006-07-03 2020-10-06 3M Innovative Properties Company System and method for medical coding of vascular interventional radiology procedures
US9147039B2 (en) * 2010-09-15 2015-09-29 Epic Systems Corporation Hybrid query system for electronic medical records

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105069123A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese surgical operation information
CN105069124A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic ICD (International Classification of Diseases) coding method and system
CN105184053A (en) * 2015-08-13 2015-12-23 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese medical service project information
CN108182972A (en) * 2017-12-15 2018-06-19 上海长江科技发展有限公司 The intelligent coding method and system of Chinese medical diagnosis on disease based on participle network
CN108182207A (en) * 2017-12-15 2018-06-19 上海长江科技发展有限公司 The intelligent coding method and system of Chinese surgical procedure based on participle network
CN110705214A (en) * 2019-08-27 2020-01-17 天津开心生活科技有限公司 Automatic coding method and device
CN112131868A (en) * 2020-09-22 2020-12-25 上海亿普医药科技有限公司 Clinical trial medical coding method
CN112800213A (en) * 2021-01-26 2021-05-14 北京壹永科技有限公司 Medical text information display method and device and electronic equipment
CN112765136A (en) * 2021-04-07 2021-05-07 浙江太美医疗科技股份有限公司 Storage method, upgrading method and device of medical coding dictionary
CN113641714A (en) * 2021-08-31 2021-11-12 平安医疗健康管理股份有限公司 Medical data correction method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN115017326A (en) 2022-09-06

Similar Documents

Publication Publication Date Title
CN111221979B (en) Medicine knowledge graph construction method and system
CN107833637B (en) Medicine rule record updating method and device, computer equipment and medium
Hamon et al. Linguistic approach for identification of medication names and related information in clinical narratives
CN101198952A (en) System and method for parsing medical data
CN111061841A (en) Knowledge graph construction method and device
US20180246876A1 (en) Apparatus and method for automatically mapping verbatim narratives to terms in a terminology dictionary
CN109582975B (en) Named entity identification method and device
CN115017326B (en) Medical coding method and device
CN105184053A (en) Automatic coding method and system for Chinese medical service project information
CN110674244B (en) Structured processing method and device for medical text
CN107784611A (en) medicine coding method and device
Deléger et al. Extracting medication information from French clinical texts
CN112635072A (en) ICU (intensive care unit) similar case retrieval method and system based on similarity calculation and storage medium
CN115631823A (en) Similar case recommendation method and system
Sharma et al. Identifying natural health product and dietary supplement information within adverse event reporting systems
CN111177117B (en) Data processing method for traditional Chinese medicine medical records
Faviez et al. Enriching UMLS-based phenotyping of rare diseases using deep-learning: evaluation on Jeune syndrome
CN114841144A (en) Construction method and device of medical dictionary
CN110399610B (en) Processing system of medicine specification
Shah-Mohammadi et al. Comparison of ACM and CLAMP for Entity Extraction in Clinical Notes
CN114925676A (en) Synonym library management method and device
Masarie Jr et al. Medical Subject Headings and medical terminology: an analysis of terminology used in hospital charts.
CN115691741B (en) Medical information-based information transmission and information combination method
CN116894685B (en) Automatic fee measuring and calculating method and system for medical behavior fragments
Marovac et al. Creating Resources for Marking Diagnoses in Electronic Health Reports in Serbian

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant