WO2022151896A1 - 确定药品编码的方法、装置、电子设备以及计算机介质 - Google Patents
确定药品编码的方法、装置、电子设备以及计算机介质 Download PDFInfo
- Publication number
- WO2022151896A1 WO2022151896A1 PCT/CN2021/138298 CN2021138298W WO2022151896A1 WO 2022151896 A1 WO2022151896 A1 WO 2022151896A1 CN 2021138298 W CN2021138298 W CN 2021138298W WO 2022151896 A1 WO2022151896 A1 WO 2022151896A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- code
- drug
- component
- components
- screening candidate
- Prior art date
Links
- 239000003814 drug Substances 0.000 title claims abstract description 357
- 229940079593 drug Drugs 0.000 title claims abstract description 326
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000012216 screening Methods 0.000 claims abstract description 98
- 239000000126 substance Substances 0.000 claims abstract description 49
- 230000001225 therapeutic effect Effects 0.000 claims abstract description 41
- 239000004615 ingredient Substances 0.000 claims abstract description 18
- 201000010099 disease Diseases 0.000 claims description 52
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 52
- 230000004044 response Effects 0.000 claims description 28
- 150000001875 compounds Chemical class 0.000 claims description 21
- 238000013145 classification model Methods 0.000 claims description 19
- 238000004590 computer program Methods 0.000 claims description 15
- 238000001514 detection method Methods 0.000 claims description 9
- 229940126534 drug product Drugs 0.000 claims description 9
- 239000000825 pharmaceutical preparation Substances 0.000 claims description 9
- 238000012790 confirmation Methods 0.000 claims description 4
- 239000000284 extract Substances 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 4
- 238000002560 therapeutic procedure Methods 0.000 claims description 4
- 238000013473 artificial intelligence Methods 0.000 abstract description 2
- 239000000306 component Substances 0.000 description 143
- 230000006870 function Effects 0.000 description 10
- 238000010586 diagram Methods 0.000 description 9
- 238000004891 communication Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000003058 natural language processing Methods 0.000 description 6
- VAOCPAMSLUNLGC-UHFFFAOYSA-N metronidazole Chemical compound CC1=NC=C([N+]([O-])=O)N1CCO VAOCPAMSLUNLGC-UHFFFAOYSA-N 0.000 description 5
- 229960000282 metronidazole Drugs 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- WFWLQNSHRPWKFK-ZCFIWIBFSA-N tegafur Chemical compound O=C1NC(=O)C(F)=CN1[C@@H]1OCCC1 WFWLQNSHRPWKFK-ZCFIWIBFSA-N 0.000 description 4
- 229960001674 tegafur Drugs 0.000 description 4
- 206010067484 Adverse reaction Diseases 0.000 description 3
- 230000006838 adverse reaction Effects 0.000 description 3
- KDLRVYVGXIQJDK-AWPVFWJPSA-N clindamycin Chemical compound CN1C[C@H](CCC)C[C@H]1C(=O)N[C@H]([C@H](C)Cl)[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@@H](SC)O1 KDLRVYVGXIQJDK-AWPVFWJPSA-N 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- POPOYOKQQAEISW-UHFFFAOYSA-N ticlatone Chemical compound ClC1=CC=C2C(=O)NSC2=C1 POPOYOKQQAEISW-UHFFFAOYSA-N 0.000 description 3
- 229960002010 ticlatone Drugs 0.000 description 3
- 208000002874 Acne Vulgaris Diseases 0.000 description 2
- 206010016936 Folliculitis Diseases 0.000 description 2
- 241001303601 Rosacea Species 0.000 description 2
- 206010039793 Seborrhoeic dermatitis Diseases 0.000 description 2
- 206010000496 acne Diseases 0.000 description 2
- 230000002924 anti-infective effect Effects 0.000 description 2
- 230000000259 anti-tumor effect Effects 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 210000000748 cardiovascular system Anatomy 0.000 description 2
- 229960001200 clindamycin hydrochloride Drugs 0.000 description 2
- WDEFBBTXULIOBB-WBVHZDCISA-N dextilidine Chemical compound C=1C=CC=CC=1[C@@]1(C(=O)OCC)CCC=C[C@H]1N(C)C WDEFBBTXULIOBB-WBVHZDCISA-N 0.000 description 2
- 210000002249 digestive system Anatomy 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000003163 gonadal steroid hormone Substances 0.000 description 2
- 230000003394 haemopoietic effect Effects 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 210000002346 musculoskeletal system Anatomy 0.000 description 2
- 210000000653 nervous system Anatomy 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- 239000000955 prescription drug Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 210000002345 respiratory system Anatomy 0.000 description 2
- 201000004700 rosacea Diseases 0.000 description 2
- 208000008742 seborrheic dermatitis Diseases 0.000 description 2
- 230000001953 sensory effect Effects 0.000 description 2
- 208000017520 skin disease Diseases 0.000 description 2
- 229960001402 tilidine Drugs 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 210000002229 urogenital system Anatomy 0.000 description 2
- 229940045434 amoxicillin and metronidazole lansoprazole Drugs 0.000 description 1
- 210000003484 anatomy Anatomy 0.000 description 1
- 229940124599 anti-inflammatory drug Drugs 0.000 description 1
- 230000002141 anti-parasite Effects 0.000 description 1
- 239000003096 antiparasitic agent Substances 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 229940126678 chinese medicines Drugs 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 229960002227 clindamycin Drugs 0.000 description 1
- 229940000425 combination drug Drugs 0.000 description 1
- 229940113826 combination tegafur Drugs 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 229960004756 ethanol Drugs 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 229960005150 glycerol Drugs 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 229940126601 medicinal product Drugs 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 239000005426 pharmaceutical component Substances 0.000 description 1
- 229940126532 prescription medicine Drugs 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- MSJLJWCAEPENBL-UHFFFAOYSA-N teclozan Chemical compound CCOCCN(C(=O)C(Cl)Cl)CC1=CC=C(CN(CCOCC)C(=O)C(Cl)Cl)C=C1 MSJLJWCAEPENBL-UHFFFAOYSA-N 0.000 description 1
- 229960002299 teclozan Drugs 0.000 description 1
- 229940126673 western medicines Drugs 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
- G16H70/40—ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/381—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using identifiers, e.g. barcodes, RFIDs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/319—Inverted lists
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/335—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/38—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/383—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
Definitions
- the present disclosure relates to the field of computer technology, in particular to the field of artificial intelligence technology, and in particular to a method, an apparatus, an electronic device, a computer-readable medium, and a computer program product for determining a drug code.
- ATC Anatomical Therapeutic Chemical
- ATC Anatomical Therapeutic Chemical
- the molecular formula structure of the drug is generally predicted by the learning classification algorithm, and the ATC code of the drug is obtained.
- predicting the ATC code of a drug by molecular formula structure is complicated and the accuracy is not high, and it is not suitable for drugs other than newly developed drugs.
- Embodiments of the present disclosure propose a method, apparatus, electronic device, computer-readable medium, and computer program product for determining a drug code.
- an embodiment of the present disclosure provides a method for determining a drug code, the method comprising: obtaining an instruction text of a drug; extracting key drug information in the instruction text; At least one code related to the key information and components corresponding to each code; based on the key information of the drug and the components corresponding to each code, the at least one code is screened to obtain the anatomical therapeutic and chemical classification system code of the drug.
- the above-mentioned drug key information includes drug components; the above-mentioned at least one code is screened based on the drug key information and the components corresponding to each code to obtain the anatomical therapeutic and chemical classification system code of the drug, including: for For each code in the at least one code, detect whether the component corresponding to the code satisfies one of a plurality of rules with a priority order, and the plurality of rules are determined based on the drug component; in response to determining that the component corresponding to the code satisfies one of the plurality of rules One of the codes and all codes are detected, determine the primary screening candidate code including the code; in response to detecting that there is only one code for the primary screening candidate code, determine that the primary screening candidate code is the anatomical therapeutic and chemical classification system code of the drug.
- the above-mentioned multiple rules are ordered from high to low priority as follows: 1) when there are two or more drug components, the components corresponding to the code include all the drug components of the drug; 2) the drug components have two or above, the component corresponding to the code includes at least one drug component of the drug and contains the word compound; 3) When there are two or more drug components, the component corresponding to the code includes at least one drug component of the drug and does not contain the word compound; 4) When there is one drug component, the component corresponding to the code includes the drug component.
- the above-mentioned drug key information includes drug components, and based on the drug key information and the components corresponding to each code, at least one code is screened to obtain the anatomical therapeutic and chemical classification system code of the drug, including: targeting at least one code. For each code in one code, check whether the component corresponding to the code matches the drug component; in response to determining that the component corresponding to the code matches the drug component and all codes have been detected, obtain a preliminary screening candidate code including the code ; in response to detecting that there is only one code for the primary screening candidate code, the primary screening candidate code is determined to be the anatomical therapeutic and chemical classification system code of the drug product.
- the above-mentioned drug key information further includes drug indications
- the above-mentioned screening at least one code based on the drug key information and the components corresponding to each code to obtain the anatomical therapeutic and chemical classification system code of the drug, further including :
- the disease type corresponding to the drug is determined based on the indication of the drug; the code corresponding to the disease type is selected from the primary screening candidate codes as the anatomical therapeutic and chemical classification of the drug System code.
- the above-mentioned determining the disease type corresponding to the drug based on the drug indication includes: using a pre-trained classification model to classify the indication, and obtain the disease type output by the classification model.
- embodiments of the present disclosure provide a device for determining a drug code, the device comprising: an acquisition unit configured to acquire an instruction text of a drug; an extraction unit configured to extract key drug information in the instruction text; The obtaining unit is configured to obtain at least one code related to the key information of the medicine and the component corresponding to each code based on the pre-created code inverted index; the screening unit is configured to be based on the key information of the medicine and the component corresponding to each code, At least one code is screened to obtain an anatomical therapeutic and chemical classification system code for the drug product.
- the above-mentioned drug key information includes drug components
- the above-mentioned screening unit includes: a detection module configured to, for each code in the at least one code, detect whether the component corresponding to the code satisfies a plurality of priority orders.
- the primary screening module is configured to determine, in response to determining that the component corresponding to the code satisfies one of the plurality of rules and all codes are detected, determine the primary screening candidates including the code The encoding; the determining module is configured to, in response to detecting that there is only one encoding for the preliminary screening candidate encoding, determine the preliminary screening candidate encoding as the anatomical therapeutic and chemical classification system encoding of the drug product.
- the above-mentioned multiple rules are ordered from high to low priority as follows: 1) when there are two or more drug components, the components corresponding to the code include all the drug components of the drug; 2) the drug components have two or above, the component corresponding to the code includes at least one drug component of the drug and contains the word compound; 3) When there are two or more drug components, the component corresponding to the code includes at least one drug component of the drug and does not contain the word compound; 4) When there is one drug component, the component corresponding to the code includes the drug component.
- the above-mentioned drug key information includes drug components
- the above-mentioned screening unit includes: a matching module configured to, for each code in the at least one code, detect whether a component corresponding to the code matches a drug component; a response module , is configured to obtain a preliminary screening candidate code including the code in response to determining that the component corresponding to the code matches the drug component and all codes are detected; the coding module is configured to respond to detecting that there is only one preliminary screening candidate code Code, determine the primary screening candidate code as the anatomical therapeutic and chemical classification system code of the drug.
- the above-mentioned key drug information further includes drug indications
- the screening unit further includes: a classification module, configured to, in response to detecting that the primary screening candidate codes are multiple codes, determine the disease corresponding to the drug based on the drug indications The type; the confirmation module is configured to select the code corresponding to the disease type from the preliminary screening candidate codes as the code of the anatomical therapeutic and chemical classification system of the medicine.
- the above-mentioned classification module is further configured to use the pre-trained classification model to classify the indications, and obtain the disease types output by the classification model.
- embodiments of the present disclosure provide an electronic device, the electronic device includes: one or more processors; a storage device on which one or more programs are stored; when the one or more programs are stored by one or more A plurality of processors execute such that one or more processors implement a method as described in any implementation of the first aspect.
- embodiments of the present disclosure provide a computer-readable medium on which a computer program is stored, and when the program is executed by a processor, implements the method described in any implementation manner of the first aspect.
- an embodiment of the present disclosure provides a computer program product, including a computer program, which, when executed by a processor, implements the method described in any implementation manner of the first aspect.
- the method and device for determining the drug code provided by the embodiments of the present disclosure: first, the instruction text of the drug is obtained; secondly, the key information of the drug in the instruction text is extracted; At least one code related to the information and the component corresponding to each code; finally, based on the key information of the drug and the component corresponding to each code, the at least one code is screened to obtain the anatomical therapeutic and chemical classification system code of the drug.
- ATC coding can be automatically performed on drugs through the pre-created coding inverted index according to the instruction text of the drug, which solves the problems faced by the majority of pharmacists in their work, and provides basic coding information for the medical information system.
- FIG. 1 is an exemplary system architecture diagram to which an embodiment of the present disclosure may be applied;
- FIG. 2 is a flowchart of one embodiment of a method for determining a drug code according to the present disclosure
- FIG. 3 is a flow diagram of one embodiment of a method of obtaining an anatomical therapeutic and chemical classification system code for a drug product according to the present disclosure
- FIG. 4 is a flowchart of another embodiment of a method of obtaining an anatomical therapeutic and chemical classification system code for a drug product according to the present disclosure
- FIG. 5 is a schematic structural diagram of an embodiment of an apparatus for determining a drug code according to the present disclosure
- FIG. 6 is a schematic structural diagram of an electronic device suitable for implementing embodiments of the present disclosure.
- FIG. 1 illustrates an exemplary system architecture 100 to which the method of determining a drug code of the present disclosure may be applied.
- the system architecture 100 may include terminal devices 101 , 102 , and 103 , a network 104 and a server 105 .
- the network 104 is a medium used to provide a communication link between the terminal devices 101 , 102 , 103 and the server 105 .
- the network 104 may include various connection types, and may typically include wireless communication links and the like.
- the terminal devices 101, 102, and 103 interact with the server 105 through the network 104 to receive or send messages and the like.
- Various communication client applications may be installed on the terminal devices 101 , 102 and 103 , such as instant messaging tools, email clients, and the like.
- the terminal devices 101, 102, and 103 may be hardware or software; when the terminal devices 101, 102, and 103 are hardware, they may be user devices with communication and control functions, which can communicate with the server 105.
- the terminal devices 101, 102, and 103 are software, they can be installed in the above-mentioned user equipment; the terminal devices 101, 102, and 103 can be implemented into multiple software or software modules (for example, software or software modules for providing distributed services) , can also be implemented as a single software or software module. There is no specific limitation here.
- the server 105 may be a server that provides various services, for example, a background server that provides support for the drug processing system on the terminal devices 101 , 102 , and 103 to determine drug codes.
- the backend server can analyze and process the instruction text of the medicine in the network, and feed back the processing result (such as the determined ATC code) to the terminal device.
- the server may be hardware or software.
- the server can be implemented as a distributed server cluster composed of multiple servers, or can be implemented as a single server.
- the server is software, it can be implemented as a plurality of software or software modules (for example, software or software modules for providing distributed services), or can be implemented as a single software or software module. There is no specific limitation here.
- the method for determining the drug code provided by the embodiments of the present disclosure is generally executed by the server 105 .
- terminal devices, networks and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks and servers according to implementation needs.
- a flow 200 of an embodiment of a method for determining a drug code according to the present disclosure is shown, and the method for determining a drug code includes the following steps:
- Step 201 obtaining the instruction text of the medicine.
- the instruction manual of a drug refers to a legal document indicating important information of the drug, and is a legal guide for selecting a drug. Accurately reading and understanding the instruction manual before taking medicine is a prerequisite for safe drug use.
- the instructions of the drug include the name, specification, manufacturer, validity period, usage, dosage, drug ingredients, indications or functions, contraindications, adverse reactions and precautions of the drug.
- the name of the drug includes: generic name, trade name, English name, chemical name, etc. As long as the user knows the generic name of the drug, the user can avoid repeated use of the drug.
- the instruction text of a drug is a text for indicating the contents of the instruction manual of a drug.
- the execution subject on which the method for determining the drug code runs may obtain the instruction text through various means, for example, obtain the instruction text from the terminal in real time, or read the instruction text from the memory, which is not limited in this embodiment.
- Step 202 extracting the key information of the drug in the instruction text.
- the key information of the drug includes the drug ingredients or information related to the drug ingredients, and the information related to the drug ingredients includes: the name of the drug, the indications or functions of the drug, contraindications, adverse reactions, etc.
- the drug component may also be the main component of the drug.
- Natural language processing technology has been widely used in life scenarios that require semantic understanding.
- entity recognition technology can identify entities (such as drug names, disease names, treatment methods, etc.) in a piece of text, so that content such as diagnosis and prescription in doctor's orders can be automatically analyzed, and medical treatment can be carried out in a structured way.
- Information management For example, text classification technology can be applied to intelligent triage scenarios, intelligently parse the patient's condition description, accurately match the clinic based on the condition description information, and improve the efficiency of triage.
- the combination of natural language processing technology and medical scenarios can improve the intelligence of medical scenarios and provide users with a better experience.
- the ingredients of the drug, the name of the drug, the indications or functions of the drug, contraindications, adverse reactions, etc. in the instruction text can be extracted through natural language processing.
- the drug ingredients are generally included in the natural language of a short text description, for example: this product is a compound preparation, containing 10 mg of clindamycin hydrochloride (calculated as clindamycin) per milliliter, 8 of metronidazole mg. Excipients are: glycerol, ethanol.
- a natural language processing model such as a named entity recognition model
- the main components non-auxiliary components or excipients
- the drug components extracted by the natural language processing model include: clindamycin hydrochloride , metronidazole, glycerol, ethanol.
- a natural language model composed of BERT (Bidirectional Encoder Representation from Transformers, based on multi-layer bidirectional conversion and decoding) + CRF (conditional random field, conditional random field) can be used for training, and the trained entity can be identified for drug components.
- named entity recognition model The key information of the drug is obtained through the named entity recognition model.
- the accuracy rate of the recognition result of the named entity recognition model can be close to 90%, which can fully meet the requirements of actual clinical use.
- medicines Based on the different nature and characteristics of medicines, medicines include compound medicines and single prescription medicines.
- a single prescription drug refers to a single drug preparation, and a single prescription drug mainly contains one drug ingredient.
- Compound medicine refers to the mixed preparation of two or more kinds of medicines, which can be Chinese medicine, Western medicine or a mixture of Chinese and Western medicines.
- Combination drugs contain two or more drug ingredients.
- the medicine components in the key information of medicines may refer to one type or multiple types.
- Step 203 Obtain at least one code related to the key drug information and components corresponding to each code based on the pre-created code inverted index.
- the coded inverted index is an index library created before extracting the key drug information in the instruction text, and the coded inverted index only needs to be created once and can be used repeatedly.
- the created coding inverted index is determined based on the coding of the drug that needs to be determined.
- the method for determining the coding of a drug provided in this application is used to determine the ATC code of a drug. Therefore, the coding inverted index can be based on the code of the World Health Organization.
- the defined ATC coding classification standard classification information (ATC Chinese name, ATC English name, ATC code) is indexed by group, for example, a coding inverted index is shown in Table 1.
- Table 1 includes the ATC code and the Chinese name and English name of the chemical substance corresponding to the ATC code, wherein the chemical substance is also a pharmaceutical ingredient, that is, the pharmaceutical ingredient corresponding to the ATC code.
- the English name corresponding to "ticlatone” is "ticlatone”
- the corresponding ATC code is "D01AE08".
- the corresponding ATC codes for multiple drug components must be multiple, and for one drug component, the corresponding ATC codes can be multiple.
- the corresponding ATC codes of the drugs containing "tegafur" may include: “L01BC03" and "L01BC53".
- search engine software can be used to complete the indexing of ATC codes defined by the World Health Organization.
- search engine software eg, Elasticsearch
- By establishing an inverted index search engine when looking for the corresponding text field, such as looking for the classification information that contains a certain field (such as metronidazole) in the Chinese name of ATC, it is easy to find all the "metronidazole" appearing in the Chinese name. All the ATC codes of ” are found, for example, the results found are: metronidazole, A01AB17; lansoprazole, amoxicillin and metronidazole, A02BD03, etc. Therefore, the code and the components corresponding to the code can be easily obtained through the search engine software.
- each medicine component can be set to the maximum return code or the number of components corresponding to the code is n ( n>1), for example, n is set to 10.
- Step 204 Screen at least one code based on the key information of the drug and the components corresponding to each code to obtain the anatomical therapeutic and chemical classification system code of the drug.
- the at least one code may be one or more than one
- the number of the at least one code may be detected first.
- the obtained code is the ATC code.
- the above-mentioned key information based on the drug and the components corresponding to each code Screening at least one code to obtain an anatomical therapeutic and chemical classification system code of the drug, including: for each code in the at least one code, detecting whether the component corresponding to the code matches the component of the drug; in response to determining that the code corresponds to The composition of the drug matches the composition of the drug and all codes have been detected, and a primary screening candidate code including the code is obtained; in response to detecting that there is only one code for the primary screening candidate code, it is determined that the primary screening candidate code is the anatomy, therapy and chemistry of the drug. Classification system code.
- the drug components may be expressed in different languages.
- it can be detected by the similarity of the content (Chinese name or English word) of the two; or it can be determined by the applicable treatment disease of the two.
- the drug component and the component corresponding to the code can be used. Treating two or more of the same disease to determine a match.
- other methods may also be used to detect whether the components of the drug match the components corresponding to the codes, which are not limited.
- the candidate codes for preliminary screening include all codes in at least one code that match all drug components of the drug, that is, at least one of the codes, and the components corresponding to the codes match the components of the drug.
- a preliminary screening candidate code including the code is obtained. After all codes of at least one code are detected, the number of codes in the preliminary screening candidate codes is determined. And when the code is only one, get the ATC code of the drug. Therefore, the primary screening candidate code can be obtained only by matching the drug composition with the inverted index result, which is simple to implement and convenient to operate.
- the key drug information further includes drug indications; based on the key drug information and the components corresponding to each code, at least one code is screened to obtain an anatomical therapeutic and chemical classification system for the drug
- the coding also includes: in response to detecting that the primary screening candidate codes are multiple codes, determining the disease type corresponding to the drug based on the indication of the drug; and selecting the coding corresponding to the disease type from the primary screening candidate codes as the anatomical treatment of the drug Science and chemical classification system code.
- the method for determining the drug code provided by the embodiments of the present disclosure: first, obtain the instruction text of the drug; secondly, extract the key information of the drug in the instruction text; then, based on the pre-created coding inverted index, obtain the key information related to the drug At least one code and the component corresponding to each code; finally, based on the key information of the drug and the component corresponding to each code, the at least one code is screened to obtain the anatomical therapeutic and chemical classification system code of the drug.
- ATC coding can be automatically performed on drugs through the pre-created coding inverted index according to the instruction text of the drug, which solves the problems faced by the majority of pharmacists in their work, and provides basic coding information for the medical information system.
- FIG. 3 shows an embodiment of a method for obtaining an anatomical therapeutic and chemical classification system code for a drug according to the present disclosure
- the process 300, the method for obtaining the anatomical therapeutic and chemical classification system code of the medicine includes the following steps:
- Step 301 for each encoding in at least one encoding, detect whether the component corresponding to the encoding satisfies one of the multiple rules with priority order; if the component corresponding to the encoding satisfies one of the multiple rules with priority order One, step 302 is executed.
- multiple rules are determined based on drug components, and after a component corresponding to the code satisfies any one of the multiple rules according to the priority order of the rules, other rules in the multiple rules may not be considered.
- the multiple rules are sorted in descending order of priority as follows: 1) When there are two or more drug components, the components corresponding to the code include all drug components of the drug; 2) When there are two or more drug components, The component corresponding to the code includes at least one drug component of the drug and contains the word compound; 3) When there are two or more drug components, the component corresponding to the code includes at least one drug component of the drug and does not contain the word compound; 4) The drug component When there is one, the component corresponding to the code includes a pharmaceutical component.
- the content, priority order, and number of each of the above-mentioned multiple rules can be adaptively adjusted based on the drug components in the instruction text of the drug.
- a plurality of rules may only include the above 1) and 4).
- a plurality of rules may only include the above 1)-3).
- multiple rules with a priority order can be applied to unilateral medicines and compound medicines, and compound medicines are taken as the priority object, which improves the reliability and comprehensiveness of component investigation corresponding to codes.
- Step 302 check whether all the codes in the at least one code have been detected; if so, go to step 303 ; if the at least one code has not been detected, return to step 301 .
- the code is each code arranged in sequence in the at least one code, and is also the current code.
- the current code (the code) satisfies one of the plurality of rules, it will be put into the primary screening candidate code. If in step 302, the current encoding does not satisfy any one of the multiple rules, then abandon the encoding, return to step 301, and re-detect the adjacent encoding after the current encoding in at least one encoding as the current encoding.
- Step 303 determine the primary screening candidate codes including the code, and then perform step 304 .
- the primary screening candidate code is the ATC code obtained for the first time that meets the requirements of the drug instruction text, and the components corresponding to each code in the primary screening candidate code satisfy one of the multiple rules with priority order, and the primary screening candidate code can be There is only one encoding, and there can be multiple encodings.
- the candidate codes for preliminary screening include all codes in the at least one code that satisfy one of the plurality of rules, that is, at least one code, and whether the component corresponding to the code satisfies the plurality of rules with priority order one of the.
- Step 304 check whether there is only one code for the primary screening candidate code; if the detection result is that there is only one code, step 305 is executed.
- the detection result is that there is only one code, it is determined that the current primary screening candidate code is the ATC code of the drug, and no subsequent detection is required.
- the codes in the candidate codes for the preliminary screening may be subjected to similarity matching, and a plurality of initial screening candidate codes with the most similarity may be matched.
- One of the screening candidate codes is used as the ATC code of the drug.
- the code with the word compound in the corresponding component of the preliminary screening candidate code among all the preliminary screening candidate codes may be used as the ATC code of the drug.
- the code of the corresponding component of the preliminary screening candidate code that does not have the word compound may be used as the ATC code of the drug.
- Step 305 determining that the primary screening candidate code is the anatomical therapeutic and chemical classification system code of the drug.
- the anatomical therapeutic and chemical classification system codes of the drugs are determined based on multiple rules corresponding to the drug components, which improves the reliability of the determination of the ATC codes.
- a method for obtaining the anatomical therapeutic and chemical classification system code of a drug according to the present disclosure is shown Process 400 of another embodiment of .
- the method for obtaining an anatomical therapeutic and chemical classification system code for a medicinal product includes the following steps:
- Step 401 for each code in the at least one code, detect whether the component corresponding to the code satisfies one of a plurality of rules with a priority order. If the component corresponding to the code satisfies one of the multiple rules with priority order, step 402 is executed.
- Step 402 check whether all the codes in the at least one code have been detected; if so, go to step 403 ; if the at least one code has not been detected, return to step 401 .
- Step 403 determine the primary screening candidate codes including the code, and then execute step 404 .
- Step 404 detecting whether there is only one code in the candidate code for preliminary screening. If the detection result is that there is only one code, step 405 is executed. If the detection result is that the primary screening candidate codes are multiple codes, step 406 is executed.
- Step 405 determining that the primary screening candidate code is the anatomical therapeutic and chemical classification system code of the drug.
- Step 406 based on the indications of the drug, determine the disease type corresponding to the drug, and then perform step 407 .
- a table of correspondence between indications and disease types may be preset, and after obtaining the indications for drugs, based on the pre-set correspondence table between indications and disease types, one can quickly obtain the correspondence between indications and disease types. the corresponding disease type.
- determining the disease type corresponding to the drug based on the drug indication includes: using a pre-trained classification model to classify the indication, and obtain the disease type output by the classification model.
- the BERT model can be used to build a classification model, so that the classification model can classify the indications in the drug instructions text, and obtain the probability values of different disease types output by the model, for example, to classify 14 disease types.
- the indications in the instructions include "for acne vulgaris, but also for seborrheic dermatitis, rosacea, and folliculitis", and classify 14 disease types to determine which type of disease the drug belongs to.
- These categories are digestive system, metabolic system, blood and hematopoietic organs, cardiovascular system, dermatology, genitourinary system, sex hormones, anti-infective, anti-tumor and immunological drugs, musculoskeletal system, nervous system, anti-parasitic system, respiratory system, Sensory system, a total of 14 disease types. These 14 classifications also correspond to the 14 disease types in the ATC primary classification.
- the classification model outputs the respective confidence scores for the above 14 disease types.
- the classification model output classification scores are: digestive system (2%), metabolic system (7%), blood and hematopoietic organs (8%), cardiovascular system (5%), skin disease (80%) %), genitourinary system (1%), sex hormones (8%), anti-infection (10%), anti-tumor and immunological drugs (2%), musculoskeletal system (2%), nervous system (2%), anti-inflammatory drugs Parasites (2%), respiratory system (2%), sensory system (2%), the result is that the disease types for the above indications are skin diseases.
- the classification model By training a classification model corresponding to the type of disease and the indication of the drug, it is possible to know which disease the drug is used to treat through the description of the indication text in the drug instruction manual.
- the classification model is used for classification.
- the classification accuracy can reach more than 93%.
- the indications extracted from the instruction text are input into the pre-trained classification model, and the disease types output by the classification model can be obtained. Further, by comparing the obtained disease type with the disease type corresponding to each code in the preliminary screening candidate code, the preferred ATC code corresponding to the drug in the preliminary screening candidate code can be obtained.
- the classification model can improve the accuracy of disease type acquisition and ensure the reliability of ATC coding of drugs.
- step 407 the code corresponding to the disease type is selected from the preliminary screening candidate codes as the code of the anatomical therapeutic and chemical classification system of the drug.
- the disease type may be one or multiple; when the disease type is one, the primary screening candidate code corresponding to the disease type is the ATC code of the drug.
- the primary screening candidate code corresponding to the most disease type in the disease type may be used as the ATC code of the drug.
- the primary screening candidate codes in at least one code are determined based on the drug components, and when the primary screening candidate codes are multiple codes, the primary screening candidate codes are determined based on the drug indications
- the ATC code of the drug is determined from the primary screening candidate code, which solves the problem that the same drug has multiple ATC codes and ensures the accuracy of the determination of the ATC code.
- the present disclosure provides an embodiment of an apparatus for determining a drug code.
- This apparatus embodiment corresponds to the method embodiment shown in FIG. 2 , and the apparatus can be specifically applied to various electronic devices.
- an embodiment of the present disclosure provides an apparatus 500 for determining a drug code.
- the apparatus 500 includes: an acquiring unit 501 , an extracting unit 502 , a obtaining unit 503 , and a screening unit 504 .
- the obtaining unit 501 may be configured to obtain the instruction text of the medicine.
- the extraction unit 502 may be configured to extract the key information of the medicine in the instruction text.
- the obtaining unit 503 may be configured to obtain at least one code related to the key drug information and components corresponding to each code based on a pre-created code inverted index.
- the screening unit 504 may be configured to screen at least one code based on the key information of the drug and the components corresponding to each code to obtain the anatomical therapeutic and chemical classification system code of the drug.
- the specific processing of the acquiring unit 501, the extracting unit 502, the obtaining unit 503, and the screening unit 504 and the technical effects brought about by the acquiring unit 501, the extracting unit 502, and the screening unit 504, and the technical effects brought about by them may refer to the corresponding embodiments in FIG. 2, respectively.
- the above-mentioned key drug information includes drug components
- the above-mentioned screening unit 504 includes: a detection module (not shown in the figure), a preliminary screening module (not shown in the figure), and a determination module (not shown in the figure) .
- the detection module may be configured to, for each of the at least one code, detect whether the component corresponding to the code satisfies one of a plurality of rules with a priority order, and the plurality of rules are determined based on the drug components.
- the preliminary screening module may be configured to determine a preliminary screening candidate code including the code in response to determining that the component corresponding to the code satisfies one of the plurality of rules and all codes are detected.
- the determining module may be configured to, in response to detecting that the primary screening candidate coding is only one coding, determine that the primary screening candidate coding is the anatomical therapeutic and chemical classification system coding of the drug product.
- the above-mentioned multiple rules are ordered from high to low priority as follows: 1) when there are two or more drug components, the components corresponding to the code include all the drug components of the drug; 2) the drug components have two or above, the component corresponding to the code includes at least one drug component of the drug and contains the word compound; 3) When there are two or more drug components, the component corresponding to the code includes at least one drug component of the drug and does not contain the word compound; 4) When there is one drug component, the component corresponding to the code includes the drug component.
- the above-mentioned drug key information includes drug components
- the above-mentioned screening unit 504 includes: a matching module (not shown in the figure), a response module (not shown in the figure), and an encoding module (not shown in the figure).
- the matching module may be configured to, for each code in the at least one code, detect whether the component corresponding to the code matches the drug component.
- the response module may be configured to obtain a preliminary screening candidate code including the code in response to determining that the component corresponding to the code matches the drug component and all codes are detected.
- the encoding module may be configured to, in response to detecting that there is only one encoding of the preliminary screening candidate encoding, determine that the preliminary screening candidate encoding is the anatomical therapeutic and chemical classification system encoding of the drug product.
- the above-mentioned key drug information further includes: drug indications;
- the above-mentioned screening unit 504 includes: a classification module (not shown in the figure) and a confirmation module (not shown in the figure).
- the classification module may be configured to, in response to detecting that the primary screening candidate codes are multiple codes, determine the disease type corresponding to the drug based on the indication of the drug.
- the above-mentioned confirmation module may be configured to screen out the codes corresponding to the disease types from the preliminary screening candidate codes as the codes of the anatomical therapy and chemical classification system of the medicine.
- the above-mentioned classification module is further configured to use the pre-trained classification model to classify the indications, and obtain the disease types output by the classification model.
- the obtaining unit 501 obtains the instruction text of the drug; secondly, the extracting unit 502 extracts the key information of the drug in the instruction text; then, the obtaining unit 503 retrieves the code based on the pre-created code Arrange the index to obtain at least one code related to the key information of the drug and components corresponding to each code; finally, the screening unit 504 screens the at least one code based on the key information of the drug and the components corresponding to each code to obtain the anatomical treatment of the drug Science and chemical classification system code.
- ATC coding can be automatically performed on drugs through the pre-created coding inverted index according to the instruction text of the drug, which solves the problems faced by the majority of pharmacists in their work, and provides basic coding information for the medical information system.
- FIG. 6 a schematic structural diagram of an electronic device 600 suitable for implementing embodiments of the present disclosure is shown.
- an electronic device 600 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 601 that may be loaded into random access according to a program stored in a read only memory (ROM) 602 or from a storage device 608 Various appropriate actions and processes are executed by the programs in the memory (RAM) 603 . In the RAM 603, various programs and data required for the operation of the electronic device 600 are also stored.
- the processing device 601, the ROM 602, and the RAM 603 are connected to each other through a bus 604.
- An input/output (I/O) interface 605 is also connected to bus 604 .
- the following devices can be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touchpad, keyboard, mouse, etc.; output devices 607 including, for example, a Liquid Crystal Display (LCD), speakers, vibrators, etc. ; including storage devices 608 such as magnetic tapes, hard disks, etc.; and communication devices 609 .
- Communication means 609 may allow electronic device 600 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 6 shows electronic device 600 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided. Each block shown in FIG. 6 may represent one device, or may represent multiple devices as required.
- embodiments of the present disclosure include a computer program product comprising a computer program carried on a computer-readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
- the computer program may be downloaded and installed from the network via the communication device 609 , or from the storage device 608 , or from the ROM 602 .
- the processing apparatus 601 the above-described functions defined in the methods of the embodiments of the present disclosure are executed.
- the computer-readable medium of the embodiments of the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
- the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
- a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
- a computer-readable signal medium may include a data signal in baseband or propagated as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
- a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
- the program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: electric wire, optical cable, RF (Radio Frequency, radio frequency), etc., or any suitable combination of the above.
- the above-mentioned computer-readable medium may be included in the above-mentioned server; or may exist alone without being assembled into the server.
- the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the server, the server can: obtain the instruction text of the drug; extract the key information of the drug in the instruction text; based on the pre-created code Inverted index to obtain at least one code related to the key information of the drug and components corresponding to each code; based on the key information of the drug and the components corresponding to each code, screen at least one code to obtain the anatomical therapeutic and chemical classification of the drug System code.
- Computer program code for carrying out operations of embodiments of the present disclosure may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, and also A conventional procedural programming language - such as the "C" language or similar programming language.
- the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
- LAN local area network
- WAN wide area network
- each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
- the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
- each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
- the units involved in the embodiments of the present disclosure may be implemented in software or hardware.
- the described unit can also be set in the processor, for example, it can be described as: a processor including an acquisition unit, an extraction unit, a obtaining unit and a screening unit.
- the names of these units do not constitute a limitation on the unit itself under certain circumstances, for example, the obtaining unit may also be described as a unit "configured to obtain the instruction text of the medicine".
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- Library & Information Science (AREA)
- Toxicology (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Primary Health Care (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medicinal Chemistry (AREA)
- Medical Informatics (AREA)
- Epidemiology (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
中文名称 | 英文名称 | ATC编码 |
替克拉酮 | ticlatone | D01AE08 |
替克洛可 | teclozan | P01AC04 |
替利定 | tilidine | N02AX01 |
替加氟(喃氟啶) | tegafur | L01BC03 |
替加氟,复方 | tegafur,combinations | L01BC53 |
… | … | … |
Claims (12)
- 一种确定药品编码的方法,所述方法包括:获取药品的说明书文本;提取所述说明书文本中的药品关键信息;基于预先创建的编码倒排索引,得到与所述药品关键信息相关的至少一个编码以及与各个编码对应的成分;基于所述药品关键信息以及各个编码对应的成分,对所述至少一个编码进行筛选,得到所述药品的解剖学治疗学及化学分类系统编码。
- 根据权利要求1所述的方法,其中,所述药品关键信息包括:药品成分,所述基于所述药品关键信息以及各个编码对应的成分,对所述至少一个编码进行筛选,得到所述药品的解剖学治疗学及化学分类系统编码,包括:针对所述至少一个编码中的每个编码,检测该编码对应的成分是否满足具有优先级顺序的多个规则中的一个,所述多个规则基于所述药品成分确定;响应于确定该编码对应的成分满足所述多个规则中的一个且所有编码均检测完成,确定包括该编码的初筛候选编码;响应于检测到所述初筛候选编码只有一个编码,确定所述初筛候选编码为所述药品的解剖学治疗学及化学分类系统编码。
- 根据权利要求2所述的方法,其中,所述多个规则按从高至低优先级排序如下:1)所述药品成分具有两种或以上时,该编码对应的成分包括所述药品的所有药品成分;2)所述药品成分具有两种或以上时,该编码对应的成分包括所述药品的至少一个药品成分且含有复方字样;3)所述药品成分具有两种或以上时,该编码对应的成分包括所述药品的至少一个药品成分且不含有复方字样;4)所述药品成分具有一种时,该编码对应的成分包括所述药品成分。
- 根据权利要求1所述的方法,其中,所述药品关键信息包括药品成 分,所述基于所述药品关键信息以及各个编码对应的成分,对所述至少一个编码进行筛选,得到所述药品的解剖学治疗学及化学分类系统编码,包括:针对所述至少一个编码中的每个编码,检测该编码对应的成分是否与所述药品成分相匹配;响应于确定该编码对应的成分与所述药品成分相匹配且所有编码均检测完成,得到包括该编码的初筛候选编码;响应于检测到所述初筛候选编码只有一个编码,确定所述初筛候选编码为所述药品的解剖学治疗学及化学分类系统编码。
- 根据权利要求2-4之一所述的方法,其中,所述药品关键信息还包括药品适应症,所述基于所述药品关键信息以及各个编码对应的成分,对所述至少一个编码进行筛选,得到所述药品的解剖学治疗学及化学分类系统编码,还包括:响应于检测到所述初筛候选编码为多个编码,基于所述药品适应症,确定所述药品对应的疾病类型;从所述初筛候选编码中筛选出与所述疾病类型对应的编码作为所述药品的解剖学治疗学及化学分类系统编码。
- 根据权利要求5所述的方法,其中,所述基于所述药品适应症,确定所述药品对应的疾病类型,包括:采用预先训练完成的分类模型对所述适应症进行疾病分类,得到所述分类模型输出的疾病类型。
- 一种确定药品编码的装置,所述装置包括:获取单元,被配置成获取药品的说明书文本;提取单元,被配置成提取所述说明书文本中的药品关键信息;得到单元,被配置成基于预先创建的编码倒排索引,得到与所述药品关键信息相关的至少一个编码以及与各个编码对应的成分;筛选单元,被配置成基于所述药品关键信息以及各个编码对应的成分,对所述至少一个编码进行筛选,得到所述药品的解剖学治疗学及化学分类系 统编码。
- 根据权利要求7所述的装置,其中,所述药品关键信息包括药品成分,所述筛选单元包括:检测模块,被配置成针对所述至少一个编码中的每个编码,检测该编码对应的成分是否满足具有优先级顺序的多个规则中的一个,所述多个规则基于所述药品成分确定;初筛模块,被配置成响应于确定该编码对应的成分满足所述多个规则中的一个且所有编码均检测完成,确定包括该编码的初筛候选编码;确定模块,被配置成响应于检测到所述初筛候选编码只有一个编码,确定所述初筛候选编码为所述药品的解剖学治疗学及化学分类系统编码。
- 根据权利要求8所述的装置,其中,所述药品关键信息还包括药品适应症,所述筛选单元还包括:分类模块,被配置成响应于检测到所述初筛候选编码为多个编码,基于所述药品适应症,确定所述药品对应的疾病类型;确认模块,被配置成从所述初筛候选编码中筛选出与所述疾病类型对应的编码作为所述药品的解剖学治疗学及化学分类系统编码。
- 一种电子设备,包括:一个或多个处理器;存储装置,其上存储有一个或多个程序;当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器实现如权利要求1-6中任一所述的方法。
- 一种计算机可读介质,其上存储有计算机程序,其中,该程序被处理器执行时实现如权利要求1-6中任一所述的方法。
- 一种计算机程序产品,包括计算机程序,所述计算机程序在被处理器执行时实现如权利要求1-6中任一项所述的方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2023553759A JP2023550212A (ja) | 2021-01-15 | 2021-12-15 | 医薬品コードを決定するための方法、装置、電子機器及びコンピュータ媒体 |
US18/272,315 US20240071630A1 (en) | 2021-01-15 | 2021-12-15 | Method and apparatus for determining drug code, electronic device, and computer medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110054078.1 | 2021-01-15 | ||
CN202110054078.1A CN113821649B (zh) | 2021-01-15 | 2021-01-15 | 确定药品编码的方法、装置、电子设备以及计算机介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022151896A1 true WO2022151896A1 (zh) | 2022-07-21 |
Family
ID=78912354
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/138298 WO2022151896A1 (zh) | 2021-01-15 | 2021-12-15 | 确定药品编码的方法、装置、电子设备以及计算机介质 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20240071630A1 (zh) |
JP (1) | JP2023550212A (zh) |
CN (1) | CN113821649B (zh) |
WO (1) | WO2022151896A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117349452A (zh) * | 2023-12-04 | 2024-01-05 | 长春中医药大学 | 一种用于中医药物检索的信息服务系统 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116955497B (zh) * | 2023-04-07 | 2024-07-23 | 广州标点医药信息股份有限公司 | 一种中成药数据的分类方法 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160180028A1 (en) * | 2013-09-02 | 2016-06-23 | Fujitsu Limited | Information retrieval processing device and method |
CN107480425A (zh) * | 2017-07-14 | 2017-12-15 | 广东医睦科技有限公司 | 一种基于药品编码的药品信息处理方法 |
CN107784611A (zh) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | 药品编码方法及装置 |
CN109408631A (zh) * | 2018-09-03 | 2019-03-01 | 平安医疗健康管理股份有限公司 | 药品数据处理方法、装置、计算机设备和存储介质 |
CN110827948A (zh) * | 2019-10-31 | 2020-02-21 | 北京东软望海科技有限公司 | 用药数据处理方法、装置、电子设备及可读存储介质 |
US20200320139A1 (en) * | 2019-04-04 | 2020-10-08 | Iqvia Inc. | Predictive system for generating clinical queries |
CN111933244A (zh) * | 2020-08-17 | 2020-11-13 | 医渡云(北京)技术有限公司 | 药品数据编码方法、装置、计算机可读介质及电子设备 |
-
2021
- 2021-01-15 CN CN202110054078.1A patent/CN113821649B/zh active Active
- 2021-12-15 WO PCT/CN2021/138298 patent/WO2022151896A1/zh active Application Filing
- 2021-12-15 US US18/272,315 patent/US20240071630A1/en active Pending
- 2021-12-15 JP JP2023553759A patent/JP2023550212A/ja active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160180028A1 (en) * | 2013-09-02 | 2016-06-23 | Fujitsu Limited | Information retrieval processing device and method |
CN107784611A (zh) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | 药品编码方法及装置 |
CN107480425A (zh) * | 2017-07-14 | 2017-12-15 | 广东医睦科技有限公司 | 一种基于药品编码的药品信息处理方法 |
CN109408631A (zh) * | 2018-09-03 | 2019-03-01 | 平安医疗健康管理股份有限公司 | 药品数据处理方法、装置、计算机设备和存储介质 |
US20200320139A1 (en) * | 2019-04-04 | 2020-10-08 | Iqvia Inc. | Predictive system for generating clinical queries |
CN110827948A (zh) * | 2019-10-31 | 2020-02-21 | 北京东软望海科技有限公司 | 用药数据处理方法、装置、电子设备及可读存储介质 |
CN111933244A (zh) * | 2020-08-17 | 2020-11-13 | 医渡云(北京)技术有限公司 | 药品数据编码方法、装置、计算机可读介质及电子设备 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117349452A (zh) * | 2023-12-04 | 2024-01-05 | 长春中医药大学 | 一种用于中医药物检索的信息服务系统 |
CN117349452B (zh) * | 2023-12-04 | 2024-02-09 | 长春中医药大学 | 一种用于中医药物检索的信息服务系统 |
Also Published As
Publication number | Publication date |
---|---|
US20240071630A1 (en) | 2024-02-29 |
CN113821649B (zh) | 2022-11-08 |
JP2023550212A (ja) | 2023-11-30 |
CN113821649A (zh) | 2021-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10755804B2 (en) | Health information system for searching, analyzing and annotating patient data | |
US9619583B2 (en) | Predictive analysis by example | |
WO2022151896A1 (zh) | 确定药品编码的方法、装置、电子设备以及计算机介质 | |
JP2020170516A (ja) | 臨床クエリを生成するための予測システム | |
US20230245005A1 (en) | System and method for detecting drug adverse effects in social media and mobile applications data | |
Jiang et al. | Extracting and standardizing medication information in clinical text–the MedEx-UIMA system | |
WO2018200274A1 (en) | Systems and methods for extracting form information using enhanced natural language processing | |
CN114078597A (zh) | 从文本获得支持的决策树用于医疗健康应用 | |
CN111145847A (zh) | 临床试验数据的录入方法及装置、介质和电子设备 | |
JP2023514023A (ja) | 質問の検索装置、質問の検索方法、デバイス、および記憶媒体 | |
Basu et al. | Call for data standardization: lessons learned and recommendations in an imaging study | |
Alfattni et al. | Extracting drug names and associated attributes from discharge summaries: text mining study | |
CN116992839A (zh) | 病案首页自动生成方法、装置及设备 | |
Li et al. | A patient-screening tool for clinical research based on electronic health records using OpenEHR: development study | |
Zhou et al. | Complementary and Integrative Health Information in the literature: its lexicon and named entity recognition | |
CN113160914A (zh) | 在线问诊方法、装置、电子设备及存储介质 | |
TaftiAhmad | Probing patient messages enhanced by natural language processing: A top-down message corpus analysis | |
Kocabiyikoglu et al. | A spoken drug prescription dataset in french for spoken language understanding | |
Chen et al. | Characterizing the use and contents of free-text family history comments in the Electronic Health Record | |
CN111523309A (zh) | 药品信息归一化的方法、装置、存储介质及电子设备 | |
CN116913548A (zh) | 不良反应数据分析方法、装置、电子设备和存储介质 | |
Aberdeen et al. | An annotation and modeling schema for prescription regimens | |
Zeng et al. | Adapting a natural language processing tool to facilitate clinical trial curation for personalized cancer therapy | |
WO2021159054A1 (en) | Method and system for incorporating patient information | |
Lee et al. | Establishing the Automatic Identification of Clinical Trial Cohorts from Electronic Health Records by Matching Normalized Eligibility Criteria and Patient Clinical Characteristics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21919085 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2023553759 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18272315 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11202305186V Country of ref document: SG |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24.10.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21919085 Country of ref document: EP Kind code of ref document: A1 |