CN116955497A - Classification method for Chinese patent medicine data - Google Patents
Classification method for Chinese patent medicine data Download PDFInfo
- Publication number
- CN116955497A CN116955497A CN202310367188.2A CN202310367188A CN116955497A CN 116955497 A CN116955497 A CN 116955497A CN 202310367188 A CN202310367188 A CN 202310367188A CN 116955497 A CN116955497 A CN 116955497A
- Authority
- CN
- China
- Prior art keywords
- chinese patent
- data
- patent medicine
- medicine
- classification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000003814 drug Substances 0.000 title claims abstract description 263
- 238000000034 method Methods 0.000 title claims abstract description 35
- 229940079593 drug Drugs 0.000 claims abstract description 108
- 238000005070 sampling Methods 0.000 claims abstract description 22
- 239000002552 dosage form Substances 0.000 claims description 13
- 238000004458 analytical method Methods 0.000 claims description 10
- 201000010099 disease Diseases 0.000 claims description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 7
- 230000003321 amplification Effects 0.000 claims description 6
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 6
- 238000007689 inspection Methods 0.000 claims 1
- 238000007726 management method Methods 0.000 abstract description 5
- 238000013523 data management Methods 0.000 abstract description 2
- 239000004615 ingredient Substances 0.000 abstract description 2
- 238000007405 data analysis Methods 0.000 abstract 1
- 239000000047 product Substances 0.000 description 30
- 230000008569 process Effects 0.000 description 6
- 238000010606 normalization Methods 0.000 description 5
- SEBFKMXJBCUCAI-UHFFFAOYSA-N NSC 227190 Natural products C1=C(O)C(OC)=CC(C2C(OC3=CC=C(C=C3O2)C2C(C(=O)C3=C(O)C=C(O)C=C3O2)O)CO)=C1 SEBFKMXJBCUCAI-UHFFFAOYSA-N 0.000 description 4
- RXUWDKBZZLIASQ-UHFFFAOYSA-N Puerarin Natural products OCC1OC(Oc2c(O)cc(O)c3C(=O)C(=COc23)c4ccc(O)cc4)C(O)C(O)C1O RXUWDKBZZLIASQ-UHFFFAOYSA-N 0.000 description 4
- HKEAFJYKMMKDOR-VPRICQMDSA-N puerarin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1C1=C(O)C=CC(C2=O)=C1OC=C2C1=CC=C(O)C=C1 HKEAFJYKMMKDOR-VPRICQMDSA-N 0.000 description 4
- SEBFKMXJBCUCAI-HKTJVKLFSA-N silibinin Chemical compound C1=C(O)C(OC)=CC([C@@H]2[C@H](OC3=CC=C(C=C3O2)[C@@H]2[C@H](C(=O)C3=C(O)C=C(O)C=C3O2)O)CO)=C1 SEBFKMXJBCUCAI-HKTJVKLFSA-N 0.000 description 4
- 229960004245 silymarin Drugs 0.000 description 4
- 235000017700 silymarin Nutrition 0.000 description 4
- 208000024172 Cardiovascular disease Diseases 0.000 description 3
- 235000011201 Ginkgo Nutrition 0.000 description 3
- 241000218628 Ginkgo Species 0.000 description 3
- 235000008100 Ginkgo biloba Nutrition 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000003390 Chinese drug Substances 0.000 description 2
- 206010011224 Cough Diseases 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 206010062717 Increased upper airway secretion Diseases 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 208000012902 Nervous system disease Diseases 0.000 description 2
- 230000001142 anti-diarrhea Effects 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 208000026106 cerebrovascular disease Diseases 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000002483 medication Methods 0.000 description 2
- 230000000399 orthopedic effect Effects 0.000 description 2
- 208000026435 phlegm Diseases 0.000 description 2
- 208000023504 respiratory system disease Diseases 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 208000014001 urinary system disease Diseases 0.000 description 2
- 206010000234 Abortion spontaneous Diseases 0.000 description 1
- 208000002874 Acne Vulgaris Diseases 0.000 description 1
- 208000002177 Cataract Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 206010010774 Constipation Diseases 0.000 description 1
- 206010010904 Convulsion Diseases 0.000 description 1
- 201000004624 Dermatitis Diseases 0.000 description 1
- 208000007882 Gastritis Diseases 0.000 description 1
- 206010019233 Headaches Diseases 0.000 description 1
- 206010020850 Hyperthyroidism Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 206010026749 Mania Diseases 0.000 description 1
- 208000019695 Migraine disease Diseases 0.000 description 1
- 208000007117 Oral Ulcer Diseases 0.000 description 1
- 208000001132 Osteoporosis Diseases 0.000 description 1
- 208000003251 Pruritus Diseases 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 206010053615 Thermal burn Diseases 0.000 description 1
- 241000130764 Tinea Species 0.000 description 1
- 208000002474 Tinea Diseases 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 206010000496 acne Diseases 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 230000001430 anti-depressive effect Effects 0.000 description 1
- 230000003474 anti-emetic effect Effects 0.000 description 1
- 230000003556 anti-epileptic effect Effects 0.000 description 1
- 239000001961 anticonvulsive agent Substances 0.000 description 1
- 239000000935 antidepressant agent Substances 0.000 description 1
- 229940005513 antidepressants Drugs 0.000 description 1
- 239000002111 antiemetic agent Substances 0.000 description 1
- 229940030225 antihemorrhagics Drugs 0.000 description 1
- 208000002399 aphthous stomatitis Diseases 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 208000010668 atopic eczema Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 229940124579 cold medicine Drugs 0.000 description 1
- 230000036461 convulsion Effects 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- -1 diabetes medication Substances 0.000 description 1
- 208000010643 digestive system disease Diseases 0.000 description 1
- 208000002173 dizziness Diseases 0.000 description 1
- 229940029980 drug used in diabetes Drugs 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 231100000869 headache Toxicity 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000014617 hemorrhoid Diseases 0.000 description 1
- 239000002874 hemostatic agent Substances 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 206010027599 migraine Diseases 0.000 description 1
- 208000015994 miscarriage Diseases 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 206010039083 rhinitis Diseases 0.000 description 1
- 210000000697 sensory organ Anatomy 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 208000000995 spontaneous abortion Diseases 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000008736 traumatic injury Effects 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 208000019206 urinary tract infection Diseases 0.000 description 1
- 238000002759 z-score normalization Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H70/00—ICT specially adapted for the handling or processing of medical references
- G16H70/40—ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Toxicology (AREA)
- Epidemiology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Primary Health Care (AREA)
- Public Health (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Medical Preparation Storing Or Oral Administration Devices (AREA)
Abstract
The application provides a classification method of Chinese patent medicine data, and belongs to the field of medical data management. The method comprises the following steps: setting classification standards and types of Chinese patent medicines; according to metadata of a Chinese patent medicine database, a data table for data analysis and field types in the data table are obtained; extracting sampling data in the data table; and analyzing the sampled data and matching product classification codes. Wherein, the Chinese patent medicine classification standard and the Chinese patent medicine type are set specifically comprising: formulating the classification standard of the Chinese patent medicine; dividing the Chinese patent medicines into a plurality of major classes and a plurality of subclasses according to the Chinese patent medicine classification standard; setting codes corresponding to the major classes and the sub-classes, and generating product classification codes. The application establishes the classification standard of the Chinese patent medicine by analyzing the indications, main functions, ingredients and clinical application conditions of the Chinese patent medicine, and guides the management and the use of the Chinese patent medicine by combining the classification codes of the products.
Description
Technical Field
The application relates to the field of medical data management, in particular to a classification method of Chinese patent medicine data.
Background
In the prior art, medicines are often classified according to different standards such as components, purposes, dosage forms or pharmacological actions, different classification standards can be selected according to different requirements, but the management of Chinese patent medicines is disordered due to the different classification standards, and medicine information except the classification standards cannot be completely reflected.
The international ATC classification standard is a standard classified according to the structure and function of a drug, and is formulated by the world health organization. ATC classifies medications into 5 classes, separated by letters and numbers. ATC is a hierarchical classification of drug systems. The medicine ATC code is divided into 5 layers, which are respectively called a major class code, a sub class code, a primary sub class code, a secondary sub class code and a name code. Wherein the major class codes are classified according to an anatomical classification method, the sub-class codes and the primary sub-class codes are classified according to a therapeutic classification method, and the secondary sub-class codes are classified according to a chemical and therapeutic mixed classification method. However, due to the variety of Chinese patent medicines, the problem of inconvenient classification of medicines exists in a plurality of Chinese patent medicines due to the variety of the Chinese patent medicines, and the ATC classification method cannot accurately classify the Chinese patent medicines.
Disclosure of Invention
The application aims to solve the problem of inconvenient classification of the Chinese patent medicines by combining the characteristics of the Chinese patent medicines and the clinical application conditions. The aim of the application is achieved by the following technical scheme:
a classification method of Chinese patent medicine data comprises the following steps:
setting classification standards and types of Chinese patent medicines;
acquiring a data table in a database of the Chinese patent medicine;
extracting part of field data in the data table as sampling data;
analyzing the sampling data and matching product classification codes;
wherein, the set Chinese patent medicine classification standard and Chinese patent medicine type specifically comprise: formulating the classification standard of the Chinese patent medicine; dividing the Chinese patent medicines into a plurality of major classes and a plurality of subclasses according to the Chinese patent medicine classification standard; setting codes corresponding to the major classes and the sub-classes, and generating the product classification codes;
wherein, the classification standard of the Chinese patent medicine is as follows: drugs of a single indication are categorized according to the indication; the Chinese patent medicines with single function indications of multiple indications are classified according to the function indications; the Chinese patent medicines with multiple indications and multiple functions are classified according to the medicines of other dosage forms with the same Chinese patent medicine variety; the multi-indication multifunctional main-treatment Chinese patent medicines which cannot be classified by referring to medicines of other dosage forms with the same Chinese patent medicine variety are classified according to the clinical application of manufacturing enterprises.
Further, the number of the major classes is 13, and the number of the sub-classes is 80.
Further, the product classification code is divided into five layers; the first layer code is a product property classification code, and the value of the first layer code is fixed as Z to represent the Chinese patent medicine; the second layer code represents a first class code of the product, and the value range of the first class code is A-V; the third layer code represents a product secondary classification code, and the value range is 0-99; the third class code of the representative product of the fourth layer code has a value range of A-Z or 0 and is 0 when empty; the representative product four-level classification code of the fifth layer code has a value range of 0-9 and is 0 when empty.
Further, the extracting the sampling data in the data table includes:
setting a definition range of the Chinese patent medicine;
acquiring all data of the data table;
and extracting data in the data table according to the definition range of the Chinese patent medicine to serve as the sampling data.
Further, the definition range of the Chinese patent medicine comprises:
all medicines of national approved 'national medicine standard Z';
chinese medicine in imported medicines and medicines similar to Chinese patent medicine in other countries or regions of Asia;
belongs to the Chinese medicine standard character H and other dosage forms with the same components belong to Chinese patent medicines;
belongs to the Chinese medicine standard H, and the main component and a plurality of Chinese patent medicines in China are medicines for treating the same disease clinically;
a medicine containing various uncertain effective components is extracted from natural plants.
Further, the analyzing the sampled data and matching product classification codes specifically includes:
performing data processing on the sampling data;
analyzing the sampling data by referring to the Chinese patent medicine classification standard to obtain an analysis result;
and matching product classification codes according to the analysis result.
Further, the data processing includes data specification processing, experience value processing, data amplification processing, and data integrity check.
Further, before the step of setting the classification standard and the type of the Chinese patent medicine, the method further comprises the steps of: the Chinese patent medicine data are arranged into a data table and stored into a Chinese patent medicine database, and metadata of the Chinese patent medicine database are generated at the same time.
Further, the metadata includes a table name of each data table and description information of data, and a field name, a field type, and data length accuracy limit information of each field in each data table.
Further, the data table in the Chinese patent medicine database is obtained, specifically, the data table in the Chinese patent medicine database is obtained according to the metadata.
Compared with the prior art, the application has the following beneficial effects:
according to the application, the Chinese patent medicine classification standard is formulated by analyzing the indications and main functions of the Chinese patent medicine and combining the component characteristics and clinical application conditions of the Chinese patent medicine, and the main treatment diseases and other information of the Chinese patent medicine are intuitively reflected through the product classification codes, so that the management and the use of the Chinese patent medicine are better guided;
the application is convenient for the management staff to classify and manage the Chinese patent medicine data by establishing the Chinese patent medicine database of the Chinese patent medicine, and carries out data processing on the sampling data in the classifying process, thereby improving the quality and the reliability of the data;
the application solves the problem of data omission caused by fuzzy definition of part of Chinese patent medicines in the classification process by accurately dividing the definition range of the Chinese patent medicines by combining approval paper numbers, clinical application and medicine components and extracting sampling data according to the definition range.
Drawings
In order to more clearly illustrate the embodiments of the application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a classification method of Chinese patent medicine data according to an embodiment of the application;
FIG. 2 is a flow chart of the method for setting classification criteria and types of Chinese patent medicines according to the embodiment of the application;
fig. 3 is a schematic structural diagram of a product code according to an embodiment of the present application.
Detailed Description
The technical solutions of the present application will be clearly and completely described in connection with the embodiments, and it is obvious that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Referring to fig. 1, the method for classifying data of Chinese patent medicine provided by the embodiment of the application includes the following steps:
s1, setting classification standards and types of Chinese patent medicines;
s2, acquiring a data table in a database of the Chinese patent medicine;
s3, extracting part of field data in the data table as sampling data;
and S4, analyzing the sampling data and matching product classification codes.
In the application, the management and the use of the Chinese patent medicine are guided by the Chinese patent medicine classification standard and the product classification code, and the information of the main cure diseases and the like of the Chinese patent medicine can be intuitively reflected by the product classification code.
Specifically, referring to fig. 2, step S1 includes:
and S11, setting the classification standard of the Chinese patent medicine.
In the process of formulating the classification standard of the Chinese patent medicine, the application classifies the Chinese patent medicine preferentially through the indication of the medicine; secondly, classifying by the function and the main indications of the medicines when the medicines are medicines with different single indications; when the medicine has multiple indications and multiple functions, the medicine with other dosage forms is referred to for classification or classification according to clinical application of a manufacturing enterprise; and classifying the medicines according to clinical application of the production enterprises when the classification standards are not satisfied.
Accordingly, the classification standard of the Chinese patent medicine formulated in the embodiment specifically comprises: drugs of a single indication are categorized according to the indication; the Chinese patent medicines with single function indications of multiple indications are classified according to the function indications; the Chinese patent medicines with multiple indications and multiple functions are classified according to the medicines of other dosage forms with the same Chinese patent medicine variety; the multi-indication multifunctional main-treatment Chinese patent medicines which cannot be classified by referring to medicines of other dosage forms with the same Chinese patent medicine variety are classified according to the clinical application of manufacturing enterprises.
S12, dividing the Chinese patent medicines into 13 major classes and 80 subclasses according to the Chinese patent medicine classification standard; among them, 13 major classes include: digestive system diseases, qi and blood tonifying, cardiovascular and cerebrovascular diseases, dermatology, pediatrics, gynecology, urinary system diseases, five sense organs, tumor diseases, skeletal muscle system diseases, nervous system diseases, respiratory diseases and others.
The 80 subclasses include: hemorrhoid medication, gastric medication (gastritis, ulcer), intestinal medication, hyperthyroidism medication, weight-reducing medication, hepatitis medication, biliary medication, stomach-invigorating and digestion-promoting medication, antidiarrheal medication, constipation medication, diabetes medication, antiemetic medication, other medication for digestive tract diseases, spontaneous perspiration medication, qi-supplementing medication, blood-supplementing medication, qi-supplementing and blood-supplementing medication, lipid-lowering medication, cerebrovascular disease medication, cardiovascular disease medication, other medication for cardiovascular disease, burn and scald medication, other medication for dermatology, medication for dermatologic infectious disease, skin itch medication, acne medication, tinea medication, eczema medication, psoriasis medication, pediatric convulsion medication, other medication for pediatric use, anorexia medication, pediatric cold medication, pediatric antidiarrheal medication, pediatric supplement nutrition medication, pediatric cough-relieving and phlegm medication, gynecological medication for gynecological medication, other medication for gynecological medication gynecological inflammation medicine, gynecological miscarriage preventing medicine, female climacteric medicine, hyperplasia of mammary glands medicine, calculus medicine, urinary tract infection medicine, waist strengthening and kidney invigorating medicine, prostate medicine, other medicine for urinary disease, throat medicine, oral ulcer medicine, dental medicine, cataract medicine, ophthalmic other medicine, rhinitis medicine, otology medicine, nasal other medicine, laryngeal other medicine, tumor disease type other medicine, osteoporosis medicine, rheumatalgia medicine, traumatic injury medicine, orthopedic other medicine, orthopedics analgesic medicine, nervous system disease other medicine, headache medicine (migraine), shock deficiency and summer heat relieving medicine, nerve soothing and brain nourishing medicine, antiepileptic medicine, dizziness medicine, antidepressant, mania dryness medicine, common cold medicine, heat clearing and detoxicating medicine, cough relieving and phlegm eliminating and asthma relieving medicine, respiratory disease other medicine, other medications and hemostatic agents.
S13, setting codes corresponding to the major classes and the sub-classes, and generating product classification codes;
specifically, referring to fig. 3, step S13 further includes generating a product information code, a manufacturer information code, and a package number code, where the product classification code, the product information code, the manufacturer information code, and the package number code are connected by a symbol "-" to form a product code. The product classification code is 6-bit characters, the product information code is 21-bit characters, the manufacturer information code is 11-bit characters, and the packing number code is 6-bit characters.
The encoding structure of the product classification codes is shown in table 1.
TABLE 1
In this embodiment, the subclasses are divided into one or more subclasses, and the subclasses are divided into one or more minimum classes.
It can be seen that the product classification code is divided into five layers. The first layer code is a product property classification code, and the value of the first layer code is fixed as Z to represent the Chinese patent medicine; the first class classification code of the representative product of the second layer code corresponds to the major class, and the value range of the first class classification code is A-V; the secondary classification code of the representative product of the third layer code corresponds to the subclass and has a value range of 0-99; the third class code of the representative product of the fourth layer code corresponds to the subclass, and the value range is A-Z or 0, and the blank time is 0; the representative product four-level classification code of the fifth layer code corresponds to the minimum class, and the value range is 0-9, and the blank time is 0.
In one embodiment, the codes corresponding to each of the major and minor classes are shown in table 2.
TABLE 2
Further, the step S3 specifically includes: step S31, setting a definition range of the Chinese patent medicine; step S32, acquiring all data of the data table; and step S33, extracting data in the data table according to the definition range of the Chinese patent medicine as the sampling data.
Wherein, the definition range of the Chinese patent medicine comprises five types, wherein,
the first type is that all medicines with the approval clerical number of the national medicine standard Z.
The second type is that of the imported drugs, the chinese drugs in the asian countries or regions are similar to those of the traditional Chinese drugs belonging to the first type.
The third type is that the approval document is the national drug standard H and other dosage forms with the same component belong to the first type of drugs. For example, if the approval document of puerarin injection is national standard character H and the approval document of puerarin tablet belonging to other dosage forms of puerarin injection is national standard character Z, the puerarin injection meets the definition range of the third type; in addition, the approved document of the imported ginkgo leaf preparation is national standard character H, and the approved document of the domestic ginkgo leaf preparation is national standard character Z, so that the imported ginkgo leaf preparation also accords with the definition range of the third type.
The fourth type is a medicine with approval document number of national medicine standard H and main component and domestic Chinese patent medicines for treating the same disease clinically. For example, the approval document of silymarin capsule is national medicine standard H, the main component thereof is silymarin, and the approval document of Yiganling tablet containing 77mg silymarin is national medicine standard Z, then silymarin capsule conforms to the definition range of the fourth type.
The fifth type is to extract medicines containing various uncertain effective ingredients from natural plants, such as galutes extract tablet.
The application solves the problem of data omission caused by fuzzy definition of part of Chinese patent medicines in the classification process by accurately dividing the definition range of the Chinese patent medicines by combining approval paper numbers, clinical application and medicine components and extracting sampling data according to the definition range.
Further, the step S4 specifically includes:
and step S41, performing data processing on the sampling data. The data processing comprises data specification processing, experience value processing, data amplification processing and data overall checking. Wherein, the data normalization process is to convert the data into a certain standard form so as to facilitate the subsequent processing and analysis. Common data normalization methods include min-max normalization, z-score normalization, decimal scale normalization, and the like. The data normalization processing can eliminate the dimension influence in the data, so that different indexes have comparability, and the accuracy and the interpretability of the data are improved. Empirical value processing refers to processing and modifying data using expert experience or domain knowledge. Experience value processing is commonly used for data missing and outlier processing, missing data can be filled in by means of average value, median, mode and the like, and outliers can be identified and removed by means of box diagram and the like. The data amplification processing means that the sample capacity is increased by transforming and expanding the original data, thereby improving the reliability and generalization capability of the data. Common data amplification methods include upsampling, downsampling, interpolation, balanced sampling, and the like. The data amplification treatment can effectively reduce the risk of overfitting and improve the generalization capability of the model. By the data processing means, the quality and the reliability of data can be improved, so that the effects of subsequent analysis and modeling are improved.
S42, analyzing the sampling data by referring to the Chinese patent medicine classification standard to obtain an analysis result; wherein, the analysis result at least comprises indication, main function, main component, approval document, specification, dosage form, production enterprises and clinical application information of the Chinese patent medicine. In addition, the analysis results include the drug common name, administration route, unit price, quantity, amount, amplified quantity, and amplified amount.
And step S43, matching product classification codes according to the analysis result.
Further, before step S1, the classification method of the present application further includes: the Chinese patent medicine data are arranged into a data table and stored into a Chinese patent medicine database, and metadata of the Chinese patent medicine database are generated at the same time. Specifically, in the process of arranging the data of the Chinese patent medicine into a data table, a table is firstly generated, and the table is provided with a unified table head, hospital codes and the fid of the Chinese patent medicine, wherein the fid represents the unique code of each medicine; and then checking total data, including checking whether the count data is complete, whether the count form is consistent with the past and whether the count number is similar to the past, wherein the count data comprises the following fields: medicine name, manufacturer, specification, quantity, and stock.
Further, in order to facilitate statistics and classification of data by the chinese patent drug database operator, metadata of the chinese patent drug database includes a table name of each data table and description information of data, and a field name, a field type, and data length accuracy limitation information of each field in each data table. According to the method, metadata can provide comprehensive description and description of data and objects in the Chinese patent medicine database, and convenience is brought to Chinese patent medicine database administrators and developers to manage and maintain the Chinese patent medicine database. Further, step S2 is specifically to obtain a data table in a database of the Chinese patent medicine according to the metadata. By utilizing the metadata, the data and the objects in the Chinese patent medicine database can be quickly searched, the structure and the attribute of the data and the objects and the relation among the data and the objects can be known, and therefore the data and the objects in the Chinese patent medicine database can be better understood and used.
The foregoing is merely illustrative and explanatory of the application, as it is well within the scope of the application as claimed, as it relates to various modifications, additions and substitutions for those skilled in the art, without departing from the inventive concept and without departing from the scope of the application as defined in the accompanying claims.
Claims (10)
1. A classification method of Chinese patent medicine data is characterized in that: the method comprises the following steps:
setting classification standards and types of Chinese patent medicines;
acquiring a data table in a database of the Chinese patent medicine;
extracting part of field data in the data table as sampling data;
analyzing the sampling data and matching product classification codes;
wherein, the set Chinese patent medicine classification standard and Chinese patent medicine type specifically comprise: formulating the classification standard of the Chinese patent medicine; dividing the Chinese patent medicines into a plurality of major classes and a plurality of subclasses according to the Chinese patent medicine classification standard; setting codes corresponding to the major classes and the sub-classes, and generating the product classification codes;
wherein, the classification standard of the Chinese patent medicine is as follows: drugs of a single indication are categorized according to the indication; the Chinese patent medicines with single function indications of multiple indications are classified according to the function indications; the Chinese patent medicines with multiple indications and multiple functions are classified according to the medicines of other dosage forms with the same Chinese patent medicine variety; the multi-indication multifunctional main-treatment Chinese patent medicines which cannot be classified by referring to medicines of other dosage forms with the same Chinese patent medicine variety are classified according to the clinical application of manufacturing enterprises.
2. The method for classifying data of a chinese patent medicine according to claim 1, wherein: the number of the major classes is 13, and the number of the sub-classes is 80.
3. The method for classifying data of a chinese patent medicine according to claim 1, wherein: the product classification codes are divided into five layers; the first layer code is a product property classification code, and the value of the first layer code is fixed as Z to represent the Chinese patent medicine; the second layer code represents a first class code of the product, and the value range of the first class code is A-V; the third layer code represents a product secondary classification code, and the value range is 0-99; the third class code of the representative product of the fourth layer code has a value range of A-Z or 0 and is 0 when empty; the representative product four-level classification code of the fifth layer code has a value range of 0-9 and is 0 when empty.
4. The method for classifying data of a chinese patent medicine according to claim 1, wherein: the extracting the sampling data in the data table comprises the following steps:
setting a definition range of the Chinese patent medicine;
acquiring all data of the data table;
and extracting data in the data table according to the definition range of the Chinese patent medicine to serve as the sampling data.
5. The method for classifying data according to claim 4, wherein: the definition range of the Chinese patent medicine comprises:
all medicines of national approved 'national medicine standard Z';
chinese medicine in imported medicines and medicines similar to Chinese patent medicine in other countries or regions of Asia;
belongs to the Chinese medicine standard character H and other dosage forms with the same components belong to Chinese patent medicines;
belongs to the Chinese medicine standard H, and the main component and a plurality of Chinese patent medicines in China are medicines for treating the same disease clinically;
a medicine containing various uncertain effective components is extracted from natural plants.
6. The method for classifying data of a chinese patent medicine according to claim 1, wherein: the analyzing the sampled data and matching product classification codes specifically comprises the following steps:
performing data processing on the sampling data;
analyzing the sampling data by referring to the Chinese patent medicine classification standard to obtain an analysis result;
and matching product classification codes according to the analysis result.
7. The method for classifying data according to claim 6, wherein: the data processing includes data specification processing, experience value processing, data amplification processing and data overall inspection.
8. The method for classifying data of a chinese patent medicine according to claim 1, wherein: before the classification standard and the type of the Chinese patent medicine are set, the method further comprises the following steps: the Chinese patent medicine data are arranged into a data table and stored into a Chinese patent medicine database, and metadata of the Chinese patent medicine database are generated at the same time.
9. The method for classifying data according to claim 8, wherein: the metadata includes a table name of each data table and description information of data, and a field name, a field type, and data length accuracy limit information of each field in each data table.
10. The method for classifying data according to claim 8, wherein: the data table in the Chinese patent medicine database is obtained, and specifically, the data table in the Chinese patent medicine database is obtained according to the metadata.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310367188.2A CN116955497A (en) | 2023-04-07 | 2023-04-07 | Classification method for Chinese patent medicine data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310367188.2A CN116955497A (en) | 2023-04-07 | 2023-04-07 | Classification method for Chinese patent medicine data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116955497A true CN116955497A (en) | 2023-10-27 |
Family
ID=88460759
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310367188.2A Pending CN116955497A (en) | 2023-04-07 | 2023-04-07 | Classification method for Chinese patent medicine data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116955497A (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107784058A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Drug data processing method and processing device |
CN109408631A (en) * | 2018-09-03 | 2019-03-01 | 平安医疗健康管理股份有限公司 | Drug data processing method, device, computer equipment and storage medium |
CN111475686A (en) * | 2020-03-17 | 2020-07-31 | 平安科技(深圳)有限公司 | Medicine classification method and device, storage medium and intelligent equipment |
CN113821649A (en) * | 2021-01-15 | 2021-12-21 | 北京沃东天骏信息技术有限公司 | Method, device, electronic equipment and computer medium for determining medicine code |
CN113821641A (en) * | 2021-09-29 | 2021-12-21 | 平安医疗健康管理股份有限公司 | Method, device, equipment and storage medium for medicine classification based on weight distribution |
KR102350145B1 (en) * | 2021-07-14 | 2022-01-12 | 주식회사 원스글로벌 | Method and Apparatus for Matching Medicine Code |
-
2023
- 2023-04-07 CN CN202310367188.2A patent/CN116955497A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107784058A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Drug data processing method and processing device |
CN109408631A (en) * | 2018-09-03 | 2019-03-01 | 平安医疗健康管理股份有限公司 | Drug data processing method, device, computer equipment and storage medium |
CN111475686A (en) * | 2020-03-17 | 2020-07-31 | 平安科技(深圳)有限公司 | Medicine classification method and device, storage medium and intelligent equipment |
CN113821649A (en) * | 2021-01-15 | 2021-12-21 | 北京沃东天骏信息技术有限公司 | Method, device, electronic equipment and computer medium for determining medicine code |
KR102350145B1 (en) * | 2021-07-14 | 2022-01-12 | 주식회사 원스글로벌 | Method and Apparatus for Matching Medicine Code |
CN113821641A (en) * | 2021-09-29 | 2021-12-21 | 平安医疗健康管理股份有限公司 | Method, device, equipment and storage medium for medicine classification based on weight distribution |
Non-Patent Citations (2)
Title |
---|
江雅琴;: "中成药分类及探究现状", 中国实用医药, no. 33 * |
陈敏等: "湖北省药品分类与编码标准体系研究", 中国卫生信息管理杂志, no. 5, pages 2 - 4 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Camargo Jr et al. | Continuous versus intermittent beta‐agonists for acute asthma | |
CN108447536A (en) | Medication information processing method based on genius morbi and disease degree information | |
CN107480425A (en) | A kind of medicine information processing method based on medicine coding | |
Westfall | Herbal healing in pregnancy: women's experiences | |
CN106446527A (en) | Drug use information identifier processing method | |
CN108766518A (en) | Analysis method between a kind of the physiological pathology state and drug of knowledge based library rule | |
CN110223784A (en) | Clinical test patient's matching process | |
Liu et al. | Current situation of International Organization for Standardization/Technical Committee 249 international standards of traditional Chinese medicine | |
CN105373911A (en) | Traditional Chinese medicine prescription dispensing sequence control system | |
CN111883263A (en) | Method, device, equipment and storage medium for assisting in judging drug effect of traditional Chinese medicine prescription | |
Xiong et al. | Design and evaluation of a prescription drug monitoring program for Chinese patent medicine based on knowledge graph | |
CN116955497A (en) | Classification method for Chinese patent medicine data | |
CN109817300B (en) | Medicine-taking rule generation method based on artificial intelligence | |
Totelin | Hippocratic corpus | |
CN107169278A (en) | A kind of data administering method and medical information system | |
Kang et al. | Appropriateness of ICNP in Korean home care nursing | |
CN115017326B (en) | Medical coding method and device | |
CN114093455A (en) | Internet hospital rational medication engine construction method based on artificial intelligence | |
CN110245272A (en) | A kind of blood disease case database generation system | |
CN114417014A (en) | Basic knowledge system construction method taking human as center | |
Chirila et al. | Named Entity Recognition and Classification for Medical Prospectuses. | |
EP4226383A1 (en) | A system and a way to automatically monitor clinical trials - virtual monitor (vm) and a way to record medical history | |
Zhang et al. | Research on TCM Diabetes Assisted Diagnosis and Treatment Plan Integrating Association Mining and Quantitative Calculation | |
Zhu et al. | TCDO: A community-based ontology for integrative representation and analysis of traditional Chinese drugs and their properties | |
Yan et al. | Design of knowledge graph of traditional Chinese medicine prescription and knowledge analysis of implicit relationship |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |