WO2019136807A1 - Medical data relationship image acquisition method and apparatus, terminal device and storage medium - Google Patents

Medical data relationship image acquisition method and apparatus, terminal device and storage medium Download PDF

Info

Publication number
WO2019136807A1
WO2019136807A1 PCT/CN2018/077476 CN2018077476W WO2019136807A1 WO 2019136807 A1 WO2019136807 A1 WO 2019136807A1 CN 2018077476 W CN2018077476 W CN 2018077476W WO 2019136807 A1 WO2019136807 A1 WO 2019136807A1
Authority
WO
WIPO (PCT)
Prior art keywords
medical data
target
chart
degree
data relationship
Prior art date
Application number
PCT/CN2018/077476
Other languages
French (fr)
Chinese (zh)
Inventor
鲁宁
薛振坤
梅健健
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019136807A1 publication Critical patent/WO2019136807A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • the present application relates to the field of data processing, and in particular, to a method, an apparatus, a terminal device, and a storage medium for acquiring a medical data relationship image.
  • the identification of the current disease is usually determined by the doctor according to the patient's symptoms, depends on the doctor's professional and experience, and can not enable the user to know his or her health according to his or her symptoms.
  • the medical data such as symptoms, diseases and treatments provided on the medical website are displayed independently, and the relationship between the various medical data (ie, the degree of relationship) is not provided, so that the user cannot base the medical data according to the medical data. Keep abreast of your health status.
  • the embodiment of the present application provides a medical data relationship image acquisition method, device, terminal device and storage medium to solve the problem of traditional medical data independent display.
  • an embodiment of the present application provides a method for acquiring a medical data relationship image, including:
  • the medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  • the embodiment of the present application provides a medical data relationship image acquiring apparatus, including:
  • a target medical data acquisition module configured to acquire at least one target medical data
  • a medical data relationship obtaining module configured to perform correlation analysis on the target medical data by using an Apriori algorithm, and obtain a medical data relationship degree
  • the medical data relation image obtaining module is configured to perform a graph conversion on the medical data relationship degree by using an E-charts tool to obtain a medical data relationship image.
  • an embodiment of the present application provides a terminal device, including a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, where the processor executes the computer The following steps are implemented when reading the instruction:
  • the medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  • an embodiment of the present application provides a computer readable medium storing computer readable instructions, where the computer readable instructions are executed by a processor to implement the following steps:
  • the medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  • the medical data relation image acquisition method, device, terminal device and storage medium provided by the embodiments of the present application, at least one target medical data is first acquired, and then the Apriori algorithm is used to perform correlation analysis on the target medical data, and the medical data relationship degree is obtained. To help users judge the probability of illness and prevent it in time. Finally, the E-charts tool is used to graphically convert the relationship between medical data and obtain medical data relationship images so that users can visually see the relationship between disease characteristics and symptom characteristics, which helps users to understand their possible diseases according to their own symptoms. The condition.
  • Embodiment 1 is a flowchart of a medical data relationship image acquisition method provided in Embodiment 1.
  • FIG. 2 is a specific schematic diagram of step S10 of FIG. 1.
  • FIG. 3 is a specific schematic diagram of step S12 of FIG. 2.
  • FIG. 4 is a specific schematic diagram of step S20 of FIG. 1.
  • FIG. 5 is a specific schematic diagram of step S30 of FIG. 1.
  • FIG. 6 is a schematic block diagram of a medical data relation image acquiring device provided in Embodiment 2.
  • FIG. 7 is a schematic diagram of a terminal device provided in Embodiment 4.
  • FIG. 1 is a flow chart showing a method of acquiring a medical data relationship image in the present embodiment.
  • the medical data relation image acquisition method can quickly collect a large amount of medical data from the network to perform relationship analysis based on the collected medical data.
  • the medical data relationship image acquisition method can be specifically applied to the medical knowledge base management system, which is used for analyzing the relationship characteristics of the symptoms input by the user, and recommending a list of suspected disease characteristics for the user, which can effectively assist the user to understand. Self-health status.
  • the medical data relationship image acquisition method includes the following steps:
  • S10 Acquire at least one target medical data.
  • the target medical data is data for performing model training.
  • the target medical data includes, but is not limited to, medical data such as symptom characteristics, disease characteristics, and the like in the present embodiment.
  • the target medical data is acquired for subsequent model training.
  • S20 The Apriori algorithm is used to analyze the relevance of the target medical data to obtain the relationship of the medical data.
  • Apriori algorithm is a representative algorithm for association rule mining.
  • Apriori algorithm is widely used in various industries, with less calculation, easy to understand, and can effectively mine the potential rules between data.
  • the medical data relationship is used to reflect the degree of association between disease characteristics and symptom characteristics.
  • the Apriori algorithm is used to analyze the target medical data, and the N frequent items (N>1) are generated by continuous circulation until the frequent sets are obtained, so that the medical data relationship degree is obtained, and the user can quickly learn the health of the user. Situation, timely prevention.
  • S30 The E-charts tool is used to perform graph conversion on medical data relationship degree, and obtain medical data relationship image.
  • the medical data relation image refers to an image obtained by graphically converting the degree of medical data relationship.
  • ECharts Enterprise Charts
  • the underlying version relies on the lightweight Canvas library ZRender. Users provide intuitive, vivid, highly personalized data visualization charts. The Echarts tool can effectively improve the visualization of data and achieve higher interactivity.
  • At least one target medical data is acquired first, and then the Apriori algorithm is used to perform correlation analysis on the target medical data, and the degree of medical data relationship is obtained to assist the user in judging the probability of disease and prevent it in time.
  • the E-charts tool is used to convert the medical data relationship degree and obtain the medical data relationship image, so that the user can visually see the relationship between the data, assist the user in analysis and consultation, and improve work efficiency.
  • step S10 at least one target medical data is acquired, which specifically includes the following steps:
  • the target webpage address is a webpage address corresponding to the target medical data to be obtained in advance. For example: 39 health website. Support for obtaining target medical data based on the target web page address by obtaining the target web page address to enable subsequent crawling of the target medical data using the web crawler technology.
  • the crawler tool is used to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data.
  • the crawler tool is a tool for automatically crawling the webpage content corresponding to the webpage address according to certain rules, such as a Python crawler tool.
  • the crawler tool is used to crawl the webpage content corresponding to the webpage address to obtain at least one original medical data, and each of the original medical data includes an actual disease characteristic and a corresponding at least one symptom characteristic, such as a cold (actual disease characteristic). Corresponding to symptoms such as runny nose, headache and fever.
  • the crawler tool is used to crawl the webpage corresponding to the target webpage address, and no manual search is needed, which is beneficial to improving the efficiency of data collection.
  • the crawler tool performs crawling data on the target webpage address in a periodic crawling manner, so that the original medical data is time-series, so that the target medical features acquired by the subsequent model training are also sequential.
  • S13 Perform data cleaning on at least one original medical data to obtain at least one target medical data.
  • data cleaning refers to the method of processing raw medical data according to certain rules to obtain pure target medical data.
  • Target medical data refers to pure data processed in accordance with data cleaning rules.
  • the data cleaning rules include, but are not limited to, removing duplicate data. Since each raw medical data contains actual disease characteristics and corresponding at least one symptom characteristic, when cleaning according to the data cleaning rule, two or more original medical data having the same disease characteristics and symptom characteristics can be combined into one. Raw medical data.
  • the quality of the target medical data can be effectively improved, and the repeated original medical data is removed, so that when the target medical data is subsequently used for training, it is not necessary. Re-training the repeated raw medical data can effectively reduce the training time, save time and improve training efficiency.
  • the target webpage address is obtained first, and then the webpage corresponding to the target webpage address is crawled by the crawler tool to obtain at least one original medical data, and no manual search is needed, which is beneficial to improving the efficiency of data collection. Finally, data cleaning is performed on at least one original medical data, and at least one target medical data is acquired, so that when the target medical data is used for training, the repeated data is not required to be retrained, thereby effectively reducing training time, saving time, and improving training efficiency. .
  • step S12 the crawler tool is used to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data, and the original medical data includes a disease characteristic and corresponding at least one symptom.
  • the crawler tool is used to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data, and the original medical data includes a disease characteristic and corresponding at least one symptom.
  • S121 Using a crawler tool, crawling at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, where each of the access addresses corresponds to a webpage.
  • the depth-first algorithm can be used to start from the start page, and one link is tracked and tracked. After the line is processed, the next start page is transferred to continue tracking the link.
  • the target webpage address corresponds to a target webpage, and the target webpage includes at least one accessing address linked to the starting page and the at least one starting page, and each accessing address corresponds to a visiting webpage.
  • the crawler tool includes a web page extraction tool and a web page download tool.
  • the webpage extraction tool is a tool for extracting an access address, and step S121 specifically uses a webpage extraction tool to crawl at least one access address linked by the target webpage address.
  • the webpage downloading tool is a tool for downloading a webpage corresponding to an access address.
  • the breadth-first algorithm can also be used to continuously crawl a new webpage address from the current page into the message queue to be downloaded, and stop executing the crawler task until the preset stop condition is satisfied.
  • the breadth-first algorithm refers to inserting the link found by the newly downloaded webpage directly into the end of the queue to be crawled, that is, the web crawler first crawls all the webpages in the start page, and then selects one of the links. The webpage continues to crawl all pages linked in this page.
  • S122 Store at least one access address in a queue of pending messages.
  • each access address crawled in step S121 is stored in the queue to be downloaded according to the chronological order of the crawling, so that when step S123 is performed, the crawling may be based on the webpage address in the message queue to be downloaded.
  • the message queue to be downloaded processes the access address according to the advanced first-in-first method, so that the crawl access address and the original medical data can be crawled asynchronously based on the access address, which is beneficial to improving the efficiency of obtaining the original medical data.
  • the webpage extraction tool in the crawler tool first grabs all the webpage content in the start page, and then selects at least one access address linked by the startpage, and continues to crawl the webpage of the visitor address link.
  • the start page is a target webpage address.
  • the crawler tool is used to perform data extraction on the webpage corresponding to each access address in the message queue to obtain original medical data.
  • the webpage downloading tool automatically downloads all the medical data in the webpage corresponding to the accessing address according to each access address of the message queue to be downloaded.
  • a plurality of webpage addresses including the original medical data are stored in the message queue to be downloaded, and the webpage downloading tool of the crawler tool sequentially obtains the accessing address from the to-be-downloaded message queue and downloads the webpage corresponding to the accessing address.
  • Raw medical data Specifically, the crawler tool obtains an access address from the head of the message queue to be downloaded and downloads the webpage corresponding to the access address, stores the downloaded original medical data in the database, and unregisters the corresponding webpage address in the message queue to be downloaded.
  • the webpage downloading tool in the crawler tool automatically crawls the webpage address including the original medical data from the Internet according to the crawler task set by the user, and does not need manual search, which is beneficial to improving data collection efficiency.
  • Raw medical data includes, but is not limited to, symptom characteristics and disease characteristics, and may also include a visiting department.
  • the symptom table, the disease table, and the department table are also stored according to the identification fields of the original medical data (ie, the symptoms, the disease, and the visiting department).
  • the data in each table is labeled.
  • A1 labeled code
  • A2 represents the second symptom in the symptom store
  • ie A2 fever
  • the crawler tool is used to crawl at least one access address linked by the target webpage address according to the breadth-first algorithm or the depth-first algorithm, and the obtained access address is stored in the message queue to be downloaded, and then the crawler is used.
  • the tool downloads the original medical data based on the access address obtained in the message queue to be downloaded, so that the original medical data downloaded by the access address is processed asynchronously, which is beneficial to improving the efficiency of obtaining the original medical data.
  • obtaining the original medical data asynchronously by using the webpage extraction tool and the webpage downloading tool is beneficial to improving the efficiency of acquiring the original medical data.
  • step S20 the Apriori algorithm is used to perform correlation analysis on the target medical data, and the degree of medical data relationship is obtained, which specifically includes the following steps:
  • S21 Acquire at least one first candidate set based on the target medical data, and determine a corresponding first support degree according to the number of occurrences of each first candidate set.
  • the first candidate set refers to a set generated based on the target medical data in the first cycle of the algorithm.
  • the first support degree refers to the number of times each first candidate set appears in the target medical data. Specifically, the symptom feature and the disease feature in the medical record marked after the labeling code are input into the Apriori algorithm for analysis, and the first candidate set is obtained, and the corresponding first number is determined according to the number of occurrences of each first candidate set. Support.
  • the first candidate set is ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ A3 ⁇ , ⁇ B2 ⁇ , then ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ A3 ⁇ , ⁇ B2 ⁇ , the first support set corresponding to the first candidate set They are 2, 2, 1, and 2, respectively.
  • runny nose, fever, headache, and cold are the target medical data in this embodiment.
  • S22 Select a first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set.
  • the preset support degree may be preset by the user, or may be the minimum support degree selected by the first support degree generated in each round of the loop as the preset support degree.
  • the first frequent set refers to a set of items corresponding to the first support set and having a first support degree greater than a preset support degree. Specifically, the first candidate set whose first support degree is greater than or equal to the preset support degree is selected as the first frequent set. In the example of step S21, if the preset support degree is 1, the first frequent set is ⁇ A1 ⁇ . , ⁇ A2 ⁇ , ⁇ A3 ⁇ , ⁇ B2 ⁇ ; if the preset support is 2, the first frequent set is ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ B2 ⁇ .
  • the updated first candidate set refers to the item set used to generate the updated first frequent set in the Kth iteration.
  • the updated first support refers to the number of times each updated first candidate set appears in the target medical data.
  • the natural connection theorem is the generated N-term set. If there are two N-item sets, when two N-items have N-1 items in the same set, the natural connection can be made. For example, there are two sets of 3 items: ⁇ A1, A2, B2 ⁇ and ⁇ A1, A2, A3 ⁇ . Since these two items are the same, they are naturally connected. They can be joined to generate a set of 4 items ⁇ A1, A2, A3, B2 ⁇ .
  • Another example is two sets of 3 items ⁇ A1, A2, B2 ⁇ and ⁇ A1, A4, B1 ⁇ . These two items are not connectable because they have no two identical elements.
  • the value of the parameter K is the same as the value of the parameter N.
  • step S21 if the first candidate set is ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ A3 ⁇ , ⁇ B2 ⁇ , the first frequent set is ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ A3 ⁇ , ⁇ B2 ⁇ , then
  • the updated first candidate set obtained according to the natural connection theorem is ⁇ A1, A2 ⁇ , ⁇ A1, A3 ⁇ , ⁇ A2, A3 ⁇ , ⁇ A2, B2 ⁇ , ⁇ A3, B2 ⁇ , which corresponds to the updated first
  • the support is 2, 1, 1, 2, 1.
  • the pruning algorithm refers to an algorithm that obtains a frequent set by judging whether the support degree is greater than a preset support degree. Specifically, if the preset support degree is 2, according to the pruning algorithm, if the first support degree updated at this time is greater than or equal to the preset support degree, the updated first support set corresponding to the first support level is used as an update. The first frequent set.
  • the updated first candidate set obtained according to the natural connection theorem is ⁇ A1, A2 ⁇ , ⁇ A1, A3 ⁇ , ⁇ A2, A3 ⁇ , ⁇ A2, B2 ⁇ , ⁇ A3, B2 ⁇ , corresponding to the updated
  • the first support degree is 2, 1, 1, 2, 1, according to the pruning algorithm, and the first frequent set updated at this time is ⁇ A1, A2 ⁇ , ⁇ A2, B2 ⁇ .
  • the iterative processing of the updated first candidate set and the updated first frequent set is continued according to the natural connection theorem and the pruning theorem until the updated first frequent set is an empty set.
  • the updated first frequent set is an empty set, that is, the updated first support degree is less than the preset support degree, and the first candidate set that is not updated may be used as the updated first frequent set, that is, the updated first frequent set. For an empty set, the iterative processing is aborted at this time.
  • the confidence of the subset corresponding to the first frequent set of the last update is calculated, and compared with the preset reliability to obtain the final strong association rule, and the support degree corresponding to the strong association rule is taken as the medical data relationship degree.
  • the Confidence interval of a probability sample is its confidence, which is an interval estimate of a certain overall parameter of this sample, and support is support.
  • the first frequent set of the last acquired update is ⁇ A1, A2 ⁇ , ⁇ A2, B2 ⁇
  • their subset is obtained as ⁇ A1 ⁇ , ⁇ A2 ⁇ , ⁇ A1, A2 ⁇ , ⁇ B2 ⁇ , ⁇ A2, B2 ⁇
  • the rules obtained are as follows: A1->A2 ⁇ B2 (indicating symptom A1, symptom A2 and disease B2 can be introduced), A2->A1 ⁇ B2 (indicated by symptom A2, can give symptoms A1 and disease B2), A1 ⁇ A2->B2 (expressed by symptom A1, A2 can launch disease B2), B2->A1 ⁇ A2 (represented by disease B2, can be launched Symptoms A1, A2), A2 ⁇ B2->A1 (indicating symptom A2 and disease B2, symptom A1 can be introduced).
  • the confidence level corresponding to each rule is compared with the pre-set reliability, and the rule with greater than or equal to the pre-set reliability is selected as the strong association rule, and the confidence corresponding to the strong association rule is the medical data relationship degree.
  • the pre-set reliability is pre-customized by the developer.
  • the first candidate set is obtained according to the disease feature and the symptom feature, so that the first support degree is determined according to the number of occurrences of each first candidate set, and then the first support degree is greater than or equal to the pre-selection.
  • the first candidate set of the support degree is set as the first frequent set, and the first frequent set and the first candidate set are iteratively processed according to the natural connection theorem and the pruning theorem, and the updated first candidate set and the updated first candidate set are obtained.
  • the first support level and the updated first frequent set until the updated first frequent set is an empty set, the medical data relationship degree is determined based on the first frequent set of the last update.
  • the updated first frequent set is obtained by the natural connection theorem and the pruning theorem to reduce the amount of calculation and improve the acquisition efficiency of the medical data relationship.
  • step S30 the E-charts tool is used to perform graph conversion on the medical data relationship degree, and the medical data relationship image is obtained, which specifically includes the following steps:
  • S31 Acquire a chart configuration request, and the chart configuration request includes a chart ID.
  • the chart ID is an identifier that can uniquely identify the chart conversion function corresponding to different chart types stored in the E-charts tool.
  • the chart configuration request is a request by the user to make a relationship image. Specifically, after selecting the relationship image creation, the user displays all the chart types in the chart type configuration interface, and the user can determine the type of the relationship image to be selected through the chart type configuration request, and the operation process is simple and convenient, which is beneficial to improving the user experience.
  • the chart conversion tool stores conversion functions corresponding to different types of charts, and each conversion function corresponds to a conversion function identifier, and the corresponding conversion function method can be called by the conversion function identifier.
  • the E-charts tool includes, but is not limited to, a pie chart, a histogram and a line chart, a bar chart, a column-linked pie chart, a line line chart, a scatter scatter chart, a pie pie chart, and a pie2 nested ring.
  • the conversion function method corresponding to the chart type such as map map, parallel parallel coordinate, graph relation graph, and graphLes circle type relationship graph.
  • the chart conversion function in the E-charts tool is called to convert the medical data relationship degree, load the corresponding medical data relationship image, and display the medical data relationship image in the visualization area.
  • the E-charts tool obtains the medical data relationship degree and updates the relationship image in real time, so that the medical data relation image has the advantages of strong timing and high reliability.
  • the chart configuration request is acquired, and the chart configuration request includes a chart ID to obtain a chart conversion function corresponding to the chart ID in the E-charts tool based on the chart ID, and finally, the chart conversion function is invoked to perform the medical data relationship degree.
  • Chart conversion access to medical data relationship images, so that users can more intuitively see the relationship between medical data, assist with agent analysis and consulting, and improve work efficiency.
  • Fig. 6 is a block diagram showing the principle of the medical data relation image acquiring apparatus corresponding to the medical data relation image obtaining method in the first embodiment.
  • the medical data relationship image acquisition device includes a target medical data acquisition module 10, a medical data relationship degree acquisition module 20, and a medical data relationship image acquisition module 30.
  • the implementation functions of the target medical data acquisition module 10, the medical data relationship degree acquisition module 20, and the medical data relationship image acquisition module 30 correspond one-to-one with the steps corresponding to the medical data relationship image acquisition method in the embodiment. To avoid redundancy, the implementation The examples are not detailed one by one.
  • the target medical data obtaining module 10 is configured to acquire at least one target medical data.
  • the medical data relationship obtaining module 20 is configured to perform correlation analysis on the target medical data by using the Apriori algorithm, and obtain the medical data relationship degree.
  • the medical data relation image obtaining module 30 is configured to perform chart conversion on the medical data relationship degree by using the E-charts tool, and obtain a medical data relationship image.
  • the target medical data acquisition module 10 includes a target webpage address acquisition unit 11, an original medical data acquisition unit 12, and a target medical data acquisition unit 13.
  • the target webpage address obtaining unit 11 is configured to obtain a target webpage address.
  • the original medical data obtaining unit 12 is configured to crawl the webpage corresponding to the target webpage address by using the crawler tool to obtain at least one original medical data.
  • the target medical data acquiring unit 13 is configured to perform data cleaning on the at least one original medical data to acquire at least one target medical data.
  • the original medical data acquisition unit 12 includes an access address acquisition sub-unit 121, an access address storage sub-unit 122, and an original medical data acquisition sub-unit 123.
  • the access address obtaining sub-unit 121 is configured to use a crawler tool to crawl at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, where each of the access addresses corresponds to a webpage.
  • the access address storage sub-unit 122 stores at least one access address in the message queue to be downloaded.
  • the original medical data acquisition sub-unit 123 uses the crawler tool to perform data extraction on the webpage corresponding to each access address in the download message queue to obtain the original medical data.
  • the medical data relationship degree obtaining module 20 includes a first candidate set and a first support degree obtaining unit 21, a first frequent set obtaining unit 22, and a medical data relationship degree acquiring unit 23.
  • the first candidate set and the first support obtaining unit 21 are configured to acquire at least one first candidate set based on the target medical data, and determine a corresponding first support according to the number of occurrences of each first candidate set.
  • the first frequent set obtaining unit 22 is configured to select a first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set.
  • the medical data relationship obtaining unit 23 is configured to perform iterative processing on the first frequent set and the first candidate set according to the natural connection theorem and the pruning algorithm, and obtain the updated first candidate set, the updated first support degree, and the updated first A frequent set, until the updated first frequent set is an empty set, then the medical data relationship is determined based on the first frequent set of the last update.
  • the medical data relation image acquisition module 30 includes a chart configuration request acquisition unit 31, a chart conversion function acquisition unit 32, and a medical data relationship image acquisition unit 33.
  • the chart configuration request obtaining unit 31 is configured to acquire a chart configuration request, and the chart configuration request includes a chart ID.
  • the chart conversion function acquisition unit 32 is configured to acquire a chart conversion function corresponding to the chart ID in the E-charts tool.
  • the medical data relation image obtaining unit 33 is configured to invoke a chart conversion function to perform chart conversion on the medical data relationship degree, and acquire a medical data relationship image.
  • the embodiment provides a computer readable storage medium, where the computer readable storage medium is stored by the processor, and the medical data relationship image acquisition method in Embodiment 1 is implemented to avoid duplication. , no longer repeat them here.
  • the computer readable instructions are executed by the processor, the functions of the modules/units in the medical data relation image obtaining device in Embodiment 2 are implemented. To avoid repetition, details are not described herein again.
  • the computer readable storage medium can include any entity or device capable of carrying the computer readable instruction code, a recording medium, a USB flash drive, a removable hard drive, a magnetic disk, an optical disk, a computer memory, a read only memory (ROM, Read-Only) Memory), random access memory (RAM), electrical carrier signals, telecommunications signals, and software distribution media.
  • FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application.
  • the terminal device 70 of this embodiment includes a processor 71, a memory 72, and computer readable instructions 73 stored in the memory 72 and operable on the processor 71.
  • the processor 71 executes the computer readable instructions 73
  • the steps of the medical data relation image acquisition method in the above-described Embodiment 1 are implemented, such as steps S10 to S30 shown in FIG.
  • the processor 71 executes the computer readable instructions 73
  • the functions of the modules/units of the medical data relation image acquiring device in the second embodiment are implemented, for example, the target medical data acquiring module 10 and the medical data relationship acquiring module shown in FIG. 20 and medical data relate to the functionality of image acquisition module 30.
  • computer readable instructions 73 may be partitioned into one or more modules/units, one or more modules/units being stored in memory 72 and executed by processor 71 to complete the application.
  • the one or more modules/units may be an instruction segment of a series of computer readable instructions 703 capable of performing a particular function for describing the execution of computer readable instructions 73 in the terminal device 70.
  • the computer readable instructions 73 may be divided into a target medical data acquisition module 10, a medical data relationship degree acquisition module 20, and a medical data relationship image acquisition module 30.
  • the specific functions of the modules are as described in Embodiment 2, and are different here. A narrative.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

Disclosed are a medical data relationship image acquisition method and apparatus, a terminal device and a storage medium. The medical data relationship image acquisition method comprises: acquiring at least one piece of target medical data; carrying out association analysis on the target medical data using an Apriori algorithm to acquire a degree of medical data relationship; and carrying out chart transformation on the degree of medical data relationship using an E-charts tool to acquire a medical data relationship image. The medical data relationship image acquisition method can enable a user to intuitively see a relationship between a disease characteristic and a symptom characteristic, thus facilitating the user understanding his/her possible disease condition according to his/her symptoms.

Description

医疗数据关系图像获取方法、装置、终端设备及存储介质Medical data relationship image acquisition method, device, terminal device and storage medium
本专利申请以2018年1月12日提交的申请号为201810031265.6,名称为“医疗数据关系图像获取方法、装置、终端设备及存储介质”的中国发明专利申请为基础,并要求其优先权。This patent application is based on the Chinese Patent Application No. 201810031265.6 filed on January 12, 2018, entitled "Medical Data Relation Image Acquisition Method, Apparatus, Terminal Equipment and Storage Medium", and requires priority.
技术领域Technical field
本申请涉及数据处理领域,尤其涉及一种医疗数据关系图像获取方法、装置、终端设备及存储介质。The present application relates to the field of data processing, and in particular, to a method, an apparatus, a terminal device, and a storage medium for acquiring a medical data relationship image.
背景技术Background technique
当前疾病的识别通常由医生根据病人的症状确定,需依赖于医生的专业和经验,无法使用户根据自身的症状情况及时了解自身的健康情况。用户在查询医疗网站时,医疗网站中提供的症状、疾病和治疗等医疗数据都是独立展示,没有提供各种医疗数据之间的关联关系(即关系度),使得用户根据无法根据这些医疗数据及时了解自己的健康状态。The identification of the current disease is usually determined by the doctor according to the patient's symptoms, depends on the doctor's professional and experience, and can not enable the user to know his or her health according to his or her symptoms. When the user queries the medical website, the medical data such as symptoms, diseases and treatments provided on the medical website are displayed independently, and the relationship between the various medical data (ie, the degree of relationship) is not provided, so that the user cannot base the medical data according to the medical data. Keep abreast of your health status.
发明内容Summary of the invention
本申请实施例提供一种医疗数据关系图像获取方法、装置、终端设备及存储介质,以解决传统医疗数据独立展示的问题。The embodiment of the present application provides a medical data relationship image acquisition method, device, terminal device and storage medium to solve the problem of traditional medical data independent display.
第一方面,本申请实施例提供一种医疗数据关系图像获取方法,包括:In a first aspect, an embodiment of the present application provides a method for acquiring a medical data relationship image, including:
获取至少一个目标医疗数据;Obtaining at least one target medical data;
采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
第二方面,本申请实施例提供一种医疗数据关系图像获取装置,包括:In a second aspect, the embodiment of the present application provides a medical data relationship image acquiring apparatus, including:
目标医疗数据获取模块,用于获取至少一个目标医疗数据;a target medical data acquisition module, configured to acquire at least one target medical data;
医疗数据关系度获取模块,用于采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;a medical data relationship obtaining module, configured to perform correlation analysis on the target medical data by using an Apriori algorithm, and obtain a medical data relationship degree;
医疗数据关系图像获取模块,用于采用E-charts工具对所述医疗数据关系度进行图 表转换,获取医疗数据关系图像。The medical data relation image obtaining module is configured to perform a graph conversion on the medical data relationship degree by using an E-charts tool to obtain a medical data relationship image.
第三方面,本申请实施例提供一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:In a third aspect, an embodiment of the present application provides a terminal device, including a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, where the processor executes the computer The following steps are implemented when reading the instruction:
获取至少一个目标医疗数据;Obtaining at least one target medical data;
采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
第四方面,本申请实施例提供一种计算机可读介质,所述计算机可读介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下步骤:In a fourth aspect, an embodiment of the present application provides a computer readable medium storing computer readable instructions, where the computer readable instructions are executed by a processor to implement the following steps:
获取至少一个目标医疗数据;Obtaining at least one target medical data;
采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
本申请实施例提供的医疗数据关系图像获取方法、装置、终端设备及存储介质中,先获取至少一个目标医疗数据,然后,采用Apriori算法对目标医疗数据进行关联性分析,获取医疗数据关系度,以辅助用户判断患病概率,及时预防。最后,采用E-charts工具对医疗数据关系度进行图表转换,获取医疗数据关系图像,以使用户能够直观看到疾病特征和症状特征之间的关系,有利于用户根据自身症状了解其可能的患病情况。In the medical data relation image acquisition method, device, terminal device and storage medium provided by the embodiments of the present application, at least one target medical data is first acquired, and then the Apriori algorithm is used to perform correlation analysis on the target medical data, and the medical data relationship degree is obtained. To help users judge the probability of illness and prevent it in time. Finally, the E-charts tool is used to graphically convert the relationship between medical data and obtain medical data relationship images so that users can visually see the relationship between disease characteristics and symptom characteristics, which helps users to understand their possible diseases according to their own symptoms. The condition.
附图说明DRAWINGS
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings used in the description of the embodiments of the present application will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present application. Other drawings may also be obtained from those of ordinary skill in the art based on these drawings without the inventive labor.
图1是实施例1中提供的医疗数据关系图像获取方法的一流程图。1 is a flowchart of a medical data relationship image acquisition method provided in Embodiment 1.
图2是图1中步骤S10的一具体示意图。FIG. 2 is a specific schematic diagram of step S10 of FIG. 1.
图3是图2中步骤S12的一具体示意图。FIG. 3 is a specific schematic diagram of step S12 of FIG. 2.
图4是图1中步骤S20的一具体示意图。FIG. 4 is a specific schematic diagram of step S20 of FIG. 1.
图5是图1中步骤S30的一具体示意图。FIG. 5 is a specific schematic diagram of step S30 of FIG. 1.
图6是实施例2中提供的医疗数据关系图像获取装置的一原理框图。6 is a schematic block diagram of a medical data relation image acquiring device provided in Embodiment 2.
图7是实施例4中提供的终端设备的一示意图。FIG. 7 is a schematic diagram of a terminal device provided in Embodiment 4.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application are clearly and completely described in the following with reference to the drawings in the embodiments of the present application. It is obvious that the described embodiments are a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
实施例1Example 1
图1示出本实施例中医疗数据关系图像获取方法的流程图。该医疗数据关系图像获取方法可快速从网络中采集到大量的医疗数据,以便基于采集到的医疗数据进行关系度分析。该医疗数据关系图像获取方法可具体应用在医疗知识库管理系统这一数据管理系统中,用于对用户所输入的症状特征进行关系度分析,为用户推荐疑似疾病特征列表,能够有效辅助用户了解自身健康状况。如图1所示,该医疗数据关系图像获取方法包括如下步骤:FIG. 1 is a flow chart showing a method of acquiring a medical data relationship image in the present embodiment. The medical data relation image acquisition method can quickly collect a large amount of medical data from the network to perform relationship analysis based on the collected medical data. The medical data relationship image acquisition method can be specifically applied to the medical knowledge base management system, which is used for analyzing the relationship characteristics of the symptoms input by the user, and recommending a list of suspected disease characteristics for the user, which can effectively assist the user to understand. Self-health status. As shown in FIG. 1, the medical data relationship image acquisition method includes the following steps:
S10:获取至少一个目标医疗数据。S10: Acquire at least one target medical data.
其中,目标医疗数据是用于进行模型训练的数据。该目标医疗数据包括但不限于本实施例中的症状特征、疾病特征等医疗数据。本实施例中,通过获取目标医疗数据,以便后续进行模型训练。Among them, the target medical data is data for performing model training. The target medical data includes, but is not limited to, medical data such as symptom characteristics, disease characteristics, and the like in the present embodiment. In this embodiment, the target medical data is acquired for subsequent model training.
S20:采用Apriori算法对目标医疗数据进行关联性分析,获取医疗数据关系度。S20: The Apriori algorithm is used to analyze the relevance of the target medical data to obtain the relationship of the medical data.
其中,Apriori算法是一种用于关联规则挖掘(Association rule mining)的代表性算法。Apriori算法广泛应用在各大行业中,计算量少,容易理解,能够有效挖掘数据间的潜在规则。医疗数据关系度是用于反映疾病特征和症状特征间的关联程度。具体地,采用Apriori算法对目标医疗数据进行关联性分析,通过不断循环产生N项频繁集(N>1)直到没有频繁集出现,以来获取医疗数据关系度,辅助用户能够快速了解到自身的健康情况,及时预防。Among them, Apriori algorithm is a representative algorithm for association rule mining. Apriori algorithm is widely used in various industries, with less calculation, easy to understand, and can effectively mine the potential rules between data. The medical data relationship is used to reflect the degree of association between disease characteristics and symptom characteristics. Specifically, the Apriori algorithm is used to analyze the target medical data, and the N frequent items (N>1) are generated by continuous circulation until the frequent sets are obtained, so that the medical data relationship degree is obtained, and the user can quickly learn the health of the user. Situation, timely prevention.
S30:采用E-charts工具对医疗数据关系度进行图表转换,获取医疗数据关系图像。S30: The E-charts tool is used to perform graph conversion on medical data relationship degree, and obtain medical data relationship image.
其中,医疗数据关系图像是指对医疗数据关系度进行图表转换所得到的图像。ECharts(Enterprise Charts,商业级数据图表)是一个纯Javascript的图表库,可以流畅的运行在PC和移动设备上,兼容当前绝大部分浏览器,底层依赖轻量级的Canvas类库ZRender,能够给用户提供直观,生动,可高度个性化定制的数据可视化图表。采用Echarts工具可有效提高数据的可视化程度,交互性更高。The medical data relation image refers to an image obtained by graphically converting the degree of medical data relationship. ECharts (Enterprise Charts) is a pure Javascript chart library that runs smoothly on PCs and mobile devices. It is compatible with most browsers. The underlying version relies on the lightweight Canvas library ZRender. Users provide intuitive, vivid, highly personalized data visualization charts. The Echarts tool can effectively improve the visualization of data and achieve higher interactivity.
本实施例中,先获取至少一个目标医疗数据,然后,采用Apriori算法对目标医疗数 据进行关联性分析,获取医疗数据关系度,以辅助用户判断患病概率,及时预防。最后,采用E-charts工具对医疗数据关系度进行图表转换,获取医疗数据关系图像,以使用户能够直观看到各数据间的关系,辅助用户分析和咨询等工作,提高工作效率。In this embodiment, at least one target medical data is acquired first, and then the Apriori algorithm is used to perform correlation analysis on the target medical data, and the degree of medical data relationship is obtained to assist the user in judging the probability of disease and prevent it in time. Finally, the E-charts tool is used to convert the medical data relationship degree and obtain the medical data relationship image, so that the user can visually see the relationship between the data, assist the user in analysis and consultation, and improve work efficiency.
在一具体实施方式中,如图2所示,步骤S10中,即获取至少一个目标医疗数据,具体包括如下步骤:In a specific embodiment, as shown in FIG. 2, in step S10, at least one target medical data is acquired, which specifically includes the following steps:
S11:获取目标网页地址。S11: Get the target webpage address.
其中,目标网页地址是预先定义好所要获取目标医疗数据对应的网页地址。例如:39健康网站。通过获取到目标网页地址,以使后续采用网络爬虫技术爬取目标医疗数据时,为基于目标网页地址获取目标医疗数据提供支持。The target webpage address is a webpage address corresponding to the target medical data to be obtained in advance. For example: 39 health website. Support for obtaining target medical data based on the target web page address by obtaining the target web page address to enable subsequent crawling of the target medical data using the web crawler technology.
S12:采用爬虫工具爬取目标网页地址对应的网页,获取至少一个原始医疗数据。S12: The crawler tool is used to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data.
其中,爬虫工具是按照一定的规则自动爬取网页地址所对应的网页内容的工具,例如Python爬虫工具。具体地,采用爬虫工具爬取目标网页地址所对应的网页内容,以获取至少一个原始医疗数据,每一原始医疗数据都包含实际疾病特征和对应的至少一个症状特征,例如感冒(实际疾病特征)对应流鼻涕、头疼和发烧等症状特征。本实施例中,采用爬虫工具爬取目标网页地址对应的网页,无需人工搜索,有利于提高数据采集的效率。Among them, the crawler tool is a tool for automatically crawling the webpage content corresponding to the webpage address according to certain rules, such as a Python crawler tool. Specifically, the crawler tool is used to crawl the webpage content corresponding to the webpage address to obtain at least one original medical data, and each of the original medical data includes an actual disease characteristic and a corresponding at least one symptom characteristic, such as a cold (actual disease characteristic). Corresponding to symptoms such as runny nose, headache and fever. In this embodiment, the crawler tool is used to crawl the webpage corresponding to the target webpage address, and no manual search is needed, which is beneficial to improving the efficiency of data collection.
本实施例中,爬虫工具会采取周期性爬取的方式对目标网页地址进行爬取数据,以使原始医疗数据具有时序性,以使后续模型训练获取到的目标医疗特征也具有时序性。In this embodiment, the crawler tool performs crawling data on the target webpage address in a periodic crawling manner, so that the original medical data is time-series, so that the target medical features acquired by the subsequent model training are also sequential.
S13:对至少一个原始医疗数据进行数据清洗,获取至少一个目标医疗数据。S13: Perform data cleaning on at least one original medical data to obtain at least one target medical data.
其中,数据清洗是指对原始医疗数据按照一定规则进行处理,获取纯净的目标医疗数据的方法。目标医疗数据是指按照数据清洗规则进行处理得到的纯净的数据。该数据清洗规则包括但不限于去除重复的数据。由于每一原始医疗数据均包含实际疾病特征和对应的至少一个症状特征,依据数据清洗规则进行清洗时,可将实际疾病特征和症状特征均相同的两个或两个以上原始医疗数据合并为一原始医疗数据。本实施例中,通过对原始医疗数据进行数据清洗,以获取目标医疗数据,能够有效提升目标医疗数据的质量,并且,去除重复的原始医疗数据,以使后续采用目标医疗数据进行训练时,无需对重复的原始医疗数据进行再次训练,能够有效减少训练时长,节省时间,提高训练效率。Among them, data cleaning refers to the method of processing raw medical data according to certain rules to obtain pure target medical data. Target medical data refers to pure data processed in accordance with data cleaning rules. The data cleaning rules include, but are not limited to, removing duplicate data. Since each raw medical data contains actual disease characteristics and corresponding at least one symptom characteristic, when cleaning according to the data cleaning rule, two or more original medical data having the same disease characteristics and symptom characteristics can be combined into one. Raw medical data. In this embodiment, by performing data cleaning on the original medical data to obtain target medical data, the quality of the target medical data can be effectively improved, and the repeated original medical data is removed, so that when the target medical data is subsequently used for training, it is not necessary. Re-training the repeated raw medical data can effectively reduce the training time, save time and improve training efficiency.
本实施例中,先获取目标网页地址,然后采用爬虫工具爬取目标网页地址对应的网页,以获取至少一个原始医疗数据,无需人工搜索,有利于提高数据采集的效率。最后对至少一个原始医疗数据进行数据清洗,获取至少一个目标医疗数据,以使后续采用目标医疗数据进行训练时,无需对重复的数据进行再次训练,能够有效减少训练时长,节省时间,提 高训练效率。In this embodiment, the target webpage address is obtained first, and then the webpage corresponding to the target webpage address is crawled by the crawler tool to obtain at least one original medical data, and no manual search is needed, which is beneficial to improving the efficiency of data collection. Finally, data cleaning is performed on at least one original medical data, and at least one target medical data is acquired, so that when the target medical data is used for training, the repeated data is not required to be retrained, thereby effectively reducing training time, saving time, and improving training efficiency. .
在一具体实施方式中,如图3所示,步骤S12中,即采用爬虫工具爬取目标网页地址对应的网页,获取至少一个原始医疗数据,原始医疗数据包括一个疾病特征和对应的至少一个症状特征,具体包括如下步骤:In a specific embodiment, as shown in FIG. 3, in step S12, the crawler tool is used to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data, and the original medical data includes a disease characteristic and corresponding at least one symptom. Features include the following steps:
S121:采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页。S121: Using a crawler tool, crawling at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, where each of the access addresses corresponds to a webpage.
本实施例中,可以采用深度优先算法从起始页开始,一个链接一个链接跟踪下去,处理完这条线路之后再转入下一个起始页,继续追踪链接。目标网页地址对应一目标网页,该目标网页包括起始页和至少一个起始页所链接的至少一个访问地址,每一访问地址对应一访问网页。爬虫工具包括网页提取工具和网页下载工具。网页提取工具是用于提取访问地址的工具,步骤S121具体是采用网页提取工具爬取目标网页地址所链接的至少一个访问地址。网页下载工具是用于下载访问地址对应的网页的工具。In this embodiment, the depth-first algorithm can be used to start from the start page, and one link is tracked and tracked. After the line is processed, the next start page is transferred to continue tracking the link. The target webpage address corresponds to a target webpage, and the target webpage includes at least one accessing address linked to the starting page and the at least one starting page, and each accessing address corresponds to a visiting webpage. The crawler tool includes a web page extraction tool and a web page download tool. The webpage extraction tool is a tool for extracting an access address, and step S121 specifically uses a webpage extraction tool to crawl at least one access address linked by the target webpage address. The webpage downloading tool is a tool for downloading a webpage corresponding to an access address.
本实施例中,还可以采用广度优先算法不断从当前页面上爬取新的网页地址放入待下载消息队列中,直到预设停止条件满足时停止执行爬虫任务。其中,广度优先算法是指将新下载网页发现的链接直接插入到待抓取消息队列的末尾,也就是指网络爬虫会先抓取起始页中的所有网页,然后在选择其中的一个链接的网页,继续抓取在此网页中链接的所有网页。In this embodiment, the breadth-first algorithm can also be used to continuously crawl a new webpage address from the current page into the message queue to be downloaded, and stop executing the crawler task until the preset stop condition is satisfied. The breadth-first algorithm refers to inserting the link found by the newly downloaded webpage directly into the end of the queue to be crawled, that is, the web crawler first crawls all the webpages in the start page, and then selects one of the links. The webpage continues to crawl all pages linked in this page.
S122:将至少一个访问地址存储在待处理消息队列中。S122: Store at least one access address in a queue of pending messages.
具体地,将步骤S121中爬取到的每一访问地址依据爬取到的时间先后顺序存储在待下载消息队列中,以便在执行步骤S123时,可基于待下载消息队列中的网页地址进行爬取数据。待下载消息队列依据先进先入的方式对访问地址进行处理,可使爬取访问地址和基于访问地址爬取原始医疗数据异步处理,有利于提高获取原始医疗数据效率。Specifically, each access address crawled in step S121 is stored in the queue to be downloaded according to the chronological order of the crawling, so that when step S123 is performed, the crawling may be based on the webpage address in the message queue to be downloaded. Take data. The message queue to be downloaded processes the access address according to the advanced first-in-first method, so that the crawl access address and the original medical data can be crawled asynchronously based on the access address, which is beneficial to improving the efficiency of obtaining the original medical data.
具体地,爬虫工具中的网页提取工具会先抓取起始页中的所有网页内容,然后再选择起始页所链接的至少一个访问地址,继续抓取此访问地址链接的网页。本实施例中,起始页为目标网页地址。Specifically, the webpage extraction tool in the crawler tool first grabs all the webpage content in the start page, and then selects at least one access address linked by the startpage, and continues to crawl the webpage of the visitor address link. In this embodiment, the start page is a target webpage address.
S123:采用爬虫工具对待处理消息队列中的每一访问地址对应的网页进行数据提取,获取原始医疗数据。S123: The crawler tool is used to perform data extraction on the webpage corresponding to each access address in the message queue to obtain original medical data.
具体地,采用网页下载工具根据待下载消息队列的每一访问地址自动下载该访问地址对应的网页中所有医疗数据。本实施例中,待下载消息队列中存储有多个包含原始医疗数据的网页地址,爬虫工具的网页下载工具依序从待下载消息队列中逐一获取访问地址并下 载该访问地址对应的网页中的原始医疗数据。具体地,爬虫工具从待下载消息队列的队头获取到一访问地址并对该访问地址对应的网页进行下载,将下载的原始医疗数据存储在数据库后,注销待下载消息队列中相应的网页地址,重复上述步骤直至待下载消息队列中不存在访问地址,以获取爬虫工具爬取的所有网页中的原始医疗数据。本实施例中,爬虫工具中的网页下载工具根据用户设置的爬虫任务自动从互联网上爬取包含原始医疗数据的网页地址,无需人工搜索,有利于提高数据采集效率。Specifically, the webpage downloading tool automatically downloads all the medical data in the webpage corresponding to the accessing address according to each access address of the message queue to be downloaded. In this embodiment, a plurality of webpage addresses including the original medical data are stored in the message queue to be downloaded, and the webpage downloading tool of the crawler tool sequentially obtains the accessing address from the to-be-downloaded message queue and downloads the webpage corresponding to the accessing address. Raw medical data. Specifically, the crawler tool obtains an access address from the head of the message queue to be downloaded and downloads the webpage corresponding to the access address, stores the downloaded original medical data in the database, and unregisters the corresponding webpage address in the message queue to be downloaded. Repeat the above steps until there is no access address in the message queue to be downloaded to get the original medical data in all web pages crawled by the crawler tool. In this embodiment, the webpage downloading tool in the crawler tool automatically crawls the webpage address including the original medical data from the Internet according to the crawler task set by the user, and does not need manual search, which is beneficial to improving data collection efficiency.
原始医疗数据包括但不限于症状特征和疾病特征,还可以包括就诊科室。本实施例中,在获取原始医疗数据之后,还会按照原始医疗数据的标识字段(即症状、疾病和就诊科室)进行存储得到症状表、疾病表和科室表。然后,再对各表中的数据进行标注,例如A1(标注代码)代表症状库中的第一个症状即A1=流鼻涕,A2代表症状库中的第二个症状即A2=发烧,B2代表疾病库第二个疾病即B2=感冒。最后,将每一就诊记录对照上述的标注代码进行标注,例如(流鼻涕,发烧,感冒)=(A1,A2,B2)。Raw medical data includes, but is not limited to, symptom characteristics and disease characteristics, and may also include a visiting department. In this embodiment, after the original medical data is acquired, the symptom table, the disease table, and the department table are also stored according to the identification fields of the original medical data (ie, the symptoms, the disease, and the visiting department). Then, the data in each table is labeled. For example, A1 (labeled code) represents the first symptom in the symptom database, ie A1 = runny nose, and A2 represents the second symptom in the symptom store, ie A2 = fever, B2 represents The second disease of the disease library is B2 = cold. Finally, each visit record is labeled with the above-mentioned label code, for example (runny nose, fever, cold) = (A1, A2, B2).
本实施例中,采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,并将获取到的访问地址存储在待下载消息队列中,再采用爬虫工具基于待下载消息队列中获取的访问地址下载原始医疗数据,使得访问地址下载的原始医疗数据异步处理,有利于提高原始医疗数据的获取效率。本实施例中,通过采用网页提取工具和网页下载工具异步处理获取原始医疗数据,有利于提高获取原始医疗数据的效率。In this embodiment, the crawler tool is used to crawl at least one access address linked by the target webpage address according to the breadth-first algorithm or the depth-first algorithm, and the obtained access address is stored in the message queue to be downloaded, and then the crawler is used. The tool downloads the original medical data based on the access address obtained in the message queue to be downloaded, so that the original medical data downloaded by the access address is processed asynchronously, which is beneficial to improving the efficiency of obtaining the original medical data. In this embodiment, obtaining the original medical data asynchronously by using the webpage extraction tool and the webpage downloading tool is beneficial to improving the efficiency of acquiring the original medical data.
如图4所示,步骤S20中,即采用Apriori算法对目标医疗数据进行关联性分析,获取医疗数据关系度,具体包括如下步骤:As shown in FIG. 4, in step S20, the Apriori algorithm is used to perform correlation analysis on the target medical data, and the degree of medical data relationship is obtained, which specifically includes the following steps:
S21:基于目标医疗数据,获取至少一个第一候选集,根据每一第一候选集出现的次数确定对应的第一支持度。S21: Acquire at least one first candidate set based on the target medical data, and determine a corresponding first support degree according to the number of occurrences of each first candidate set.
其中,第一候选集是指在算法的第1次循环中基于目标医疗数据生成的一项集。第一支持度是指每一第一候选集在目标医疗数据中出现的次数。具体地,将对照标注代码进行标注后的就诊记录中的症状特征和疾病特征,输入到Apriori算法中进行分析,获取第一候选集,根据每一第一候选集出现的次数确定对应的第一支持度。例如:获取到的就诊记录如下:(流鼻涕,发烧,感冒)=(A1,A2,B2)、(流鼻涕,发烧,头疼,感冒)=(A1,A2,A3,B2),则获取的第一候选集为{A1},{A2},{A3},{B2},则{A1},{A2},{A3},{B2}这四个第一候选集对应的第一支持度分别为2,2,1,2。其中,流鼻涕,发烧,头疼,感冒即为本实施例中的目标医疗数据。The first candidate set refers to a set generated based on the target medical data in the first cycle of the algorithm. The first support degree refers to the number of times each first candidate set appears in the target medical data. Specifically, the symptom feature and the disease feature in the medical record marked after the labeling code are input into the Apriori algorithm for analysis, and the first candidate set is obtained, and the corresponding first number is determined according to the number of occurrences of each first candidate set. Support. For example, the medical records obtained are as follows: (runny nose, fever, cold) = (A1, A2, B2), (runny nose, fever, headache, cold) = (A1, A2, A3, B2), then obtained The first candidate set is {A1}, {A2}, {A3}, {B2}, then {A1}, {A2}, {A3}, {B2}, the first support set corresponding to the first candidate set They are 2, 2, 1, and 2, respectively. Among them, runny nose, fever, headache, and cold are the target medical data in this embodiment.
S22:选取第一支持度大于或等于预设支持度的第一候选集作为第一频繁集。S22: Select a first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set.
其中,预设支持度可以是用户预先设定好的,也可以是通过选取每一轮循环中产生的第一支持度中最小的第一支持度作为预设支持度。第一频繁集是指第一候选集对应的第一支持度大于预设支持度的项集。具体地,选取选取第一支持度大于或等于预设支持度的第一候选集作为第一频繁集,如步骤S21示例中,若预设支持度为1,则第一频繁集为{A1},{A2},{A3},{B2};若预设支持度为2,则第一频繁集为{A1},{A2},{B2}。The preset support degree may be preset by the user, or may be the minimum support degree selected by the first support degree generated in each round of the loop as the preset support degree. The first frequent set refers to a set of items corresponding to the first support set and having a first support degree greater than a preset support degree. Specifically, the first candidate set whose first support degree is greater than or equal to the preset support degree is selected as the first frequent set. In the example of step S21, if the preset support degree is 1, the first frequent set is {A1}. , {A2}, {A3}, {B2}; if the preset support is 2, the first frequent set is {A1}, {A2}, {B2}.
S23:根据自然连接定理和剪枝算法对第一频繁集和第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定医疗数据关系度。S23: Perform iterative processing on the first frequent set and the first candidate set according to the natural connection theorem and the pruning algorithm, and obtain the updated first candidate set, the updated first support degree, and the updated first frequent set until the updated first When a frequent set is an empty set, the medical data relationship is determined based on the first frequent set of the last update.
其中,更新的第一候选集是指第K次迭代中用于生成更新的第一频繁集的项集。更新的第一支持度是指每一更新的第一候选集在目标医疗数据中出现的次数。自然连接定理为生成的N项集中,若有两个N项集,当两个N项集中有N-1项个元素相同时,则可进行自然连接。例如有两个3项集:{A1,A2,B2}和{A1,A2,A3},这两个3项集由于有2项元素相同,因此是可进行自然连接。它们可以连接生成4项集{A1,A2,A3,B2}。又如两个3项集{A1,A2,B2}和{A1,A4,B1},这两个3项集由于没有2项相同的元素,因此是不可连接的。本实施例中,参数K的值与参数N的值相同。The updated first candidate set refers to the item set used to generate the updated first frequent set in the Kth iteration. The updated first support refers to the number of times each updated first candidate set appears in the target medical data. The natural connection theorem is the generated N-term set. If there are two N-item sets, when two N-items have N-1 items in the same set, the natural connection can be made. For example, there are two sets of 3 items: {A1, A2, B2} and {A1, A2, A3}. Since these two items are the same, they are naturally connected. They can be joined to generate a set of 4 items {A1, A2, A3, B2}. Another example is two sets of 3 items {A1, A2, B2} and {A1, A4, B1}. These two items are not connectable because they have no two identical elements. In this embodiment, the value of the parameter K is the same as the value of the parameter N.
如步骤S21示例中,若第一候选集为{A1},{A2},{A3},{B2},第一频繁集为{A1},{A2},{A3},{B2},则根据自然连接定理得到的更新的第一候选集为{A1,A2},{A1,A3},{A2,A3},{A2,B2},{A3,B2},其对应的更新的第一支持度为2,1,1,2,1。As in the example of step S21, if the first candidate set is {A1}, {A2}, {A3}, {B2}, the first frequent set is {A1}, {A2}, {A3}, {B2}, then The updated first candidate set obtained according to the natural connection theorem is {A1, A2}, {A1, A3}, {A2, A3}, {A2, B2}, {A3, B2}, which corresponds to the updated first The support is 2, 1, 1, 2, 1.
剪枝算法是指通过判断支持度是否大于预设支持度而获得频繁集的算法。具体地,假设预设的支持度为2,根据剪枝算法,若此时更新的第一支持度大于或等于预设支持度,则将更新的第一支持度对应的第一候选集作为更新的第一频繁集。例如,根据自然连接定理得到的更新的第一候选集为{A1,A2},{A1,A3},{A2,A3},{A2,B2},{A3,B2},其对应的更新的第一支持度为2,1,1,2,1,则根据剪枝算法,此时更新的第一频繁集为{A1,A2},{A2,B2}。The pruning algorithm refers to an algorithm that obtains a frequent set by judging whether the support degree is greater than a preset support degree. Specifically, if the preset support degree is 2, according to the pruning algorithm, if the first support degree updated at this time is greater than or equal to the preset support degree, the updated first support set corresponding to the first support level is used as an update. The first frequent set. For example, the updated first candidate set obtained according to the natural connection theorem is {A1, A2}, {A1, A3}, {A2, A3}, {A2, B2}, {A3, B2}, corresponding to the updated The first support degree is 2, 1, 1, 2, 1, according to the pruning algorithm, and the first frequent set updated at this time is {A1, A2}, {A2, B2}.
继续根据自然连接定理和剪枝定理对更新的第一候选集和更新的第一频繁集进行迭代处理,直至更新的第一频繁集为空集。The iterative processing of the updated first candidate set and the updated first frequent set is continued according to the natural connection theorem and the pruning theorem until the updated first frequent set is an empty set.
其中,更新的第一频繁集为空集是指更新的第一支持度都小于预设支持度,则没有更新的第一候选集可以作为更新的第一频繁集,即更新的第一频繁集为空集,此时中止迭代处理。The updated first frequent set is an empty set, that is, the updated first support degree is less than the preset support degree, and the first candidate set that is not updated may be used as the updated first frequent set, that is, the updated first frequent set. For an empty set, the iterative processing is aborted at this time.
计算上一次更新的第一频繁集对应的子集的置信度,并与预设置信度进行比较,以获取最终的强关联规则,将强关联规则对应的支持度作为医疗数据关系度。其中,一个概率样本的置信区间(Confidence interval)即为其置信度,是对这个样本的某个总体参数的区间估计,support为支持度。如步骤S21的实例中,设上一次获取到的更新的第一频繁集为{A1,A2},{A2,B2},基于该第一频繁集可以得到他们的子集为{A1},{A2},{A1,A2},{B2},{A2,B2},则得到的规则如下:A1->A2^B2(表示由症状A1,可以推出症状A2和疾病B2),A2->A1^B2(表示由症状A2,可以推出症状A1和疾病B2),A1^A2->B2(表示由症状A1,A2可推出疾病B2),B2->A1^A2(表示由疾病B2,可以推出症状A1,A2),A2^B2->A1(表示由症状A2和疾病B2,可推出症状A1)。然后,再根据公式support(A∪B)/suport(A)分别求出每条规则的置信度。本实施例中,则每条规则的置信度分别为support(A1,A2,B2)/support(A1)=1,support(A1,A2,B2)/support(A2)=1,support(A1,A2,B2)/support(A1,A2)=1,support(A1,A2,B2)/support(B2)=1,support(A1,A2,B2)/support(A2,B2)=1。将每条规则对应的置信度与预设置信度进行比较,选取大于或等于预设置信度的规则作为强关联规则,该强关联规则对应的置信度即为医疗数据关系度。其中,预设置信度为开发人员预先自定义的。The confidence of the subset corresponding to the first frequent set of the last update is calculated, and compared with the preset reliability to obtain the final strong association rule, and the support degree corresponding to the strong association rule is taken as the medical data relationship degree. Among them, the Confidence interval of a probability sample is its confidence, which is an interval estimate of a certain overall parameter of this sample, and support is support. In the example of step S21, the first frequent set of the last acquired update is {A1, A2}, {A2, B2}, and based on the first frequent set, their subset is obtained as {A1}, { A2}, {A1, A2}, {B2}, {A2, B2}, the rules obtained are as follows: A1->A2^B2 (indicating symptom A1, symptom A2 and disease B2 can be introduced), A2->A1 ^B2 (indicated by symptom A2, can give symptoms A1 and disease B2), A1^A2->B2 (expressed by symptom A1, A2 can launch disease B2), B2->A1^A2 (represented by disease B2, can be launched Symptoms A1, A2), A2^B2->A1 (indicating symptom A2 and disease B2, symptom A1 can be introduced). Then, according to the formula support(A∪B)/suport(A), the confidence of each rule is obtained separately. In this embodiment, the confidence level of each rule is support (A1, A2, B2) / support (A1) = 1, support (A1, A2, B2) / support (A2) = 1, support (A1, A2, B2) / support (A1, A2) = 1, support (A1, A2, B2) / support (B2) = 1, support (A1, A2, B2) / support (A2, B2) = 1. The confidence level corresponding to each rule is compared with the pre-set reliability, and the rule with greater than or equal to the pre-set reliability is selected as the strong association rule, and the confidence corresponding to the strong association rule is the medical data relationship degree. Among them, the pre-set reliability is pre-customized by the developer.
本实施例中,先基于疾病特征和症状特征,获取对应的第一候选集,以便根据每一第一候选集出现的次数确定对应的第一支持度,然后选取第一支持度大于或等于预设支持度的第一候选集作为第一频繁集,根据自然连接定理和剪枝定理对所述第一频繁集和所述第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定所述医疗数据关系度。通过自然连接定理和剪枝定理来获取更新的第一频繁集,以减少计算量,提高医疗数据关系度的获取效率。In this embodiment, the first candidate set is obtained according to the disease feature and the symptom feature, so that the first support degree is determined according to the number of occurrences of each first candidate set, and then the first support degree is greater than or equal to the pre-selection. The first candidate set of the support degree is set as the first frequent set, and the first frequent set and the first candidate set are iteratively processed according to the natural connection theorem and the pruning theorem, and the updated first candidate set and the updated first candidate set are obtained. The first support level and the updated first frequent set, until the updated first frequent set is an empty set, the medical data relationship degree is determined based on the first frequent set of the last update. The updated first frequent set is obtained by the natural connection theorem and the pruning theorem to reduce the amount of calculation and improve the acquisition efficiency of the medical data relationship.
在一具体实施方式中,如图5所示,步骤S30中,即采用E-charts工具对医疗数据关系度进行图表转换,获取医疗数据关系图像,具体包括如下步骤:In a specific embodiment, as shown in FIG. 5, in step S30, the E-charts tool is used to perform graph conversion on the medical data relationship degree, and the medical data relationship image is obtained, which specifically includes the following steps:
S31:获取图表配置请求,图表配置请求包括图表ID。S31: Acquire a chart configuration request, and the chart configuration request includes a chart ID.
其中,图表ID是能够唯一识别E-charts工具中所存储的不同图表类型所对应的图表转换函数的标识。图表配置请求是用户选择制作关系图像的请求。具体地,用户在选择关系图像制作之后,会在图表类型配置界面显示所有图表类型,用户可通过图表类型配置请求确定所要选择的关系图像的类型,操作过程简单方便,有利于提升用户体验。Among them, the chart ID is an identifier that can uniquely identify the chart conversion function corresponding to different chart types stored in the E-charts tool. The chart configuration request is a request by the user to make a relationship image. Specifically, after selecting the relationship image creation, the user displays all the chart types in the chart type configuration interface, and the user can determine the type of the relationship image to be selected through the chart type configuration request, and the operation process is simple and convenient, which is beneficial to improving the user experience.
S32:获取E-charts工具中与图表ID相对应的图表转换函数。S32: Obtain a chart conversion function corresponding to the chart ID in the E-charts tool.
具体地,图表转换工具中存储有不同类型图表所对应的转换函数,每个转换函数都对应一转换函数标识,通过转换函数标识,可调用对应的转换函数方法。本实施例中,E-charts工具中包括但不限于饼图、柱状图和折线图、,条形图、柱状联动饼图、line折线图、scatter散点图、pie饼图、pie2嵌套环形、map地图、parallel平行坐标、graph关系图和graphLes圆圈型关系图等图表类型所对应的转换函数方法。Specifically, the chart conversion tool stores conversion functions corresponding to different types of charts, and each conversion function corresponds to a conversion function identifier, and the corresponding conversion function method can be called by the conversion function identifier. In this embodiment, the E-charts tool includes, but is not limited to, a pie chart, a histogram and a line chart, a bar chart, a column-linked pie chart, a line line chart, a scatter scatter chart, a pie pie chart, and a pie2 nested ring. The conversion function method corresponding to the chart type such as map map, parallel parallel coordinate, graph relation graph, and graphLes circle type relationship graph.
S33:调用图表转换函数对医疗数据关系度进行图表转换,获取医疗数据关系图像。S33: Call the chart conversion function to perform chart conversion on the medical data relationship degree, and obtain a medical data relationship image.
具体地,调用E-charts工具中的图表转换函数对医疗数据关系度进行转换,加载对应的医疗数据关系图像,并在可视化区域中显示该医疗数据关系图像。本实施例中,该E-charts工具会实时获取医疗数据关系度并更新关系图像,以使医疗数据关系图像具有时序性强,可靠性高的优点。Specifically, the chart conversion function in the E-charts tool is called to convert the medical data relationship degree, load the corresponding medical data relationship image, and display the medical data relationship image in the visualization area. In this embodiment, the E-charts tool obtains the medical data relationship degree and updates the relationship image in real time, so that the medical data relation image has the advantages of strong timing and high reliability.
本实施例中,获取图表配置请求,图表配置请求包括图表ID,以便基于图表ID,获取E-charts工具中与图表ID相对应的图表转换函数,最后,调用图表转换函数对医疗数据关系度进行图表转换,获取医疗数据关系图像,以使用户能够更加直观的看到医疗数据间的关系度,辅助坐席分析和咨询等工作,提高工作效率。In this embodiment, the chart configuration request is acquired, and the chart configuration request includes a chart ID to obtain a chart conversion function corresponding to the chart ID in the E-charts tool based on the chart ID, and finally, the chart conversion function is invoked to perform the medical data relationship degree. Chart conversion, access to medical data relationship images, so that users can more intuitively see the relationship between medical data, assist with agent analysis and consulting, and improve work efficiency.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence of the steps in the above embodiments does not mean that the order of execution is performed. The order of execution of each process should be determined by its function and internal logic, and should not be construed as limiting the implementation process of the embodiments of the present application.
实施例2Example 2
图6示出与实施例1中医疗数据关系图像获取方法一一对应的医疗数据关系图像获取装置的原理框图。如图6所示,该医疗数据关系图像获取装置包括目标医疗数据获取模块10、医疗数据关系度获取模块20和医疗数据关系图像获取模块30。其中,目标医疗数据获取模块10、医疗数据关系度获取模块20和医疗数据关系图像获取模块30的实现功能与实施例中医疗数据关系图像获取方法对应的步骤一一对应,为避免赘述,本实施例不一一详述。Fig. 6 is a block diagram showing the principle of the medical data relation image acquiring apparatus corresponding to the medical data relation image obtaining method in the first embodiment. As shown in FIG. 6, the medical data relationship image acquisition device includes a target medical data acquisition module 10, a medical data relationship degree acquisition module 20, and a medical data relationship image acquisition module 30. The implementation functions of the target medical data acquisition module 10, the medical data relationship degree acquisition module 20, and the medical data relationship image acquisition module 30 correspond one-to-one with the steps corresponding to the medical data relationship image acquisition method in the embodiment. To avoid redundancy, the implementation The examples are not detailed one by one.
目标医疗数据获取模块10,用于获取至少一个目标医疗数据。The target medical data obtaining module 10 is configured to acquire at least one target medical data.
医疗数据关系度获取模块20,用于采用Apriori算法对目标医疗数据进行关联性分析,获取医疗数据关系度。The medical data relationship obtaining module 20 is configured to perform correlation analysis on the target medical data by using the Apriori algorithm, and obtain the medical data relationship degree.
医疗数据关系图像获取模块30,用于采用E-charts工具对医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relation image obtaining module 30 is configured to perform chart conversion on the medical data relationship degree by using the E-charts tool, and obtain a medical data relationship image.
优选地,目标医疗数据获取模块10包括目标网页地址获取单元11、原始医疗数据获取单元12和目标医疗数据获取单元13。Preferably, the target medical data acquisition module 10 includes a target webpage address acquisition unit 11, an original medical data acquisition unit 12, and a target medical data acquisition unit 13.
目标网页地址获取单元11,用于获取目标网页地址。The target webpage address obtaining unit 11 is configured to obtain a target webpage address.
原始医疗数据获取单元12,用于采用爬虫工具爬取目标网页地址对应的网页,获取至少一个原始医疗数据。The original medical data obtaining unit 12 is configured to crawl the webpage corresponding to the target webpage address by using the crawler tool to obtain at least one original medical data.
目标医疗数据获取单元13,用于对至少一个原始医疗数据进行数据清洗,获取至少一个目标医疗数据。The target medical data acquiring unit 13 is configured to perform data cleaning on the at least one original medical data to acquire at least one target medical data.
优选地,原始医疗数据获取单元12包括访问地址获取子单元121、访问地址存储子单元122和原始医疗数据获取子单元123。Preferably, the original medical data acquisition unit 12 includes an access address acquisition sub-unit 121, an access address storage sub-unit 122, and an original medical data acquisition sub-unit 123.
访问地址获取子单元121,用于采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页。The access address obtaining sub-unit 121 is configured to use a crawler tool to crawl at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, where each of the access addresses corresponds to a webpage.
访问地址存储子单元122,将至少一个访问地址存储在待下载消息队列中。The access address storage sub-unit 122 stores at least one access address in the message queue to be downloaded.
原始医疗数据获取子单元123,采用爬虫工具对待下载消息队列中的每一访问地址对应的网页进行数据提取,获取原始医疗数据。The original medical data acquisition sub-unit 123 uses the crawler tool to perform data extraction on the webpage corresponding to each access address in the download message queue to obtain the original medical data.
优选地,医疗数据关系度获取模块20包括第一候选集和第一支持度获取单元21、第一频繁集获取单元22和医疗数据关系度获取单元23。Preferably, the medical data relationship degree obtaining module 20 includes a first candidate set and a first support degree obtaining unit 21, a first frequent set obtaining unit 22, and a medical data relationship degree acquiring unit 23.
第一候选集和第一支持度获取单元21,用于基于目标医疗数据,获取至少一个第一候选集,根据每一第一候选集出现的次数确定对应的第一支持度。The first candidate set and the first support obtaining unit 21 are configured to acquire at least one first candidate set based on the target medical data, and determine a corresponding first support according to the number of occurrences of each first candidate set.
第一频繁集获取单元22,用于选取第一支持度大于或等于预设支持度的第一候选集作为第一频繁集。The first frequent set obtaining unit 22 is configured to select a first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set.
医疗数据关系度获取单元23,用于根据自然连接定理和剪枝算法对第一频繁集和第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定医疗数据关系度。The medical data relationship obtaining unit 23 is configured to perform iterative processing on the first frequent set and the first candidate set according to the natural connection theorem and the pruning algorithm, and obtain the updated first candidate set, the updated first support degree, and the updated first A frequent set, until the updated first frequent set is an empty set, then the medical data relationship is determined based on the first frequent set of the last update.
优选地,医疗数据关系图像获取模块30包括图表配置请求获取单元31、图表转换函数获取单元32和医疗数据关系图像获取单元33。Preferably, the medical data relation image acquisition module 30 includes a chart configuration request acquisition unit 31, a chart conversion function acquisition unit 32, and a medical data relationship image acquisition unit 33.
图表配置请求获取单元31,用于获取图表配置请求,图表配置请求包括图表ID。The chart configuration request obtaining unit 31 is configured to acquire a chart configuration request, and the chart configuration request includes a chart ID.
图表转换函数获取单元32,用于获取E-charts工具中与图表ID相对应的图表转换函数。The chart conversion function acquisition unit 32 is configured to acquire a chart conversion function corresponding to the chart ID in the E-charts tool.
医疗数据关系图像获取单元33,用于调用图表转换函数对医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relation image obtaining unit 33 is configured to invoke a chart conversion function to perform chart conversion on the medical data relationship degree, and acquire a medical data relationship image.
实施例3Example 3
本实施例提供一计算机可读存储介质,该计算机可读存储介质上存储有计算机可读指 令,该计算机可读指令被处理器执行时实现实施例1中医疗数据关系图像获取方法,为避免重复,这里不再赘述。或者,该计算机可读指令被处理器执行时实现实施例2中医疗数据关系图像获取装置中各模块/单元的功能,为避免重复,这里不再赘述。The embodiment provides a computer readable storage medium, where the computer readable storage medium is stored by the processor, and the medical data relationship image acquisition method in Embodiment 1 is implemented to avoid duplication. , no longer repeat them here. Alternatively, when the computer readable instructions are executed by the processor, the functions of the modules/units in the medical data relation image obtaining device in Embodiment 2 are implemented. To avoid repetition, details are not described herein again.
该计算机可读存储介质可以包括:能够携带所述计算机可读指令代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。The computer readable storage medium can include any entity or device capable of carrying the computer readable instruction code, a recording medium, a USB flash drive, a removable hard drive, a magnetic disk, an optical disk, a computer memory, a read only memory (ROM, Read-Only) Memory), random access memory (RAM), electrical carrier signals, telecommunications signals, and software distribution media.
实施例4Example 4
图7是本申请一实施例提供的终端设备的示意图。如图7所示,该实施例的终端设备70包括:处理器71、存储器72以及存储在存储器72中并可在处理器71上运行的计算机可读指令73。处理器71执行计算机可读指令73时实现上述实施例1中医疗数据关系图像获取方法的步骤,例如图1所示的步骤S10至S30。或者,处理器71执行计算机可读指令73时实现上述实施例2中医疗数据关系图像获取装置的各模块/单元的功能,例如图6所示目标医疗数据获取模块10、医疗数据关系度获取模块20和医疗数据关系图像获取模块30的功能。FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application. As shown in FIG. 7, the terminal device 70 of this embodiment includes a processor 71, a memory 72, and computer readable instructions 73 stored in the memory 72 and operable on the processor 71. When the processor 71 executes the computer readable instructions 73, the steps of the medical data relation image acquisition method in the above-described Embodiment 1 are implemented, such as steps S10 to S30 shown in FIG. Alternatively, when the processor 71 executes the computer readable instructions 73, the functions of the modules/units of the medical data relation image acquiring device in the second embodiment are implemented, for example, the target medical data acquiring module 10 and the medical data relationship acquiring module shown in FIG. 20 and medical data relate to the functionality of image acquisition module 30.
示例性的,计算机可读指令73可以被分割成一个或多个模块/单元,一个或者多个模块/单元被存储在存储器72中,并由处理器71执行,以完成本申请。一个或多个模块/单元可以是能够完成特定功能的一系列计算机可读指令703的指令段,该指令段用于描述计算机可读指令73在终端设备70中的执行过程。例如,计算机可读指令73可以被分割成目标医疗数据获取模块10、医疗数据关系度获取模块20和医疗数据关系图像获取模块30,各模块具体功能如如实施例2所述,在此不一一赘述。Illustratively, computer readable instructions 73 may be partitioned into one or more modules/units, one or more modules/units being stored in memory 72 and executed by processor 71 to complete the application. The one or more modules/units may be an instruction segment of a series of computer readable instructions 703 capable of performing a particular function for describing the execution of computer readable instructions 73 in the terminal device 70. For example, the computer readable instructions 73 may be divided into a target medical data acquisition module 10, a medical data relationship degree acquisition module 20, and a medical data relationship image acquisition module 30. The specific functions of the modules are as described in Embodiment 2, and are different here. A narrative.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing embodiments. The technical solutions described in the examples are modified or equivalently replaced with some of the technical features; and the modifications or substitutions do not deviate from the spirit and scope of the technical solutions of the embodiments of the present application, and should be included in Within the scope of protection of this application.

Claims (20)

  1. 一种医疗数据关系图像获取方法,其特征在于,包括:A medical data relationship image acquisition method, comprising:
    获取至少一个目标医疗数据;Obtaining at least one target medical data;
    采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
    采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  2. 如权利要求1所述的医疗数据关系图像获取方法,其特征在于,所述获取至少一个目标医疗数据,包括:The method of acquiring a medical data relationship image according to claim 1, wherein the obtaining the at least one target medical data comprises:
    获取目标网页地址;Get the landing page address;
    采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据;Using a crawler tool to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data;
    对至少一个所述原始医疗数据进行数据清洗,获取至少一个目标医疗数据。Performing data cleaning on at least one of the original medical data to acquire at least one target medical data.
  3. 如权利要求2所述的医疗数据关系图像获取方法,其特征在于,所述采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据,包括:The medical data relationship image acquisition method according to claim 2, wherein the crawling tool crawls the webpage corresponding to the target webpage address to obtain at least one original medical data, including:
    采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页;Using a crawler tool, crawling at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, each of the access addresses corresponding to a webpage;
    将至少一个所述访问地址存储在待处理消息队列中;Storing at least one of the access addresses in a queue of pending messages;
    采用爬虫工具对所述待处理消息队列中的每一所述访问地址对应的网页进行数据提取,获取所述原始医疗数据。And using the crawler tool to perform data extraction on the webpage corresponding to each of the access addresses in the to-be-processed message queue, to obtain the original medical data.
  4. 如权利要求1所述的医疗数据关系图像获取方法,其特征在于,所述采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度,包括:The medical data relationship image acquisition method according to claim 1, wherein the Apriori algorithm performs correlation analysis on the target medical data to obtain a medical data relationship degree, including:
    基于所述目标医疗数据,获取至少一个第一候选集,根据每一所述第一候选集出现的次数确定对应的第一支持度;And acquiring, according to the target medical data, at least one first candidate set, and determining a corresponding first support degree according to the number of occurrences of each of the first candidate sets;
    选取第一支持度大于或等于预设支持度的所述第一候选集作为第一频繁集;Selecting the first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set;
    根据自然连接定理和剪枝算法对所述第一频繁集和所述第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定所述医疗数据关系度。And the first frequent set and the first candidate set are iteratively processed according to the natural connection theorem and the pruning algorithm, and the updated first candidate set, the updated first support degree, and the updated first frequent set are obtained until the update When the first frequent set is an empty set, the medical data relationship degree is determined based on the first frequent set of the last update.
  5. 如权利要求1所述的医疗数据关系图像获取方法,其特征在于,采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像,包括:The medical data relationship image acquisition method according to claim 1, wherein the medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image, including:
    获取图表配置请求;所述图表配置请求包括图表ID;Obtaining a chart configuration request; the chart configuration request includes a chart ID;
    获取所述E-charts工具中与所述图表ID相对应的图表转换函数;Obtaining a chart conversion function corresponding to the chart ID in the E-charts tool;
    调用所述图表转换函数对所述医疗数据关系度进行图表转换,获取所述医疗数据关系图像。The chart conversion function is invoked to perform a chart conversion on the medical data relationship degree, and the medical data relationship image is acquired.
  6. 一种医疗数据关系图像获取装置,其特征在于,包括:A medical data relationship image acquiring device, comprising:
    目标医疗数据获取模块,用于获取至少一个目标医疗数据;a target medical data acquisition module, configured to acquire at least one target medical data;
    医疗数据关系度获取模块,用于采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;a medical data relationship obtaining module, configured to perform correlation analysis on the target medical data by using an Apriori algorithm, and obtain a medical data relationship degree;
    医疗数据关系图像获取模块,用于采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relation image obtaining module is configured to perform chart conversion on the medical data relationship degree by using an E-charts tool to obtain a medical data relationship image.
  7. 如权利要求6所述的医疗数据关系图像获取装置,其特征在于,所述医疗数据关系度获取模块,包括The medical data relationship image acquisition apparatus according to claim 6, wherein the medical data relationship degree acquisition module includes
    第一候选集和第一支持度获取单元,用于基于所述目标医疗数据,获取至少一个第一候选集,根据每一所述第一候选集出现的次数确定对应的第一支持度;a first candidate set and a first support obtaining unit, configured to acquire at least one first candidate set based on the target medical data, and determine a corresponding first support degree according to the number of occurrences of each of the first candidate set;
    第一频繁集获取单元,用于选取第一支持度大于或等于预设支持度的所述第一候选集作为第一频繁集;a first frequent set obtaining unit, configured to select the first candidate set whose first support degree is greater than or equal to a preset support degree as the first frequent set;
    医疗数据关系度获取单元,用于根据自然连接定理和剪枝算法对所述第一频繁集和所述第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定所述医疗数据关系度。a medical data relationship obtaining unit, configured to iteratively process the first frequent set and the first candidate set according to a natural connection theorem and a pruning algorithm, to obtain an updated first candidate set, updated first support degree, and The updated first frequent set, until the updated first frequent set is an empty set, then determining the medical data relationship based on the first frequent set of the last update.
  8. 如权利要求6所述的医疗数据关系图像获取装置,其特征在于,所述医疗数据关系图像获取模块,包括:The medical data relationship image obtaining apparatus according to claim 6, wherein the medical data relation image obtaining module comprises:
    图表配置请求获取单元,用于获取图表配置请求;所述图表配置请求包括图表ID;a chart configuration request obtaining unit, configured to acquire a chart configuration request; the chart configuration request includes a chart ID;
    图表转换函数获取单元,用于获取所述E-charts工具中与所述图表ID相对应的图表转换函数;a chart conversion function obtaining unit, configured to acquire a chart conversion function corresponding to the chart ID in the E-charts tool;
    医疗数据关系图像获取单元,用于调用所述图表转换函数对所述医疗数据关系度进行图表转换,获取所述医疗数据关系图像。The medical data relation image obtaining unit is configured to invoke the chart conversion function to perform chart conversion on the medical data relationship degree, and acquire the medical data relationship image.
  9. 如权利要求6所述的医疗数据关系图像获取装置,其特征在于,所述目标医疗数据获取模块,包括:The medical data relation image obtaining apparatus according to claim 6, wherein the target medical data acquiring module comprises:
    目标网页地址获取单元,用于获取目标网页地址;a target webpage address obtaining unit, configured to obtain a target webpage address;
    原始医疗数据获取单元,用于采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据;The original medical data obtaining unit is configured to crawl the webpage corresponding to the target webpage address by using a crawler tool to obtain at least one original medical data;
    目标医疗数据获取单元,用于对至少一个所述原始医疗数据进行数据清洗,获取至少一个目标医疗数据。The target medical data acquiring unit is configured to perform data cleaning on the at least one of the original medical data to acquire at least one target medical data.
  10. 如权利要求6所述的医疗数据关系图像获取装置,其特征在于,所述原始医疗数据获取单元,包括:The medical data relation image obtaining apparatus according to claim 6, wherein the original medical data acquiring unit comprises:
    访问地址获取子单元,用于采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页;An access address obtaining sub-unit, configured to: crawl the at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, where each of the access addresses corresponds to a webpage;
    访问地址存储子单元,用于将至少一个所述访问地址存储在待处理消息队列中;An access address storage subunit, configured to store at least one of the access addresses in a queue of pending messages;
    原始医疗数据获取子单元,用于采用爬虫工具对所述待处理消息队列中的每一所述访问地址对应的网页进行数据提取,获取所述原始医疗数据。The original medical data acquisition sub-unit is configured to perform data extraction on the webpage corresponding to each of the access addresses in the to-be-processed message queue by using a crawler tool to obtain the original medical data.
  11. 一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:A terminal device comprising a memory, a processor, and computer readable instructions stored in the memory and operable on the processor, wherein the processor executes the computer readable instructions as follows step:
    获取至少一个目标医疗数据;Obtaining at least one target medical data;
    采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
    采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  12. 如权利要求11所述的终端设备,其特征在于,所述获取至少一个目标医疗数据,包括:The terminal device according to claim 11, wherein the obtaining the at least one target medical data comprises:
    获取目标网页地址;Get the landing page address;
    采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据;Using a crawler tool to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data;
    对至少一个所述原始医疗数据进行数据清洗,获取至少一个目标医疗数据。Performing data cleaning on at least one of the original medical data to acquire at least one target medical data.
  13. 如权利要求12所述的终端设备,其特征在于,所述采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据,包括:The terminal device according to claim 12, wherein the crawling tool crawls the webpage corresponding to the target webpage address to obtain at least one original medical data, including:
    采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页;Using a crawler tool, crawling at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, each of the access addresses corresponding to a webpage;
    将至少一个所述访问地址存储在待处理消息队列中;Storing at least one of the access addresses in a queue of pending messages;
    采用爬虫工具对所述待处理消息队列中的每一所述访问地址对应的网页进行数据提取,获取所述原始医疗数据。And using the crawler tool to perform data extraction on the webpage corresponding to each of the access addresses in the to-be-processed message queue, to obtain the original medical data.
  14. 如权利要求11所述的终端设备,其特征在于,所述采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度,包括:The terminal device according to claim 11, wherein the Apriori algorithm performs correlation analysis on the target medical data to obtain a medical data relationship degree, including:
    基于所述目标医疗数据,获取至少一个第一候选集,根据每一所述第一候选集出现的 次数确定对应的第一支持度;And acquiring, according to the target medical data, at least one first candidate set, and determining a corresponding first support degree according to the number of occurrences of each of the first candidate sets;
    选取第一支持度大于或等于预设支持度的所述第一候选集作为第一频繁集;Selecting the first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set;
    根据自然连接定理和剪枝算法对所述第一频繁集和所述第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定所述医疗数据关系度。And the first frequent set and the first candidate set are iteratively processed according to the natural connection theorem and the pruning algorithm, and the updated first candidate set, the updated first support degree, and the updated first frequent set are obtained until the update When the first frequent set is an empty set, the medical data relationship degree is determined based on the first frequent set of the last update.
  15. 如权利要求11所述的终端设备,其特征在于,采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像,包括:The terminal device according to claim 11, wherein the medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image, including:
    获取图表配置请求;所述图表配置请求包括图表ID;Obtaining a chart configuration request; the chart configuration request includes a chart ID;
    获取所述E-charts工具中与所述图表ID相对应的图表转换函数;Obtaining a chart conversion function corresponding to the chart ID in the E-charts tool;
    调用所述图表转换函数对所述医疗数据关系度进行图表转换,获取所述医疗数据关系图像。The chart conversion function is invoked to perform a chart conversion on the medical data relationship degree, and the medical data relationship image is acquired.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现如下步骤:A computer readable storage medium storing computer readable instructions, wherein the computer readable instructions, when executed by a processor, implement the following steps:
    获取至少一个目标医疗数据;Obtaining at least one target medical data;
    采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度;Performing correlation analysis on the target medical data by using the Apriori algorithm, and obtaining medical data relationship degree;
    采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像。The medical data relationship degree is graphically converted by using an E-charts tool to obtain a medical data relationship image.
  17. 如权利要求16所述的计算机可读存储介质,其特征在于,所述获取至少一个目标医疗数据,包括:The computer readable storage medium of claim 16, wherein the obtaining the at least one target medical data comprises:
    获取目标网页地址;Get the landing page address;
    采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据;Using a crawler tool to crawl the webpage corresponding to the target webpage address to obtain at least one original medical data;
    对至少一个所述原始医疗数据进行数据清洗,获取至少一个目标医疗数据。Performing data cleaning on at least one of the original medical data to acquire at least one target medical data.
  18. 如权利要求17所述的计算机可读存储介质,其特征在于,所述采用爬虫工具爬取所述目标网页地址对应的网页,获取至少一个原始医疗数据,包括:The computer readable storage medium according to claim 17, wherein the crawling tool crawls the webpage corresponding to the target webpage address to obtain at least one original medical data, including:
    采用爬虫工具,依据广度优先算法或深度优先算法爬取所述目标网页地址所链接的至少一个访问地址,每一所述访问地址对应一网页;Using a crawler tool, crawling at least one access address linked by the target webpage address according to a breadth-first algorithm or a depth-first algorithm, each of the access addresses corresponding to a webpage;
    将至少一个所述访问地址存储在待处理消息队列中;Storing at least one of the access addresses in a queue of pending messages;
    采用爬虫工具对所述待处理消息队列中的每一所述访问地址对应的网页进行数据提取,获取所述原始医疗数据。And using the crawler tool to perform data extraction on the webpage corresponding to each of the access addresses in the to-be-processed message queue, to obtain the original medical data.
  19. 如权利要求16所述的计算机可读存储介质,其特征在于,所述采用Apriori算法对所述目标医疗数据进行关联性分析,获取医疗数据关系度,包括:The computer readable storage medium according to claim 16, wherein the Apriori algorithm performs correlation analysis on the target medical data to obtain a medical data relationship degree, including:
    基于所述目标医疗数据,获取至少一个第一候选集,根据每一所述第一候选集出现的次数确定对应的第一支持度;And acquiring, according to the target medical data, at least one first candidate set, and determining a corresponding first support degree according to the number of occurrences of each of the first candidate sets;
    选取第一支持度大于或等于预设支持度的所述第一候选集作为第一频繁集;Selecting the first candidate set whose first support degree is greater than or equal to the preset support degree as the first frequent set;
    根据自然连接定理和剪枝算法对所述第一频繁集和所述第一候选集进行迭代处理,获取更新的第一候选集、更新的第一支持度和更新的第一频繁集,直至更新的第一频繁集为空集时,则基于上一次更新的第一频繁集确定所述医疗数据关系度。And the first frequent set and the first candidate set are iteratively processed according to the natural connection theorem and the pruning algorithm, and the updated first candidate set, the updated first support degree, and the updated first frequent set are obtained until the update When the first frequent set is an empty set, the medical data relationship degree is determined based on the first frequent set of the last update.
  20. 如权利要求16所述的计算机可读存储介质,其特征在于,采用E-charts工具对所述医疗数据关系度进行图表转换,获取医疗数据关系图像,包括:The computer readable storage medium according to claim 16, wherein the graphical conversion of the medical data relationship degree by using an E-charts tool to obtain a medical data relationship image comprises:
    获取图表配置请求;所述图表配置请求包括图表ID;Obtaining a chart configuration request; the chart configuration request includes a chart ID;
    获取所述E-charts工具中与所述图表ID相对应的图表转换函数;Obtaining a chart conversion function corresponding to the chart ID in the E-charts tool;
    调用所述图表转换函数对所述医疗数据关系度进行图表转换,获取所述医疗数据关系图像。The chart conversion function is invoked to perform a chart conversion on the medical data relationship degree, and the medical data relationship image is acquired.
PCT/CN2018/077476 2018-01-12 2018-02-28 Medical data relationship image acquisition method and apparatus, terminal device and storage medium WO2019136807A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810031265.6A CN108461127B (en) 2018-01-12 2018-01-12 Medical data relation image acquisition method and device, terminal equipment and storage medium
CN201810031265.6 2018-01-12

Publications (1)

Publication Number Publication Date
WO2019136807A1 true WO2019136807A1 (en) 2019-07-18

Family

ID=63221426

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/077476 WO2019136807A1 (en) 2018-01-12 2018-02-28 Medical data relationship image acquisition method and apparatus, terminal device and storage medium

Country Status (2)

Country Link
CN (1) CN108461127B (en)
WO (1) WO2019136807A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109358602A (en) * 2018-10-23 2019-02-19 山东中创软件商用中间件股份有限公司 A kind of failure analysis methods, device and relevant device
CN112349399B (en) * 2020-11-12 2022-05-24 杭州火树科技有限公司 Operation omission automatic detection method based on correlation algorithm
CN112908484A (en) * 2021-01-18 2021-06-04 吾征智能技术(北京)有限公司 System, equipment and storage medium for analyzing diseases by cross-modal fusion

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104715013A (en) * 2015-01-26 2015-06-17 南京邮电大学 Hadoop-based user health data analysis method and system
CN105893765A (en) * 2016-04-06 2016-08-24 成都数联易康科技有限公司 Echarts-based hierarchical diagnosis and treatment analysis and data visualization system
CN106055908A (en) * 2016-06-13 2016-10-26 武汉理工大学 Personal medical information recommending method and system based on cloud computation
CN106202883A (en) * 2016-06-28 2016-12-07 成都中医药大学 A kind of method setting up disease cloud atlas based on big data analysis
US20170310733A1 (en) * 2012-10-01 2017-10-26 Dexcom, Inc. Analyte data retriever

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104794234B (en) * 2015-05-06 2019-02-15 华北电力大学 Data processing method and device for fellow peers' evaluation
CN105022783A (en) * 2015-06-03 2015-11-04 南京邮电大学 Hadoop based user service security system and method
CN107563857B (en) * 2017-08-31 2020-10-09 齐鲁工业大学 Customer purchasing behavior analysis method based on logic reasoning negative association rule pruning technology

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170310733A1 (en) * 2012-10-01 2017-10-26 Dexcom, Inc. Analyte data retriever
CN104715013A (en) * 2015-01-26 2015-06-17 南京邮电大学 Hadoop-based user health data analysis method and system
CN105893765A (en) * 2016-04-06 2016-08-24 成都数联易康科技有限公司 Echarts-based hierarchical diagnosis and treatment analysis and data visualization system
CN106055908A (en) * 2016-06-13 2016-10-26 武汉理工大学 Personal medical information recommending method and system based on cloud computation
CN106202883A (en) * 2016-06-28 2016-12-07 成都中医药大学 A kind of method setting up disease cloud atlas based on big data analysis

Also Published As

Publication number Publication date
CN108461127B (en) 2020-10-20
CN108461127A (en) 2018-08-28

Similar Documents

Publication Publication Date Title
US10395004B2 (en) Hypothesis-driven, real-time analysis of physiological data streams using textual representations
US9460085B2 (en) Testing and training a question-answering system
Oh et al. Learning to exploit invariances in clinical time-series data using sequence transformer networks
WO2021012519A1 (en) Artificial intelligence-based question and answer method and apparatus, computer device, and storage medium
WO2019136807A1 (en) Medical data relationship image acquisition method and apparatus, terminal device and storage medium
CN110837550A (en) Knowledge graph-based question and answer method and device, electronic equipment and storage medium
WO2019136806A1 (en) Medical model training method and apparatus, medical identification method and apparatus, device, and medium
EP3956763A1 (en) Systems and methods for semi-automated data transformation and presentation of content through adapted user interface
US20100118267A1 (en) Finding sequential matches in eye tracking data
JP6783375B2 (en) Methods and equipment for information interaction
Beheshti et al. Enabling the analysis of cross-cutting aspects in ad-hoc processes
US11663057B2 (en) Analytics framework for selection and execution of analytics in a distributed environment
EP2191399A1 (en) System and method for analyzing electronic data records
US11232134B2 (en) Customized visualization based intelligence augmentation
Bhowmick et al. VOGUE: Towards A Visual Interaction-aware Graph Query Processing Framework.
JP2022163051A (en) Compound property prediction model training method and device, electronic device, storage medium, and computer program
CN103578067A (en) Association device and method for diagnosis and treatment report
JP2023020910A (en) Method for constructing medicine synergism prediction model, prediction method, and corresponding apparatus
Hill et al. Which feature location technique is better?
CN110457304A (en) Data cleaning method, device, electronic equipment and readable storage medium storing program for executing
Fhima et al. Lirot. ai: a novel platform for crowd-sourcing retinal image segmentations
Howe et al. Smoothing the ROI Curve for Scientific Data Management Applications.
US11710068B2 (en) Labeling a dataset
US20240086394A1 (en) Multimodal retrieval and execution monitoring using rich recipe representation
JP5600826B1 (en) Unstructured data processing system, unstructured data processing method and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18899720

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 19/10/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18899720

Country of ref document: EP

Kind code of ref document: A1