WO2020119383A1 - Medical insurance supervision method, device, apparatus and computer readable storage medium - Google Patents

Medical insurance supervision method, device, apparatus and computer readable storage medium Download PDF

Info

Publication number
WO2020119383A1
WO2020119383A1 PCT/CN2019/118810 CN2019118810W WO2020119383A1 WO 2020119383 A1 WO2020119383 A1 WO 2020119383A1 CN 2019118810 W CN2019118810 W CN 2019118810W WO 2020119383 A1 WO2020119383 A1 WO 2020119383A1
Authority
WO
WIPO (PCT)
Prior art keywords
preset
medical insurance
medical
insured
suspected abnormal
Prior art date
Application number
PCT/CN2019/118810
Other languages
French (fr)
Chinese (zh)
Inventor
陈明东
黄越
胥畅
Original Assignee
平安医疗健康管理股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安医疗健康管理股份有限公司 filed Critical 平安医疗健康管理股份有限公司
Publication of WO2020119383A1 publication Critical patent/WO2020119383A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance

Definitions

  • This application relates to the technical field of medical insurance, in particular to a medical insurance supervision method, device, device and computer-readable storage medium.
  • the main purpose of this application is to provide a method of medical insurance supervision, which aims to solve the above technical problems of medical insurance supervision that are difficult, low in accuracy, and prone to regulatory loopholes, so as to reduce the difficulty of medical insurance supervision and improve the accuracy and rigor of medical insurance supervision .
  • this application provides a medical insurance supervision method, including the following steps:
  • a preset model extracting preset fields in the medical consultation data, wherein the medical consultation data includes information of the insured, medical records and diagnosis and treatment paths;
  • the preset field is analyzed based on the preset model, and a suspected abnormal case is extracted according to the analysis result.
  • the present application further proposes a medical insurance supervision device
  • the medical insurance supervision device includes: a memory, a processor, and computer-readable instructions stored on the memory and operable on the processor, the When the computer-readable instructions are executed by the processor, the steps of the medical insurance supervision method are implemented.
  • the medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records, and Diagnosis and treatment path; according to a preset model, extract the preset fields in the visit data; analyze the preset fields based on the preset model, and extract suspected abnormal cases according to the analysis results.
  • the present application also proposes a medical insurance supervision device, the medical insurance supervision device includes a data acquisition module, a field extraction module and an abnormality analysis module, wherein the data acquisition module is used to obtain medical data of the insured, Wherein, the medical consultation data includes information of the insured, medical records and diagnosis and treatment paths; the field extraction module is used to extract preset fields in the medical consultation data according to a preset model; and the abnormality analysis module is used to The preset model analyzes the preset field and extracts a suspected abnormal case according to the analysis result.
  • the present application further proposes a computer-readable storage medium having medical insurance supervision readable instructions stored on the computer-readable storage medium, the medical insurance supervision readable instructions being executed by a processor to implement a medical insurance supervision method Steps, the medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records and diagnosis and treatment paths; according to a preset model, extracting Set a field; analyze the preset field based on the preset model, and extract a suspected abnormal case according to the analysis result.
  • the medical insurance supervision method includes the following steps: obtaining medical data of the insured; extracting preset fields in the medical data according to a preset model, wherein the medical data includes information of the insured and medical records And diagnosis and treatment path; analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result.
  • the preset fields are analyzed based on the preset model.
  • the suspected abnormal case has a larger difference from the normal case. According to the analysis result, the suspected abnormal case can be automatically extracted from many cases for further processing.
  • the corresponding regulatory requirements can be achieved, thereby achieving automatic supervision of medical insurance without manual supervision, reducing the difficulty of supervision, greatly saving human resources, and improving the accuracy and rigor of medical insurance supervision , Effectively avoid the occurrence of regulatory loopholes.
  • FIG. 1 is a schematic flowchart of a first embodiment of a medical insurance supervision method for application
  • FIG. 2 is a schematic flowchart of a second embodiment of a medical insurance supervision application method
  • FIG. 3 is a schematic flowchart of a third embodiment of a medical insurance supervision application method
  • FIG. 4 is a detailed flowchart of step S200 in the fourth embodiment of the medical insurance supervision method of the application.
  • FIG. 5 is a detailed flowchart of step S240 in the fifth embodiment of the medical insurance supervision method of the application.
  • FIG. 6 is a detailed flowchart of step S300 in the sixth embodiment of the medical insurance supervision method of the application.
  • FIG. 7 is a schematic structural diagram of a medical insurance supervision device in a hardware operating environment involved in an embodiment of the present application.
  • the main solution of the embodiment of the present application is to analyze the medical data of the insured based on a preset model and extract suspected abnormal cases.
  • This application provides a solution to analyze the insured's medical data based on a preset model and extract suspected abnormal cases to achieve automatic supervision of medical insurance costs, reduce the difficulty of supervision, and improve the accuracy of supervision And rigor.
  • the first embodiment of the present application provides a medical insurance supervision method.
  • the medical insurance supervision method includes the following steps:
  • Step S100 Obtain medical data of the insured, wherein the medical data includes information of the insured, medical records and diagnosis and treatment paths;
  • medical insurance costs are settled directly when the insured swipes a card to pay for medical treatment, so as to facilitate the medical treatment of the insured and improve the circulation efficiency of hospitals and other medical institutions.
  • Medical insurance costs cover all or part of the insured's medical expenses.
  • the consultation data specifically include the information of the insured, the condition record and the diagnosis and treatment path, etc.
  • the condition record helps to monitor the development of the insured's condition and determine whether the diagnosis and treatment path adopted is reasonable and effective according to the development of the condition.
  • diagnosis and treatment The path can specifically include the insured's inspection items and drug programs.
  • the insured's visit data can be uploaded to the user's system (such as the Human Resources and Social Security Bureau) after the treatment (including outpatient treatment and inpatient treatment), or can be uploaded to the user's system regularly during the treatment In order to discover the possible loss of reimbursement or medical insurance costs in time to avoid the eventual loss of medical insurance costs.
  • the user's system such as the Human Resources and Social Security Bureau
  • the insured's visit data can be uploaded to the user's system regularly during the treatment In order to discover the possible loss of reimbursement or medical insurance costs in time to avoid the eventual loss of medical insurance costs.
  • Step S200 According to the preset model, extract the preset fields in the consultation data
  • the medical data usually includes basic information such as the insured's name and age, the corresponding inspection information for various examinations performed by the insured, and medical record information such as medical symptoms and treatment plans recorded by the doctor.
  • the preset fields included in the basic information can often be extracted directly from the form in a relatively simple manner, and the form of the inspection information and medical record information is varied, and the presets often need to be extracted based on natural language processing and the like Fields, for further analysis, the extraction of preset fields will be further elaborated later.
  • the preset field is determined according to the preset model, and the preset model is determined according to the specific item to be supervised.
  • Step S300 Analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result.
  • the preset field is analyzed based on the preset model, and the suspected abnormal case is extracted according to the analysis result.
  • the analysis of preset fields is mainly deviation analysis.
  • Deviation refers to abnormal or irregular cases in the case to be analyzed, or cases where the consistency between the observation results and the results predicted by the preset model is poor.
  • the goal of deviation analysis is to find out abnormal cases with meaningful differences from the reference object.
  • Specific deviation analysis methods include clustering method, sequence anomaly method, nearest neighbor method, multidimensional data analysis method, etc., which will be detailed later Elaborate.
  • the preset fields analyze the deviation of a particular insured relative to other insureds. If the deviation of the insured is relatively large compared to other insureds, the case is extracted as a suspected abnormal case, and further abnormalities are to be confirmed Cases, recalculation of medical insurance costs, etc.
  • the medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records, and diagnosis and treatment paths; according to a preset model, extracting preset fields in the medical data ; Analyze the preset fields based on the preset model, and extract suspected abnormal cases based on the analysis results.
  • the preset fields are analyzed based on the preset model.
  • the suspected abnormal case has a larger deviation from the normal case. According to the analysis result, the suspected abnormal case can be automatically extracted from many cases for further processing.
  • the corresponding regulatory requirements can be achieved, thereby achieving automatic supervision of medical insurance without manual supervision, reducing the difficulty of supervision, greatly saving human resources, and improving the accuracy and rigor of medical insurance supervision , Effectively avoid the occurrence of regulatory loopholes.
  • the medical insurance supervision method further includes The following steps:
  • Step S400 output a suspected abnormal case
  • Step S500 Receive confirmation information corresponding to the suspected abnormal case
  • Step S610 When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is amended.
  • the further confirmation may be automatic confirmation or manual confirmation, or a combination of automatic confirmation and manual confirmation.
  • the range of the preset field corresponding to the obvious abnormality can be set, and the case can be directly confirmed by the supervisory readable command as an abnormal case, and corresponding confirmation information can be generated.
  • the relevant personnel will manually confirm and give the corresponding confirmation information.
  • Step S620 When the confirmation information denies that the suspected abnormal case is abnormal, optimize the preset model according to the suspected abnormal case, where the optimization of the preset model is implemented based on machine learning.
  • the preset model is optimized based on the method of machine learning to improve the accuracy and reliability of medical insurance supervision.
  • the step of extracting the preset fields in the consultation data according to the preset model includes:
  • Step S210 cleaning the medical data
  • Step S220 Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
  • Step S230 Extract the preset fields in the specification text according to the preset model
  • Step S240 Analyze the non-standard text based on the recurrent neural network, and extract the preset fields in the non-standard text according to the analysis results of the preset model and the non-standard text.
  • data cleaning refers to checking and correcting the identifiable errors in the data, including checking the consistency of the data, removing duplicate data, correcting invalid and erroneous data, etc., in order to improve the efficiency of the execution of subsequent steps.
  • the standard text refers to the relatively fixed information such as the name and age of the insured person recorded in specific items such as tables and forms.
  • Non-standard text refers to relatively complex information handwritten by doctors and other personnel such as examination information and medical record information.
  • the preset fields can be directly extracted according to the requirements of the preset model, while for the non-canonical text, the preset fields are often extracted based on the natural language processing method and based on recurrent neural network analysis .
  • non-standard text is analyzed based on a recurrent neural network, and the non-standard text is extracted from the analysis results of the preset model and the non-standard text
  • the steps of the preset fields include:
  • Step S241 Represent non-standard text as a sequence of vectors
  • Step S242 Based on the semantic content and semantic distance of the vector sequence, encode the vector sequence into a sentence vector matrix based on the bidirectional recurrent neural network;
  • Step S243 According to the preset model, the attention mechanism is used to compress the sentence vector matrix into a sentence vector, and the preset field in the sentence vector is extracted.
  • the non-normative text to be extracted is represented by a vector sequence, combined with the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network model.
  • the semantic content reflects the meaning of the text itself, while the semantic distance reflects the correlation between the texts, which can be expressed by correlation functions or correlation coefficients.
  • each row can be understood as a word vector, and the word vector is sensitive to the context of the sentence.
  • the attention vector mechanism is used to compress the sentence vector matrix into sentence vectors, and the required preset fields are extracted therefrom, so that the consultation data is matched to the corresponding standardized preset fields for subsequent analysis .
  • the attention mechanism makes the neural network have the ability to focus on its input (or feature) subset: select a specific input. Attention can be applied to any type of input regardless of its shape.
  • the attention mechanism is a resource allocation scheme that solves the problem of information overload, so as to allocate computing resources to more important tasks.
  • the step of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
  • Step S310 Calculate the distance metric of the preset field in the case corresponding to each insured.
  • Step S320 Calculate the outlier degree of the distance metric of the insured's preset field relative to the distance metric of other insured's preset fields;
  • Step S330 comparing the outlier degree and the preset outlier degree
  • Step S340 When the outlier degree is greater than the preset outlier degree, mark the case corresponding to the insured as a suspected abnormal case.
  • each insured's visit data can be regarded as a point in the statistical distribution graph, and whether there is a suspected abnormal case is determined according to the distribution position of the point.
  • the preset fields may include at least one of symptoms of illness, examination items, medication plan, length of hospitalization, and length of hospitalization interval.
  • the preset fields may include at least one of symptoms of illness, examination items, medication plan, length of hospitalization, and length of hospitalization interval.
  • decomposed hospitalization means that the insured person deliberately splits one hospitalization into multiple hospitalizations in order to obtain more reimbursement expenses, so as to reimburse them separately, resulting in the loss of medical insurance funds.
  • the distance metric can be obtained based on the absolute distance (Manhattan distance), Euclidean distance, and Mahalanobis distance of the preset fields.
  • the preset field can be selected as the range, interquartile range, interquartile range, mean difference, standard deviation, etc.
  • the threshold for the number of days between intervals can be set to 10 to 15 days.
  • the threshold for insured persons whose number of days between consecutive hospitalizations is less than the threshold it can be classified as a suspected abnormal case. Further, by analyzing the medical condition in the medical data of the suspected abnormal case and the diagnosis and treatment path including the inspection items and medication plan, etc., the suspected abnormal case is further confirmed. If the same insured person is hospitalized for the same disease within a short period of time, or the disease path of the two hospitalizations is the same or belongs to a continuation relationship, it is determined to be a case of suspected decomposition hospitalization.
  • the possible false hospitalizations are monitored.
  • false hospitalization refers to the insured person making hospital records without hospitalization in order to obtain more reimbursement expenses in order to defraud the reimbursement expenses.
  • the anomaly detection algorithm adopted by the preset model in identifying suspected abnormal cases is an index-based algorithm, that is, given a data set, the index-based algorithm uses a multi-dimensional index structure to find the phase of each case to be analyzed within a certain radius O case.
  • M is the maximum number of objects within a certain radius of the case to be analyzed. If the M+l adjacent case of the case to be analyzed is found, the case to be analyzed is not a suspected abnormal case.
  • the complexity of the index-based algorithm it has good scalability.
  • the situation in which the insured person is admitted to the hospital for treatment when the patient's condition does not reach the admission treatment indication is supervised.
  • the analysis is performed through a nested loop algorithm, that is, a nested loop algorithm and an index-based algorithm. Because the loop algorithm and the nested algorithm have the same complexity, and at the same time avoid the construction of the index structure, in order to minimize the number of data input and output, the memory buffer space is divided into two halves, and the data set is divided into several logical blocks. By carefully selecting the order in which the logic blocks are loaded into each buffer area, the efficiency of input and output is effectively improved.
  • the situation of splitting the charges of a project is monitored.
  • the unit-based algorithm analyzes the insured's medical data. By dividing the data space into units with side length equal to d/(2*k1/2). Each unit has two layers surrounding it. The thickness of the first layer is one unit, and the thickness of the second layer is (2*k1/2-1).
  • the algorithm counts outliers on a unit-by-unit basis rather than on an object-by-object basis. For a given unit, three counts are accumulated: the number of objects in the unit, the number of objects in the unit and the first layer, and the number of objects in the unit and two layers.
  • the algorithm changes the detection of outlier data for each element of the data set to the detection of outlier data for each unit, thereby improving the efficiency of the algorithm and reducing the complexity of the algorithm. It performs anomaly detection in this way: if the number of objects in the unit and the first layer is greater than a preset number, all objects in the unit are not abnormal; if the number of objects in the unit and the two layers is less than or equal to the above-mentioned preset number , All objects in the unit are abnormal; otherwise, some data in the unit may be abnormal. In order to detect these abnormal points, you need to join the processing object by object.
  • FIG. 7 is a schematic structural diagram of a terminal of a hardware operating environment involved in the embodiment of the present application, that is, a medical insurance supervision device.
  • the terminal may be a server, a PC, or a smart phone, tablet computer, e-book reader, MP3 (Moving Picture Experts Group Audio Layer III, motion picture expert compression standard audio layer 3 player, MP4 (Moving Picture Experts Group Audio Layer IV, the standard audio layer for motion picture experts compression 3) Players, portable computers and other mobile terminal devices with display functions.
  • MP3 Moving Picture Experts Group Audio Layer III, motion picture expert compression standard audio layer 3 player
  • MP4 Moving Picture Experts Group Audio Layer IV, the standard audio layer for motion picture experts compression 3
  • portable computers and other mobile terminal devices with display functions.
  • the terminal may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection communication between these components.
  • the user interface 1003 may include a display (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface).
  • the memory 1005 may be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage.
  • the memory 1005 may optionally be a storage device independent of the foregoing processor 1001.
  • the terminal may also include a camera, RF (Radio Frequency (radio frequency) circuits, sensors, audio circuits, WiFi modules, etc.
  • sensors such as light sensors, motion sensors and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display screen according to the brightness of the ambient light, and the proximity sensor may turn off the display screen and/or when the mobile terminal moves to the ear Backlight.
  • the gravity acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when at rest.
  • the mobile terminal can be used for applications that recognize the posture of mobile terminals (such as horizontal and vertical screen switching, Related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer, tap), etc.
  • the mobile terminal can also be configured with other sensors such as gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc. No longer.
  • terminal structure shown in FIG. 7 does not constitute a limitation on the terminal, and may include more or less components than those shown in the figure, or a combination of certain components, or a different component arrangement.
  • the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and medical insurance supervision readable instructions.
  • the network interface 1004 is mainly used to connect to the back-end server and perform data communication with the back-end server;
  • the user interface 1003 is mainly used to connect to the client (user end) and perform data communication with the client;
  • the processor 1001 can be used to call the medical insurance supervision readable instructions stored in the memory 1005, and perform the following operations:
  • the visit data includes the insured's information, medical record and diagnosis and treatment path;
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result, and then perform the following operations:
  • the medical insurance fee is amended.
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and after receiving the confirmation information corresponding to the suspected abnormal case, perform the following operations:
  • the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and the operation of extracting the preset fields in the medical data according to the preset model includes:
  • the non-standard text is analyzed based on the recurrent neural network, and the preset fields in the non-standard text are extracted according to the analysis results of the preset model and the non-standard text.
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the non-standard text based on the recurrent neural network, and extract the pre-code in the non-standard text according to the analysis result of the preset model and the non-standard text
  • the operations for setting fields include:
  • the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network
  • the attention mechanism is used to compress the sentence vector matrix into sentence vectors, and the preset fields in the sentence vectors are extracted.
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result includes:
  • the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
  • the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and the preset fields include at least one of the symptom of the condition, the examination item, the medication plan, the length of hospitalization, and the length of the interval between hospitalizations.
  • an embodiment of the present application also proposes a medical insurance supervision device.
  • the medical insurance supervision device includes:
  • the data acquisition module is used to obtain the medical data of the insured, wherein the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
  • the field extraction module is used to extract the preset fields in the consultation data according to the preset model
  • the abnormality analysis module is used to analyze the preset fields based on the preset model and extract suspected abnormal cases according to the analysis results.
  • the medical insurance supervision device also includes:
  • Case output module to output suspected abnormal cases
  • Information receiving module used to receive confirmation information corresponding to suspected abnormal cases
  • the expense revision module is used to revise the medical insurance expenses when the confirmation information confirms that the suspected abnormal case is abnormal.
  • the medical insurance supervision device also includes:
  • the model optimization module is used to optimize the preset model according to the suspected abnormal case when the confirmation information denies that the suspected abnormal case is abnormal, wherein the optimization of the preset model is implemented based on machine learning.
  • the field extraction module includes:
  • the data cleaning unit is used to clean up the medical data
  • the text classification unit is used to obtain the canonical text and non-normative text in the cleaned visit data, respectively;
  • the field extraction unit is used to extract the preset fields in the specification text according to the preset model
  • the field extraction unit is also used to analyze the non-standard text based on the recurrent neural network, and extract the preset fields in the non-standard text according to the analysis results of the preset model and the non-standard text.
  • the field extraction unit includes:
  • Vector sequence subunit to represent non-standard text as a sequence of vectors
  • Vector matrix subunit used to encode the vector sequence into a sentence vector matrix based on the bidirectional recurrent neural network based on the semantic content and semantic distance of the vector sequence;
  • the vector compression subunit is used to compress the sentence vector matrix into a sentence vector according to a preset model, and extract a preset field in the sentence vector.
  • the abnormality analysis module includes:
  • the distance measurement unit is used to calculate the distance measurement of the preset field in the case corresponding to each insured;
  • the outlier degree unit is used to calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of the other insured's preset field;
  • the comparison unit is used to compare the outlier degree and the preset outlier degree
  • the marking unit is used to mark the case corresponding to the insured as a suspected abnormal case when the outlier degree is greater than the preset outlier degree.
  • the preset fields include at least one of illness symptom, examination item, medication plan, length of hospitalization and length of hospitalization interval.
  • the embodiments of the present application also propose a computer-readable storage medium, which may be a non-volatile readable storage medium; the computer-readable storage medium stores medical insurance regulatory readable instructions, and the medical insurance regulatory readable The instruction performs the following operations when executed by the processor:
  • the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
  • the medical insurance supervision readable instruction when executed by the processor, after analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result, the following operations are also performed:
  • the medical insurance fee is amended.
  • the medical insurance supervision readable instruction when executed by the processor, after receiving the confirmation information corresponding to the suspected abnormal case, it also performs the following operations:
  • the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
  • the operation of extracting the preset field in the consultation data according to the preset model includes:
  • the non-standard text is analyzed based on the recurrent neural network, and the preset fields in the non-standard text are extracted according to the analysis results of the preset model and the non-standard text.
  • the non-standard text is analyzed based on the recurrent neural network, and according to the analysis results of the preset model and the non-standard text, the operation of extracting the preset fields in the non-standard text includes:
  • the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network
  • the attention mechanism is used to compress the sentence vector matrix into sentence vectors, and the preset fields in the sentence vectors are extracted.
  • the operation of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
  • the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
  • the preset field includes at least one of the condition symptom, the examination item, the medication plan, the length of hospitalization and the length of the interval between hospitalizations.
  • the methods in the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course, can also be implemented by hardware, but in many cases the former is better Implementation.
  • the technical solution of the present application can be embodied in the form of a software product in essence or part that contributes to the existing technology, and the computer software product is stored in a storage medium (such as ROM/RAM as described above) , Magnetic disk, optical disk), including several instructions to make a terminal device (which can be a mobile phone, computer, server, air conditioner, or network equipment, etc.) to perform the method described in each embodiment of the present application.

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

A medical insurance supervision method, a device, an apparatus and a computer readable storage medium, the medical insurance supervision method being implemented on the basis of intelligent decision making. The medical insurance supervision method comprises the following steps: acquiring treatment data of an insured individual (S100); extracting a preset field in the treatment data according to a preset data model (S200); and analyzing the preset field on the basis of the preset model and extracting a suspected abnormal case according to the analysis result (S300). The described method helps to reduce the difficulty of medical insurance supervision, thereby improving the accuracy and strictness of medical insurance supervision.

Description

医保监管方法、设备、装置及计算机可读存储介质 Medical insurance supervision method, equipment, device and computer readable storage medium The
本申请要求于2018年12月13日提交中国专利局、申请号为201811530813.6、发明名称为“医保监管方法、设备、装置及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在申请中。This application requires the priority of the Chinese patent application submitted to the China Patent Office on December 13, 2018, with the application number 201811530813.6 and the invention titled "Medical Insurance Supervision Methods, Equipment, Devices, and Computer-readable Storage Media." The reference is incorporated in the application.
技术领域Technical field
本申请涉及医保技术领域,尤其涉及一种医保监管方法、设备、装置及计算机可读存储介质。This application relates to the technical field of medical insurance, in particular to a medical insurance supervision method, device, device and computer-readable storage medium.
背景技术Background technique
随着社会和经济的发展,国家的医保政策逐渐优化,越来越多医疗过程中所产生的费用可以得到报销。然而,部分被保险人为了获得更多的报销,存在伪造病情、拆分费用、过度治疗等违规行为,造成了医保经费的流失。目前,医保监管主要依赖于人工审核的方式,然而由于被保险人众多、就诊数据庞大且专业化强等因素,导致人工监管的难度很大,不仅需要大量的专业化的人力资源,而且监管的准确性也较低,还很容易产生监管漏洞。With the development of society and economy, the national medical insurance policy is gradually optimized, and more and more expenses incurred in the medical process can be reimbursed. However, in order to obtain more reimbursement, some insured persons have violated regulations such as falsifying the medical condition, splitting expenses, and excessive treatment, resulting in the loss of medical insurance funds. At present, medical insurance supervision mainly relies on the method of manual review. However, due to factors such as a large number of insured persons, huge medical data and strong specialization, manual supervision is very difficult. It not only requires a large amount of professional human resources, but also The accuracy is also low, and it is easy to produce regulatory loopholes.
发明内容Summary of the invention
本申请的主要目的在于提供一种医保监管方法,旨在解决上述医保监管难度大、准确性低、容易产生监管漏洞的技术问题,以降低医保监管的难度,改善医保监管的准确性和严密性。The main purpose of this application is to provide a method of medical insurance supervision, which aims to solve the above technical problems of medical insurance supervision that are difficult, low in accuracy, and prone to regulatory loopholes, so as to reduce the difficulty of medical insurance supervision and improve the accuracy and rigor of medical insurance supervision .
为实现上述目的,本申请提供一种医保监管方法,包括以下步骤:To achieve the above purpose, this application provides a medical insurance supervision method, including the following steps:
获取被保险人的就诊数据;Obtain medical data of the insured;
根据预设模型,提取所述就诊数据中的预设字段,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;According to a preset model, extracting preset fields in the medical consultation data, wherein the medical consultation data includes information of the insured, medical records and diagnosis and treatment paths;
基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。The preset field is analyzed based on the preset model, and a suspected abnormal case is extracted according to the analysis result.
为实现上述目的,本申请进一步提出一种医保监管设备,所述医保监管设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被所述处理器执行时实现医保监管方法的步骤,所述医保监管方法包括以下步骤:获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;根据预设模型,提取所述就诊数据中的预设字段;基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。To achieve the above purpose, the present application further proposes a medical insurance supervision device, the medical insurance supervision device includes: a memory, a processor, and computer-readable instructions stored on the memory and operable on the processor, the When the computer-readable instructions are executed by the processor, the steps of the medical insurance supervision method are implemented. The medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records, and Diagnosis and treatment path; according to a preset model, extract the preset fields in the visit data; analyze the preset fields based on the preset model, and extract suspected abnormal cases according to the analysis results.
为实现上述目的,本申请还提出一种医保监管装置,所述医保监管装置包括数据获取模块、字段提取模块和异常分析模块,其中,所述数据获取模块用以获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;所述字段提取模块用以根据预设模型,提取所述就诊数据中的预设字段;所述异常分析模块用以基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。In order to achieve the above purpose, the present application also proposes a medical insurance supervision device, the medical insurance supervision device includes a data acquisition module, a field extraction module and an abnormality analysis module, wherein the data acquisition module is used to obtain medical data of the insured, Wherein, the medical consultation data includes information of the insured, medical records and diagnosis and treatment paths; the field extraction module is used to extract preset fields in the medical consultation data according to a preset model; and the abnormality analysis module is used to The preset model analyzes the preset field and extracts a suspected abnormal case according to the analysis result.
为实现上述目的,本申请进一步提出一种计算机可读存储介质,所述计算机可读存储介质上存储有医保监管可读指令,所述医保监管可读指令被处理器执行时实现医保监管方法的步骤,所述医保监管方法包括以下步骤:获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;根据预设模型,提取所述就诊数据中的预设字段;基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。In order to achieve the above object, the present application further proposes a computer-readable storage medium having medical insurance supervision readable instructions stored on the computer-readable storage medium, the medical insurance supervision readable instructions being executed by a processor to implement a medical insurance supervision method Steps, the medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records and diagnosis and treatment paths; according to a preset model, extracting Set a field; analyze the preset field based on the preset model, and extract a suspected abnormal case according to the analysis result.
在本申请技术方案中,医保监管方法包括以下步骤:获取被保险人的就诊数据;根据预设模型,提取就诊数据中的预设字段,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。通过在被保险人的就诊数据中提取出预设字段,从而剔除就诊数据中的冗余信息,保留关键信息,降低后续预设模型的分析难度。进一步的,基于预设模型对预设字段进行分析,通常,疑似异常案例相对正常案例具有较大的差别,根据分析结果,能够自动从众多案例中提取出疑似异常案例,以待进一步的处理。通过设置不同的预设模型,可以实现相应的监管要求,从而实现了医保的自动监管,而无需人工监管,降低了监管难度,大大节约了人力资源,同时提高了医保监管的准确性和严密性,有效避免了监管漏洞的产生。In the technical solution of the present application, the medical insurance supervision method includes the following steps: obtaining medical data of the insured; extracting preset fields in the medical data according to a preset model, wherein the medical data includes information of the insured and medical records And diagnosis and treatment path; analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result. By extracting the preset fields in the insured's visit data, the redundant information in the visit data is removed, the key information is retained, and the analysis difficulty of the subsequent preset model is reduced. Further, the preset fields are analyzed based on the preset model. Generally, the suspected abnormal case has a larger difference from the normal case. According to the analysis result, the suspected abnormal case can be automatically extracted from many cases for further processing. By setting different preset models, the corresponding regulatory requirements can be achieved, thereby achieving automatic supervision of medical insurance without manual supervision, reducing the difficulty of supervision, greatly saving human resources, and improving the accuracy and rigor of medical insurance supervision , Effectively avoid the occurrence of regulatory loopholes.
附图说明BRIEF DESCRIPTION
图1为本申请医保监管方法第一实施例的流程示意图;FIG. 1 is a schematic flowchart of a first embodiment of a medical insurance supervision method for application;
图2为本申请医保监管方法第二实施例的流程示意图;2 is a schematic flowchart of a second embodiment of a medical insurance supervision application method;
图3为本申请医保监管方法第三实施例的流程示意图;FIG. 3 is a schematic flowchart of a third embodiment of a medical insurance supervision application method;
图4为本申请医保监管方法第四实施例中步骤S200的细化流程示意图;FIG. 4 is a detailed flowchart of step S200 in the fourth embodiment of the medical insurance supervision method of the application; FIG.
图5为本申请医保监管方法第五实施例中步骤S240的细化流程示意图;FIG. 5 is a detailed flowchart of step S240 in the fifth embodiment of the medical insurance supervision method of the application; FIG.
图6为本申请医保监管方法第六实施例中步骤S300的细化流程示意图;FIG. 6 is a detailed flowchart of step S300 in the sixth embodiment of the medical insurance supervision method of the application; FIG.
图7是本申请实施例方案涉及的硬件运行环境的医保监管设备的结构示意图。FIG. 7 is a schematic structural diagram of a medical insurance supervision device in a hardware operating environment involved in an embodiment of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional characteristics and advantages of the present application will be further described in conjunction with the embodiments and with reference to the drawings.
具体实施方式detailed description
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
本申请实施例的主要解决方案是:基于预设模型对被保险人的就诊数据进行分析,提取出其中的疑似异常案例。The main solution of the embodiment of the present application is to analyze the medical data of the insured based on a preset model and extract suspected abnormal cases.
由于现有技术中通常采用人工方式实现医保监管,监管难度大,需要大量的专业化的人力资源,且人工监管的准确性低,还很容易产生监管漏洞。In the prior art, medical insurance supervision is usually implemented manually, which is difficult to supervise, requires a large amount of professional human resources, and has low accuracy of manual supervision. It is also easy to produce supervision loopholes.
本申请提供一种解决方案,基于预设模型对被保险人的就诊数据进行分析,提取出其中的疑似异常案例,以实现医保费用的自动监管,降低了监管难度,同时改善了监管的准确性和严密性。This application provides a solution to analyze the insured's medical data based on a preset model and extract suspected abnormal cases to achieve automatic supervision of medical insurance costs, reduce the difficulty of supervision, and improve the accuracy of supervision And rigor.
本申请第一实施例提供一种医保监管方法,如图1所示,医保监管方法包括以下步骤:The first embodiment of the present application provides a medical insurance supervision method. As shown in FIG. 1, the medical insurance supervision method includes the following steps:
步骤S100、获取被保险人的就诊数据,其中,就诊数据包括被保险人信息、病情记录和诊疗路径;Step S100: Obtain medical data of the insured, wherein the medical data includes information of the insured, medical records and diagnosis and treatment paths;
通常,医保费用是在就诊的过程中,当被保险人刷卡付费时直接结算的,以方便一般被保险人的就医,提高医院等医疗机构的流转效率。医保费用全部或部分覆盖被保险人的医疗费,为了对医保费用进行监管,有必要获取被保险人的就诊数据,进一步根据就诊数据确定是否存在违规报销的异常情况。其中,就诊数据具体包括被保险人信息、病情记录和诊疗路径等,病情记录有助于监测被保险人的病情发展,并根据其病情发展确定所采用的诊疗路径是否合理和有效,其中,诊疗路径具体可以包括被保险人的检查项目和药物方案等。被保险人的就诊数据可以在本次治疗(包括门诊治疗和住院治疗)结束后上传到用户(例如人社局等)的系统中,也可以在本次治疗的过程中定时上传到用户的系统中,以便及时发现可能存在违规报销或医保费用的流失因素,避免医保费用最终的流失。Generally, medical insurance costs are settled directly when the insured swipes a card to pay for medical treatment, so as to facilitate the medical treatment of the insured and improve the circulation efficiency of hospitals and other medical institutions. Medical insurance costs cover all or part of the insured's medical expenses. In order to supervise the medical insurance costs, it is necessary to obtain the insured's medical data, and further determine whether there are any abnormal reimbursements based on the medical data. Among them, the consultation data specifically include the information of the insured, the condition record and the diagnosis and treatment path, etc. The condition record helps to monitor the development of the insured's condition and determine whether the diagnosis and treatment path adopted is reasonable and effective according to the development of the condition. Among them, diagnosis and treatment The path can specifically include the insured's inspection items and drug programs. The insured's visit data can be uploaded to the user's system (such as the Human Resources and Social Security Bureau) after the treatment (including outpatient treatment and inpatient treatment), or can be uploaded to the user's system regularly during the treatment In order to discover the possible loss of reimbursement or medical insurance costs in time to avoid the eventual loss of medical insurance costs.
步骤S200、根据预设模型,提取就诊数据中的预设字段;Step S200: According to the preset model, extract the preset fields in the consultation data;
就诊数据中通常包括被保险人的姓名、年龄等基本信息,被保险人所作各种检查对应的检查信息,以及医生所记录的病情症状、治疗方案等病历信息。其中,基本信息中包含的预设字段往往可以采用相对简单的方式直接从表单中提取出来,而检查信息和病历信息的形式变化较为多样,往往需要基于自然语言处理等方式提取出其中的预设字段,以待进一步的分析,后文中还将对预设字段的提取进一步详细阐述。预设字段是根据预设模型确定的,而预设模型是根据所要监管的具体项目所确定的。例如,当需要对是否存在分解住院、虚假住院等违规住院的项目进行监管时,则有必要提取出与被保险人的住院情况相关的预设字段。根据预设模型,提取就诊数据中的预设字段,以待进一步的分析。The medical data usually includes basic information such as the insured's name and age, the corresponding inspection information for various examinations performed by the insured, and medical record information such as medical symptoms and treatment plans recorded by the doctor. Among them, the preset fields included in the basic information can often be extracted directly from the form in a relatively simple manner, and the form of the inspection information and medical record information is varied, and the presets often need to be extracted based on natural language processing and the like Fields, for further analysis, the extraction of preset fields will be further elaborated later. The preset field is determined according to the preset model, and the preset model is determined according to the specific item to be supervised. For example, when it is necessary to supervise whether there are illegal hospitalization items such as decomposition hospitalization and false hospitalization, it is necessary to extract preset fields related to the insured's hospitalization. According to the preset model, the preset fields in the consultation data are extracted for further analysis.
步骤S300、基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。Step S300: Analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result.
在提取出预设字段之后,基于预设模型对预设字段进行分析,并根据分析结果提取出疑似异常案例。通常,对预设字段的分析主要为偏差分析,偏差是指待分析案例中反常的、不合常规的案例,或者观测结果与预设模型所预测的结果之间一致性较差的案例。偏差分析的目标是寻找出与参照对象之间存在有意义的差别的异常案例,具体的偏差分析方法包括聚类法、序列异常法、最近邻法、多维数据分析法等,后文中还将详细阐述。根据预设字段,分析一特定被保险人相对其它被保险人的偏差,若该被保险人相对其它被保险人的偏差很大,则提取该案例为疑似异常案例,以待执行进一步的确认异常案例、重新计算医保费用等处理。After the preset field is extracted, the preset field is analyzed based on the preset model, and the suspected abnormal case is extracted according to the analysis result. In general, the analysis of preset fields is mainly deviation analysis. Deviation refers to abnormal or irregular cases in the case to be analyzed, or cases where the consistency between the observation results and the results predicted by the preset model is poor. The goal of deviation analysis is to find out abnormal cases with meaningful differences from the reference object. Specific deviation analysis methods include clustering method, sequence anomaly method, nearest neighbor method, multidimensional data analysis method, etc., which will be detailed later Elaborate. According to the preset fields, analyze the deviation of a particular insured relative to other insureds. If the deviation of the insured is relatively large compared to other insureds, the case is extracted as a suspected abnormal case, and further abnormalities are to be confirmed Cases, recalculation of medical insurance costs, etc.
在本实施例中,医保监管方法包括以下步骤:获取被保险人的就诊数据,其中,就诊数据包括被保险人信息、病情记录和诊疗路径;根据预设模型,提取就诊数据中的预设字段;基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。通过在被保险人的就诊数据中提取出预设字段,从而剔除就诊数据中的冗余信息,保留关键信息,降低后续预设模型的分析难度。进一步的,基于预设模型对预设字段进行分析,通常,疑似异常案例相对正常案例具有较大的偏差,根据分析结果,能够自动从众多案例中提取出疑似异常案例,以待进一步的处理。通过设置不同的预设模型,可以实现相应的监管要求,从而实现了医保的自动监管,而无需人工监管,降低了监管难度,大大节约了人力资源,同时提高了医保监管的准确性和严密性,有效避免了监管漏洞的产生。In this embodiment, the medical insurance supervision method includes the following steps: obtaining medical data of the insured, wherein the medical data includes information of the insured, medical records, and diagnosis and treatment paths; according to a preset model, extracting preset fields in the medical data ; Analyze the preset fields based on the preset model, and extract suspected abnormal cases based on the analysis results. By extracting the preset fields in the insured's visit data, the redundant information in the visit data is removed, the key information is retained, and the analysis difficulty of the subsequent preset model is reduced. Further, the preset fields are analyzed based on the preset model. Usually, the suspected abnormal case has a larger deviation from the normal case. According to the analysis result, the suspected abnormal case can be automatically extracted from many cases for further processing. By setting different preset models, the corresponding regulatory requirements can be achieved, thereby achieving automatic supervision of medical insurance without manual supervision, reducing the difficulty of supervision, greatly saving human resources, and improving the accuracy and rigor of medical insurance supervision , Effectively avoid the occurrence of regulatory loopholes.
基于上述第一实施例,如图2所示,在本申请的第二实施例中,在基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的步骤之后,医保监管方法还包括以下步骤:Based on the above first embodiment, as shown in FIG. 2, in the second embodiment of the present application, after analyzing the preset fields based on the preset model and extracting the suspected abnormal case according to the analysis result, the medical insurance supervision method further includes The following steps:
步骤S400、输出疑似异常案例;Step S400, output a suspected abnormal case;
步骤S500、接收与疑似异常案例对应的确认信息;Step S500: Receive confirmation information corresponding to the suspected abnormal case;
步骤S610、当确认信息确认疑似异常案例异常时,修正医保费用。Step S610: When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is amended.
在本实施例中,当提取到疑似异常案例时,通过输出疑似异常案例,进一步确认该案例是否为异常案例,以提高监管的准确性。进一步的确认可以是自动确认或者人工确认,也可以是自动确认和人工确认的结合。例如,对于明显异常的疑似异常案例,可以通过设置对应于明显异常的预设字段的范围,由监管可读指令直接确认该案例为异常案例,并产生相应的确认信息。对于其它案例,则由相关人员进行人工确认,并给出相应的确认信息。当接收到与疑似异常案例对应的确认信息时,若确认信息确认该疑似异常案例异常,则修正医保费用。具体的,可以对相关责任单位或责任人出具扣费依据和扣费通知,并直接进行医保费用的扣除,以提高处理效率。为了加强监管,还可以定时出具一定时段内的异常案例的统计结果,包括异常案例对应的相关责任单位或责任人,以便进一步对违规严重的责任单位或责任人进行处理。In this embodiment, when a suspected abnormal case is extracted, it is further confirmed whether the case is an abnormal case by outputting the suspected abnormal case to improve the accuracy of supervision. The further confirmation may be automatic confirmation or manual confirmation, or a combination of automatic confirmation and manual confirmation. For example, for a suspected abnormal case of obvious abnormality, the range of the preset field corresponding to the obvious abnormality can be set, and the case can be directly confirmed by the supervisory readable command as an abnormal case, and corresponding confirmation information can be generated. For other cases, the relevant personnel will manually confirm and give the corresponding confirmation information. When receiving the confirmation information corresponding to the suspected abnormal case, if the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is amended. Specifically, you can issue a deduction basis and a deduction notice to the relevant responsible unit or responsible person, and directly deduct medical insurance expenses to improve processing efficiency. In order to strengthen supervision, the statistical results of abnormal cases within a certain period of time can also be issued regularly, including the relevant responsible units or persons responsible for the abnormal cases, so as to further deal with the serious violations of the responsible units or persons responsible.
基于上述第二实施例,如图3所示,在本申请的第三实施例中,在接收与疑似异常案例对应的确认信息的步骤之后,还包括以下步骤:Based on the second embodiment described above, as shown in FIG. 3, in the third embodiment of the present application, after the step of receiving confirmation information corresponding to the suspected abnormal case, the following steps are further included:
步骤S620、当确认信息否认疑似异常案例异常时,根据疑似异常案例优化预设模型,其中,预设模型的优化基于机器学习实现。Step S620: When the confirmation information denies that the suspected abnormal case is abnormal, optimize the preset model according to the suspected abnormal case, where the optimization of the preset model is implemented based on machine learning.
对于相对复杂的疑似异常案例,通过自动或人工的进一步确认后,若该疑似异常案例并非异常,则表明预设模型存在一定的缺陷。根据该疑似异常案例,或者在积累到一定数目的初步判断错误的疑似异常案例之后,基于机器学习的方法优化预设模型,以改善医保监管的准确性和可靠性。当然,也可以基于预设模型,对医保方案进行适当的改革,以满足被保险人实际的保险需求。For relatively complicated suspected abnormal cases, after further confirmation by automatic or manual, if the suspected abnormal case is not abnormal, it indicates that the preset model has certain defects. According to the suspected abnormal case, or after accumulating a certain number of suspected abnormal cases with preliminary judgment errors, the preset model is optimized based on the method of machine learning to improve the accuracy and reliability of medical insurance supervision. Of course, it is also possible to make appropriate reforms to the medical insurance scheme based on preset models to meet the actual insurance needs of the insured.
基于上述各实施例,如图4所示,在本申请的第四实施例中,根据预设模型,提取就诊数据中的预设字段的步骤包括:Based on the above embodiments, as shown in FIG. 4, in the fourth embodiment of the present application, the step of extracting the preset fields in the consultation data according to the preset model includes:
步骤S210、清洗就诊数据;Step S210, cleaning the medical data;
步骤S220、分别获取清洗后的就诊数据中的规范文本和非规范文本;Step S220: Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
步骤S230、根据预设模型,提取规范文本中的预设字段;Step S230: Extract the preset fields in the specification text according to the preset model;
步骤S240、基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段。Step S240: Analyze the non-standard text based on the recurrent neural network, and extract the preset fields in the non-standard text according to the analysis results of the preset model and the non-standard text.
其中,数据清洗是指对数据中可识别的错误进行检查并纠正,包括检查数据的一致性、剔除重复数据、纠正无效和错误数据等,以提高后续步骤的执行效率。对于清洗后的就诊数据,分别获取其中的规范文本和非规范文本,具体的,规范文本是指记录在例如表格、表单等具体条目中的被保险人的姓名、年龄等相对固定的信息,而非规范文本是指检查信息、病历信息等由医生等人员手写的相对复杂的信息。对于规范文本,可以根据预设模型所需,直接提取出其中的预设字段,而对于非规范文本,则往往基于自然语言处理的方法,基于循环神经网络分析,对其中的预设字段进行提取。Among them, data cleaning refers to checking and correcting the identifiable errors in the data, including checking the consistency of the data, removing duplicate data, correcting invalid and erroneous data, etc., in order to improve the efficiency of the execution of subsequent steps. Obtain the standardized text and non-standard text of the medical data after cleaning. Specifically, the standard text refers to the relatively fixed information such as the name and age of the insured person recorded in specific items such as tables and forms. Non-standard text refers to relatively complex information handwritten by doctors and other personnel such as examination information and medical record information. For the canonical text, the preset fields can be directly extracted according to the requirements of the preset model, while for the non-canonical text, the preset fields are often extracted based on the natural language processing method and based on recurrent neural network analysis .
基于上述第四实施例,如图5所示,在本申请的第五实施例中,基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段的步骤包括:Based on the fourth embodiment described above, as shown in FIG. 5, in the fifth embodiment of the present application, non-standard text is analyzed based on a recurrent neural network, and the non-standard text is extracted from the analysis results of the preset model and the non-standard text The steps of the preset fields include:
步骤S241、将非规范文本表示为向量序列;Step S241: Represent non-standard text as a sequence of vectors;
步骤S242、根据向量序列的语义内容和语义距离,基于双向循环神经网络将向量序列编码为句子向量矩阵;Step S242: Based on the semantic content and semantic distance of the vector sequence, encode the vector sequence into a sentence vector matrix based on the bidirectional recurrent neural network;
步骤S243、根据预设模型,采用注意力机制压缩句子向量矩阵为句子向量,并提取句子向量中的预设字段。Step S243: According to the preset model, the attention mechanism is used to compress the sentence vector matrix into a sentence vector, and the preset field in the sentence vector is extracted.
具体的,将待提取的非规范文本用一个向量序列表示,结合向量序列的语义内容和语义距离,基于双向循环神经网络模型将向量序列编码为句子向量矩阵。语义内容反映了文本本身的意思,而语义距离则反映了文本之间的关联性,可以通过相关函数或相关系数来表示。对于该句子向量矩阵,其中每一行可以理解为词向量,词向量对于句子的上下文敏感。进一步的,根据预设模型,采用注意力机制将句子向量矩阵压缩为句子向量,从中提取出所需的预设字段,从而将就诊数据匹配到相应的标准化的预设字段中,以待后续分析。其中,注意力机制使得神经网络具备专注于其输入(或特征)子集的能力:选择特定的输入。注意力可以应用于任何类型的输入而不管其形状如何。在计算能力有限情况下,注意力机制是解决信息超载问题的主要手段的一种资源分配方案,从而将计算资源分配给更重要的任务。Specifically, the non-normative text to be extracted is represented by a vector sequence, combined with the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network model. The semantic content reflects the meaning of the text itself, while the semantic distance reflects the correlation between the texts, which can be expressed by correlation functions or correlation coefficients. For the sentence vector matrix, each row can be understood as a word vector, and the word vector is sensitive to the context of the sentence. Further, according to the preset model, the attention vector mechanism is used to compress the sentence vector matrix into sentence vectors, and the required preset fields are extracted therefrom, so that the consultation data is matched to the corresponding standardized preset fields for subsequent analysis . Among them, the attention mechanism makes the neural network have the ability to focus on its input (or feature) subset: select a specific input. Attention can be applied to any type of input regardless of its shape. In the case of limited computing power, the attention mechanism is a resource allocation scheme that solves the problem of information overload, so as to allocate computing resources to more important tasks.
基于上述各实施例,如图6所示,在本申请的第六实施例中,基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的步骤包括:Based on the above embodiments, as shown in FIG. 6, in the sixth embodiment of the present application, the step of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
步骤S310、计算各被保险人对应的案例中预设字段的距离度量;Step S310: Calculate the distance metric of the preset field in the case corresponding to each insured;
步骤S320、计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;Step S320: Calculate the outlier degree of the distance metric of the insured's preset field relative to the distance metric of other insured's preset fields;
步骤S330、比对离群程度和预设离群程度;Step S330, comparing the outlier degree and the preset outlier degree;
步骤S340、当离群程度大于预设离群程度时,标记被保险人对应的案例为疑似异常案例。Step S340: When the outlier degree is greater than the preset outlier degree, mark the case corresponding to the insured as a suspected abnormal case.
采用统计分布的方式对被保险人的就诊数据进行分析,其中,每个被保险人的就诊数据可以视为统计分布图中的一个点,根据点的分布位置确定是否存在疑似异常案例。Statistical analysis is used to analyze the insured's visit data. Among them, each insured's visit data can be regarded as a point in the statistical distribution graph, and whether there is a suspected abnormal case is determined according to the distribution position of the point.
进一步的,对应于不同的监管需求,预设字段可以包括病情症状、检查项目、药物方案、住院时长和住院间隔时长中的至少一种。后文中将以几种具体的监管需求为例,分别采用不同的偏差分析或离群分析方法,进行详细阐述。Further, corresponding to different regulatory requirements, the preset fields may include at least one of symptoms of illness, examination items, medication plan, length of hospitalization, and length of hospitalization interval. In the following text, several specific regulatory requirements will be used as examples, and different deviation analysis or outlier analysis methods will be used to elaborate in detail.
在一具体示例中,对可能存在的分解住院情况进行监管。其中,分解住院是指被保险人为了获得更多的报销费用,故意将一次住院拆分成多次住院,以分别报销,从而造成医保经费的流失。在监管分解住院这一异常情形时,可以根据分解住院的特点,对相应的预设字段进行偏差分析或离群分析。距离度量具体可以根据预设字段的绝对距离(曼哈顿距离)、欧式距离和马氏距离等得到。预设字段可以选取为被保险人两次连续住院之间的间隔天数的极差、四分位间距、八分位间距、均差、标准差等,从而初步筛选出疑似分解住院的案例。其中,数据变异指标的值越大,则表示数据变异越大,散布越广。间隔天数的阈值可设为10~15天,对于连续两次住院之间的间隔天数小于阈值的被保险人,可列为疑似异常案例。进一步的,通过分析疑似异常案例的就诊数据中的病情情况和包括检查项目、用药方案等的诊疗路径等,进一步确认疑似异常案例。如果同一参保人在很短的时间内因同一疾病住院,或者两次住院的诊疗路径疾病一致或属于延续关系,则判定其为疑似分解住院的案例。In a specific example, the possible decomposition of hospitalization is monitored. Among them, decomposed hospitalization means that the insured person deliberately splits one hospitalization into multiple hospitalizations in order to obtain more reimbursement expenses, so as to reimburse them separately, resulting in the loss of medical insurance funds. When supervising the decomposition of hospitalization, an abnormal situation, you can perform deviation analysis or outlier analysis on the corresponding preset fields according to the characteristics of decomposition hospitalization. The distance metric can be obtained based on the absolute distance (Manhattan distance), Euclidean distance, and Mahalanobis distance of the preset fields. The preset field can be selected as the range, interquartile range, interquartile range, mean difference, standard deviation, etc. of the number of days between two consecutive hospitalizations of the insured, so as to preliminarily screen out the cases of suspected decomposition hospitalization. Among them, the larger the value of the data variability index, the greater the data variability and the wider the spread. The threshold for the number of days between intervals can be set to 10 to 15 days. For insured persons whose number of days between consecutive hospitalizations is less than the threshold, it can be classified as a suspected abnormal case. Further, by analyzing the medical condition in the medical data of the suspected abnormal case and the diagnosis and treatment path including the inspection items and medication plan, etc., the suspected abnormal case is further confirmed. If the same insured person is hospitalized for the same disease within a short period of time, or the disease path of the two hospitalizations is the same or belongs to a continuation relationship, it is determined to be a case of suspected decomposition hospitalization.
在另一具体示例中,对可能存在的虚假住院情况进行监管。其中,虚假住院是指被保险人为了获得更多的报销费用,没有住院而制造住院记录,以骗取报销费用。例如,当存在内科住院15天以上或外科住院30天以上的情况,却没有任何治疗方案时,则对应于虚假住院。预设模型在识别疑似异常案例时所采用的异常检测算法为基于索引的算法,即给定一个数据集合,基于索引的算法采用多维索引结构来查找每个待分析案例在一定半径范围内的相邻案例。假设M为待分析案例的一定半径范围内的最大对象数目。如果待分析案例的第M+l个相邻案例被发现,则该待分析案例就不是疑似异常案例。考虑到基于索引的算法的复杂度,其具有良好的扩展性。In another specific example, the possible false hospitalizations are monitored. Among them, false hospitalization refers to the insured person making hospital records without hospitalization in order to obtain more reimbursement expenses in order to defraud the reimbursement expenses. For example, when there is a medical hospitalization for more than 15 days or a surgical hospitalization for more than 30 days, but there is no treatment plan, it corresponds to a false hospitalization. The anomaly detection algorithm adopted by the preset model in identifying suspected abnormal cases is an index-based algorithm, that is, given a data set, the index-based algorithm uses a multi-dimensional index structure to find the phase of each case to be analyzed within a certain radius O case. Suppose M is the maximum number of objects within a certain radius of the case to be analyzed. If the M+l adjacent case of the case to be analyzed is found, the case to be analyzed is not a suspected abnormal case. Considering the complexity of the index-based algorithm, it has good scalability.
在又一具体示例中,对被保险人病情未达到入院治疗指征时即收入院治疗的情况进行监管。具体的,通过嵌套循环算法,即嵌套一循环算法和基于索引的算法进行分析。由于循环算法和嵌套算法有相同的复杂度,同时避免了索引结构的构建,以最小化数据输入输出的次数,把内存的缓冲空间分为两半,把数据集合分为若干个逻辑块。通过精心选择逻辑块装入每个缓冲区域的顺序,有效提高了输入输出的效率。In yet another specific example, the situation in which the insured person is admitted to the hospital for treatment when the patient's condition does not reach the admission treatment indication is supervised. Specifically, the analysis is performed through a nested loop algorithm, that is, a nested loop algorithm and an index-based algorithm. Because the loop algorithm and the nested algorithm have the same complexity, and at the same time avoid the construction of the index structure, in order to minimize the number of data input and output, the memory buffer space is divided into two halves, and the data set is divided into several logical blocks. By carefully selecting the order in which the logic blocks are loaded into each buffer area, the efficiency of input and output is effectively improved.
在再一具体示例中,对一个项目的收费进行拆分的情况进行监管。具体的,基于单元的算法对被保险人的就诊数据进行分析。通过将数据空间划分为边长等于d/(2*k1/2)的单元。每个单元有两个层围绕着它。第一层的厚度是一个单元,而第二层的厚度是(2*k1/2-1)。该算法逐个单元对异常点计数,而不是逐个对象进行计数。对于一给定的单元,累计三个计数:单元中对象的数目、单元和第一层中对象的数目以及单元和两个层次中的对象的数目。该算法将对数据集的每一个元素进行异常点数据的检测改为对每一个单元进行异常点数据的检测,从而提高了算法的效率,降低了算法复杂度。它是这样进行异常检测的:若单元和第一层中对象的数目大于一预设数目,单元中的所有对象都不是异常;若单元和两个层次中的对象的数目小于等于上述预设数目,单元中的所有对象都是异常;否则,单元中的某一些数据可能是异常。为了检测这些异常点,需要逐个对象加入处理。In yet another specific example, the situation of splitting the charges of a project is monitored. Specifically, the unit-based algorithm analyzes the insured's medical data. By dividing the data space into units with side length equal to d/(2*k1/2). Each unit has two layers surrounding it. The thickness of the first layer is one unit, and the thickness of the second layer is (2*k1/2-1). The algorithm counts outliers on a unit-by-unit basis rather than on an object-by-object basis. For a given unit, three counts are accumulated: the number of objects in the unit, the number of objects in the unit and the first layer, and the number of objects in the unit and two layers. The algorithm changes the detection of outlier data for each element of the data set to the detection of outlier data for each unit, thereby improving the efficiency of the algorithm and reducing the complexity of the algorithm. It performs anomaly detection in this way: if the number of objects in the unit and the first layer is greater than a preset number, all objects in the unit are not abnormal; if the number of objects in the unit and the two layers is less than or equal to the above-mentioned preset number , All objects in the unit are abnormal; otherwise, some data in the unit may be abnormal. In order to detect these abnormal points, you need to join the processing object by object.
如图7所示,图7是本申请实施例方案涉及的硬件运行环境的终端,即医保监管设备的结构示意图。As shown in FIG. 7, FIG. 7 is a schematic structural diagram of a terminal of a hardware operating environment involved in the embodiment of the present application, that is, a medical insurance supervision device.
本申请实施例终端可以是服务器、PC,也可以是智能手机、平板电脑、电子书阅读器、MP3(Moving Picture Experts Group Audio Layer III,动态影像专家压缩标准音频层面3)播放器、MP4(Moving Picture Experts Group Audio Layer IV,动态影像专家压缩标准音频层面3)播放器、便携计算机等具有显示功能的可移动式终端设备。In this embodiment of the present application, the terminal may be a server, a PC, or a smart phone, tablet computer, e-book reader, MP3 (Moving Picture Experts Group Audio Layer III, motion picture expert compression standard audio layer 3 player, MP4 (Moving Picture Experts Group Audio Layer IV, the standard audio layer for motion picture experts compression 3) Players, portable computers and other mobile terminal devices with display functions.
如图7所示,该终端可以包括:处理器1001,例如CPU,网络接口1004,用户接口1003,存储器1005,通信总线1002。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 7, the terminal may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002. Among them, the communication bus 1002 is used to implement connection communication between these components. The user interface 1003 may include a display (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface). The memory 1005 may be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage. The memory 1005 may optionally be a storage device independent of the foregoing processor 1001.
可选地,终端还可以包括摄像头、RF(Radio Frequency,射频)电路,传感器、音频电路、WiFi模块等等。其中,传感器比如光传感器、运动传感器以及其他传感器。具体地,光传感器可包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节显示屏的亮度,接近传感器可在移动终端移动到耳边时,关闭显示屏和/或背光。作为运动传感器的一种,重力加速度传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别移动终端姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等;当然,移动终端还可配置陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器,在此不再赘述。Optionally, the terminal may also include a camera, RF (Radio Frequency (radio frequency) circuits, sensors, audio circuits, WiFi modules, etc. Among them, sensors such as light sensors, motion sensors and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display screen according to the brightness of the ambient light, and the proximity sensor may turn off the display screen and/or when the mobile terminal moves to the ear Backlight. As a type of motion sensor, the gravity acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when at rest. It can be used for applications that recognize the posture of mobile terminals (such as horizontal and vertical screen switching, Related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer, tap), etc. Of course, the mobile terminal can also be configured with other sensors such as gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc. No longer.
本领域技术人员可以理解,图7中示出的终端结构并不构成对终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。A person skilled in the art may understand that the terminal structure shown in FIG. 7 does not constitute a limitation on the terminal, and may include more or less components than those shown in the figure, or a combination of certain components, or a different component arrangement.
如图7所示,作为一种计算机存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及医保监管可读指令。As shown in FIG. 7, the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and medical insurance supervision readable instructions.
在图7所示的终端中,网络接口1004主要用于连接后台服务器,与后台服务器进行数据通信;用户接口1003主要用于连接客户端(用户端),与客户端进行数据通信;而处理器1001可以用于调用存储器1005中存储的医保监管可读指令,并执行以下操作:In the terminal shown in FIG. 7, the network interface 1004 is mainly used to connect to the back-end server and perform data communication with the back-end server; the user interface 1003 is mainly used to connect to the client (user end) and perform data communication with the client; and the processor 1001 can be used to call the medical insurance supervision readable instructions stored in the memory 1005, and perform the following operations:
获取被保险人的就诊数据,其中,就诊数据包括被保险人信息、病情记录和诊疗路径;Obtain the insured's visit data, where the visit data includes the insured's information, medical record and diagnosis and treatment path;
根据预设模型,提取就诊数据中的预设字段;According to the preset model, extract the preset fields in the consultation data;
基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。Analyze the preset fields based on the preset model, and extract suspected abnormal cases based on the analysis results.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,在基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的操作之后,还执行以下操作:Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result, and then perform the following operations:
输出疑似异常案例;Output suspected abnormal cases;
接收与疑似异常案例对应的确认信息;Receive confirmation information corresponding to suspected abnormal cases;
当确认信息确认疑似异常案例异常时,修正医保费用。When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is amended.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,在接收与疑似异常案例对应的确认信息的操作之后,还执行以下操作:Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and after receiving the confirmation information corresponding to the suspected abnormal case, perform the following operations:
当确认信息否认疑似异常案例异常时,根据疑似异常案例优化预设模型,其中,预设模型的优化基于机器学习实现。When the confirmation information denies that the suspected abnormal case is abnormal, the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,根据预设模型,提取就诊数据中的预设字段的操作包括:Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and the operation of extracting the preset fields in the medical data according to the preset model includes:
清洗就诊数据;Clean the visit data;
分别获取清洗后的就诊数据中的规范文本和非规范文本;Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
根据预设模型,提取规范文本中的预设字段;According to the preset model, extract the preset fields in the specification text;
基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段。The non-standard text is analyzed based on the recurrent neural network, and the preset fields in the non-standard text are extracted according to the analysis results of the preset model and the non-standard text.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段的操作包括:Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the non-standard text based on the recurrent neural network, and extract the pre-code in the non-standard text according to the analysis result of the preset model and the non-standard text The operations for setting fields include:
将非规范文本表示为向量序列;Represent non-standard text as a sequence of vectors;
根据向量序列的语义内容和语义距离,基于双向循环神经网络将向量序列编码为句子向量矩阵;According to the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network;
根据预设模型,采用注意力机制压缩句子向量矩阵为句子向量,并提取句子向量中的预设字段。According to the preset model, the attention mechanism is used to compress the sentence vector matrix into sentence vectors, and the preset fields in the sentence vectors are extracted.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的操作包括:Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, analyze the preset field based on the preset model, and extract the suspected abnormal case according to the analysis result includes:
计算各被保险人对应的案例中预设字段的距离度量;Calculate the distance metric of the preset field in the case corresponding to each insured;
计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;Calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of other insured's preset fields;
比对离群程度和预设离群程度;Compare the outlier degree with the preset outlier degree;
当离群程度大于预设离群程度时,标记被保险人对应的案例为疑似异常案例。When the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
进一步的,处理器1001可以用于调用存储器1005中存储的医保监管可读指令,预设字段包括病情症状、检查项目、药物方案、住院时长和住院间隔时长中的至少一种。Further, the processor 1001 may be used to call the medical insurance supervision readable instructions stored in the memory 1005, and the preset fields include at least one of the symptom of the condition, the examination item, the medication plan, the length of hospitalization, and the length of the interval between hospitalizations.
此外,本申请实施例还提出一种医保监管装置,医保监管装置包括:In addition, an embodiment of the present application also proposes a medical insurance supervision device. The medical insurance supervision device includes:
数据获取模块,用以获取被保险人的就诊数据,其中,就诊数据包括被保险人信息、病情记录和诊疗路径;The data acquisition module is used to obtain the medical data of the insured, wherein the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
字段提取模块,用以根据预设模型,提取就诊数据中的预设字段;The field extraction module is used to extract the preset fields in the consultation data according to the preset model;
异常分析模块,用以基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。The abnormality analysis module is used to analyze the preset fields based on the preset model and extract suspected abnormal cases according to the analysis results.
进一步的,医保监管装置还包括:Further, the medical insurance supervision device also includes:
案例输出模块,用以输出疑似异常案例;Case output module to output suspected abnormal cases;
信息接收模块,用以接收与疑似异常案例对应的确认信息;Information receiving module, used to receive confirmation information corresponding to suspected abnormal cases;
费用修正模块,用以当确认信息确认疑似异常案例异常时,修正医保费用。The expense revision module is used to revise the medical insurance expenses when the confirmation information confirms that the suspected abnormal case is abnormal.
进一步的,医保监管装置还包括:Further, the medical insurance supervision device also includes:
模型优化模块,用以当确认信息否认疑似异常案例异常时,根据疑似异常案例优化预设模型,其中,预设模型的优化基于机器学习实现。The model optimization module is used to optimize the preset model according to the suspected abnormal case when the confirmation information denies that the suspected abnormal case is abnormal, wherein the optimization of the preset model is implemented based on machine learning.
进一步的,字段提取模块包括:Further, the field extraction module includes:
数据清洗单元,用以清洗就诊数据;The data cleaning unit is used to clean up the medical data;
文本分类单元,用以分别获取清洗后的就诊数据中的规范文本和非规范文本;The text classification unit is used to obtain the canonical text and non-normative text in the cleaned visit data, respectively;
字段提取单元,用以根据预设模型,提取规范文本中的预设字段;The field extraction unit is used to extract the preset fields in the specification text according to the preset model;
字段提取单元还用以基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段。The field extraction unit is also used to analyze the non-standard text based on the recurrent neural network, and extract the preset fields in the non-standard text according to the analysis results of the preset model and the non-standard text.
进一步的,字段提取单元包括:Further, the field extraction unit includes:
向量序列子单元,用以将非规范文本表示为向量序列;Vector sequence subunit to represent non-standard text as a sequence of vectors;
向量矩阵子单元,用以根据向量序列的语义内容和语义距离,基于双向循环神经网络将向量序列编码为句子向量矩阵;Vector matrix subunit, used to encode the vector sequence into a sentence vector matrix based on the bidirectional recurrent neural network based on the semantic content and semantic distance of the vector sequence;
向量压缩子单元,用以根据预设模型,采用注意力机制压缩句子向量矩阵为句子向量,并提取句子向量中的预设字段。The vector compression subunit is used to compress the sentence vector matrix into a sentence vector according to a preset model, and extract a preset field in the sentence vector.
进一步的,异常分析模块包括:Further, the abnormality analysis module includes:
距离度量单元,用以计算各被保险人对应的案例中预设字段的距离度量;The distance measurement unit is used to calculate the distance measurement of the preset field in the case corresponding to each insured;
离群程度单元,用以计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;The outlier degree unit is used to calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of the other insured's preset field;
比对单元,用以比对离群程度和预设离群程度;The comparison unit is used to compare the outlier degree and the preset outlier degree;
标记单元,用以当离群程度大于预设离群程度时,标记被保险人对应的案例为疑似异常案例。The marking unit is used to mark the case corresponding to the insured as a suspected abnormal case when the outlier degree is greater than the preset outlier degree.
进一步的,预设字段包括病情症状、检查项目、药物方案、住院时长和住院间隔时长中的至少一种。Further, the preset fields include at least one of illness symptom, examination item, medication plan, length of hospitalization and length of hospitalization interval.
此外,本申请实施例还提出一种计算机可读存储介质,计算机可读存储介质可以为非易失性可读存储介质;计算机可读存储介质上存储有医保监管可读指令,医保监管可读指令被处理器执行时实现如下操作:In addition, the embodiments of the present application also propose a computer-readable storage medium, which may be a non-volatile readable storage medium; the computer-readable storage medium stores medical insurance regulatory readable instructions, and the medical insurance regulatory readable The instruction performs the following operations when executed by the processor:
获取被保险人的就诊数据,其中,就诊数据包括被保险人信息、病情记录和诊疗路径;Obtain the medical data of the insured, where the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
根据预设模型,提取就诊数据中的预设字段;According to the preset model, extract the preset fields in the consultation data;
基于预设模型分析预设字段,并根据分析结果提取疑似异常案例。Analyze the preset fields based on the preset model, and extract suspected abnormal cases based on the analysis results.
进一步的,医保监管可读指令被处理器执行时,在基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的操作之后,还执行以下操作:Further, when the medical insurance supervision readable instruction is executed by the processor, after analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result, the following operations are also performed:
输出疑似异常案例;Output suspected abnormal cases;
接收与疑似异常案例对应的确认信息;Receive confirmation information corresponding to suspected abnormal cases;
当确认信息确认疑似异常案例异常时,修正医保费用。When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is amended.
进一步的,医保监管可读指令被处理器执行时,在接收与疑似异常案例对应的确认信息的操作之后,还执行以下操作:Further, when the medical insurance supervision readable instruction is executed by the processor, after receiving the confirmation information corresponding to the suspected abnormal case, it also performs the following operations:
当确认信息否认疑似异常案例异常时,根据疑似异常案例优化预设模型,其中,预设模型的优化基于机器学习实现。When the confirmation information denies that the suspected abnormal case is abnormal, the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
进一步的,医保监管可读指令被处理器执行时,根据预设模型,提取就诊数据中的预设字段的操作包括:Further, when the medical insurance supervision readable instruction is executed by the processor, the operation of extracting the preset field in the consultation data according to the preset model includes:
清洗就诊数据;Clean the visit data;
分别获取清洗后的就诊数据中的规范文本和非规范文本;Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
根据预设模型,提取规范文本中的预设字段;According to the preset model, extract the preset fields in the specification text;
基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段。The non-standard text is analyzed based on the recurrent neural network, and the preset fields in the non-standard text are extracted according to the analysis results of the preset model and the non-standard text.
进一步的,医保监管可读指令被处理器执行时,基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段的操作包括:Further, when the medical insurance supervision readable instructions are executed by the processor, the non-standard text is analyzed based on the recurrent neural network, and according to the analysis results of the preset model and the non-standard text, the operation of extracting the preset fields in the non-standard text includes:
将非规范文本表示为向量序列;Represent non-standard text as a sequence of vectors;
根据向量序列的语义内容和语义距离,基于双向循环神经网络将向量序列编码为句子向量矩阵;According to the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on the bidirectional recurrent neural network;
根据预设模型,采用注意力机制压缩句子向量矩阵为句子向量,并提取句子向量中的预设字段。According to the preset model, the attention mechanism is used to compress the sentence vector matrix into sentence vectors, and the preset fields in the sentence vectors are extracted.
进一步的,医保监管可读指令被处理器执行时,基于预设模型分析预设字段,并根据分析结果提取疑似异常案例的操作包括:Further, when the medical insurance supervision readable instruction is executed by the processor, the operation of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
计算各被保险人对应的案例中预设字段的距离度量;Calculate the distance metric of the preset field in the case corresponding to each insured;
计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;Calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of other insured's preset fields;
比对离群程度和预设离群程度;Compare the outlier degree with the preset outlier degree;
当离群程度大于预设离群程度时,标记被保险人对应的案例为疑似异常案例。When the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
进一步的,医保监管可读指令被处理器执行时,预设字段包括病情症状、检查项目、药物方案、住院时长和住院间隔时长中的至少一种。Further, when the medical insurance supervision readable instruction is executed by the processor, the preset field includes at least one of the condition symptom, the examination item, the medication plan, the length of hospitalization and the length of the interval between hospitalizations.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variant thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system that includes a series of elements includes not only those elements, It also includes other elements that are not explicitly listed, or include elements inherent to this process, method, article, or system. Without more restrictions, the element defined by the sentence "include one..." does not exclude that there are other identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The sequence numbers of the above embodiments of the present application are for description only, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods in the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course, can also be implemented by hardware, but in many cases the former is better Implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or part that contributes to the existing technology, and the computer software product is stored in a storage medium (such as ROM/RAM as described above) , Magnetic disk, optical disk), including several instructions to make a terminal device (which can be a mobile phone, computer, server, air conditioner, or network equipment, etc.) to perform the method described in each embodiment of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the preferred embodiments of the present application, and do not limit the scope of the patent of the present application. Any equivalent structure or equivalent process transformation made by the description and drawings of this application, or directly or indirectly used in other related technical fields , The same reason is included in the scope of patent protection in this application.

Claims (20)

  1. 一种医保监管方法,其特征在于,所述医保监管方法包括以下步骤: A medical insurance supervision method, characterized in that the medical insurance supervision method includes the following steps:
    获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;Obtain the medical data of the insured, wherein the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
    根据预设模型,提取所述就诊数据中的预设字段;According to a preset model, extract preset fields in the medical data;
    基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。The preset field is analyzed based on the preset model, and a suspected abnormal case is extracted according to the analysis result.
  2. 如权利要求1所述的医保监管方法,其特征在于,在基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例的步骤之后,所述医保监管方法还包括以下步骤:The medical insurance supervision method according to claim 1, wherein after the step of analyzing the predetermined field based on the predetermined model and extracting a suspected abnormal case according to the analysis result, the medical insurance supervision method further comprises the following steps :
    输出所述疑似异常案例;Output the suspected abnormal case;
    接收与所述疑似异常案例对应的确认信息;Receive confirmation information corresponding to the suspected abnormal case;
    当所述确认信息确认所述疑似异常案例异常时,修正医保费用。When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is revised.
  3. 如权利要求2所述的医保监管方法,其特征在于,在接收与所述疑似异常案例对应的确认信息的步骤之后,还包括以下步骤:The medical insurance supervision method according to claim 2, wherein after the step of receiving confirmation information corresponding to the suspected abnormal case, the method further includes the following steps:
    当所述确认信息否认所述疑似异常案例异常时,根据所述疑似异常案例优化所述预设模型,其中,所述预设模型的优化基于机器学习实现。When the confirmation information denies that the suspected abnormal case is abnormal, the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
  4. 如权利要求1所述的医保监管方法,其特征在于,根据预设模型,提取所述就诊数据中的预设字段的步骤包括:The medical insurance supervision method according to claim 1, wherein the step of extracting a preset field in the medical data according to a preset model includes:
    清洗所述就诊数据;Clean the visit data;
    分别获取清洗后的就诊数据中的规范文本和非规范文本;Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
    根据预设模型,提取所述规范文本中的预设字段;Extract the preset fields in the specification text according to the preset model;
    基于循环神经网络分析所述非规范文本,并根据所述预设模型和所述非规范文本的分析结果,提取所述非规范文本中的预设字段。The non-normative text is analyzed based on the recurrent neural network, and the preset fields in the non-normative text are extracted according to the analysis results of the preset model and the non-normative text.
  5. 如权利要求4所述的医保监管方法,其特征在于,基于循环神经网络分析所述非规范文本,并根据所述预设模型和所述非规范文本的分析结果,提取所述非规范文本中的预设字段的步骤包括:The medical insurance supervision method according to claim 4, wherein the non-standard text is analyzed based on a recurrent neural network, and the non-standard text is extracted according to the analysis result of the preset model and the non-standard text The steps of the preset fields include:
    将所述非规范文本表示为向量序列;Express the non-standard text as a sequence of vectors;
    根据向量序列的语义内容和语义距离,基于双向循环神经网络将所述向量序列编码为句子向量矩阵;According to the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on a bidirectional recurrent neural network;
    根据所述预设模型,采用注意力机制压缩所述句子向量矩阵为句子向量,并提取所述句子向量中的预设字段。According to the preset model, an attention mechanism is used to compress the sentence vector matrix into a sentence vector, and a preset field in the sentence vector is extracted.
  6. 如权利要求1所述的医保监管方法,其特征在于,基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例的步骤包括:The medical insurance supervision method according to claim 1, wherein the step of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
    计算各被保险人对应的案例中预设字段的距离度量;Calculate the distance metric of the preset field in the case corresponding to each insured;
    计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;Calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of other insured's preset fields;
    比对所述离群程度和预设离群程度;Compare the outlier degree with the preset outlier degree;
    当所述离群程度大于所述预设离群程度时,标记所述被保险人对应的案例为疑似异常案例。When the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
  7. 如权利要求1所述的医保监管方法,其特征在于,所述预设字段包括病情症状、检查项目、药物方案、住院时长和住院间隔时长中的至少一种。The medical insurance supervision method according to claim 1, wherein the preset field includes at least one of a disease symptom, an examination item, a drug plan, a length of hospitalization, and a length of interval between hospitalizations.
  8. 一种医保监管设备,其特征在于,所述医保监管设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被所述处理器执行时实现以下步骤:A medical insurance supervision device, characterized in that the medical insurance supervision device includes: a memory, a processor, and computer-readable instructions stored on the memory and operable on the processor, the computer-readable instructions are When the processor executes, the following steps are realized:
    获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;Obtain the medical data of the insured, wherein the medical data includes the information of the insured, the medical record and the path of diagnosis and treatment;
    根据预设模型,提取所述就诊数据中的预设字段;According to a preset model, extract preset fields in the medical data;
    基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。The preset field is analyzed based on the preset model, and a suspected abnormal case is extracted according to the analysis result.
  9. 如权利要求8所述的医保监管设备,其特征在于,所述在基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例的步骤之后,所述医保监管方法还包括以下步骤:The medical insurance supervision device according to claim 8, wherein after the step of analyzing the preset field based on the preset model and extracting a suspected abnormal case according to the analysis result, the medical insurance supervision method further includes The following steps:
    输出所述疑似异常案例;Output the suspected abnormal case;
    接收与所述疑似异常案例对应的确认信息;Receive confirmation information corresponding to the suspected abnormal case;
    当所述确认信息确认所述疑似异常案例异常时,修正医保费用。When the confirmation information confirms that the suspected abnormal case is abnormal, the medical insurance fee is revised.
  10. 如权利要求9所述的医保监管设备,其特征在于,所述在接收与所述疑似异常案例对应的确认信息的步骤之后,还包括以下步骤:The medical insurance supervision device of claim 9, wherein after the step of receiving confirmation information corresponding to the suspected abnormal case, the method further includes the following steps:
    当所述确认信息否认所述疑似异常案例异常时,根据所述疑似异常案例优化所述预设模型,其中,所述预设模型的优化基于机器学习实现。When the confirmation information denies that the suspected abnormal case is abnormal, the preset model is optimized according to the suspected abnormal case, wherein the optimization of the preset model is implemented based on machine learning.
  11. 如权利要求8所述的医保监管设备,其特征在于,所述根据预设模型,提取所述就诊数据中的预设字段的步骤包括:The medical insurance supervision device according to claim 8, wherein the step of extracting a preset field in the medical data according to a preset model includes:
    清洗所述就诊数据;Clean the visit data;
    分别获取清洗后的就诊数据中的规范文本和非规范文本;Obtain the canonical text and non-normative text in the cleaned medical data, respectively;
    根据预设模型,提取所述规范文本中的预设字段;Extract the preset fields in the specification text according to the preset model;
    基于循环神经网络分析所述非规范文本,并根据所述预设模型和所述非规范文本的分析结果,提取所述非规范文本中的预设字段。The non-normative text is analyzed based on the recurrent neural network, and the preset fields in the non-normative text are extracted according to the analysis results of the preset model and the non-normative text.
  12. 如权利要求11所述的医保监管设备,其特征在于,基于循环神经网络分析所述非规范文本,并根据所述预设模型和所述非规范文本的分析结果,提取所述非规范文本中的预设字段的步骤包括:The medical insurance supervision device according to claim 11, wherein the non-standard text is analyzed based on a recurrent neural network, and the non-standard text is extracted from the analysis results of the preset model and the non-standard text The steps of the preset fields include:
    将所述非规范文本表示为向量序列;Express the non-standard text as a sequence of vectors;
    根据向量序列的语义内容和语义距离,基于双向循环神经网络将所述向量序列编码为句子向量矩阵;According to the semantic content and semantic distance of the vector sequence, the vector sequence is encoded into a sentence vector matrix based on a bidirectional recurrent neural network;
    根据所述预设模型,采用注意力机制压缩所述句子向量矩阵为句子向量,并提取所述句子向量中的预设字段。According to the preset model, an attention mechanism is used to compress the sentence vector matrix into a sentence vector, and a preset field in the sentence vector is extracted.
  13. 如权利要求11所述的医保监管设备,其特征在于,基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例的步骤包括:The medical insurance supervision device according to claim 11, wherein the step of analyzing the preset field based on the preset model and extracting the suspected abnormal case according to the analysis result includes:
    计算各被保险人对应的案例中预设字段的距离度量;Calculate the distance metric of the preset field in the case corresponding to each insured;
    计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;Calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of other insured's preset fields;
    比对所述离群程度和预设离群程度;Compare the outlier degree with the preset outlier degree;
    当所述离群程度大于所述预设离群程度时,标记所述被保险人对应的案例为疑似异常案例。When the outlier degree is greater than the preset outlier degree, the case corresponding to the insured is marked as a suspected abnormal case.
  14. 一种医保监管装置,其特征在于,所述医保监管装置包括:A medical insurance supervision device, characterized in that the medical insurance supervision device includes:
    数据获取模块,用以获取被保险人的就诊数据;The data acquisition module is used to obtain the medical data of the insured;
    字段提取模块,用以根据预设模型,提取所述就诊数据中的预设字段;A field extraction module, used to extract a preset field in the medical data according to a preset model;
    异常分析模块,用以基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。The abnormality analysis module is configured to analyze the preset field based on the preset model and extract a suspected abnormal case according to the analysis result.
  15. 如权利要求14所述的医保监管装置,其特征在于,医保监管装置还包括:The medical insurance supervision device of claim 14, wherein the medical insurance supervision device further comprises:
    案例输出模块,用以输出疑似异常案例;Case output module to output suspected abnormal cases;
    信息接收模块,用以接收与疑似异常案例对应的确认信息;Information receiving module, used to receive confirmation information corresponding to suspected abnormal cases;
    费用修正模块,用以当确认信息确认疑似异常案例异常时,修正医保费用。The expense revision module is used to revise the medical insurance expenses when the confirmation information confirms that the suspected abnormal case is abnormal.
  16. 如权利要求15所述的医保监管装置,其特征在于,医保监管装置还包括:The medical insurance supervision device of claim 15, wherein the medical insurance supervision device further comprises:
    模型优化模块,用以当确认信息否认疑似异常案例异常时,根据疑似异常案例优化预设模型,其中,预设模型的优化基于机器学习实现。The model optimization module is used to optimize the preset model according to the suspected abnormal case when the confirmation information denies that the suspected abnormal case is abnormal, wherein the optimization of the preset model is implemented based on machine learning.
  17. 如权利要求14所述的医保监管装置,其特征在于,字段提取模块包括:The medical insurance supervision device of claim 14, wherein the field extraction module includes:
    数据清洗单元,用以清洗就诊数据;The data cleaning unit is used to clean up the medical data;
    文本分类单元,用以分别获取清洗后的就诊数据中的规范文本和非规范文本;The text classification unit is used to obtain the canonical text and non-normative text in the cleaned visit data, respectively;
    字段提取单元,用以根据预设模型,提取规范文本中的预设字段;The field extraction unit is used to extract the preset fields in the specification text according to the preset model;
    字段提取单元还用以基于循环神经网络分析非规范文本,并根据预设模型和非规范文本的分析结果,提取非规范文本中的预设字段。The field extraction unit is also used to analyze the non-standard text based on the recurrent neural network, and extract the preset fields in the non-standard text according to the analysis results of the preset model and the non-standard text.
  18. 如权利要求17所述的医保监管装置,其特征在于,字段提取单元包括:The medical insurance supervision device according to claim 17, wherein the field extraction unit includes:
    向量序列子单元,用以将非规范文本表示为向量序列;Vector sequence subunit to represent non-standard text as a sequence of vectors;
    向量矩阵子单元,用以根据向量序列的语义内容和语义距离,基于双向循环神经网络将向量序列编码为句子向量矩阵;Vector matrix subunit, used to encode the vector sequence into a sentence vector matrix based on the bidirectional recurrent neural network based on the semantic content and semantic distance of the vector sequence;
    向量压缩子单元,用以根据预设模型,采用注意力机制压缩句子向量矩阵为句子向量,并提取句子向量中的预设字段。The vector compression subunit is used to compress the sentence vector matrix into a sentence vector according to a preset model, and extract a preset field in the sentence vector.
  19. 如权利要求14所述的医保监管装置,其特征在于,异常分析模块包括:The medical insurance supervision device of claim 14, wherein the abnormality analysis module includes:
    距离度量单元,用以计算各被保险人对应的案例中预设字段的距离度量;The distance measurement unit is used to calculate the distance measurement of the preset field in the case corresponding to each insured;
    离群程度单元,用以计算被保险人的预设字段的距离度量相对其它被保险人的预设字段的距离度量的离群程度;The outlier degree unit is used to calculate the outlier degree of the distance measure of the insured's preset field relative to the distance measure of the other insured's preset field;
    比对单元,用以比对离群程度和预设离群程度;The comparison unit is used to compare the outlier degree and the preset outlier degree;
    标记单元,用以当离群程度大于预设离群程度时,标记被保险人对应的案例为疑似异常案例。The marking unit is used to mark the case corresponding to the insured as a suspected abnormal case when the outlier degree is greater than the preset outlier degree.
  20. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有医保监管可读指令,所述医保监管可读指令被处理器执行时实现如下的步骤:A computer-readable storage medium, characterized in that medical insurance supervision readable instructions are stored on the computer readable storage medium, and the medical insurance supervision readable instructions are executed by a processor to implement the following steps:
    获取被保险人的就诊数据,其中,所述就诊数据包括被保险人信息、病情记录和诊疗路径;Obtain the medical data of the insured, wherein the medical data includes the information of the insured, the medical record and the diagnosis and treatment path
    根据预设模型,提取所述就诊数据中的预设字段;According to the preset model, extract the preset fields in the medical data;
    基于所述预设模型分析所述预设字段,并根据分析结果提取疑似异常案例。 The preset field is analyzed based on the preset model, and a suspected abnormal case is extracted according to the analysis result.
PCT/CN2019/118810 2018-12-13 2019-11-15 Medical insurance supervision method, device, apparatus and computer readable storage medium WO2020119383A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811530813.6 2018-12-13
CN201811530813.6A CN109636645A (en) 2018-12-13 2018-12-13 Medical insurance monitoring and managing method, unit and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2020119383A1 true WO2020119383A1 (en) 2020-06-18

Family

ID=66073787

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/118810 WO2020119383A1 (en) 2018-12-13 2019-11-15 Medical insurance supervision method, device, apparatus and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN109636645A (en)
WO (1) WO2020119383A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230222597A1 (en) * 2022-01-10 2023-07-13 International Business Machines Corporation Predictive and Prescriptive Analytics for Managing High-Cost Claimants in Healthcare

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110910976A (en) * 2019-10-12 2020-03-24 平安国际智慧城市科技股份有限公司 Medical record detection method, device, equipment and storage medium
CN110941717B (en) * 2019-11-22 2023-08-11 深圳马可孛罗科技有限公司 Passenger ticket rule analysis method and device, electronic equipment and computer readable medium
CN111626884B (en) * 2020-06-01 2023-07-18 中国联合网络通信集团有限公司 Method and device for preventing insurance fraud
CN111986035A (en) * 2020-08-31 2020-11-24 平安医疗健康管理股份有限公司 Medical insurance service auditing method, device, equipment and storage medium
CN112380201A (en) * 2020-11-10 2021-02-19 中国人寿保险股份有限公司 Insurance information reporting method and device
CN113436027B (en) * 2021-06-30 2022-11-22 山大地纬软件股份有限公司 Medical insurance reimbursement abnormal data detection method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102013084A (en) * 2010-12-14 2011-04-13 江苏大学 System and method for detecting fraudulent transactions in medical insurance outpatient services
CN104408547A (en) * 2014-10-30 2015-03-11 浙江网新恒天软件有限公司 Data-mining-based detection method for medical insurance fraud behavior
CN105279382A (en) * 2015-11-10 2016-01-27 成都数联易康科技有限公司 Medical insurance abnormal data on-line intelligent detection method
US20160063636A1 (en) * 2014-08-28 2016-03-03 Cerner Innovation, Inc. Predictive insurance transaction error system
CN107610761A (en) * 2017-09-30 2018-01-19 电子科技大学 A kind of clinical path analysis method based on medical insurance data
CN108921710A (en) * 2018-06-08 2018-11-30 东莞迪赛软件技术有限公司 The method and system of medical insurance abnormality detection

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156766B (en) * 2015-03-25 2020-02-18 阿里巴巴集团控股有限公司 Method and device for generating text line classifier
CN107609980A (en) * 2017-09-07 2018-01-19 平安医疗健康管理股份有限公司 Medical data processing method, device, computer equipment and storage medium
CN108052593B (en) * 2017-12-12 2020-09-22 山东科技大学 Topic keyword extraction method based on topic word vector and network structure
CN108596770B (en) * 2017-12-29 2022-04-01 山大地纬软件股份有限公司 Medical insurance fraud detection device and method based on outlier analysis
CN108536754A (en) * 2018-03-14 2018-09-14 四川大学 Electronic health record entity relation extraction method based on BLSTM and attention mechanism

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102013084A (en) * 2010-12-14 2011-04-13 江苏大学 System and method for detecting fraudulent transactions in medical insurance outpatient services
US20160063636A1 (en) * 2014-08-28 2016-03-03 Cerner Innovation, Inc. Predictive insurance transaction error system
CN104408547A (en) * 2014-10-30 2015-03-11 浙江网新恒天软件有限公司 Data-mining-based detection method for medical insurance fraud behavior
CN105279382A (en) * 2015-11-10 2016-01-27 成都数联易康科技有限公司 Medical insurance abnormal data on-line intelligent detection method
CN107610761A (en) * 2017-09-30 2018-01-19 电子科技大学 A kind of clinical path analysis method based on medical insurance data
CN108921710A (en) * 2018-06-08 2018-11-30 东莞迪赛软件技术有限公司 The method and system of medical insurance abnormality detection

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230222597A1 (en) * 2022-01-10 2023-07-13 International Business Machines Corporation Predictive and Prescriptive Analytics for Managing High-Cost Claimants in Healthcare

Also Published As

Publication number Publication date
CN109636645A (en) 2019-04-16

Similar Documents

Publication Publication Date Title
WO2020119383A1 (en) Medical insurance supervision method, device, apparatus and computer readable storage medium
WO2020119403A1 (en) Hospitalization data abnormity detection method, apparatus and device, and readable storage medium
WO2020019403A1 (en) Electricity consumption abnormality detection method, apparatus and device, and readable storage medium
WO2020119402A1 (en) Irrelevant medication identification method and apparatus, terminal and computer-readable storage medium
WO2020078058A1 (en) Medical data abnormality identification method and device, terminal, and storage medium
WO2013035902A1 (en) Health care system based on video in remote health care solution and method for providing health care service
WO2020215681A1 (en) Indication information generation method and apparatus, terminal, and storage medium
WO2020119116A1 (en) Medical insurance auditing method, apparatus and device based on data analysis, and storage medium
WO2015008936A1 (en) Diagnostic apparatus using habit, diagnosis management apparatus, and diagnostic method using same
WO2021060899A1 (en) Training method for specializing artificial intelligence model in institution for deployment, and apparatus for training artificial intelligence model
WO2020010677A1 (en) Method for acquiring consecutive missing values, data analysis device, terminal, and storage medium
WO2020119385A1 (en) Method, device, equipment, and medium for prescription generation monitoring based on data analysis
WO2020119131A1 (en) Medication scheme abnormality identification method and device, terminal, and readable storage medium
WO2020087981A1 (en) Method and apparatus for generating risk control audit model, device and readable storage medium
WO2020119384A1 (en) Medical insurance abnormity detection method, apparatus and device based on big data analysis, and medium
WO2020119118A1 (en) Abnormal data processing method, apparatus and device, and computer readable storage medium
WO2021101105A2 (en) System and method for classifying subjects of medical specialty materials
WO2020108111A1 (en) Medical insurance fraud behavior identification method, apparatus, device and readable storage medium
WO2019107804A1 (en) Method for predicting drug-drug or drug-food interaction by using structural information of drug
WO2013027869A1 (en) Method for providing health care service for providing self-aware reminder for health care motivation
WO2018004111A1 (en) User terminal, service providing device, body temperature information collection and service providing system using smart thermometer, and service providing method using same
WO2021012508A1 (en) Ai image recognition method, apparatus and device, and storage medium
WO2019240343A1 (en) System for providing customized self-management information according to user state
WO2022050551A1 (en) Legal service provision system and method therefor
WO2020119175A1 (en) Method for monitoring medical expense abnormalities, monitoring server and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19896232

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15.10.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19896232

Country of ref document: EP

Kind code of ref document: A1