US11182411B2 - Combined data driven and knowledge driven analytics - Google Patents
Combined data driven and knowledge driven analytics Download PDFInfo
- Publication number
- US11182411B2 US11182411B2 US16/145,869 US201816145869A US11182411B2 US 11182411 B2 US11182411 B2 US 11182411B2 US 201816145869 A US201816145869 A US 201816145869A US 11182411 B2 US11182411 B2 US 11182411B2
- Authority
- US
- United States
- Prior art keywords
- data
- machine learning
- knowledge based
- learning model
- knowledge
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 238000010801 machine learning Methods 0.000 claims abstract description 65
- 238000000034 method Methods 0.000 claims abstract description 48
- 238000012545 processing Methods 0.000 claims description 22
- 238000012549 training Methods 0.000 claims description 20
- 238000003745 diagnosis Methods 0.000 claims description 16
- 238000013528 artificial neural network Methods 0.000 claims description 6
- 238000005065 mining Methods 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims 1
- 238000007418 data mining Methods 0.000 description 10
- 238000013459 approach Methods 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 2
- 208000035977 Rare disease Diseases 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000012517 data analytics Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000013150 knee replacement Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 206010020772 Hypertension Diseases 0.000 description 1
- 208000001953 Hypotension Diseases 0.000 description 1
- 101150044251 OGT gene Proteins 0.000 description 1
- 230000036772 blood pressure Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 208000012866 low blood pressure Diseases 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 108090000623 proteins and genes Proteins 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/335—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
Definitions
- Implementations of the present disclosure relate to data analytics.
- data analytics can improve the accuracy or diagnosis, identification, prognosis, or other predictions in a variety of environments.
- These techniques can include hard coded decision trees, automatic machine learning, or other uses of data to provide a predictive outcome.
- FIG. 1 is a schematic diagram of an embodiment of systems to generate and apply a machine learning model, which can be used in accordance with some embodiments.
- FIG. 2 is an example interface generated by an analytics system, in accordance with some embodiments.
- FIG. 3 is a flow diagram of an embodiment of a method of generating and applying a machine learning model, in accordance with some embodiments.
- FIG. 4 is a flow diagram of an embodiment of a method of applying a machine learning model, in accordance with some embodiments.
- FIG. 5 is an illustration showing an example computing device which may implement the embodiments described herein.
- Described herein are systems that utilize knowledge driven and data driven analytics to improve the quality of analysis performed. Separately, both knowledge driven and data driven analytics can provide useful information. However, each can have drawbacks based on the type of information and how it is used. Systems described herein can take benefits of each type of analytical approach to generate models with improved predictive capabilities.
- Knowledge driven approaches to analytics use accumulated knowledge from a number of sources to generate an output.
- knowledge based data may come in the form of text books, industry papers, journal articles, online information, databases, or other repositories where knowledge of one or more subjects is stored.
- knowledge driven analytics can be understood as codifying the knowledge stored in such repositories into an expert system.
- an industry expert can be used to generate the expert system or it can be learned automatically based on relations of terms in text. While this type of knowledge driven approach is useful, it is limited by the amount of knowledge available and the type of decisions it can make.
- a question-answer type expert system learned automatically from textual data can lack the semantics necessary to make active decisions based on new data.
- Data driven analytics can provide additional insight by generating a machine learning model based on example data.
- a machine learning system can take in a large number of hospital admissions records tagged with particular outcomes and generate a model that can predict outcomes based on those records. The model can then be applied to new admissions records to predict an outcome of newly admitted patients.
- data driven approaches overcome some of the shortcomings of knowledge driven systems, they may suffer from a complementary set of shortcomings.
- the machine learning approach disregards the available insight from existing domain knowledge.
- research and learning from experts in fields is disregarded when generating a model. Given sufficient data, such knowledge could in principle be learned automatically. However, the resources taken to train such a model could be significant.
- machine learning models can make mistakes based on statistical anomalies. For example, even with a large set of data, rare occurrences within the system can cause spurious connections to form within the model. For example, in the medical context, a machine learning model may connect a rare disease with another condition if by random distribution the other condition happens to be present among the few examples with the rare disease. While a knowledge-driven approach will generate a system that does not connect knee replacements to a rare form of cancer, if those patients having the rare form of cancer also had a knee replacement, the machine learning model may connect the two. This can result in inaccurate output from the machine learning model.
- the machine learning model may have a number of elements that generate an output based on new input. However, there may be no clear connection between one element of the machine learning model and how that effects that output based on certain inputs.
- the described systems and methods improve the operation of computer systems by generating models that use data-driven analytics, but also the knowledge available in a field. While generally described with reference to medical data, the systems and methods described can also be applied to other fields. For example, machine learning models can be generated by similar systems to predict mortgage defaults, stock prices, student admissions, astrophysics identification, high energy physics predictions, fleet maintenance, or other fields with sufficient example data to train a machine learning system and sufficient recorded knowledge to improve those models.
- Machine learning systems described herein utilize both data-driven and knowledge based approaches to generate models that exploit the advantages of each.
- knowledge based text data are mined to find co-occurrences of terms.
- medical texts can be mined to find co-occurrences of various medical diagnoses.
- the mining process generates a subset of existing medical knowledge.
- a set of training data is used to identify other pertinent information.
- the training data can include a large number of medical diagnoses, procedures, and outcomes.
- the pertinent information identified from the training set also includes co-occurrences of diagnoses.
- a machine learning model is generated to predict outcomes from the input data.
- the machine learning model can use co-occurrences of diagnoses from both the medical texts and the example data to make fewer mistakes compared with a data-driven only model. Unlike purely knowledge-driven approaches, this automatically maps real-world data available at the hospital to patient outcomes. The machine learning model can than receive new data and generate a predictive outcome based on the new data. The model can also provide new insights from data associations that may not be recorded in the knowledge based data.
- FIG. 1 is a diagram showing a model generation system 100 and an analytics system 160 .
- the model generation system 100 generates a predictive model 165 .
- the predictive model 165 can then be used by the analytics system 160 to generate a prediction 175 based on a characteristic data 170 .
- the characteristic data 170 can be an admission record for a patient at a hospital and the prediction 175 can be a likelihood of certain outcomes for the patient.
- the prediction 175 can be a likelihood of readmission within a certain amount of time, an identification of another potential diagnosis, or other information that can be used by a healthcare worker to improve care for a patient.
- the model generation system 100 includes knowledge data 140 and example data 150 .
- the knowledge data 140 can be data that includes a record of expert knowledge or other information about a relevant topic.
- the knowledge data 140 can be in the form of human-readable text (for example, textbooks or research articles), or it can be in some other suitable format (for example, a computerized database that includes known associations between genes and medical disorders).
- knowledge data 140 can include medical textbooks, journal articles, internet articles, or other data sources that include a record of knowledge about a certain field.
- the example data 150 can include examples of records associated with the field.
- the example data 150 can include a large set of hospital admission data.
- the admission data can include one or more medical diagnosis codes, procedure codes, patient data, or the like.
- the example data 150 can also include one or more flags to be used when training a predictive model 165 .
- admission data for patients can be flagged as resulting in certain outcomes, such as readmission with a certain period of time after release of a patient.
- the model generation system 100 includes a data mining service 110 .
- the data mining service 110 generates a set of data from the knowledge data 140 that can be used to train a machine learning model.
- the data mining service 110 identifies diagnostic codes, outcomes, diseases, conditions, procedures, or other medical data within the knowledge data 140 in order to form groups of useable data for training.
- the data mining service 110 may weight one or more of the identified data elements within the knowledge data 140 differently based on the context. For example, the data mining service can perform analysis to change weighting of identified relationships between different diagnosis codes based on textual analysis.
- the relationship may have a lower weight than a one where the two terms are in the same sentence.
- Other rules and analysis techniques can also be used. For example, structural rules using sentence templates can be used to identify those instances where the terms are highly correlated or have a specific relationship of interest. As one example, instances where the terms are related with “cause,” “effect,” “leads to,” or the like between the words may be given a higher weight.
- the data mining service 110 can therefore generate a set of relevant facts known in in the field.
- these facts may be sets of terms.
- the terms can be limited to those that are related to diagnosis codes, procedure codes, or the like.
- the model generation system 100 can train a machine learning model without data mining service 110 .
- the model generation system 100 can train a machine learning model from raw knowledge data 140 .
- the model generation system 100 also includes combination service 120 that combines the knowledge data 140 and the example data 150 .
- the combination service 120 can use the output of the data mining service 110 and the example dat 1 150 to generate a set of data for use by the model generation service 130 .
- the combination service 120 can replicate certain instances from either the knowledge data 140 or the example data 150 to reflect relative weighting of such instances. For example, if an instance in knowledge data 140 is identified by the data mining service 110 as having a strong correlation between certain words, it can be weighted higher than other instances. In order to have the model generation system 100 consider the instance at a higher weight, it can be provided to the model generation service 130 multiple times. In some embodiments, the combination service 120 can provide additional weighting to different instances.
- all of the knowledge data 140 or all of the example data 150 can be weighted higher than the other set of data.
- the combination service 120 can be used to confirm or reject models output by the model generation service 130 .
- the model generation service 130 trains a machine learning model based on the knowledge data 140 and the example data 150 as provided through one or more of the data mining service 110 and the combination service 120 .
- a number of different machine learning techniques can be used by the model generation service 130 .
- the model generation service 130 performs topic modeling to identify groups of terms in the data that are related. The grouping of terms can then be used to determine a likelihood of certain terms when other terms are present.
- the model generation service 130 can treat data instances from the knowledge data 140 and example data 150 as a set of diagnostic, procedure, or other medical codes. The model generation service 130 can then determine different groupings of those codes, which can be referred to as topics.
- the terms in the groups of codes can have an associated probability indicating how related they are to that particular group.
- the model generation service 130 can change those probabilities based on how often different terms co-occur in the data.
- the predictive model 165 converges and those probabilities become indicative of the likelihood of the co-occurrence of terms in the groups.
- example data 150 by the model generation service 130 without the use of knowledge data 140 can result in misidentification of certain groupings due to limitations of the data-set.
- mining the knowledge data 140 and combining the data with the example data 150 prevents spurious correlations in the data by providing additional information for training.
- using example data 150 to train the predictive model 165 can identify correlations that are not present in the knowledge data if there are sufficient co-occurrences in the data.
- the model generation service 130 can perform different types of machine learning model training. As discussed above, the model generation service 130 can perform topic modeling. In some embodiments, the topic modeling may be performed as latent dirichlet allocation (LDA), probabilistic latent semantic analysis, or other topic modeling. In some embodiments, the model generation service 130 can generate a neural network or other types of model as well. For example, the example data 150 can have one or more flags for different outcomes and the model generation service 130 can be trained to predict those outcomes based on the inputs of new data. For example, in some embodiments, the model generation service 130 can train a predictive model 165 to determine the likelihood of readmission to a hospital in view of admissions data for a new patient.
- LDA latent dirichlet allocation
- the model generation service 130 can generate a neural network or other types of model as well.
- the example data 150 can have one or more flags for different outcomes and the model generation service 130 can be trained to predict those outcomes based on the inputs of new data.
- the model generation service 130 can train a predictive model
- the predictive model 165 generated by the model generation system 100 can be used by an analytics system 160 to predict outcomes based on characteristic data 170 .
- the analytics system can receive characteristic data 170 from an internal or external system.
- the characteristic data 170 could be hospital admission data for a new patient.
- the analytics system 160 can be hosted on the same computer system as the model generation system 100 , or it can be hosted on a different device.
- the model generation system 100 can be hosted in a server system and the predictive model and analytics system 160 can be hosted on a local system.
- the analytics system 160 can be hosted on a personal computer, laptop, tablet, phone, or other computing device in a room of a hospital where an output can be provided to a medical practitioner.
- the analytics system 160 can apply the predictive model 165 to the received characteristic data 170 in order to generate a prediction output 175 .
- the predictive model 165 is a neural network that receives characteristic data 170 and extracts features to generate an output.
- the predictive model 165 is a topic model that includes groups of terms in a number of topics.
- the analytics system 160 can extract terms present in the characteristic data 170 and identify additional terms that may be related to the characteristic data 170 based on how correlated the terms in characteristic data 170 . For example, for a term in the characteristic data, there can be a corresponding term associated with one or more topics of the predictive model 165 .
- the analytics system 160 can determine other related terms. As applied to each of the terms in the characteristic data 170 , the analytics system can generate a set of terms that are predicted to also be associated with the source of the characteristic data. This can be used to generate a prediction output 175 .
- the characteristic data 170 can include a number of diagnosis, procedures, or other information about a patient.
- the information can be received through admission data at a hospital.
- the analytics system 160 can then use the predictive model 165 to determine other diagnosis, procedures, or outcomes that have high probability of co-occurrence with the characteristic data 170 for the patient.
- the characteristic data 170 indicates a high probability of co-occurrence with a readmission, death, heart attack, or other negative consequence, that outcome can be provided to a medical practitioner to provide guidance for further treatment of the patient.
- the prediction outcome 175 can provide predicted outcomes, potentially related conditions, or other information.
- the analytics system 160 can then provide the prediction output 175 to the medical practitioner.
- the analytics system 160 can provide an output as an alert for high risk patients or as an indication of the likelihood of certain events.
- the analytics system 160 can provide an output as a probability that a patient will be readmitted to the hospital within a period of time based on the application of the predictive model 165 to the characteristic data 170 .
- the analytics system 160 can provide the prediction outcome 175 in a computer application, an email, automated text or telephone calls, printed output on admission charts for the patient, or in other formats through which to inform a medical practitioner of the output. While the outputs are discussed with respect to medical environments, in other fields additional relevant outputs could be provided. For example, the likelihood of a mechanical failure in a system, likelihood of student success at a college, or other predictions to inform an expert of the analysis results.
- FIG. 2 illustrates an example interface 200 generated in response to a predictive outcome for a medical admission.
- the example interface 200 can include admission information 210 , diagnostic and procedure codes 220 , and a predictive outcome 230 .
- admission information 210 and the diagnostic and procedure codes 220 can be used as characteristic data 170 by an analytics system 160 .
- the analytics system 160 can apply the predictive model 165 to the diagnostic and procedure codes 220 to generate a prediction about an outcome for a patient.
- the admission information 210 and the diagnostic and procedure codes 220 may be used to generate an outcome.
- the data in the admission information 210 can be modified as an input to a machine learning model.
- a patient's blood pressure can be characterized as high or low and used as a characteristic to provide to the machine learning model.
- Other characterizations can also be made such as calculating a person's body mass index, high or low heart rate, or other data for input to the machine learning model.
- only the diagnostic or procedure codes 210 may be provided to the machine learning model.
- an analytics system can provide an indication 230 of a potential outcome for a patient.
- the example prediction is that there is an 80% chance of readmission within the next month for the patient. As presented to a medical practitioner, this can help convince the medical practitioner to further analyze the medical records of the patient, keep the patient admitted, perform additional tests, or otherwise improve treatment of a patient.
- fewer or additional details may be provided than as shown in the interface 200 . For example, in some embodiments, the probability of an outcome may not be provided and instead just an indication to perform further tests can be given. Furthermore, in some embodiments, additional potential diagnosis or recommended procedures can be provided as part of the predicted outcome 230 .
- FIG. 3 is a flow chart 300 illustrating example operations of a model generation system.
- the processes described with reference to FIG. 3 may be performed by a model generation system 100 as described with reference to FIG. 1 .
- the model generation system receives a set of example data and a set of knowledge based data.
- the example data may be representative of real world outcomes for a system that the model generation system is modeling.
- the knowledge based data can be textual or non-textual information describing the field in which that system operates.
- the example data can include patient admission charts or similar documents that describe characteristics of a patient.
- the example data can also include an outcome that results during or after treatment of the patient, additional diagnosis for the patient that are later discovered, or other tags that can be used to train a model.
- the knowledge based data can include medical textbooks, online articles, published journal articles, a combination of text sources, non-text sources such as computer databases, or the like.
- the model generation system can combine the set of example data and the set of knowledge based data to generate a set of combined data.
- Combining the data can include analyzing the knowledge based data to identify co-occurrences of different terms. For example, in the medical context, diagnosis or procedures that co-occur within the texts can be identified. In some embodiments, those terms can then be converted to diagnosis codes or procedure codes that match with the example data.
- the combined set of data can be organized into groups that can be used to train a model. In some embodiments, certain groups can be duplicated in the combined data set to increase their weighting within the generated model. In some embodiments, other weighting techniques can be used when training the model.
- a model generation system trains a machine learning model based on the combined dataset.
- this can include topic modelling to identify certain terms that have higher probabilities of co-occurrences within documents of the combined data set.
- training the system can include training a neural network, or other machine learning model, based on the data and one or more tags for certain inputs.
- training a machine learning model can include generating a model based on the example data portion of a data set and identifying support for the generated model within the knowledge based portion of the data set.
- the knowledge based data can be used by a model generation system to check the accuracy of a model generated by the example data.
- the system can apply the machine learning model to a new set of data. Applying the model to the new set of data can generate a predictive outcome based on that set of data.
- the new set of data can be an admission chart for a new patient at the hospital.
- the machine learning model can then use the characteristics in the admission chart to generate an outcome.
- the outcome can include a likelihood of readmission, a likelihood that the patient has a particular diagnosis, or the likelihood of other medical outcomes.
- FIG. 4 is a flow chart 400 illustrating example operations of an analytics system.
- the processes described with reference to FIG. 4 may be performed by an analytics system 160 as described with reference to FIG. 1 .
- Flow chart 400 is described within the context of applying an analytics system to medical records, however, similar operations could be used to analyze data in other fields.
- the analytics system receives a hospital admissions record of a subject.
- the admissions record can include patient data, diagnosis codes, procedure codes, or additional information.
- the analytics system can generate a set of characteristic data of the subject based at least in part on the hospital admission record.
- the characteristic data can include data that is input to a machine learning model to generate an outcome.
- generating the characteristic data can include generating characteristics based on the admissions chart.
- patient data can be analyzed to identify one or more characteristics such as high body mass index, high blood pressure, low blood pressure, or other diagnostic or characteristics data that can be derived from the admission chart, but may not be included in the admissions chart.
- the analytics system can apply a machine learning model to the characteristic data of the subject.
- the machine learning model can be trained with example data and knowledge based data.
- the machine learning model could be one trained as described above with reference to flow chart 300 or by model generation system 100 .
- the analytics system can generate a useful output. For example, if the machine learning model is a topic model, the output of the machine learning model can be a probability that certain diagnostic codes also apply to the patient. In some embodiments, a probability of readmission, death, or other information can also be provided by the analytics system.
- FIG. 5 illustrates a diagrammatic representation of a machine in the example form of a computer system 500 within which a set of instructions, for causing the machine to perform any one or more of the methodologies discussed herein, may be executed.
- the machine may be connected (e.g., networked) to other machines in a local area network (LAN), an intranet, an extranet, or the Internet.
- the machine may operate in the capacity of a server or a client machine in a client-server network environment, or as a peer machine in a peer-to-peer (or distributed) network environment.
- the machine may be a personal computer (PC), a tablet PC, a set-top box (STB), a Personal Digital Assistant (PDA), a cellular telephone, a web appliance, a server, a network router, a switch or bridge, a hub, an access point, a network access control device, or any machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine.
- PC personal computer
- PDA Personal Digital Assistant
- STB set-top box
- a cellular telephone a web appliance
- server a network router, a switch or bridge, a hub, an access point, a network access control device, or any machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine.
- machine shall also be taken to include any collection of machines that individually or jointly execute a set (or multiple sets) of instructions to perform any one or more of the methodologies discussed herein.
- computer system 500 may be representative of a server computer system, such as
- the exemplary computer system 500 includes a processing device 502 , a main memory 504 (e.g., read-only memory (ROM), flash memory, dynamic random access memory (DRAM), a static memory 506 (e.g., flash memory, static random access memory (SRAM), etc.), and a data storage device 518 , which communicate with each other via a bus 530 .
- main memory 504 e.g., read-only memory (ROM), flash memory, dynamic random access memory (DRAM), a static memory 506 (e.g., flash memory, static random access memory (SRAM), etc.
- SRAM static random access memory
- Any of the signals provided over various buses described herein may be time multiplexed with other signals and provided over one or more common buses.
- the interconnection between circuit components or blocks may be shown as buses or as single signal lines. Each of the buses may alternatively be one or more single signal lines and each of the single signal lines may alternatively be buses.
- Processing device 502 represents one or more general-purpose processing devices such as a microprocessor, central processing unit, or the like. More particularly, the processing device may be complex instruction set computing (CISC) microprocessor, reduced instruction set computer (RISC) microprocessor, very long instruction word (VLIW) microprocessor, or processor implementing other instruction sets, or processors implementing a combination of instruction sets. Processing device 502 may also be one or more special-purpose processing devices such as an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), a digital signal processor (DSP), network processor, or the like. The processing device 502 is configured to execute processing logic 526 , which may be one example of system 400 shown in FIG. 4 , for performing the operations and steps discussed herein.
- processing logic 526 may be one example of system 400 shown in FIG. 4 , for performing the operations and steps discussed herein.
- the data storage device 518 may include a machine-readable storage medium 528 , on which is stored one or more set of instructions 522 (e.g., software) embodying any one or more of the methodologies of functions described herein, including instructions to cause the processing device 502 to execute model generation system 100 or analytics system 160 .
- the instructions 522 may also reside, completely or at least partially, within the main memory 504 or within the processing device 502 during execution thereof by the computer system 500 ; the main memory 504 and the processing device 502 also constituting machine-readable storage media.
- the instructions 522 may further be transmitted or received over a network 520 via the network interface device 508 .
- the data storage device, the memory, the network, the processing device, and other components may store and/or access the data, including the example data and knowledge-based data.
- This data may be stored in raw form or in a preprocessed form, depending on the application.
- the machine-readable storage medium 528 may also be used to store instructions to perform a method for analyzing log data received from networked devices, as described herein. While the machine-readable storage medium 528 is shown in an exemplary embodiment to be a single medium, the term “machine-readable storage medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) that store the one or more sets of instructions.
- a machine-readable medium includes any mechanism for storing information in a form (e.g., software, processing application) readable by a machine (e.g., a computer).
- the machine-readable medium may include, but is not limited to, magnetic storage medium (e.g., floppy diskette); optical storage medium (e.g., CD-ROM); magneto-optical storage medium; read-only memory (ROM); random-access memory (RAM); erasable programmable memory (e.g., EPROM and EEPROM); flash memory; or another type of medium suitable for storing electronic instructions.
- magnetic storage medium e.g., floppy diskette
- optical storage medium e.g., CD-ROM
- magneto-optical storage medium e.g., magneto-optical storage medium
- ROM read-only memory
- RAM random-access memory
- EPROM and EEPROM erasable programmable memory
- flash memory or another type of medium suitable for storing electronic instructions.
- some embodiments may be practiced in distributed computing environments where the machine-readable medium is stored on and or executed by more than one computer system.
- the information transferred between computer systems may either be pulled or pushed across the communication medium connecting the computer systems.
- Embodiments of the claimed subject matter include, but are not limited to, various operations described herein. These operations may be performed by hardware components, software, firmware, or a combination thereof.
- the term “or” is intended to mean an inclusive “or” rather than an exclusive “or”. That is, unless specified otherwise, or clear from context, “X includes A or B” is intended to mean any of the natural inclusive permutations. That is, if X includes A; X includes B; or X includes both A and B, then “X includes A or B” is satisfied under any of the foregoing instances.
- the articles “a” and “an” as used in this application and the appended claims should generally be construed to mean “one or more” unless specified otherwise or clear from context to be directed to a singular form.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Medical Informatics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Public Health (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biomedical Technology (AREA)
- Computational Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Mathematical Analysis (AREA)
- Molecular Biology (AREA)
- Algebra (AREA)
- Pathology (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
Description
Claims (18)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/145,869 US11182411B2 (en) | 2018-09-28 | 2018-09-28 | Combined data driven and knowledge driven analytics |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/145,869 US11182411B2 (en) | 2018-09-28 | 2018-09-28 | Combined data driven and knowledge driven analytics |
Publications (2)
Publication Number | Publication Date |
---|---|
US20200104412A1 US20200104412A1 (en) | 2020-04-02 |
US11182411B2 true US11182411B2 (en) | 2021-11-23 |
Family
ID=69945981
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/145,869 Active 2039-08-23 US11182411B2 (en) | 2018-09-28 | 2018-09-28 | Combined data driven and knowledge driven analytics |
Country Status (1)
Country | Link |
---|---|
US (1) | US11182411B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11379727B2 (en) * | 2019-11-25 | 2022-07-05 | Shanghai United Imaging Intelligence Co., Ltd. | Systems and methods for enhancing a distributed medical network |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210082575A1 (en) * | 2019-09-18 | 2021-03-18 | Cerner Innovation, Inc. | Computerized decision support tool for post-acute care patients |
US11443045B2 (en) * | 2020-05-05 | 2022-09-13 | Booz Allen Hamilton Inc. | Methods and systems for explaining a decision process of a machine learning model |
CN113724861A (en) * | 2021-09-06 | 2021-11-30 | 汤学民 | Preliminary diagnosis generation method and device based on deep learning and computer equipment |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140201126A1 (en) * | 2012-09-15 | 2014-07-17 | Lotfi A. Zadeh | Methods and Systems for Applications for Z-numbers |
US20140222746A1 (en) * | 2013-02-01 | 2014-08-07 | Worcester Polytechnic Institute | Inquiry skills tutoring system |
US20140279837A1 (en) * | 2013-03-15 | 2014-09-18 | BeulahWorks, LLC | Knowledge capture and discovery system |
US9092802B1 (en) * | 2011-08-15 | 2015-07-28 | Ramakrishna Akella | Statistical machine learning and business process models systems and methods |
US20170006135A1 (en) * | 2015-01-23 | 2017-01-05 | C3, Inc. | Systems, methods, and devices for an enterprise internet-of-things application development platform |
US20180315507A1 (en) * | 2017-04-27 | 2018-11-01 | Yale-New Haven Health Services Corporation | Prediction of adverse events in patients undergoing major cardiovascular procedures |
US20190005200A1 (en) * | 2017-06-28 | 2019-01-03 | General Electric Company | Methods and systems for generating a patient digital twin |
US20190131016A1 (en) * | 2016-04-01 | 2019-05-02 | 20/20 Genesystems Inc. | Methods and compositions for aiding in distinguishing between benign and maligannt radiographically apparent pulmonary nodules |
US20190147128A1 (en) * | 2016-06-14 | 2019-05-16 | 360 Knee Systems Pty Ltd | Graphical representation of a dynamic knee score for a knee surgery |
US20200050586A1 (en) * | 2017-07-31 | 2020-02-13 | Splunk Inc. | Query execution at a remote heterogeneous data store of a data fabric service |
US20200104733A1 (en) * | 2018-09-27 | 2020-04-02 | Palo Alto Research Center Incorporated | Generation of human readable explanations of data driven analytics |
US10671703B1 (en) * | 2015-03-26 | 2020-06-02 | Cerner Innovation, Inc. | Maintaining stability of health services entities treating influenza |
-
2018
- 2018-09-28 US US16/145,869 patent/US11182411B2/en active Active
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9092802B1 (en) * | 2011-08-15 | 2015-07-28 | Ramakrishna Akella | Statistical machine learning and business process models systems and methods |
US20140201126A1 (en) * | 2012-09-15 | 2014-07-17 | Lotfi A. Zadeh | Methods and Systems for Applications for Z-numbers |
US20140222746A1 (en) * | 2013-02-01 | 2014-08-07 | Worcester Polytechnic Institute | Inquiry skills tutoring system |
US20140279837A1 (en) * | 2013-03-15 | 2014-09-18 | BeulahWorks, LLC | Knowledge capture and discovery system |
US20170006135A1 (en) * | 2015-01-23 | 2017-01-05 | C3, Inc. | Systems, methods, and devices for an enterprise internet-of-things application development platform |
US10671703B1 (en) * | 2015-03-26 | 2020-06-02 | Cerner Innovation, Inc. | Maintaining stability of health services entities treating influenza |
US20190131016A1 (en) * | 2016-04-01 | 2019-05-02 | 20/20 Genesystems Inc. | Methods and compositions for aiding in distinguishing between benign and maligannt radiographically apparent pulmonary nodules |
US20190147128A1 (en) * | 2016-06-14 | 2019-05-16 | 360 Knee Systems Pty Ltd | Graphical representation of a dynamic knee score for a knee surgery |
US20180315507A1 (en) * | 2017-04-27 | 2018-11-01 | Yale-New Haven Health Services Corporation | Prediction of adverse events in patients undergoing major cardiovascular procedures |
US20190005200A1 (en) * | 2017-06-28 | 2019-01-03 | General Electric Company | Methods and systems for generating a patient digital twin |
US20200050586A1 (en) * | 2017-07-31 | 2020-02-13 | Splunk Inc. | Query execution at a remote heterogeneous data store of a data fabric service |
US20200104733A1 (en) * | 2018-09-27 | 2020-04-02 | Palo Alto Research Center Incorporated | Generation of human readable explanations of data driven analytics |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11379727B2 (en) * | 2019-11-25 | 2022-07-05 | Shanghai United Imaging Intelligence Co., Ltd. | Systems and methods for enhancing a distributed medical network |
Also Published As
Publication number | Publication date |
---|---|
US20200104412A1 (en) | 2020-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Suresh et al. | A framework for understanding sources of harm throughout the machine learning life cycle | |
Peterson et al. | Classification accuracy as a substantive quantity of interest: Measuring polarization in westminster systems | |
US11182411B2 (en) | Combined data driven and knowledge driven analytics | |
CN110020660A (en) | Use the integrity assessment of the unstructured process of artificial intelligence (AI) technology | |
US20160110502A1 (en) | Human and Machine Assisted Data Curation for Producing High Quality Data Sets from Medical Records | |
CN111584021B (en) | Medical records information verification method and device, electronic equipment and storage medium | |
US20200104733A1 (en) | Generation of human readable explanations of data driven analytics | |
US11748384B2 (en) | Determining an association rule | |
KR102517743B1 (en) | Method and system for investment indicators related with stock item based on artificial intelligence | |
Suresh et al. | Understanding potential sources of harm throughout the machine learning life cycle | |
Loyola et al. | UNSL at eRisk 2021: A Comparison of Three Early Alert Policies for Early Risk Detection. | |
CN118230952A (en) | Psychological assessment method and system based on BPRS (Business Process reference System) concise psychosis table | |
Ardimento et al. | Predicting bug-fix time: Using standard versus topic-based text categorization techniques | |
CN113722507A (en) | Hospital cost prediction method and device based on knowledge graph and computer equipment | |
Amador-Domínguez et al. | A case-based reasoning model powered by deep learning for radiology report recommendation | |
CN116701752A (en) | News recommendation method and device based on artificial intelligence, electronic equipment and medium | |
Sippl et al. | Data-based stakeholder identification in technical change management | |
Denvir et al. | The Devil in the Detail: Mitigating the Constitutional & Rule of Law Risks Associated with the Use of Artificial Intelligence in the Legal Domain | |
CN113987351A (en) | Artificial intelligence based intelligent recommendation method and device, electronic equipment and medium | |
Painuly et al. | Natural Language Processing Techniques for e-Healthcare Supply Chain Management System | |
Galli et al. | AI Approaches to Predictive Justice: A Critical Assessment | |
Suresh et al. | A Framework of Potential Sources of Harm Throughout the Machine Learning Life Cycle | |
Maraver et al. | Automatic identification of scientific publications describing digital reconstructions of neural morphology | |
US20240184974A1 (en) | Artificial intelligence-based systems and methods for textual identification and feature generation | |
US20240145050A1 (en) | Phenotyping of clinical notes using natural language processing models |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: PALO ALTO RESEARCH CENTER INCORPORATED, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BART, EVGENIY;REEL/FRAME:053772/0447 Effective date: 20180927 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: XEROX CORPORATION, CONNECTICUT Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PALO ALTO RESEARCH CENTER INCORPORATED;REEL/FRAME:064038/0001 Effective date: 20230416 |
|
AS | Assignment |
Owner name: CITIBANK, N.A., AS COLLATERAL AGENT, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:064760/0389 Effective date: 20230621 |
|
AS | Assignment |
Owner name: XEROX CORPORATION, CONNECTICUT Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REMOVAL OF US PATENTS 9356603, 10026651, 10626048 AND INCLUSION OF US PATENT 7167871 PREVIOUSLY RECORDED ON REEL 064038 FRAME 0001. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT;ASSIGNOR:PALO ALTO RESEARCH CENTER INCORPORATED;REEL/FRAME:064161/0001 Effective date: 20230416 |
|
AS | Assignment |
Owner name: JEFFERIES FINANCE LLC, AS COLLATERAL AGENT, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:065628/0019 Effective date: 20231117 |
|
AS | Assignment |
Owner name: XEROX CORPORATION, CONNECTICUT Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS RECORDED AT RF 064760/0389;ASSIGNOR:CITIBANK, N.A., AS COLLATERAL AGENT;REEL/FRAME:068261/0001 Effective date: 20240206 Owner name: CITIBANK, N.A., AS COLLATERAL AGENT, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:XEROX CORPORATION;REEL/FRAME:066741/0001 Effective date: 20240206 |