US20190311807A1 - Systems and methods for responding to healthcare inquiries - Google Patents

Systems and methods for responding to healthcare inquiries Download PDF

Info

Publication number
US20190311807A1
US20190311807A1 US16/265,799 US201916265799A US2019311807A1 US 20190311807 A1 US20190311807 A1 US 20190311807A1 US 201916265799 A US201916265799 A US 201916265799A US 2019311807 A1 US2019311807 A1 US 2019311807A1
Authority
US
United States
Prior art keywords
user
medical
diagnosis
engine
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/265,799
Inventor
Anitha Kannan
Murali RAVURI
Vitor Rodrigues
Vignesh Venkataraman
Geoffrey TSO
Neal KHOSLA
Neil Hunt
Xavier Amatriain
Manish Chablani
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Curai Inc
Original Assignee
Curai Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Curai Inc filed Critical Curai Inc
Priority to US16/265,799 priority Critical patent/US20190311807A1/en
Priority to US16/266,784 priority patent/US10847265B2/en
Priority to PCT/US2019/023381 priority patent/WO2019194980A1/en
Priority to EP19782274.5A priority patent/EP3776247A4/en
Assigned to Curai, Inc. reassignment Curai, Inc. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CHABLANI, MANISH, KANNAN, ANITHA, TSO, Geoffrey, VENKATARAMAN, VIGNESH, KHOSLA, NEAL, HUNT, NEIL, Amatriain, Xavier, RAVURI, Murali, RODRIGUES, VITOR
Publication of US20190311807A1 publication Critical patent/US20190311807A1/en
Priority to US17/903,645 priority patent/US20230105969A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/20ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/027Frames
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H20/00ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H40/00ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices
    • G16H40/60ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices
    • G16H40/67ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices for remote operation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/60ICT specially adapted for the handling or processing of medical references relating to pathologies
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H80/00ICT specially adapted for facilitating communication between medical practitioners or patients, e.g. for collaborative diagnosis, therapy or health monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present disclosure relates generally to medical decision support systems and, more particularly, to systems and methods for responding to healthcare inquiries.
  • the decision is provided as a ranked list of diagnosis, also called “differential diagnosis.”
  • Most expert rule-based systems encode knowledge as a bipartite graph with nodes representing findings and diagnoses, and the edges representing likelihoods of a finding given a diagnosis and/or a diagnosis given a finding.
  • Internist-1 Two well-known expert rule-based systems are Internist-1 (see e.g., Miller, R. A., H. E. Pople, and J. D. Myers, “Internist-1, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine,” The New England Journal of Medicine , vol. 307, no. 8 (Aug. 19, 1982): 468-76) and DXplain® (see e.g., Barnett, G. Octo, James J. Cimino, Jon A. Hupp, and Edward P. Hoffer, “DXplain: An Evolving Diagnostic Decision-Support System,” JAMA , vol. 258, no. 1 (Jul. 3, 1987): 67-74).
  • DXplain® see e.g., Barnett, G. Octo, James J. Cimino, Jon A. Hupp, and Edward P. Hoffer, “DXplain: An Evolving Diagnostic Decision-Support System,” JAMA , vol. 258, no. 1 (Jul. 3, 1987)
  • Internist-1 and DXplain® are related to QMR (see e.g., Miller, R. A., H. E. Pople, and J. D. Myers, “Internist-1, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine,” The New England Journal of Medicine , vol. 307, no. 8 (Aug. 19, 1982): 468-76) since Internist-1 is a precursor and DXplain® is a recent evolution.
  • the relation between a finding and a disease is encoded into two numbers: “Evoking Strength” encodes the likelihood of a disease given a finding while “Frequency” describes how likely a symptom is to occur given a disease manifestation.
  • a medical decision support system not only needs to have a model that predicts a diagnosis given a set of inputs, but also needs to provide an efficient way to navigate the path towards an optimal diagnosis by asking the right questions. That is easier to implement in rule-based systems, but gets more complicated for more complex diagnosis models.
  • the techniques may be realized as a method for responding to a healthcare inquiry from a user, according to a set of instructions stored on a memory of a computing device and executed by a processor of the computing device, the method comprising the steps of: classifying an intent of the user based on the healthcare inquiry; instantiating a conversational engine based on the intent; eliciting, by the conversational engine, information from the user; and presenting one or more medical recommendations to the user based at least in part on the information.
  • the eliciting step comprises using, by the conversation engine, an entropy minimization process to determine a next question to present to the user, such that subsequent information provided by the user in response to the next question minimizes the number of medical recommendations.
  • the entropy minimization process is weighted to optimize diagnosis that identifies worst outcomes for early medical intervention on the user.
  • the entropy minimization process is weighted to optimize diagnosis for treating the user's symptoms or disease clusters that maximizes a diagnostic value of the user's response to the proposed treatments.
  • the presenting step comprises invoking a knowledge base and a diagnosis engine.
  • the knowledge base represents normalized medical concepts, which include at least one of: entities representing findings, symptoms, and conditions; modifiers representing anatomical location, severity, and temporal modifiers; weighted relations between the entities; relations between the entities and the modifiers; a representation of models and engines as an instance of a graph with the entities and the relations; a mapping of a medical text to a knowledge base representation by using an entity recognition module; and additional knowledge sources.
  • the entity recognition component module is configured to translate health-related text into medical entities and modifiers.
  • the diagnosis engine is at least one of: a first diagnosis engine based on rules in a knowledge based codifying probabilistic relationships between symptoms/findings and diseases; a second diagnosis engine based on first machine-learned models deriving relations between symptoms/findings and diseases from historical medical records; a third diagnosis engine based on second machine-learned models deriving both probabilities and relationships from historic medical records; and a fourth diagnosis engine based on third machine-learned models learned from mixed data that includes at least one of synthetic data generated by a pre-existing expert system, electronic medical records, manual cases, labelled cases from the diagnosis engine.
  • the historical records are either obtained from anonymized databases or generated by interactions with the user in a recursive manner.
  • the first, second, third, and fourth diagnosis engines act in ensemble; each of the first, second, third, and fourth diagnosis engines operates upon a current state of knowledge independently, and offers possible responses along with a confidence and value estimate; and an ensemble arbitrator chooses a response out of the possible responses or a collection of responses that is best for the user or circumstance given a match or a mismatch between the possible responses, and a value and confidence each of the first, second, third, and fourth diagnosis engines expresses in its corresponding response.
  • the ensemble arbitrator learns a weight to use for each possible response from each of the first, second, third, and fourth diagnosis engines based upon history.
  • the eliciting step includes invoking natural language understanding engine.
  • the information from the user is in the form of at least one of text, speech, imagery, sound, and medical test results.
  • the information from the user includes at least one of user's medical history and the user's symptoms.
  • the conversation engine is one of a diagnosis conversational engine, an information conversational engine, a referral conversational engine, and a treatment conversational engine.
  • the method further comprises, prior to presenting the one or more medical recommendations to the user, seeking approval or revision of the one or more medical recommendations by a medical expert.
  • the techniques may be realized as a system for responding to a healthcare inquiry from a user, the system comprising: memory for storing instructions; and a processor configured to execute the instructions to: classify an intent of the user based on the healthcare inquiry; instantiate a conversational engine based on the intent; elicit, by the conversational engine, information from the user; and present one or more medical recommendations to the user based at least in part on the information.
  • FIG. 1 illustrates a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 2 illustrates interactions between a medical decision support system and patients and medical experts in accordance with an embodiment of the present disclosure.
  • FIG. 3 illustrates interactions between a medical decision support system and patients and medical experts in accordance with an embodiment of the present disclosure.
  • FIG. 4 illustrates an infrastructure of a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 5 illustrates an exemplary architecture a backend server for a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 6 is a flowchart illustrating a method for responding to a healthcare inquiry from a user using a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 7 illustrates the operation of an intent classification module within a conversational engine in accordance with an embodiment of the present disclosure.
  • FIG. 8 illustrates a structure of a specialized conversational engine in accordance with an embodiment of the present disclosure.
  • FIG. 9 is a flowchart for an exemplary entity recognition algorithm that may be employed by a natural language understanding module in a specialized conversation engine in accordance with an embodiment of the present disclosure.
  • FIG. 10 shows an example of an entity extraction in accordance with an embodiment of the present disclosure.
  • FIG. 11 shows an example of an autocomplete operation in accordance with an embodiment of the present disclosure.
  • FIG. 12 is a flowchart illustrating a method of training a machine-learned model for a diagnosis engine in accordance with an embodiment of the present disclosure.
  • FIG. 13 shows a possible diagnosis output from a medical decision system resulting from symptoms entered by a user in accordance with an embodiment of the present disclosure.
  • FIG. 14 is a flowchart illustrating a method for responding to a healthcare inquiry from a user using a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 15 illustrates an architecture of a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 1 illustrates a medical decision support system 100 in accordance with an embodiment of the present disclosure.
  • the system 100 may interface with a user (e.g., a patient, a medical expert, etc.) via the user interface 102 .
  • the user may access the user interface 102 through at least one of a plurality of user devices 101 , which may comprise a laptop, a smartphone, a tablet, smart speakers (e.g., Amazon Echo, Google Home, etc.), digital assistants, etc.
  • the user interface 102 may be in communication with the system 100 backend, which may comprise a user information module 108 , a conversational engine 114 , a differential diagnosis engine (DDx) 116 , a knowledge base (KB) 118 , and a data/knowledge input module 120 .
  • DDx differential diagnosis engine
  • KB knowledge base
  • the user interface 102 may be divided into a user input interface 104 and a user output interface 106 .
  • the user input interface 104 may allow input from the user in different modalities, including, but not limited to, natural language (of the user), speech data (e.g., from a smart speaker), image/video, and audio.
  • the user input interface 104 may integrate the different modalities into a multimodal representation to be processed by the conversational engine 114 , as described below.
  • the user output interface 106 may provide the user (i.e., a patient or a medical expert), on at least one of a plurality of user devices 101 , with one or more actionable recommendations, such as triaging, diagnosis, treatment, or explanations, as described below. Sometimes, the user output interface 106 may also need to present the user with questions in order to elicit additional and the right kind of information such that the system 100 is able provide a confident recommendation.
  • the user information module 108 may include two submodules—a user profile/modeling submodule 110 and a personalization submodule 112 .
  • the user profile/modeling submodule 110 may create and/or update a user's profile with all relevant information available about the user from the user input interface 104 .
  • the user profile/modeling submodule 110 may be built on the notion of creating a personalized health record.
  • the personalization submodule 112 may use the user's profile from the user profile/modeling submodule 110 in order to optimize the relevance of the recommendations and information that the system 100 is to present to the user.
  • the conversational engine 114 may be in charge of understanding user's input(s), reasoning about the user's input(s), and deciding what is the most appropriate output(s) after consulting with the DDx 116 and the KB 118 .
  • the DDx 116 may produce a ranked list of possible diagnoses given any number of findings, which may be symptoms expressed by the patient as well as their history, demographics, etc.
  • the DDx 116 may be based on rules in a knowledge based codifying probabilistic relationships between symptoms/findings and diseases.
  • the DDx 116 may be based on machine-learned models deriving relations between symptoms/findings and diseases from historical medical records.
  • the DDx 116 may be based on machine-learned models deriving both probabilities and relationships from historic medical records.
  • the DDx 116 may be based on machine-learned models learned from mixed data that includes at least one of synthetic data generated by a pre-existing expert system, electronic medical records, manual cases, labeled cases from the diagnosis engine, as will be described below.
  • the DDx 116 may comprise all these embodiments working in ensemble such that each embodiment may operate based on a current state of knowledge independently, and offer possible responses along with a confidence and a value estimate.
  • an ensemble arbitrator to choose a response out of the possible responses or a collection of responses from the different embodiments, that is best for the user or circumstance given a match or a mismatch between the possible responses, and the value and the confidence estimate each embodiment expresses in its corresponding response.
  • the ensemble arbitrator may learn a weight to use for each possible response from each of embodiment of the DDx 116 based upon history.
  • the KB 118 may be a graph representation of medical concepts and relations between the medical concepts, allowing for concepts, modifiers, relations, and hierarchies.
  • the initial set of concepts in the graph may be derived from the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) ontology standard, and represented by unified medical language system (UMLS) concept unique identifiers (CUIs).
  • Relations in the graph may be machine-learned from different sources.
  • the KB 118 may represent normalized medical concepts, which include at least one of: entities representing findings, symptoms, and conditions; modifiers representing anatomical location, severity, and temporal modifiers; weighted relations between the entities; and relations between the entities and the modifiers.
  • the normalized medical concepts may further include a representation of models and engines as an instance of a graph with the entities and the relations; a mapping of a medical text to a knowledge base representation by using an entity recognition module; and additional knowledge sources.
  • the entity recognition component module is configured to translate health-related text into medical entities and modifiers.
  • the KB 118 may grow and may incorporate future concepts as needed. The following is an example of some concepts and relationships that may be represented in the KB 118 :
  • the above exemplary graph illustrates a relationship between a finding 31 (“hip pain”) under the “findings” node and a disease 7 (“acute renal failure”) under the “diseases” node.
  • the relationship is captured under the “edges” node of the graph with a relative “strength” of 0.83 (i.e., it is closely related or “common”).
  • each node may capture additional information under the “expression” field.
  • the finding 31 (“hip pain”) is of type “pain” and is located in the “hip.”
  • “hip” is defined as a “joint”.
  • the KB 118 may also provide an application programming interface (API) with a set of operations that allow to navigate the graph.
  • API application programming interface
  • the “refine” operation allows to retrieve entities of a parent node type from the KB 118 .
  • the “refine” operation when applied to “joint,” the KB 118 returns different kinds of joints; when applied to “joint pain,” the KB 118 returns different kinds of joint pains; and when applied to “knee pains,” the KB 118 returns variations of knee pain.
  • the “expansion” operation does a traversal of the graph and returns any valid modification for any of the concepts in the expression. For example, when the “expansion” operation is applied to “knee swelling,” the KB 118 returns all possible types of knee conditions, as well as swellings.
  • Both the DDx 116 and the KB 118 may be informed by the data/knowledge input module 120 .
  • the DDx 116 may further be informed by the KB 118 .
  • the data/knowledge input module 120 may intake different data sources and models, such as datasets 130 and rules/protocols 132 .
  • Examples of the datasets 130 are electronic medical records (EMRs) (also known as electronic health records (EHRs)) and Wikidoc (www.wikidoc.org).
  • EMRs electronic medical records
  • the rules/protocols 132 may come from medical experts, from decision support systems such as DXplain®, for example.
  • the data/knowledge input module 120 may also intake information and data from the user input interface 104 .
  • the data/knowledge input module 120 may include a medical records understanding submodule 122 , an expert systems input submodule 124 , a medical text understanding submodule 126 , and a model/knowledge inference submodule 128 .
  • the medical records understanding submodule 122 may parse and understand medical data in EMRs by applying natural language processing (NLP) technologies to doctors' notes and encoding all other available information as features that may be used in later modeling stages.
  • the expert systems input submodule 124 may ingest arbitrary knowledge graphs and encode them in a way that may be understood and combined with other existing models.
  • the medical text understanding submodule 126 may apply NLP technologies to extract concepts and relations from existing medical text resources to construct knowledge base graphs and diagnosis models.
  • the model/knowledge inference submodule 128 may be a service model that includes functionalities that is needed by the other submodules.
  • One such example of the functionalities may be to allow the medical records understanding submodule 122 , the expert systems input submodule 124 , and/or the medical text understanding submodule 126 to interface with the model combination/reasoning module 134 , which informs the DDx 116 .
  • the model combination/reasoning module 134 may select the optimal model needed for making a particular diagnosis.
  • model combination/reasoning module 134 may respond to rules about which of the existing models respond better to a kind of diagnosis. For example, a model that has been trained on medical records from a pediatric hospital will be preferred when building a diagnosis for a child.
  • the model combination/reasoning module 134 may be an ensemble trained on data such that responses from different models are combined with different weighs corresponding to the models' abilities to respond to previous similar cases.
  • Another functionality of the model/knowledge inference submodule 128 may be to interface with the knowledge representation module 136 , which informs the KB 118 .
  • the knowledge representation module 136 may use the output of the data/knowledge input module 120 and organize the output in a representation that is usable by the other modules. For instance, the knowledge representation module 136 may provide processed knowledge and structures from the data/knowledge input module 120 to the KB 118 .
  • FIG. 2 illustrates interactions between a medical decision support system 200 and patients and medical experts in accordance with an embodiment of the present disclosure.
  • the system 200 may be a cloud service system.
  • the system 200 may encompass the system 100 and enable a plurality of client applications.
  • a patient-facing application may be deployed as a web-based application 202 , as well as a mobile device application 204 .
  • the system 200 may include a plurality of tools 206 , for example via a computer, providing functionalities for medical experts to access data generated by patients during their sessions with the system 200 .
  • FIG. 3 illustrates interactions between a medical decision support system 300 and patients and medical experts in accordance with an embodiment of the present disclosure.
  • the system 300 may be a cloud service system.
  • the system 300 may encompass the system 100 and enable a plurality of client applications. While a patient may interact directly with a web-based application 302 and/or a mobile device application 304 , the cloud service system 300 may also provide the ability to input data from medical and health devices 308 (e.g., a heart-rate monitor, an at-home blood pressure device, etc.), as well as results from laboratory tests 310 (e.g., from conventional laboratories, at-home or in-pharmacy instruments, etc.).
  • medical and health devices 308 e.g., a heart-rate monitor, an at-home blood pressure device, etc.
  • results from laboratory tests 310 e.g., from conventional laboratories, at-home or in-pharmacy instruments, etc.
  • the system 300 may include a plurality of tools 306 , for example via a computer, providing functionalities for medical experts to access data generated by patients during their sessions with the system 300 and/or to access patient data from the medical and health devices 308 and the laboratory tests 310 .
  • FIG. 4 illustrates an infrastructure 400 of a medical decision support system in accordance with an embodiment of the present disclosure.
  • the infrastructure 400 may be exposed to users via an application proxy (App Proxy) node 402 , which may route requests to an administrative application (Admin App) 404 and/or a client application (Client App) 406 , while restricting access to other internal resources.
  • the App Proxy node 402 may be a Hypertext Transfer Protocol (HTTP) server, which may handle both serving web resources and acting as a proxy between the public internet and the Admin App 404 and the Client App 406 .
  • HTTP Hypertext Transfer Protocol
  • An example of such an HTTP server is an nginx server.
  • An engine 408 may reside in a cloud service system.
  • the engine 408 may be deployed as a Python Flask server running on a cloud service system (e.g., Google Cloud), while the Admin App 404 and the Client App 406 may share a frontend stack, which may for example be built on React, a JavaScript library.
  • the infrastructure 400 may be built on top of a Docker container platform. Each of the major components in the infrastructure 400 may be packaged and deployed as a self-contained container, avoiding the need to keep development and production environments in synchronization in terms of tools, dependencies, etc.
  • the engine 408 may be the backend server for a medical decision support system (e.g., the system 100 , 200 , or 300 ), and may provide the following functionalities:
  • FIG. 5 illustrates an exemplary architecture of the engine 408 in accordance with an embodiment of the present disclosure.
  • the engine 408 may be a Python server built on a Flask server 510 . Flask aims to be a lightweight, but extensible framework for building Python services and servers. Flask libraries, such as authentication, logging, persistence, HTTP request/response, etc., are available for functions and functionalities needed to build the engine 408 .
  • the engine 408 may be deployed, locally and/or on a cloud service system, as a self-contained Docker image, which may be a composition of several smaller Docker images with specific functionalities.
  • the engine 408 may include an in-memory knowledge base (KB) 514 that may contain relevant datasets imported into internal data structures.
  • KB in-memory knowledge base
  • the Flask server 510 may perform only read operations on the KB 514 .
  • the KB 514 is served as a binary SQLLite file that gets loaded into memory on startup.
  • the engine 408 may further include a MySQL server 520 , which may host databases used by the Flask server 510 .
  • only user data e.g., patient profiles and their interaction sessions
  • the MySQL server 520 may store data
  • the KB 514 is served as a binary file.
  • a Celery 518 which may be included in the engine 408 , is an asynchronous task queue/job queue based on distributed message passing. The Celery 518 may be focused on real-time operation, and may also supports scheduling.
  • the Celery 518 may be used in the engine 4408 to schedule and execute non-blocking tasks, such as sending messages, thereby allowing the Flask server 510 to respond to users' requests quickly without being blocked by slow calls that may be executed asynchronous.
  • a Redis 516 which also may be include in engine 408 , is an in-memory data structure store that may be used as a database, cache, and message broker. The engine 408 may schedule asynchronous tasks by writing objects into the Redis 516 , to which the Celery 518 may be continuously listening for changes and handling/executing any tasks as they arrive.
  • FIG. 6 is a flowchart illustrating a method 600 for responding to a healthcare inquiry from a user (e.g., a patient) using a medical decision support system (e.g., the system 100 , 200 , or 300 ) in accordance with an embodiment of the present disclosure.
  • the method 600 begins at step 610 with the user making a healthcare inquiry.
  • the method 600 determines and classifies the intent of user's inquiry as one of a plurality of intent classifications (as described below), using a conversational engine (e.g., conversational engine 114 ).
  • the method 600 proceeds to step 630 , where a decision is made as to whether the user's intent is understood.
  • the user's inquiry is escalated to a doctor, at step 640 .
  • the method 600 moves to step 650 and instantiates one of a plurality of specialized conversational engines for the intent, based on the intent classification.
  • the method 600 elicits more information from the user (if needed).
  • the method 600 decides, at step 680 , whether a confident recommendation may be made. If a confident recommendation may not be made at step 680 , the user's inquiry is escalated to a doctor, at step 640 . Otherwise, a recommendation (e.g., diagnosis, referral, treatment, etc.) is made to the user at step 690 .
  • the goal of the method 600 therefore, is to provide at least one of a plurality of actionable recommendations to the user.
  • a medical decision support system typically includes a software system to manage a patient dialog system to perform one or more of the following steps:
  • the medical decision support systems described herein may be included in a broad category of dialog systems, with a specific focus on medical tasks.
  • Dialog systems may be generally categorized as task-oriented (e.g., buying a plane ticket) or chatbots.
  • Most approaches to dialog systems have used a “slot-filling” mechanism, where an initial frame with slots is defined and the goal of the dialog is to fill in those slots. See e.g., Jurafsky and Martin, “Speech and Language Processing,” 2 nd edition. Research into end-to-end dialog systems using Deep Learning techniques has recently emerged. See e.g., Bordes et al., “Learning End-to-End Goal-Oriented Dialog,” International Conference on Learning Representations, 2016. The systems described herein may use a combination of both approaches with a goal of offering not only an accurate, but also a natural medical conversation.
  • a patient dialog system may be comprised of a series of templates for the information required, and formulations for questions to pose to the patient to elicit that information.
  • a model may represent the system's current estimate of the patient objectives, and chooses next questions to be asked with a combination of the following objectives:
  • the system may implement a dialog engine (e.g., the conversational engine 114 ) that may include an intent classification module.
  • FIG. 7 illustrates the operation of an intent classification module 710 within a conversational engine 700 , in accordance with an embodiment of the present disclosure.
  • the conversational engine 700 may be an example of the conversational engine 114 .
  • the intent classification module 710 may classify a user's intent into a finite set of “intents.” Those intents include, but are not limited to, diagnosis, doctor referral, prescription or treatment recommendation, and information on a given disease.
  • the intent classification module 710 may be built by using a combination of encoded rules and machine-learned classifier models.
  • Encoded rules help the intent classification module 710 to identify patterns that are then treated as a single entity. For example, “right lower back” may be identified as an anatomical part plus two location modifiers. Entities and processed raw text (through, e.g., embeddings) may then be fed into a supervised classification component that has been trained using thousands of labeled examples. As shown in FIG. 7 , the intent classification module 710 may then route the user query to one of a plurality specialized conversational engines—information conversational engine 720 , referral conversational engine 730 , diagnosis conversational engine 740 , treatment conversational engine 750 , other conversational engine(s) 760 —, each configured to handle a classified intent.
  • a plurality specialized conversational engines information conversational engine 720 , referral conversational engine 730 , diagnosis conversational engine 740 , treatment conversational engine 750 , other conversational engine(s) 760 —, each configured to handle a classified intent.
  • Each specialized conversational engine may be built using a combination of a natural language understanding (NLU) module, an autocomplete module, and a next question module.
  • NLU natural language understanding
  • FIG. 8 illustrates the structure of the diagnosis conversational engine 740 in accordance with an embodiment of the present disclosure.
  • the diagnosis conversational engine 740 may include a natural language understanding module 742 , an autocomplete module 744 , and a next question module 746 .
  • an NLU module may operate with unstructured natural language inputted as text, which may include textual representations of other modalities such as voice transformed into text through a speech-to-text system.
  • an entity recognition algorithm may recognize words and extract relevant medical concepts including modifiers, quantifiers, and negations. Entities may be mapped to a knowledge base unique identifier. Therefore, the output of the NLU module is a structured representation of the user query in the form of a combination of identifiers, which may be interpreted to determine next steps according to rules in each of the specialized conversational engines.
  • FIG. 9 is a flowchart for an exemplary entity recognition algorithm 900 that may be employed by an NLU module in a specialized conversational engine in accordance with an embodiment of the present disclosure.
  • the entity recognition algorithm 900 starts with the text processing step 902 .
  • the text processing step 902 includes information retrieval (IR) processing, which involves removing, from text input by a user, various punctuations and standardization of certain tokens. Unlike in standard IR processing, the text processing step 902 does not remove stop words. Stop words tend to function as a bridge to connect entities. Stop words may be removed, if needed, in the pruner step 908 , described below.
  • IR information retrieval
  • the entity recognition algorithm 900 then moves to the text synonymizer step 904 .
  • the text synonymizer step 904 is used to generate various synonymous ways of representing the processed text from the text processing step 902 .
  • Each K-gram phrase may be synonymized by lookups on Wordnet, word-by-word. When there are multiple words in the phrase that have synonyms, all possible combinations may be maintained, by constructing the combinations recursively. Some phrases may not have synonyms, while others may have many.
  • the annotator step 906 is used to identify all entities in the free text. That is, all phrases and their synonyms are identified.
  • a dictionary data structure may be used for fast lookups. Maintaining K-grams of various lengths and their locations in the input text may facilitate the annotator step 906 .
  • the entity recognition algorithm 900 moves the pruner step 908 .
  • the pruner step 908 two operations are performed. First, entities that are sub-entities of another identified entity are pruned. For example, for the user text “knee pain,” the entity corresponding to “knee” and “pain” are removed, and only the entity corresponding to “knee pain” is kept. Second, entities that can correspond to stop words such as “in” are pruned.
  • UMLS is not always consistent in concept representation.
  • UMLS may have concept “knee pain,” but not “pain in the knee” as a synonymous.
  • rules or patterns may be captured that are effective in capturing the main concepts that are of interest. For example, these rules may include the following code:
  • ⁇ body system>
  • ⁇ findings> ⁇ ′′Finding′′, ′′Sign or System′′, ′′Body Substance′′, ′′Disease or Syndrome′′ ⁇ # ⁇ findings>
  • the Pruner step may be important to increase coverage.
  • the output of Pruner step may be used to identify entities that are in the vicinity of each other that satisfies one of these rules, and where stop is stop words. If so, the entities may be merged and annotated, using the annotator step 906 . During the entity merger step 910 , the chances for matches are also increased by using the UMLS synonyms of two entities, and taking their cross-product combinations. This may further improve identification. At the end of the entity merging step 910 , the pruner step 908 may be run again, for further clean up. It is worth noting that, along this process, the location of the merge in original free text is maintained.
  • the subsequent sundry step 912 when the text is large, redundant entities are removed, based on CUIs of the entities identified.
  • the returned list of entities may be restricted to be only those defined as findings.
  • the output from the sundry step 912 is matched to engine specific findings, in two ways: (1) exact match on the CUIs between the entities and findings in knowledge base; and (2) when (1) is not present, approximate match based on a number of matching, missing, and mismatching entities between the candidates (with mismatching penalized more than missing).
  • FIG. 10 shows an example of an entity extraction in accordance with an embodiment of the present disclosure.
  • an entity recognition algorithm within an NLU module of a specialized conversational engine may return “headache” and “general malaise” as possible symptoms to the user.
  • An autocomplete module may allow a specialized conversational engine to suggest medical concepts or entities coming from the knowledge base (e.g., KB 118 ) as a letters from a user are being inputted in the system.
  • the suggestions not only include words that include the letters entered by the user, but also words that include the letters among their synonyms.
  • FIG. 11 shows an example of an autocomplete operation in accordance with an embodiment of the present disclosure.
  • the next question module in a specialized conversational engine may be able to determine the optimal next question to ask a user given the current state of the dialog between the user and the specialized conversational engine.
  • the way that the optimal next question may be determined depends on the specifics of each specialized conversational engine, but it generally may be based on either predetermined frames or a entropy minimization algorithm.
  • the referral conversation engine may instantiate a frame with the necessary information (location, preferred time, preferred gender, specialty, etc.) and will determine a next question in order to fill in the blank slots in this frame.
  • the diagnosis conversational engine (see FIGS. 7 and 8 ) may initially formulate a differential diagnosis and determine a next question by minimizing the entropy difference of the existing hypothesis. In other words, the goal is to minimize the entropy of the differentials. Before asking a particular question, this is given by
  • t ,f) ( d ) 0.5*[ H p(d
  • t ,f 1) ( d )+ H p(d
  • t ,f 0) ( d )]
  • the system needs to have a representation of medical concepts. This involves having a knowledge base, and tools to convert or translate from any language representation to the internal knowledge graph representation.
  • the NLU module described above in a specialized conversational engine needs to convert natural language into a list of identifiers from the knowledge base. But, that is also true for natural language generated by doctors, or structured information coming from electronic medical records or an external knowledge base.
  • the language of medicine also includes tools to map text to knowledge base representation (as described in the NLU module above) and to navigate a knowledge base graph in different directions (e.g., “find related nodes” or “find parent of”).
  • a diagnosis engine e.g., DDx 116
  • a multi-class classifier on co-occurrence of medical concepts in EMRs may be extracted. This produces a graph of medical concepts and edges between them, where the edge weights model the probability of co-occurrence of, for example, a disease given a symptom. Given such a graph, a diagnosis algorithm may be devised. The probability of a finding being present given a disease may be estimated as follows:
  • p ( d,f ,age,gender) p ( f
  • Bayes rule may be invoked.
  • the compute probability may be used to rank the differential diagnosis given the currently available information.
  • a medical decision support system may incorporate several engines working in ensemble. These engines may include rule-based engines, wherein rules in a knowledge base specify the strength of relationship between a symptom/finding, and diseases that may cause such symptoms. Those rule-based systems are derived from existing medical literature by using a combination of manual curation from expert physicians and statistical techniques.
  • models learned from data may be included. Such models quantify the relation between symptoms and diseases observed in real-world data.
  • the models are trained using various medical data such as anonymized EMRs from research databases or university hospitals, and from introspecting on the data collected in feedback and follow-up from the engine's own interaction with patients. Sources of bias in the samples that drive the models may be quantified, and the models may be corrected for known and estimated bias.
  • the models may incorporate disease and symptom prevalence in the general population, and may include how those statistics vary in specific subpopulations by demographic, geography, calendar, social connections, etc.
  • the models are constrained to quantify relationships identified by the rule-based engines.
  • the models are allowed to infer knew relationships that have never been encoded as an explicit rule.
  • the method 1200 may start at step 1210 with a rule-based expert system.
  • the rule-based expert system is used as a generative model to create sample medical cases.
  • the sample medical cases created at step 1220 are used to train a machine-learned model.
  • the machine-learned model may then be applied to a medical decision support system at step 1250 . This becomes a novel and effective way to extend and generalize expert systems that can then be combined at the data level.
  • the medical cases generated by the expert system may be combined with other medical cases such as manually generated medical cases 1260 , medical cases 1270 gathered from sources such as EMRs, and/or labeled medical cases 1280 from the system usage itself.
  • the system 100 allows for multimodal input as part of the dialog/conversation.
  • the system 100 allows a user/patient to input voice, images, and video as part of the dialog flow. For example, if the patient mentions having a skin rash, the system 100 may invite the patient to upload a picture. The system 100 may then perform automatic image classification and will use the output as another symptom in the diagnosis. Beyond typical multimedia documents, the system 100 may be designed to accept information coming from any medical sensor or monitoring device (e.g., heart rate monitor, blood pressure monitor, etc.).
  • any medical sensor or monitoring device e.g., heart rate monitor, blood pressure monitor, etc.
  • PHR personal health record
  • EMRs electronic medical records
  • EHRs electronic medical records
  • a PHR may always be available to the patient to query, manage, examine, or update.
  • the PHR is not intended to replace the specific electronic records stored at the health institutions, but rather complement and offer a holistic view of a patient's health that includes not only the aggregate of the patient's interactions with their health providers, but also more nuanced interactions with the system 100 .
  • the PHR in the system 100 may include information about interactions where the patient complained or asked about a headache despite none of those might have resulted in any medical treatment.
  • the PHR that may be offered in the system 100 remains the property of the patient and enables the patient to not only visualize, but also modify and comment in order to add more detailed or nuanced information.
  • the goal to the system 100 is to answer patient questions by giving a set of actionable recommendations that may include not only a diagnostic and triaging, but also referral or treatment.
  • One important actionable recommendation that a patient that is looking for information about her health situation can get is whether she needs to visit a doctor and what is the relative urgency of the medical attention. This is accomplished by the ability of the system 100 to triage patients.
  • the triaging decision may be accomplished by formulating a diagnosis and classifying the diagnosis into the required attention and urgency.
  • the simple existence of a symptom can trigger a triage recommendation. For example, chest pain on an elderly patient or high fever in an infant will trigger an automatic recommendation to visit the emergency room regardless of the confidence on the diagnosis.
  • the system 100 may respond in the form of a differential diagnosis. This includes a ranked list of possible conditions with an indication on how likely they are and recommended next steps.
  • FIG. 13 shows a possible diagnosis output from the system 100 and the symptoms entered by a user that led to the output in accordance with an embodiment of the present disclosure.
  • system 100 may be designed to give up-to-date and accurate medical information, in many situations, the interaction with the patient will determine that the best path forward is for her to visit a doctor.
  • exemplary cases when the system 100 may decide to refer the patient to a doctor may include, but is not limited, to the following:
  • the system 100 may determine the best doctor to whom to refer the patient considering different variables that include everything that is known about the patient (e.g., location, healthcare provider) and their current situation (e.g., vitals, symptoms).
  • the system 100 may present a ranked list of recommendations.
  • the system 100 may include a database of doctors and facilities that can provide a given medical procedure or service.
  • the system 100 may also provide the possibility to refer directly to in-house physicians through text or video consultation.
  • a component of the system 100 may be designed to sort through alternative possible treatments, qualify them based upon relevance to the patient's symptoms and history, and propose possible treatments that the patient could pursue.
  • the system 100 may collect data on treatment paths chosen by the patient, and outcomes, and use these data to learn models for which treatments to propose to other patients in the future. Some of those treatments may involve prescriptions or recommendations for over-the-counter medications.
  • the system 100 may include referral to a healthcare professional, who can safely prescribe medications. This could be an offline prescription based on data collected, a real-time or scheduled chat or message interaction, a real-time or scheduled video consultation, or a scheduled visit for physical examination. In any of these cases, the system 100 may facilitate the paperwork of prescribing and forwarding the prescription onto a pharmacy.
  • the system 100 may include a database of pharmacy within which it can:
  • some situations may require the patient to undergo some form of laboratory testing.
  • the kind of testing might be available through some form of at-home procedure (e.g., blood pressure), but in many other cases the patient may need to visit a physician or pharmacist.
  • the system 100 may refer the patient to existing third-party applications and solutions.
  • the system 100 may refer the patient to nearby facilities considering all the information about the patient and the current situation (e.g., how urgent the test is). In any of these cases, the system 100 may facilitate the paperwork.
  • the system 100 may include a database of pharmacies and doctors that may provide a given test procedure.
  • a key to effective machine learning is generating data from which to learn.
  • the system 100 may be designed for comprehensive follow-up with each patient, which may take the form of:
  • the system 100 has the goal of offering actionable and personalized healthcare information to patients. All use cases outlined above have some flavor of understanding a patient informational need and presenting some information about the patient's situation or the next steps to take. In order to serve those informational needs, the system 100 may include comprehensive content that may be indexed and presented in response to any question. This includes:
  • FIG. 14 is a flowchart illustrating a method 1400 for responding to a healthcare inquiry from a user (e.g., a patient) using a medical decision support system (e.g., the system 100 , 200 , or 300 ) in accordance with an embodiment of the present disclosure.
  • the method 1400 begins at step 1410 with the user making a healthcare inquiry.
  • the method 1400 classifies the intent of user's inquiry (e.g., using intent classification module 710 ).
  • the method 1400 proceeds to step 1430 , where a decision is made as to whether the user's intent is related to either a diagnosis or a treatment.
  • the method 1400 invokes a converse at step 1440 .
  • the converse generates a reasonable response based on what the medical decision support system knows about its past few interactions with the user.
  • the method 1400 moves to step 1450 and determines whether the confidence on a given diagnosis or treatment is above a predetermined threshold. If the confidence is below or at the predetermined threshold, the method 1400 moves to step 1460 to generate a next question, as described above, to elicit more information from the user.
  • step 1470 to communicate either a diagnosis or a treatment (informed by a diagnosis engine (e.g., the DDx 116 ) and a knowledge base (e.g., the KB 118 )) to the user.
  • a diagnosis engine e.g., the DDx 116
  • a knowledge base e.g., the KB 118
  • Any of the response from step 1440 , the next question from step 1460 , and the diagnosis or treatment from step 1470 may require an optional approval/edit by a medical expert at step 1480 before being presented to the user.
  • the method 1400 may trigger step 1480 based on predefined settings (e.g., a physician may need to approve diagnosis recommendations that include diagnosis above a certain severity).
  • FIG. 15 illustrates an architecture 1500 of a medical decision support system in accordance with an embodiment of the present disclosure.
  • the architecture 1500 may be used to perform the method 1400 , for example.
  • a dialog manager 1506 may be a central component in the architecture 1500 .
  • the dialog manager 1506 may communicate with the patient via a patient user interface (UI) 1502 and with a medical expert via a medical expert UI 1504 .
  • UI patient user interface
  • a medical expert edit/approval module 1508 may be interposed between the dialog manager 1506 and the medical expert UI 1504 . Any automatically generated information might be either sent directly to the user or pass through the medical expert edit/approval module 1508 .
  • standby physicians or professionals with equivalent training may read auto-responses or questions, and decide whether their responses or questions are of good enough quality to be directly approved or whether they may require edits.
  • the dialog manager 1506 may be in charge of keeping a state of a conversation between the system and the patient, and routing requests to other components. For example, the dialog manager 1506 may invoke an end-to-end dialog model 1516 .
  • the end-to-end dialog model 1516 may be a Deep Learning language model that uses a model pre-trained on a general corpuses of text (e.g., Wikipedia), and may be then fine-tuned on medical conversations between doctors and patients. This technique may be referred to as transfer learning since patterns learned on a given task or training corpus are transferred to a similar task.
  • the model may be learned using a regularized long short-term memory network (LSTM).
  • LSTM long short-term memory network
  • UMLFIT Universal Language Model Fine-tuning for Text Classification
  • the model may be able to engage with patients in general conversation as well as accurately engage in conversations on topics that are common in the training set. However, it may lack an overlay of medical semantics and causal reasoning, and may have no knowledge safeguards. Such a model may be intended to respond to simple queries and determine which more detailed model in the system best suits the patient's goals and intent.
  • a next question module 1514 may be in charge of finding the optimal best question to ask the patient given the current state of the dialog, and, particularly, medically relevant information that is known at a given point of the dialog, as described above.
  • the dialog manager 1506 may call an intent classification module 1518 , and, depending on its response, it may decide which of the following modules described below to call up next.
  • the intent classification module 1518 may detect whether the patient is seeking a diagnosis, or a treatment, or other help.
  • the intent classification module 1518 may be based on performing medical entity recognition, using the medical entity recognition module 1524 , to recognize words related to either diagnosis or treatment (e.g. symptoms, or medications).
  • the intent classification module 1518 may be a supervised machine learning algorithm trained on interactions that have been labeled as “diagnosis”, “treatment”, or “other”.
  • a diagnosis recommendation module 1512 may provide a differential diagnosis given what is known about the patient at a given point during the dialog.
  • the diagnosis recommendation module 1512 may use a diagnosis algorithm 1522 , an example of which is described above.
  • a treatment recommendation module 1510 may provide a ranked list of potential treatments given what is known about the patient at a given point during the dialog.
  • the treatment recommendation module 1510 may use the diagnosis algorithms and then map the differential diagnosis to known treatments for a given diagnosis using a diagnosis to treatment mapping 1520 .
  • a medical decision support system in accordance with the present disclosure as described above may involve the processing of input data and the generation of output data to some extent.
  • This input data processing and output data generation may be implemented in hardware or software.
  • specific electronic components may be employed in a computer server or similar or related circuitry for implementing the functions associated with a medical decision support system in accordance with the present disclosure as described above.
  • one or more processors operating in accordance with instructions may implement the functions associated with a medical decision support system in accordance with the present disclosure as described above.
  • Such instructions may be stored on one or more non-transitory processor readable storage media (e.g., a magnetic disk or other storage medium), or transmitted to one or more processors via one or more signals embodied in one or more carrier waves.
  • processor readable storage media e.g., a magnetic disk or other storage medium

Abstract

Techniques for responding to a healthcare inquiry from a user are disclosed. In one particular embodiment, the techniques may be realized as a method for responding to a healthcare inquiry from a user, according to a set of instructions stored on a memory of a computing device and executed by a processor of the computing device, the method comprising the steps of: classifying an intent of the user based on the healthcare inquiry; instantiating a conversational engine based on the intent; eliciting, by the conversational engine, information from the user; and presenting one or more medical recommendations to the user based at least in part on the information.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This patent application claims priority to U.S. Provisional Patent Application No. 62/654,111, filed Apr. 6, 2018, which is hereby incorporated by reference herein in its entirety.
  • FIELD OF THE DISCLOSURE
  • The present disclosure relates generally to medical decision support systems and, more particularly, to systems and methods for responding to healthcare inquiries.
  • BACKGROUND OF THE DISCLOSURE
  • Medical decision support systems have been studied and developed over the years. See e.g., Miller, Randolph A., “Computer-Assisted Diagnostic Decision Support: History, Challenges, and Possible Paths Forward,” Advances in Health Sciences Education, vol. 14, no. S1 (September 2009): 89-106. While there have been different approaches to designing medical decision support systems, the most successful ones fall in a category known as expert rule-based systems. Expert rule-based systems involve domain experts (e.g., case physicians) encoding medical knowledge into rules that are then executed to come up with a decision or a recommendation for a given case. The decision is provided as a ranked list of diagnosis, also called “differential diagnosis.” Most expert rule-based systems encode knowledge as a bipartite graph with nodes representing findings and diagnoses, and the edges representing likelihoods of a finding given a diagnosis and/or a diagnosis given a finding.
  • Two well-known expert rule-based systems are Internist-1 (see e.g., Miller, R. A., H. E. Pople, and J. D. Myers, “Internist-1, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine,” The New England Journal of Medicine, vol. 307, no. 8 (Aug. 19, 1982): 468-76) and DXplain® (see e.g., Barnett, G. Octo, James J. Cimino, Jon A. Hupp, and Edward P. Hoffer, “DXplain: An Evolving Diagnostic Decision-Support System,” JAMA, vol. 258, no. 1 (Jul. 3, 1987): 67-74). Internist-1 and DXplain® are related to QMR (see e.g., Miller, R. A., H. E. Pople, and J. D. Myers, “Internist-1, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine,” The New England Journal of Medicine, vol. 307, no. 8 (Aug. 19, 1982): 468-76) since Internist-1 is a precursor and DXplain® is a recent evolution. In both of Internist-1 and DXplain®, the relation between a finding and a disease is encoded into two numbers: “Evoking Strength” encodes the likelihood of a disease given a finding while “Frequency” describes how likely a symptom is to occur given a disease manifestation.
  • Expert rule-based systems such as the ones described above have been used to support doctors in their medical practice. However, simplified versions have also been instantiated as patient-facing applications that are sometimes called “symptom checkers.” These patient systems have unreliable accuracy (see e.g., Semigran, Hannah L., David M. Levine, Shantanu Nundy, and Ateev Mehrotra, “Comparison of Physician and Computer Diagnostic Accuracy,” JAMA Internal Medicine, vol. 176, no. 12 (Dec. 1, 2016): 1860-61) and have limited features.
  • Furthermore, creating a new disease in an expert rule-based system is very costly, both in terms of time and robustness. It may take weeks of several expert diagnosticians working full-time, and the resulting system can be brittle and inflexible since it cannot react to newly available data or evidence (e.g., a new disease).
  • Because of some of these shortcomings, there have been through the years many attempts to develop machine-learned diagnosis models. These include early approaches using Bayesian Networks (see e.g., Shwe, Michael A., Blackford Middleton, D. E. Heckerman, Max Henrion, E. J. Horvitz, H. P. Lehmann, and G. F. Cooper, “Probabilistic Diagnosis Using a Reformulation of the INTERNIST-1/QMR Knowledge Base,” Methods of Information in Medicine, vol. 30, no. 4 (1991): 241-255) to the most recent ones using Deep Learning models (see e.g., Rajkomar, Alvin, Eyal Oren, Kai Chen, Andrew M. Dai, Nissan Hajaj, Peter J. Liu, Xiaobing Liu, et al., “Scalable and Accurate Deep Learning for Electronic Health Records,” ArXiv: 1801.07860 [Cs], Jan. 24, 2018.; Miotto, Riccardo, Li Li, Brian A. Kidd, and Joel T. Dudley, “Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records,” Scientific Reports, vol. 6 (May 17, 2016)). These machine-learned approaches have been mostly academic research exercises that have not been included in commercially-available systems. Also, one of the reasons for the lack of practical applicability of the machine-learned models is that a medical decision support system not only needs to have a model that predicts a diagnosis given a set of inputs, but also needs to provide an efficient way to navigate the path towards an optimal diagnosis by asking the right questions. That is easier to implement in rule-based systems, but gets more complicated for more complex diagnosis models.
  • More generally, these existing technologies have many shortcomings that make them less useful in a patient-facing application. Some relate to the user interface and experience or the lack of key features, while others relate more to the performance or accuracy of the system itself.
  • In view of the foregoing, it may be understood that there may be significant problems and shortcomings associated with current medical decision support systems. Accordingly, there is a need for addressing these shortcomings and introducing a whole suite of innovations that make it possible to implement a patient-facing medical decision support system.
  • SUMMARY OF THE DISCLOSURE
  • Techniques for responding to a healthcare inquiry from a user are disclosed. In one particular embodiment, the techniques may be realized as a method for responding to a healthcare inquiry from a user, according to a set of instructions stored on a memory of a computing device and executed by a processor of the computing device, the method comprising the steps of: classifying an intent of the user based on the healthcare inquiry; instantiating a conversational engine based on the intent; eliciting, by the conversational engine, information from the user; and presenting one or more medical recommendations to the user based at least in part on the information.
  • In accordance with other aspects of this particular embodiment, the eliciting step comprises using, by the conversation engine, an entropy minimization process to determine a next question to present to the user, such that subsequent information provided by the user in response to the next question minimizes the number of medical recommendations.
  • In accordance with further aspects of this particular embodiment, the entropy minimization process is weighted to optimize diagnosis that identifies worst outcomes for early medical intervention on the user.
  • In accordance with additional aspects of this particular embodiment, the entropy minimization process is weighted to optimize diagnosis for treating the user's symptoms or disease clusters that maximizes a diagnostic value of the user's response to the proposed treatments.
  • In accordance with other aspects of this particular embodiment, the presenting step comprises invoking a knowledge base and a diagnosis engine.
  • In accordance with further aspects of this particular embodiment, the knowledge base represents normalized medical concepts, which include at least one of: entities representing findings, symptoms, and conditions; modifiers representing anatomical location, severity, and temporal modifiers; weighted relations between the entities; relations between the entities and the modifiers; a representation of models and engines as an instance of a graph with the entities and the relations; a mapping of a medical text to a knowledge base representation by using an entity recognition module; and additional knowledge sources.
  • In accordance with additional aspects of this particular embodiment, the entity recognition component module is configured to translate health-related text into medical entities and modifiers.
  • In accordance with other aspects of this particular embodiment, the diagnosis engine is at least one of: a first diagnosis engine based on rules in a knowledge based codifying probabilistic relationships between symptoms/findings and diseases; a second diagnosis engine based on first machine-learned models deriving relations between symptoms/findings and diseases from historical medical records; a third diagnosis engine based on second machine-learned models deriving both probabilities and relationships from historic medical records; and a fourth diagnosis engine based on third machine-learned models learned from mixed data that includes at least one of synthetic data generated by a pre-existing expert system, electronic medical records, manual cases, labelled cases from the diagnosis engine.
  • In accordance with further aspects of this particular embodiment, the historical records are either obtained from anonymized databases or generated by interactions with the user in a recursive manner.
  • In accordance with additional aspects of this particular embodiment, the first, second, third, and fourth diagnosis engines act in ensemble; each of the first, second, third, and fourth diagnosis engines operates upon a current state of knowledge independently, and offers possible responses along with a confidence and value estimate; and an ensemble arbitrator chooses a response out of the possible responses or a collection of responses that is best for the user or circumstance given a match or a mismatch between the possible responses, and a value and confidence each of the first, second, third, and fourth diagnosis engines expresses in its corresponding response. The ensemble arbitrator learns a weight to use for each possible response from each of the first, second, third, and fourth diagnosis engines based upon history.
  • In accordance with other aspects of this particular embodiment, the eliciting step includes invoking natural language understanding engine.
  • In accordance with further aspects of this particular embodiment, the information from the user is in the form of at least one of text, speech, imagery, sound, and medical test results.
  • In accordance with additional aspects of this particular embodiment, the information from the user includes at least one of user's medical history and the user's symptoms.
  • In accordance with other aspects of this particular embodiment, the conversation engine is one of a diagnosis conversational engine, an information conversational engine, a referral conversational engine, and a treatment conversational engine.
  • In accordance with other aspects of this particular embodiment, the method further comprises, prior to presenting the one or more medical recommendations to the user, seeking approval or revision of the one or more medical recommendations by a medical expert.
  • In another particular embodiment, the techniques may be realized as a system for responding to a healthcare inquiry from a user, the system comprising: memory for storing instructions; and a processor configured to execute the instructions to: classify an intent of the user based on the healthcare inquiry; instantiate a conversational engine based on the intent; elicit, by the conversational engine, information from the user; and present one or more medical recommendations to the user based at least in part on the information.
  • The present disclosure will now be described in more detail with reference to particular embodiments thereof as shown in the accompanying drawings. While the present disclosure is described below with reference to particular embodiments, it should be understood that the present disclosure is not limited thereto. Those of ordinary skill in the art having access to the teachings herein will recognize additional implementations, modifications, and embodiments, as well as other fields of use, which are within the scope of the present disclosure as described herein, and with respect to which the present disclosure may be of significant utility.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • In order to facilitate a fuller understanding of the present disclosure, reference is now made to the accompanying drawings, in which like elements are referenced with like numerals. These drawings should not be construed as limiting the present disclosure, but are intended to be illustrative only.
  • FIG. 1 illustrates a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 2 illustrates interactions between a medical decision support system and patients and medical experts in accordance with an embodiment of the present disclosure.
  • FIG. 3 illustrates interactions between a medical decision support system and patients and medical experts in accordance with an embodiment of the present disclosure.
  • FIG. 4 illustrates an infrastructure of a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 5 illustrates an exemplary architecture a backend server for a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 6 is a flowchart illustrating a method for responding to a healthcare inquiry from a user using a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 7 illustrates the operation of an intent classification module within a conversational engine in accordance with an embodiment of the present disclosure.
  • FIG. 8 illustrates a structure of a specialized conversational engine in accordance with an embodiment of the present disclosure.
  • FIG. 9 is a flowchart for an exemplary entity recognition algorithm that may be employed by a natural language understanding module in a specialized conversation engine in accordance with an embodiment of the present disclosure.
  • FIG. 10 shows an example of an entity extraction in accordance with an embodiment of the present disclosure.
  • FIG. 11 shows an example of an autocomplete operation in accordance with an embodiment of the present disclosure.
  • FIG. 12 is a flowchart illustrating a method of training a machine-learned model for a diagnosis engine in accordance with an embodiment of the present disclosure.
  • FIG. 13 shows a possible diagnosis output from a medical decision system resulting from symptoms entered by a user in accordance with an embodiment of the present disclosure.
  • FIG. 14 is a flowchart illustrating a method for responding to a healthcare inquiry from a user using a medical decision support system in accordance with an embodiment of the present disclosure.
  • FIG. 15 illustrates an architecture of a medical decision support system in accordance with an embodiment of the present disclosure.
  • DETAILED DESCRIPTION OF EMBODIMENTS
  • FIG. 1 illustrates a medical decision support system 100 in accordance with an embodiment of the present disclosure. The system 100 may interface with a user (e.g., a patient, a medical expert, etc.) via the user interface 102. The user may access the user interface 102 through at least one of a plurality of user devices 101, which may comprise a laptop, a smartphone, a tablet, smart speakers (e.g., Amazon Echo, Google Home, etc.), digital assistants, etc. The user interface 102 may be in communication with the system 100 backend, which may comprise a user information module 108, a conversational engine 114, a differential diagnosis engine (DDx) 116, a knowledge base (KB) 118, and a data/knowledge input module 120.
  • The user interface 102 may be divided into a user input interface 104 and a user output interface 106. The user input interface 104 may allow input from the user in different modalities, including, but not limited to, natural language (of the user), speech data (e.g., from a smart speaker), image/video, and audio. The user input interface 104 may integrate the different modalities into a multimodal representation to be processed by the conversational engine 114, as described below. The user output interface 106 may provide the user (i.e., a patient or a medical expert), on at least one of a plurality of user devices 101, with one or more actionable recommendations, such as triaging, diagnosis, treatment, or explanations, as described below. Sometimes, the user output interface 106 may also need to present the user with questions in order to elicit additional and the right kind of information such that the system 100 is able provide a confident recommendation.
  • The user information module 108 may include two submodules—a user profile/modeling submodule 110 and a personalization submodule 112. The user profile/modeling submodule 110 may create and/or update a user's profile with all relevant information available about the user from the user input interface 104. The user profile/modeling submodule 110 may be built on the notion of creating a personalized health record. The personalization submodule 112 may use the user's profile from the user profile/modeling submodule 110 in order to optimize the relevance of the recommendations and information that the system 100 is to present to the user.
  • The conversational engine 114 may be in charge of understanding user's input(s), reasoning about the user's input(s), and deciding what is the most appropriate output(s) after consulting with the DDx 116 and the KB 118. The DDx 116 may produce a ranked list of possible diagnoses given any number of findings, which may be symptoms expressed by the patient as well as their history, demographics, etc. In an embodiment, the DDx 116 may be based on rules in a knowledge based codifying probabilistic relationships between symptoms/findings and diseases. In another embodiment, the DDx 116 may be based on machine-learned models deriving relations between symptoms/findings and diseases from historical medical records. In yet another embodiment, the DDx 116 may be based on machine-learned models deriving both probabilities and relationships from historic medical records. In another embodiment, the DDx 116 may be based on machine-learned models learned from mixed data that includes at least one of synthetic data generated by a pre-existing expert system, electronic medical records, manual cases, labeled cases from the diagnosis engine, as will be described below. Alternatively, the DDx 116 may comprise all these embodiments working in ensemble such that each embodiment may operate based on a current state of knowledge independently, and offer possible responses along with a confidence and a value estimate. Additionally, there may be an ensemble arbitrator to choose a response out of the possible responses or a collection of responses from the different embodiments, that is best for the user or circumstance given a match or a mismatch between the possible responses, and the value and the confidence estimate each embodiment expresses in its corresponding response. The ensemble arbitrator may learn a weight to use for each possible response from each of embodiment of the DDx 116 based upon history.
  • The KB 118 may be a graph representation of medical concepts and relations between the medical concepts, allowing for concepts, modifiers, relations, and hierarchies. The initial set of concepts in the graph may be derived from the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) ontology standard, and represented by unified medical language system (UMLS) concept unique identifiers (CUIs). Relations in the graph may be machine-learned from different sources. The KB 118 may represent normalized medical concepts, which include at least one of: entities representing findings, symptoms, and conditions; modifiers representing anatomical location, severity, and temporal modifiers; weighted relations between the entities; and relations between the entities and the modifiers. The normalized medical concepts may further include a representation of models and engines as an instance of a graph with the entities and the relations; a mapping of a medical text to a knowledge base representation by using an entity recognition module; and additional knowledge sources. As will be described below, the entity recognition component module is configured to translate health-related text into medical entities and modifiers. The KB 118 may grow and may incorporate future concepts as needed. The following is an example of some concepts and relationships that may be represented in the KB 118:
  • findings:
     {
       ″id″: 31,
       ″name″: ″hip pain″,
       ″expression″; ″22253000|pain:363698007|Finding site=
       304907005|hip″,
     }
    diseases:
      {
        ″id″: 7,
        ″name″: ″acute renal failure″,
        ″expression″: ″14669001|acute renal failure″,
      }
    relationships:
     ″304907005|hip″: [
      {
        ″is_a″: ″39352004|joint″
      }
     ]
    edges:.
     {
       ″disease″:7,
       ″finding″: 31,
       ″metadata″: {
        ″strength″: 0.8284268201196736
       }
     }
  • The above exemplary graph illustrates a relationship between a finding 31 (“hip pain”) under the “findings” node and a disease 7 (“acute renal failure”) under the “diseases” node. The relationship is captured under the “edges” node of the graph with a relative “strength” of 0.83 (i.e., it is closely related or “common”). Moreover, each node may capture additional information under the “expression” field. For example, the finding 31 (“hip pain”) is of type “pain” and is located in the “hip.” Also, under the “relationships” node, “hip” is defined as a “joint”. The KB 118 may also provide an application programming interface (API) with a set of operations that allow to navigate the graph. A sequence of exemplary operations may be:
      • get_children: joint→[knee, elbow, shoulder, ankle]
      • refine
      • “knee pain”→“severe knee pain”
      • “joint pain”→[“knee pain”, “elbow pain”, “ankle pain”]
      • expansion: “knee swelling”→[“knee pain”, “knee inflammation”, “knee . . . ”, “hand swelling”, “joint swelling”, “ . . . swelling”]
  • In this example, the “refine” operation allows to retrieve entities of a parent node type from the KB 118. For example, when the “refine” operation is applied to “joint,” the KB 118 returns different kinds of joints; when applied to “joint pain,” the KB 118 returns different kinds of joint pains; and when applied to “knee pains,” the KB 118 returns variations of knee pain. The “expansion” operation does a traversal of the graph and returns any valid modification for any of the concepts in the expression. For example, when the “expansion” operation is applied to “knee swelling,” the KB 118 returns all possible types of knee conditions, as well as swellings.
  • Both the DDx 116 and the KB 118 may be informed by the data/knowledge input module 120. The DDx 116 may further be informed by the KB 118. The data/knowledge input module 120 may intake different data sources and models, such as datasets 130 and rules/protocols 132. Examples of the datasets 130 are electronic medical records (EMRs) (also known as electronic health records (EHRs)) and Wikidoc (www.wikidoc.org). The rules/protocols 132 may come from medical experts, from decision support systems such as DXplain®, for example. In addition to the datasets 130 and the rules/protocols 132, the data/knowledge input module 120 may also intake information and data from the user input interface 104.
  • The data/knowledge input module 120 may include a medical records understanding submodule 122, an expert systems input submodule 124, a medical text understanding submodule 126, and a model/knowledge inference submodule 128. The medical records understanding submodule 122 may parse and understand medical data in EMRs by applying natural language processing (NLP) technologies to doctors' notes and encoding all other available information as features that may be used in later modeling stages. The expert systems input submodule 124 may ingest arbitrary knowledge graphs and encode them in a way that may be understood and combined with other existing models. The medical text understanding submodule 126 may apply NLP technologies to extract concepts and relations from existing medical text resources to construct knowledge base graphs and diagnosis models.
  • The model/knowledge inference submodule 128 may be a service model that includes functionalities that is needed by the other submodules. One such example of the functionalities may be to allow the medical records understanding submodule 122, the expert systems input submodule 124, and/or the medical text understanding submodule 126 to interface with the model combination/reasoning module 134, which informs the DDx 116. The model combination/reasoning module 134 may select the optimal model needed for making a particular diagnosis. In an embodiment, model combination/reasoning module 134 may respond to rules about which of the existing models respond better to a kind of diagnosis. For example, a model that has been trained on medical records from a pediatric hospital will be preferred when building a diagnosis for a child. In another embodiment, the model combination/reasoning module 134 may be an ensemble trained on data such that responses from different models are combined with different weighs corresponding to the models' abilities to respond to previous similar cases. Another functionality of the model/knowledge inference submodule 128 may be to interface with the knowledge representation module 136, which informs the KB 118. The knowledge representation module 136 may use the output of the data/knowledge input module 120 and organize the output in a representation that is usable by the other modules. For instance, the knowledge representation module 136 may provide processed knowledge and structures from the data/knowledge input module 120 to the KB 118.
  • FIG. 2 illustrates interactions between a medical decision support system 200 and patients and medical experts in accordance with an embodiment of the present disclosure. The system 200 may be a cloud service system. The system 200 may encompass the system 100 and enable a plurality of client applications. For example, a patient-facing application may be deployed as a web-based application 202, as well as a mobile device application 204. The system 200 may include a plurality of tools 206, for example via a computer, providing functionalities for medical experts to access data generated by patients during their sessions with the system 200.
  • FIG. 3 illustrates interactions between a medical decision support system 300 and patients and medical experts in accordance with an embodiment of the present disclosure. The system 300 may be a cloud service system. The system 300 may encompass the system 100 and enable a plurality of client applications. While a patient may interact directly with a web-based application 302 and/or a mobile device application 304, the cloud service system 300 may also provide the ability to input data from medical and health devices 308 (e.g., a heart-rate monitor, an at-home blood pressure device, etc.), as well as results from laboratory tests 310 (e.g., from conventional laboratories, at-home or in-pharmacy instruments, etc.). The system 300 may include a plurality of tools 306, for example via a computer, providing functionalities for medical experts to access data generated by patients during their sessions with the system 300 and/or to access patient data from the medical and health devices 308 and the laboratory tests 310.
  • FIG. 4 illustrates an infrastructure 400 of a medical decision support system in accordance with an embodiment of the present disclosure. The infrastructure 400 may be exposed to users via an application proxy (App Proxy) node 402, which may route requests to an administrative application (Admin App) 404 and/or a client application (Client App) 406, while restricting access to other internal resources. The App Proxy node 402 may be a Hypertext Transfer Protocol (HTTP) server, which may handle both serving web resources and acting as a proxy between the public internet and the Admin App 404 and the Client App 406. An example of such an HTTP server is an nginx server. An engine 408 may reside in a cloud service system. For example, the engine 408 may be deployed as a Python Flask server running on a cloud service system (e.g., Google Cloud), while the Admin App 404 and the Client App 406 may share a frontend stack, which may for example be built on React, a JavaScript library. The infrastructure 400 may be built on top of a Docker container platform. Each of the major components in the infrastructure 400 may be packaged and deployed as a self-contained container, avoiding the need to keep development and production environments in synchronization in terms of tools, dependencies, etc.
  • The engine 408 may be the backend server for a medical decision support system (e.g., the system 100, 200, or 300), and may provide the following functionalities:
      • Functionalities available on the Admin App 404 and the Client App 406
      • Packaging all supported knowledge bases (e.g., Sontag, VDDX, Wiki) in the medical decision support system
      • Implementing algorithms that operate on the knowledge bases to provide functionalities such as next question to be posed to a patient and diagnosis to be presented to the patient
      • Logging user activity and managing user session history
      • Providing messaging support between patients and medical experts
        As shown in FIG. 4, the engine 408 may be in communication with a database 410, which in turn may be in communication with a backup database 412. The database 410 may store all data transactions including user and doctor information. The backup database 412 may be a secondary storage where a daily snapshot of the database 410 may be stored, for example, for security purposes.
  • FIG. 5 illustrates an exemplary architecture of the engine 408 in accordance with an embodiment of the present disclosure. The engine 408 may be a Python server built on a Flask server 510. Flask aims to be a lightweight, but extensible framework for building Python services and servers. Flask libraries, such as authentication, logging, persistence, HTTP request/response, etc., are available for functions and functionalities needed to build the engine 408. The engine 408 may be deployed, locally and/or on a cloud service system, as a self-contained Docker image, which may be a composition of several smaller Docker images with specific functionalities. The engine 408 may include an in-memory knowledge base (KB) 514 that may contain relevant datasets imported into internal data structures. At runtime, the Flask server 510 may perform only read operations on the KB 514. The KB 514 is served as a binary SQLLite file that gets loaded into memory on startup. The engine 408 may further include a MySQL server 520, which may host databases used by the Flask server 510. In an embodiment, only user data (e.g., patient profiles and their interaction sessions) may be stored in the MySQL server 520, while the KB 514 is served as a binary file. A Celery 518, which may be included in the engine 408, is an asynchronous task queue/job queue based on distributed message passing. The Celery 518 may be focused on real-time operation, and may also supports scheduling. The Celery 518 may be used in the engine 4408 to schedule and execute non-blocking tasks, such as sending messages, thereby allowing the Flask server 510 to respond to users' requests quickly without being blocked by slow calls that may be executed asynchronous. A Redis 516, which also may be include in engine 408, is an in-memory data structure store that may be used as a database, cache, and message broker. The engine 408 may schedule asynchronous tasks by writing objects into the Redis 516, to which the Celery 518 may be continuously listening for changes and handling/executing any tasks as they arrive.
  • FIG. 6 is a flowchart illustrating a method 600 for responding to a healthcare inquiry from a user (e.g., a patient) using a medical decision support system (e.g., the system 100, 200, or 300) in accordance with an embodiment of the present disclosure. The method 600 begins at step 610 with the user making a healthcare inquiry. At step 620, the method 600 determines and classifies the intent of user's inquiry as one of a plurality of intent classifications (as described below), using a conversational engine (e.g., conversational engine 114). The method 600 proceeds to step 630, where a decision is made as to whether the user's intent is understood. If the user's intent is not understood (in which case, no trustworthy actionable recommendation may be provided with some degree of confidence), the user's inquiry is escalated to a doctor, at step 640. However, if the user's intent is understood at step 630, the method 600 moves to step 650 and instantiates one of a plurality of specialized conversational engines for the intent, based on the intent classification. Then, at step 660, based on interfacing with a diagnosis engine (e.g., the DDx 116) and a knowledge base (e.g., the KB 118), the method 600 elicits more information from the user (if needed). Once all needed information is obtained from the user, the method 600 decides, at step 680, whether a confident recommendation may be made. If a confident recommendation may not be made at step 680, the user's inquiry is escalated to a doctor, at step 640. Otherwise, a recommendation (e.g., diagnosis, referral, treatment, etc.) is made to the user at step 690. The goal of the method 600, therefore, is to provide at least one of a plurality of actionable recommendations to the user.
  • A medical decision support system (e.g., the system 100, 200, or 300) typically includes a software system to manage a patient dialog system to perform one or more of the following steps:
      • clarify question that the patient asked
      • gather additional information, including presence or absence of additional symptoms, patient history, demographic data, etc.
      • establish the patient intent
      • gather such additional details as calendar appointment details, insurance details, emergency contact information, etc.
  • The medical decision support systems described herein may be included in a broad category of dialog systems, with a specific focus on medical tasks. Dialog systems may be generally categorized as task-oriented (e.g., buying a plane ticket) or chatbots. Most approaches to dialog systems have used a “slot-filling” mechanism, where an initial frame with slots is defined and the goal of the dialog is to fill in those slots. See e.g., Jurafsky and Martin, “Speech and Language Processing,” 2nd edition. Research into end-to-end dialog systems using Deep Learning techniques has recently emerged. See e.g., Bordes et al., “Learning End-to-End Goal-Oriented Dialog,” International Conference on Learning Representations, 2016. The systems described herein may use a combination of both approaches with a goal of offering not only an accurate, but also a natural medical conversation.
  • A patient dialog system may be comprised of a series of templates for the information required, and formulations for questions to pose to the patient to elicit that information. A model may represent the system's current estimate of the patient objectives, and chooses next questions to be asked with a combination of the following objectives:
      • clarifying, quantifying, or modifying a previous response
      • prompting for additional inputs that the patient may consider important or relevant
      • identifying symptoms that may discriminate between different possible diagnoses or patient states, and asking whether the patient is experiencing any such symptoms, from general to specific (e.g., pain, joint pain, elbow pain, shooting elbow pain, etc.)
      • asking questions about symptoms that may serve to strengthen or weaken the hypotheses about specific diseases that may be present
      • asking for personal health history, including but not limited to gender, age, ethnicity, previous diseases, vaccinations, current and past prescriptions, ancestry, and ancestors' health status, etc.
      • prompting for input of, or connection to existing health or test records to integrate into the model
  • In order to offer the above functionalities, the system may implement a dialog engine (e.g., the conversational engine 114) that may include an intent classification module. FIG. 7 illustrates the operation of an intent classification module 710 within a conversational engine 700, in accordance with an embodiment of the present disclosure. The conversational engine 700 may be an example of the conversational engine 114. The intent classification module 710 may classify a user's intent into a finite set of “intents.” Those intents include, but are not limited to, diagnosis, doctor referral, prescription or treatment recommendation, and information on a given disease. The intent classification module 710 may be built by using a combination of encoded rules and machine-learned classifier models. Encoded rules help the intent classification module 710 to identify patterns that are then treated as a single entity. For example, “right lower back” may be identified as an anatomical part plus two location modifiers. Entities and processed raw text (through, e.g., embeddings) may then be fed into a supervised classification component that has been trained using thousands of labeled examples. As shown in FIG. 7, the intent classification module 710 may then route the user query to one of a plurality specialized conversational engines—information conversational engine 720, referral conversational engine 730, diagnosis conversational engine 740, treatment conversational engine 750, other conversational engine(s) 760—, each configured to handle a classified intent.
  • Each specialized conversational engine may be built using a combination of a natural language understanding (NLU) module, an autocomplete module, and a next question module. For instance, FIG. 8 illustrates the structure of the diagnosis conversational engine 740 in accordance with an embodiment of the present disclosure. As can be seen in FIG. 8, the diagnosis conversational engine 740 may include a natural language understanding module 742, an autocomplete module 744, and a next question module 746.
  • Generally, for each specialized conversational engine, an NLU module may operate with unstructured natural language inputted as text, which may include textual representations of other modalities such as voice transformed into text through a speech-to-text system. In order for the NLU module to understand and decide on next steps, it may apply an entity recognition algorithm that may recognize words and extract relevant medical concepts including modifiers, quantifiers, and negations. Entities may be mapped to a knowledge base unique identifier. Therefore, the output of the NLU module is a structured representation of the user query in the form of a combination of identifiers, which may be interpreted to determine next steps according to rules in each of the specialized conversational engines.
  • FIG. 9 is a flowchart for an exemplary entity recognition algorithm 900 that may be employed by an NLU module in a specialized conversational engine in accordance with an embodiment of the present disclosure.
  • The entity recognition algorithm 900 starts with the text processing step 902. The text processing step 902 includes information retrieval (IR) processing, which involves removing, from text input by a user, various punctuations and standardization of certain tokens. Unlike in standard IR processing, the text processing step 902 does not remove stop words. Stop words tend to function as a bridge to connect entities. Stop words may be removed, if needed, in the pruner step 908, described below.
  • The entity recognition algorithm 900 then moves to the text synonymizer step 904. The text synonymizer step 904 is used to generate various synonymous ways of representing the processed text from the text processing step 902. For example, the text may be split into a collection of overlapping K-grams (K=1 to 10) using a sliding window. The location of each K-gram in the text may be preserved. Each K-gram phrase may be synonymized by lookups on Wordnet, word-by-word. When there are multiple words in the phrase that have synonyms, all possible combinations may be maintained, by constructing the combinations recursively. Some phrases may not have synonyms, while others may have many.
  • Following the text synonymizer step 904 is the annotator step 906, which is used to identify all entities in the free text. That is, all phrases and their synonyms are identified. A dictionary data structure may be used for fast lookups. Maintaining K-grams of various lengths and their locations in the input text may facilitate the annotator step 906.
  • Once the annotator step 906 completed, the entity recognition algorithm 900 moves the pruner step 908. At the pruner step 908, two operations are performed. First, entities that are sub-entities of another identified entity are pruned. For example, for the user text “knee pain,” the entity corresponding to “knee” and “pain” are removed, and only the entity corresponding to “knee pain” is kept. Second, entities that can correspond to stop words such as “in” are pruned.
  • Next comes the entity merger step 910. The UMLS is not always consistent in concept representation. For example, UMLS may have concept “knee pain,” but not “pain in the knee” as a synonymous. Through analysis on a large number of extractions, a small number of rules or patterns may be captured that are effective in capturing the main concepts that are of interest. For example, these rules may include the following code:
  •  <body system> = |″Body Part, Organ, or Organ Component″, ″Tissue″,
    ″Body Location or Region″,
       ″Body Space or Junction″, ″Body System″, ″Anatomical Structure″|
     <findings> = {″Finding″, ″Sign or System″, ″Body Substance″, ″Disease
    or Syndrome″}
     # <findings>|stop|<body system>
     # <body system>|stop|<findings>
     # <body system>|stop|Qualitative Concept
     # Sign or Symptom|stop|Organism Function
      # Example: pain|with|urination, pain|with|periods
     # Body System|stop|Pathologic Function
      # Example skin||bleeding|
     # Pathologic Function|stop|Body System
      # Example |bleeding|into|skin or |bleeding||skin|

    The Pruner step may be important to increase coverage. The output of Pruner step may be used to identify entities that are in the vicinity of each other that satisfies one of these rules, and where stop is stop words. If so, the entities may be merged and annotated, using the annotator step 906. During the entity merger step 910, the chances for matches are also increased by using the UMLS synonyms of two entities, and taking their cross-product combinations. This may further improve identification. At the end of the entity merging step 910, the pruner step 908 may be run again, for further clean up. It is worth noting that, along this process, the location of the merge in original free text is maintained.
  • In the subsequent sundry step 912, when the text is large, redundant entities are removed, based on CUIs of the entities identified. The returned list of entities may be restricted to be only those defined as findings.
  • Lastly, in the matcher step 914, the output from the sundry step 912 is matched to engine specific findings, in two ways: (1) exact match on the CUIs between the entities and findings in knowledge base; and (2) when (1) is not present, approximate match based on a number of matching, missing, and mismatching entities between the candidates (with mismatching penalized more than missing).
  • FIG. 10 shows an example of an entity extraction in accordance with an embodiment of the present disclosure. As can be seen, if the phrase “I am not feeling great today and my head hurts” is input by a user, an entity recognition algorithm within an NLU module of a specialized conversational engine may return “headache” and “general malaise” as possible symptoms to the user.
  • An autocomplete module may allow a specialized conversational engine to suggest medical concepts or entities coming from the knowledge base (e.g., KB 118) as a letters from a user are being inputted in the system. The suggestions not only include words that include the letters entered by the user, but also words that include the letters among their synonyms. FIG. 11 shows an example of an autocomplete operation in accordance with an embodiment of the present disclosure.
  • The next question module in a specialized conversational engine may be able to determine the optimal next question to ask a user given the current state of the dialog between the user and the specialized conversational engine. The way that the optimal next question may be determined depends on the specifics of each specialized conversational engine, but it generally may be based on either predetermined frames or a entropy minimization algorithm. As an example of the predetermine frames, the referral conversation engine (see FIG. 7) may instantiate a frame with the necessary information (location, preferred time, preferred gender, specialty, etc.) and will determine a next question in order to fill in the blank slots in this frame. As an example of an entropy minimization next question approach, the diagnosis conversational engine (see FIGS. 7 and 8) may initially formulate a differential diagnosis and determine a next question by minimizing the entropy difference of the existing hypothesis. In other words, the goal is to minimize the entropy of the differentials. Before asking a particular question, this is given by
  • H p ( d t ) ( d ) = - k = 1 K p ( d = k t ) log p ( d = k t )
  • where Ft is the collection of the current findings and p(d|Ft) is the posterior probability of disease d given the current findings. Once a given question is asked the entropy can be written as:

  • H p(d|
    Figure US20190311807A1-20191010-P00001
    t ,f)(d)=0.5*[H p(d|
    Figure US20190311807A1-20191010-P00001
    t ,f=1)(d)+H p(d|
    Figure US20190311807A1-20191010-P00001
    t ,f=0)(d)]
  • where 0.5 assumes an equal probability of a patient answering positively or negatively to a question, but could in fact be a probability distribution. Given this, the goal of the next question algorithm is to find f* by maximizing the following expression:
  • f * = arg max f H p ( d t ) ( d ) - p ~ ( f = 1 ) H p ( d t , f ) ( d )
  • In order to understand and reason about medicine, and have meaningful dialogs/conversations with patients, the system needs to have a representation of medical concepts. This involves having a knowledge base, and tools to convert or translate from any language representation to the internal knowledge graph representation. For example, the NLU module described above in a specialized conversational engine needs to convert natural language into a list of identifiers from the knowledge base. But, that is also true for natural language generated by doctors, or structured information coming from electronic medical records or an external knowledge base.
  • The language of medicine also includes tools to map text to knowledge base representation (as described in the NLU module above) and to navigate a knowledge base graph in different directions (e.g., “find related nodes” or “find parent of”).
  • An important feature of a medical decision support system (e.g., system 100, 200, or 300) is a diagnosis engine (e.g., DDx 116). According to an embodiment, a multi-class classifier on co-occurrence of medical concepts in EMRs may be extracted. This produces a graph of medical concepts and edges between them, where the edge weights model the probability of co-occurrence of, for example, a disease given a symptom. Given such a graph, a diagnosis algorithm may be devised. The probability of a finding being present given a disease may be estimated as follows:
  • p ( f j = 1 d = k ) = { α j , k , if in graph β , otherwise , β = .0001 .01 × min j , k α j , k
  • where αj,k are the learned weights and β is a very small number.
  • Other information, but symptoms, such as patient demographics or disease prevalence, may be incorporated, as follows:

  • p(d,f,age,gender)=p(f|d,age,gender)p(age|d)p(gender|d)p(d)
  • In order to calculate the posterior probability of the diseases, Bayes rule may be invoked. The compute probability may be used to rank the differential diagnosis given the currently available information.
  • A medical decision support system (e.g., system 100, 200, or 300) may incorporate several engines working in ensemble. These engines may include rule-based engines, wherein rules in a knowledge base specify the strength of relationship between a symptom/finding, and diseases that may cause such symptoms. Those rule-based systems are derived from existing medical literature by using a combination of manual curation from expert physicians and statistical techniques.
  • Additionally, engines that are based on models learned from data may be included. Such models quantify the relation between symptoms and diseases observed in real-world data. The models are trained using various medical data such as anonymized EMRs from research databases or university hospitals, and from introspecting on the data collected in feedback and follow-up from the engine's own interaction with patients. Sources of bias in the samples that drive the models may be quantified, and the models may be corrected for known and estimated bias. The models may incorporate disease and symptom prevalence in the general population, and may include how those statistics vary in specific subpopulations by demographic, geography, calendar, social connections, etc. In one embodiment, the models are constrained to quantify relationships identified by the rule-based engines. In another embodiment, the models are allowed to infer knew relationships that have never been encoded as an explicit rule. FIG. 12 illustrates a method 1200 of training a machine-learned model for a diagnosis engine in accordance with an embodiment of the present disclosure. The method 1200 may start at step 1210 with a rule-based expert system. At step 1220, the rule-based expert system is used as a generative model to create sample medical cases. At step 1240, the sample medical cases created at step 1220 are used to train a machine-learned model. The machine-learned model may then be applied to a medical decision support system at step 1250. This becomes a novel and effective way to extend and generalize expert systems that can then be combined at the data level. At step 1230, the medical cases generated by the expert system may be combined with other medical cases such as manually generated medical cases 1260, medical cases 1270 gathered from sources such as EMRs, and/or labeled medical cases 1280 from the system usage itself.
  • Referring back to FIG. 1, one important aspect of the system 100 is that it allows for multimodal input as part of the dialog/conversation. The system 100 allows a user/patient to input voice, images, and video as part of the dialog flow. For example, if the patient mentions having a skin rash, the system 100 may invite the patient to upload a picture. The system 100 may then perform automatic image classification and will use the output as another symptom in the diagnosis. Beyond typical multimedia documents, the system 100 may be designed to accept information coming from any medical sensor or monitoring device (e.g., heart rate monitor, blood pressure monitor, etc.).
  • Another important feature that the system 100 may provide is a personal health record (PHR), which is a longitudinal record of all relevant health data from history questions and from previous sessions by an individual patient, integrated with conventional medical records (EMRs, EHRs) from one or multiple healthcare providers, including both structured and unstructured doctors' notes, prescription records, etc. Unlike an EMR, a PHR may always be available to the patient to query, manage, examine, or update. The PHR is not intended to replace the specific electronic records stored at the health institutions, but rather complement and offer a holistic view of a patient's health that includes not only the aggregate of the patient's interactions with their health providers, but also more nuanced interactions with the system 100. For example, the PHR in the system 100 may include information about interactions where the patient complained or asked about a headache despite none of those might have resulted in any medical treatment. The PHR that may be offered in the system 100 remains the property of the patient and enables the patient to not only visualize, but also modify and comment in order to add more detailed or nuanced information.
  • As described before, the goal to the system 100 is to answer patient questions by giving a set of actionable recommendations that may include not only a diagnostic and triaging, but also referral or treatment. One important actionable recommendation that a patient that is looking for information about her health situation can get is whether she needs to visit a doctor and what is the relative urgency of the medical attention. This is accomplished by the ability of the system 100 to triage patients. In most situations, the triaging decision may be accomplished by formulating a diagnosis and classifying the diagnosis into the required attention and urgency. However, in many other situations, the simple existence of a symptom can trigger a triage recommendation. For example, chest pain on an elderly patient or high fever in an infant will trigger an automatic recommendation to visit the emergency room regardless of the confidence on the diagnosis.
  • Many questions from patients may be directly related to them wanting to know “what they have”. The system 100 may respond in the form of a differential diagnosis. This includes a ranked list of possible conditions with an indication on how likely they are and recommended next steps. FIG. 13 shows a possible diagnosis output from the system 100 and the symptoms entered by a user that led to the output in accordance with an embodiment of the present disclosure.
  • While the system 100 may be designed to give up-to-date and accurate medical information, in many situations, the interaction with the patient will determine that the best path forward is for her to visit a doctor. Exemplary cases when the system 100 may decide to refer the patient to a doctor may include, but is not limited, to the following:
      • Probability of complications are above a certain threshold
      • Patient requires physical examination or lab tests
      • The internal algorithms are not certain of what is the issue with the patient
      • The patient has described a critical symptom (e.g., “chest pain”)
  • In all these cases, the system 100 may determine the best doctor to whom to refer the patient considering different variables that include everything that is known about the patient (e.g., location, healthcare provider) and their current situation (e.g., vitals, symptoms). The system 100 may present a ranked list of recommendations. The system 100 may include a database of doctors and facilities that can provide a given medical procedure or service. On the other hand, the system 100 may also provide the possibility to refer directly to in-house physicians through text or video consultation.
  • Given a disease, or cluster of diseases that may afflict a patient, a component of the system 100 may be designed to sort through alternative possible treatments, qualify them based upon relevance to the patient's symptoms and history, and propose possible treatments that the patient could pursue.
  • In follow-ups, the system 100 may collect data on treatment paths chosen by the patient, and outcomes, and use these data to learn models for which treatments to propose to other patients in the future. Some of those treatments may involve prescriptions or recommendations for over-the-counter medications. For the former, the system 100 may include referral to a healthcare professional, who can safely prescribe medications. This could be an offline prescription based on data collected, a real-time or scheduled chat or message interaction, a real-time or scheduled video consultation, or a scheduled visit for physical examination. In any of these cases, the system 100 may facilitate the paperwork of prescribing and forwarding the prescription onto a pharmacy.
  • The system 100 may include a database of pharmacy within which it can:
      • Cross-check against other medications the patient may be taking
      • Validate dosage for the demographics, weight, gender of the patient
      • Warn about possible side-effects, and incorporate follow-up on such side-effects in future interactions with the patient.
      • Evaluate responses to prescribed medications to confirm or reduce weight on previous conclusions about the patient's disease or condition
      • Guide or advise on substitutions with alternatives or generic medications that may be differentially available, covered by insurance, and have differing efficacies.
  • Similarly, some situations may require the patient to undergo some form of laboratory testing. In some cases, the kind of testing might be available through some form of at-home procedure (e.g., blood pressure), but in many other cases the patient may need to visit a physician or pharmacist. For the former, the system 100 may refer the patient to existing third-party applications and solutions. For the latter, the system 100 may refer the patient to nearby facilities considering all the information about the patient and the current situation (e.g., how urgent the test is). In any of these cases, the system 100 may facilitate the paperwork. The system 100 may include a database of pharmacies and doctors that may provide a given test procedure.
  • A key to effective machine learning is generating data from which to learn. The system 100 may be designed for comprehensive follow-up with each patient, which may take the form of:
      • Follow-up questions to the patient asking whether their situation followed the trajectory that the system projected.
      • Follow-up questions to the patient asking about new, changing, or past symptoms
      • Follow-up questions to the patient asking whether they followed up with their doctor, and what the doctor might have confirmed or changed.
      • Follow-up questions to the any doctor or professional to add their corrected labels to the record.
      • Checking with other sources of EMR to learn results from additional testing or procedures prescribed.
      • Independent third-party review of interactions by expert doctors trained in classifying open-loop data and attaching a correct label to the outcome.
  • The system 100 has the goal of offering actionable and personalized healthcare information to patients. All use cases outlined above have some flavor of understanding a patient informational need and presenting some information about the patient's situation or the next steps to take. In order to serve those informational needs, the system 100 may include comprehensive content that may be indexed and presented in response to any question. This includes:
      • Information about diseases, symptoms, and treatments
      • Information about medications
      • Information about laboratory tests and other medical procedures
      • Information about doctors and medical facilities, including their locations and services provided by them
        Such content may be structured, tagged, and categorized in the system 100 so that it may be served in response to concrete informational needs and personalized in different ways.
  • FIG. 14 is a flowchart illustrating a method 1400 for responding to a healthcare inquiry from a user (e.g., a patient) using a medical decision support system (e.g., the system 100, 200, or 300) in accordance with an embodiment of the present disclosure. The method 1400 begins at step 1410 with the user making a healthcare inquiry. At step 1420, the method 1400 classifies the intent of user's inquiry (e.g., using intent classification module 710). The method 1400 proceeds to step 1430, where a decision is made as to whether the user's intent is related to either a diagnosis or a treatment. If the user's intent is not related to either a diagnosis or a treatment, the method 1400 invokes a converse at step 1440. The converse generates a reasonable response based on what the medical decision support system knows about its past few interactions with the user. However, if the user's intent is related to either a diagnosis or a treatment at step 1430, the method 1400 moves to step 1450 and determines whether the confidence on a given diagnosis or treatment is above a predetermined threshold. If the confidence is below or at the predetermined threshold, the method 1400 moves to step 1460 to generate a next question, as described above, to elicit more information from the user. If the confidence is above the predetermined threshold, the method 1400 moves to step 1470 to communicate either a diagnosis or a treatment (informed by a diagnosis engine (e.g., the DDx 116) and a knowledge base (e.g., the KB 118)) to the user. Any of the response from step 1440, the next question from step 1460, and the diagnosis or treatment from step 1470 may require an optional approval/edit by a medical expert at step 1480 before being presented to the user. For example, the method 1400 may trigger step 1480 based on predefined settings (e.g., a physician may need to approve diagnosis recommendations that include diagnosis above a certain severity).
  • FIG. 15 illustrates an architecture 1500 of a medical decision support system in accordance with an embodiment of the present disclosure. The architecture 1500 may be used to perform the method 1400, for example. As can be seen in FIG. 15, a dialog manager 1506 may be a central component in the architecture 1500. The dialog manager 1506 may communicate with the patient via a patient user interface (UI) 1502 and with a medical expert via a medical expert UI 1504.
  • A medical expert edit/approval module 1508 may be interposed between the dialog manager 1506 and the medical expert UI 1504. Any automatically generated information might be either sent directly to the user or pass through the medical expert edit/approval module 1508. In this module, standby physicians or professionals with equivalent training may read auto-responses or questions, and decide whether their responses or questions are of good enough quality to be directly approved or whether they may require edits.
  • The dialog manager 1506 may be in charge of keeping a state of a conversation between the system and the patient, and routing requests to other components. For example, the dialog manager 1506 may invoke an end-to-end dialog model 1516. The end-to-end dialog model 1516 may be a Deep Learning language model that uses a model pre-trained on a general corpuses of text (e.g., Wikipedia), and may be then fine-tuned on medical conversations between doctors and patients. This technique may be referred to as transfer learning since patterns learned on a given task or training corpus are transferred to a similar task. The model may be learned using a regularized long short-term memory network (LSTM). See e.g., Stephen Merity et al., “Regularizing and Optimizing LSTM Language Models,” International Conference on Learning Representations, 2018. In particular, the Universal Language Model Fine-tuning for Text Classification (UMLFIT) implementation may be used. See e.g., Howard and Ruder, “Universal Language Model Fine-tuning for Text Classification,” Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol. 1 (July 2018). The model may be able to engage with patients in general conversation as well as accurately engage in conversations on topics that are common in the training set. However, it may lack an overlay of medical semantics and causal reasoning, and may have no knowledge safeguards. Such a model may be intended to respond to simple queries and determine which more detailed model in the system best suits the patient's goals and intent.
  • A next question module 1514 may be in charge of finding the optimal best question to ask the patient given the current state of the dialog, and, particularly, medically relevant information that is known at a given point of the dialog, as described above.
  • With input from the patient, the dialog manager 1506 may call an intent classification module 1518, and, depending on its response, it may decide which of the following modules described below to call up next. The intent classification module 1518 may detect whether the patient is seeking a diagnosis, or a treatment, or other help. In an embodiment, the intent classification module 1518 may be based on performing medical entity recognition, using the medical entity recognition module 1524, to recognize words related to either diagnosis or treatment (e.g. symptoms, or medications). In another embodiment, the intent classification module 1518 may be a supervised machine learning algorithm trained on interactions that have been labeled as “diagnosis”, “treatment”, or “other”.
  • A diagnosis recommendation module 1512 may provide a differential diagnosis given what is known about the patient at a given point during the dialog. The diagnosis recommendation module 1512 may use a diagnosis algorithm 1522, an example of which is described above.
  • A treatment recommendation module 1510 may provide a ranked list of potential treatments given what is known about the patient at a given point during the dialog. The treatment recommendation module 1510 may use the diagnosis algorithms and then map the differential diagnosis to known treatments for a given diagnosis using a diagnosis to treatment mapping 1520.
  • At this point it should be noted that a medical decision support system in accordance with the present disclosure as described above may involve the processing of input data and the generation of output data to some extent. This input data processing and output data generation may be implemented in hardware or software. For example, specific electronic components may be employed in a computer server or similar or related circuitry for implementing the functions associated with a medical decision support system in accordance with the present disclosure as described above. Alternatively, one or more processors operating in accordance with instructions may implement the functions associated with a medical decision support system in accordance with the present disclosure as described above. If such is the case, it is within the scope of the present disclosure that such instructions may be stored on one or more non-transitory processor readable storage media (e.g., a magnetic disk or other storage medium), or transmitted to one or more processors via one or more signals embodied in one or more carrier waves.
  • The present disclosure is not to be limited in scope by the specific embodiments described herein. Indeed, other various embodiments of and modifications to the present disclosure, in addition to those described herein, will be apparent to those of ordinary skill in the art from the foregoing description and accompanying drawings. Thus, such other embodiments and modifications are intended to fall within the scope of the present disclosure. Further, although the present disclosure has been described herein in the context of at least one particular implementation in at least one particular environment for at least one particular purpose, those of ordinary skill in the art will recognize that its usefulness is not limited thereto and that the present disclosure may be beneficially implemented in any number of environments for any number of purposes. Accordingly, the claims set forth below should be construed in view of the full breadth and spirit of the present disclosure as described herein.

Claims (17)

1. A method for responding to a healthcare inquiry from a user, according to a set of instructions stored on a memory of a computing device and executed by a processor of the computing device, the method comprising the steps of:
classifying an intent of the user based on the healthcare inquiry;
instantiating a conversational engine based on the intent;
eliciting, by the conversational engine, information from the user; and
presenting one or more medical recommendations to the user based at least in part on the information.
2. The method of claim 1, wherein the eliciting step comprises:
using, by the conversational engine, an entropy minimization process to determine a next question to present to the user, such that subsequent information provided by the user in response to the next question minimizes the number of medical recommendations.
3. The method of claim 2, wherein the entropy minimization process is weighted to optimize diagnosis that identifies worst outcomes for early medical intervention on the user.
4. The method of claim 2, wherein the entropy minimization process is weighted to optimize diagnosis for treating the user's symptoms or disease clusters that maximizes a diagnostic value of the user's response to the proposed treatments.
5. The method of claim 1, wherein the presenting step comprises invoking a knowledge base and a diagnosis engine.
6. The method of claim 5, wherein the knowledge base represents normalized medical concepts, wherein the normalized medical concepts include at least one of:
entities representing findings, symptoms, and conditions;
modifiers representing anatomical location, severity, and temporal modifiers;
weighted relations between the entities;
relations between the entities and the modifiers;
a representation of models and engines as an instance of a graph with the entities and the relations;
a mapping of a medical text to a knowledge base representation by using an entity recognition module; and
additional knowledge sources.
7. The method of claim 6, wherein the entity recognition component module is configured to translate health-related text into medical entities and modifiers.
8. The method of claim 5, wherein the diagnosis engine is at least one of:
a first diagnosis engine based on rules in a knowledge based codifying probabilistic relationships between symptoms/findings and diseases;
a second diagnosis engine based on first machine-learned models deriving relations between symptoms/findings and diseases from historical medical records;
a third diagnosis engine based on second machine-learned models deriving both probabilities and relationships from historic medical records; and
a fourth diagnosis engine based on third machine-learned models learned from mixed data that includes at least one of synthetic data generated by a pre-existing expert system, electronic medical records, manual cases, labeled cases from the diagnosis engine.
9. The method of claim 8, wherein the historical records are either obtained from anonymized databases or generated by interactions with the user in a recursive manner.
10. The method of claim 8, wherein
the first, second, third, and fourth diagnosis engines act in ensemble;
each of the first, second, third, and fourth diagnosis engines operates upon a current state of knowledge independently, and offers possible responses along with a confidence and a value estimate; and
an ensemble arbitrator chooses a response out of the possible responses or a collection of responses that is best for the user or circumstance given a match or a mismatch between the possible responses, and the value and the confidence estimate each of the first, second, third, and fourth diagnosis engines expresses in its corresponding response, wherein
the ensemble arbitrator learns a weight to use for each possible response from each of the first, second, third, and fourth diagnosis engines based upon history.
11. The method of claim 1, wherein the eliciting step includes invoking a natural language understanding engine.
12. The method of claim 1, wherein the information is in the form of at least one of text, speech, imagery, sound, and medical test results.
13. The method of claim 1, wherein the information from the user includes at least one of user's medical history and the user's symptoms.
14. The method of claim 1, wherein the conversational engine is one of a diagnosis conversational engine, an information conversational engine, a referral conversational engine, and a treatment conversational engine.
15. The method of claim 1, further comprising, prior to presenting the one or more medical recommendations to the user, seeking approval or revision of the one or more medical recommendations by a medical expert.
16. At least one processor readable storage medium storing a computer program of instructions configured to be readable by at least one processor for instructing the at least one processor to execute a computer process for performing the method as recited in claim 1.
17. A system for responding to a healthcare inquiry from a user, the system comprising:
memory for storing instructions; and
a processor configured to execute the instructions to:
classify an intent of the user based on the healthcare inquiry;
instantiate a conversational engine based on the intent;
elicit, by the conversational engine, information from the user; and
present one or more medical recommendations to the user based at least in part on the information.
US16/265,799 2018-04-06 2019-02-01 Systems and methods for responding to healthcare inquiries Abandoned US20190311807A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US16/265,799 US20190311807A1 (en) 2018-04-06 2019-02-01 Systems and methods for responding to healthcare inquiries
US16/266,784 US10847265B2 (en) 2018-04-06 2019-02-04 Systems and methods for responding to healthcare inquiries
PCT/US2019/023381 WO2019194980A1 (en) 2018-04-06 2019-03-21 Systems and methods for responding to healthcare inquiries
EP19782274.5A EP3776247A4 (en) 2018-04-06 2019-03-21 Systems and methods for responding to healthcare inquiries
US17/903,645 US20230105969A1 (en) 2018-04-06 2022-09-06 Computer-based system for applying machine learning models to healthcare related data

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201862654111P 2018-04-06 2018-04-06
US16/265,799 US20190311807A1 (en) 2018-04-06 2019-02-01 Systems and methods for responding to healthcare inquiries

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US16/266,784 Continuation-In-Part US10847265B2 (en) 2018-04-06 2019-02-04 Systems and methods for responding to healthcare inquiries
US17/903,645 Continuation US20230105969A1 (en) 2018-04-06 2022-09-06 Computer-based system for applying machine learning models to healthcare related data

Publications (1)

Publication Number Publication Date
US20190311807A1 true US20190311807A1 (en) 2019-10-10

Family

ID=68096555

Family Applications (2)

Application Number Title Priority Date Filing Date
US16/265,799 Abandoned US20190311807A1 (en) 2018-04-06 2019-02-01 Systems and methods for responding to healthcare inquiries
US17/903,645 Pending US20230105969A1 (en) 2018-04-06 2022-09-06 Computer-based system for applying machine learning models to healthcare related data

Family Applications After (1)

Application Number Title Priority Date Filing Date
US17/903,645 Pending US20230105969A1 (en) 2018-04-06 2022-09-06 Computer-based system for applying machine learning models to healthcare related data

Country Status (1)

Country Link
US (2) US20190311807A1 (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110853760A (en) * 2019-11-12 2020-02-28 好人生(上海)健康科技有限公司 Interactive system for providing health guidance based on disease diagnosis
CN110993093A (en) * 2019-11-15 2020-04-10 北京邮电大学 Deep learning-based ophthalmic pre-interrogation method and device
CN111063429A (en) * 2019-10-25 2020-04-24 中国科学院自动化研究所 Medical consultation method, device, equipment and computer-readable storage medium
CN111145903A (en) * 2019-12-18 2020-05-12 东北大学 Method and device for acquiring vertigo inquiry text, electronic equipment and inquiry system
CN111159242A (en) * 2019-12-27 2020-05-15 杭州趣维科技有限公司 Client reordering method and system based on edge calculation
CN111539529A (en) * 2020-04-15 2020-08-14 东莞证券股份有限公司 Event reasoning method and device
US20200357515A1 (en) * 2019-05-10 2020-11-12 Tencent America LLC System and method for clinical decision support system with inquiry based on reinforcement learning
CN112233787A (en) * 2020-09-17 2021-01-15 北京挺好农牧科技有限公司 Animal disease diagnosis system based on chat robot
CN112259260A (en) * 2020-11-18 2021-01-22 中国科学院自动化研究所 Intelligent medical question and answer method, system and device based on intelligent wearable equipment
CN112613313A (en) * 2020-12-28 2021-04-06 北京百度网讯科技有限公司 Method, device, equipment, storage medium and program product for quality control of medical orders
CN113450905A (en) * 2020-03-24 2021-09-28 株式会社理光 Medical auxiliary diagnosis system, method and computer readable storage medium
US20210398671A1 (en) * 2018-10-10 2021-12-23 Healthpointe Solutions, Inc. System and method for recommending items in conversational streams
US11222166B2 (en) * 2019-11-19 2022-01-11 International Business Machines Corporation Iteratively expanding concepts
US20220130544A1 (en) * 2020-10-23 2022-04-28 Remmie, Inc Machine learning techniques to assist diagnosis of ear diseases
CN115083602A (en) * 2022-08-16 2022-09-20 阿里巴巴(中国)有限公司 Mental disease auxiliary diagnosis method, device, equipment, storage medium and system
CN115545960A (en) * 2022-12-01 2022-12-30 江苏联弘信科技发展有限公司 Electronic information data interaction system and method
US20230153539A1 (en) * 2020-03-17 2023-05-18 Decoded Health, Inc. Machine-assisted medical patient interaction, diagnosis, and treatment
US11798560B1 (en) 2018-12-21 2023-10-24 Cerner Innovation, Inc. Rapid event and trauma documentation using voice capture
US11862164B2 (en) 2018-12-21 2024-01-02 Cerner Innovation, Inc. Natural language understanding of conversational sources
US11875883B1 (en) 2018-12-21 2024-01-16 Cerner Innovation, Inc. De-duplication and contextually-intelligent recommendations based on natural language understanding of conversational sources
US11961605B2 (en) 2019-11-29 2024-04-16 Kpn Innovations, Llc Methods and systems for informing self-treatment remedy selection

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180075011A1 (en) * 2016-09-13 2018-03-15 International Business Machines Corporation Hybrid Approach to Handling Hypotheticals in Texts
US20180150607A1 (en) * 2016-11-30 2018-05-31 Welltok, Inc. Synthetic ground truth expansion

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180075011A1 (en) * 2016-09-13 2018-03-15 International Business Machines Corporation Hybrid Approach to Handling Hypotheticals in Texts
US20180150607A1 (en) * 2016-11-30 2018-05-31 Welltok, Inc. Synthetic ground truth expansion

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210398671A1 (en) * 2018-10-10 2021-12-23 Healthpointe Solutions, Inc. System and method for recommending items in conversational streams
US11798560B1 (en) 2018-12-21 2023-10-24 Cerner Innovation, Inc. Rapid event and trauma documentation using voice capture
US11862164B2 (en) 2018-12-21 2024-01-02 Cerner Innovation, Inc. Natural language understanding of conversational sources
US11875883B1 (en) 2018-12-21 2024-01-16 Cerner Innovation, Inc. De-duplication and contextually-intelligent recommendations based on natural language understanding of conversational sources
US11869509B1 (en) * 2018-12-21 2024-01-09 Cerner Innovation, Inc. Document generation from conversational sources
US11763944B2 (en) * 2019-05-10 2023-09-19 Tencent America LLC System and method for clinical decision support system with inquiry based on reinforcement learning
US20200357515A1 (en) * 2019-05-10 2020-11-12 Tencent America LLC System and method for clinical decision support system with inquiry based on reinforcement learning
CN111063429A (en) * 2019-10-25 2020-04-24 中国科学院自动化研究所 Medical consultation method, device, equipment and computer-readable storage medium
CN110853760A (en) * 2019-11-12 2020-02-28 好人生(上海)健康科技有限公司 Interactive system for providing health guidance based on disease diagnosis
CN110993093A (en) * 2019-11-15 2020-04-10 北京邮电大学 Deep learning-based ophthalmic pre-interrogation method and device
US11222166B2 (en) * 2019-11-19 2022-01-11 International Business Machines Corporation Iteratively expanding concepts
US11961605B2 (en) 2019-11-29 2024-04-16 Kpn Innovations, Llc Methods and systems for informing self-treatment remedy selection
CN111145903A (en) * 2019-12-18 2020-05-12 东北大学 Method and device for acquiring vertigo inquiry text, electronic equipment and inquiry system
CN111159242A (en) * 2019-12-27 2020-05-15 杭州趣维科技有限公司 Client reordering method and system based on edge calculation
US20230153539A1 (en) * 2020-03-17 2023-05-18 Decoded Health, Inc. Machine-assisted medical patient interaction, diagnosis, and treatment
CN113450905A (en) * 2020-03-24 2021-09-28 株式会社理光 Medical auxiliary diagnosis system, method and computer readable storage medium
CN111539529A (en) * 2020-04-15 2020-08-14 东莞证券股份有限公司 Event reasoning method and device
CN112233787A (en) * 2020-09-17 2021-01-15 北京挺好农牧科技有限公司 Animal disease diagnosis system based on chat robot
US20220130544A1 (en) * 2020-10-23 2022-04-28 Remmie, Inc Machine learning techniques to assist diagnosis of ear diseases
CN112259260A (en) * 2020-11-18 2021-01-22 中国科学院自动化研究所 Intelligent medical question and answer method, system and device based on intelligent wearable equipment
CN112613313A (en) * 2020-12-28 2021-04-06 北京百度网讯科技有限公司 Method, device, equipment, storage medium and program product for quality control of medical orders
CN115083602A (en) * 2022-08-16 2022-09-20 阿里巴巴(中国)有限公司 Mental disease auxiliary diagnosis method, device, equipment, storage medium and system
CN115545960A (en) * 2022-12-01 2022-12-30 江苏联弘信科技发展有限公司 Electronic information data interaction system and method

Also Published As

Publication number Publication date
US20230105969A1 (en) 2023-04-06

Similar Documents

Publication Publication Date Title
US20230105969A1 (en) Computer-based system for applying machine learning models to healthcare related data
US10847265B2 (en) Systems and methods for responding to healthcare inquiries
Wang et al. Deep learning in medicine—promise, progress, and challenges
US11810671B2 (en) System and method for providing health information
US10699215B2 (en) Self-training of question answering system using question profiles
US10331659B2 (en) Automatic detection and cleansing of erroneous concepts in an aggregated knowledge base
US11031103B2 (en) Personalized questionnaire for health risk assessment
US10380251B2 (en) Mining new negation triggers dynamically based on structured and unstructured knowledge
US10971254B2 (en) Medical condition independent engine for medical treatment recommendation system
US20180089383A1 (en) Container-Based Knowledge Graphs for Determining Entity Relations in Medical Text
US10818394B2 (en) Cognitive building of medical condition base cartridges for a medical system
US20180082030A1 (en) Automatic Adjustment of Treatment Recommendations Based on Economic Status of Patients
US20180096103A1 (en) Verification of Clinical Hypothetical Statements Based on Dynamic Cluster Analysis
US11379660B2 (en) Deep learning approach to computing spans
US11295080B2 (en) Automatic detection of context switch triggers
US20190198137A1 (en) Automatic Summarization of Patient Data Using Medically Relevant Summarization Templates
Cheerkoot-Jalim et al. A systematic review of text mining approaches applied to various application areas in the biomedical domain
US20230095258A1 (en) Detecting Discrepancies Between Clinical Notes and Administrative Records
Alekhya et al. An ensemble approach for healthcare application and diagnosis using natural language processing
US20190198138A1 (en) Automatic Expansion of Medically Relevant Summarization Templates Using Semantic Expansion
Muthuraju et al. Doctor patient assistance system using artificial intelligence
US20230043849A1 (en) Answer generation using machine reading comprehension and supported decision trees
US20240062858A1 (en) Electronic health records reader
Li et al. Improving patient self-description in Chinese online consultation using contextual prompts
Brundage Prevalence and evaluation of potential abbreviations in intensive care documentation

Legal Events

Date Code Title Description
AS Assignment

Owner name: CURAI, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KANNAN, ANITHA;RAVURI, MURALI;RODRIGUES, VITOR;AND OTHERS;SIGNING DATES FROM 20190103 TO 20190325;REEL/FRAME:049155/0527

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION