WO2016094450A1 - Systems and methods for rare disease prediction and treatment - Google Patents

Systems and methods for rare disease prediction and treatment Download PDF

Info

Publication number
WO2016094450A1
WO2016094450A1 PCT/US2015/064564 US2015064564W WO2016094450A1 WO 2016094450 A1 WO2016094450 A1 WO 2016094450A1 US 2015064564 W US2015064564 W US 2015064564W WO 2016094450 A1 WO2016094450 A1 WO 2016094450A1
Authority
WO
WIPO (PCT)
Prior art keywords
disease
candidate
symptoms
diseases
patient
Prior art date
Application number
PCT/US2015/064564
Other languages
French (fr)
Inventor
Hendrik Anton MEIJER
Original Assignee
Bioneur, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/US2014/053018 external-priority patent/WO2015031542A1/en
Application filed by Bioneur, Llc filed Critical Bioneur, Llc
Priority to EP15868521.4A priority Critical patent/EP3230944A4/en
Publication of WO2016094450A1 publication Critical patent/WO2016094450A1/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

Definitions

  • Various embodiments described herein relate generally to the field of health- related predictive analytics, and more particularly to correlating multiple symptoms with rare disease information and applying customized algorithms to generate confidence values for possible rare disease matches.
  • Rare diseases are often chronic, debilitating, life-threatening and with degenerative progression. They not only have an impact on the patients, but they also have an impact on their family, friends, their physicians and society. 80% of rare diseases are genetic in origin and thus are present throughout a person's life. 50% of the people affected by rare diseases are children. 30% of children with a rare disease will not live to see their 5th birthday. A recent survey showed that it takes 7.6 years in the US and 5.6 years in the UK for a patient with a rare disease to be properly diagnosed. [0004] One can only speculate about the time it takes to diagnose patients with a rare disease in countries outside of the US and Europe. Today, medical schools around the world still do not pay sufficient attention to rare diseases, resulting in a lack of awareness within the medical communities.
  • Systems and methods are provided herein for a rare disease matching and prediction portal where physicians can select from a list of patient symptoms which are then matched with a list of rare diseases to produce candidate diseases, after which the candidate diseases are evaluated based on weighted lists of symptoms for each candidate disease to produce a confidence indicating a likelihood that a patient suffers from one of the rare disease.
  • Information curated from peer reviewed medical and scientific publications related to rare diseases are utilized to determine the weighted list of symptoms for each disease, and a customized algorithm is applied to determine the confidences from a list of candidate diseases.
  • the portal may provide a disease profile for each disease which includes additional tests, specialized facilities and experts, and possible treatments for the candidate diseases. Additional disease specific testing may be conducted to confirm a diagnosis. The patient may then be treated for at least one of the candidate diseases based on the confidences.
  • a method of rare disease prediction comprises the steps of: receiving one or more patient symptoms pertaining to a patient; comparing the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases; applying a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease; generating a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; identifying at least one treatment for each candidate disease; and treating the patient with the at least one treatment.
  • a system for rare disease prediction comprises a receiving unit which receives one or more patient symptoms pertaining to a patient; a disease matching unit which compares the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases; a symptom weighting unit which applies a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease; a confidence generating unit which generates a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; and a treatment recommendation unit which identifies at least one treatment for one or more of the candidate diseases.
  • FIG. 1 is a block diagram illustrating an exemplary system for predicting likelihoods of rare diseases in patients, according to one embodiment.
  • FIG. 2 is a block diagram illustrating an exemplary prediction server configured to predict the likelihoods of rare diseases in patients, according to one embodiment.
  • FIG. 3 is a flow diagram illustrating an exemplary method of predicting the likelihoods of rare diseases in patients, according to one embodiment.
  • FIG. 4 is a flow diagram illustrating an exemplary method of predicting the likelihoods of rare diseases in patients and treating the patients for at least one rare disease, according to one embodiment.
  • FIG. 5 is a block diagram that illustrates an embodiment of a computer/server system upon which an embodiment of the inventive methodology may be implemented.
  • the systems and methods described herein provide a portal for predictive analytics relating to comprehensive lists of symptoms weighted for each disease in order to determine the likelihood that a patient may suffer from a rare disease.
  • the embodiments described herein enable physicians to query multiple symptoms that the patient presents with and receive a list of candidate diseases and confidences associated with each candidate disease. The symptoms of these rare diseases are ranked and/or weighted according to its specificity with a rare disease.
  • the portal will process the selected symptoms and their disease-specific weights with a customized algorithm to generate a confidence for each candidate disease. The confidences are then displayed to the physician along with disease profile information including confirmatory diagnostic tests, specialized facilities and possible treatments.
  • the embodiments do not provide a diagnostic indication, but can be regarded a "hypothetico-deductive method" in the sense that the suggested candidate diseases can be viewed as hypotheses that require further disease-specific testing to confirm a positive or negative diagnosis.
  • the beneficiaries of these systems include patients, physicians, policy makers and industry. First and foremost, thousands of currently undiagnosed and/or misdiagnosed patients suffering from a rare, chronic, debilitating, life-threatening but treatable disease, around the world. Physicians are provided with a tool that supports them in the process of diagnosing a patient with a rare treatable disease, faster than their current capability.
  • Biopharmaceutical companies that developed and market these orphan drugs will have the opportunity to serve many more patients around the world and meet their unmet medical needs faster and without having to make significant investments in local country specific infrastructure and the like. As a result, its revenue will grow faster. Policy makers benefit as they look for statistics around the number of patients with a disease in a region or worldwide.
  • the embodiments described herein will increase the number of and expedite access to subjects eligible to participate in clinical trials.
  • the objectives are to 1) increase the number of and access to naive (previously untreated) patients, 2) reduce the time of accruing these patients, 3) reduce the overall time of all phases of clinical trials, 4) reduce the overall costs of clinical trials, 5) reduce the overall "time to market" for a candidate drug and treatment of patients suffering from a rare disease, and 6) generate ROI faster.
  • the method can be extended and adapted to rare diseases for which no treatment with an orphan drug is available but for which clinical trials are on-going or scheduled to be conducted. A patient predicted to have an identified rare disease might meet the inclusion criteria for these clinical trials.
  • Finding patients suffering a specific rare disease and that can participate in a clinical trial is a significant challenge for CROs (Clinical Research Organizations).
  • a similar portal may be developed to be used by the general public for rare disease prediction that uses lay medical terms and descriptions.
  • the portal can be developed for common diseases without modification using the same principles for rare disease prediction, only with a larger data set of symptoms, diseases and weights,
  • FIG. 1 illustrates a block diagram of one embodiment of a system for predicting rare diseases.
  • a physician or health care provider will access the system from a remote device 102 at the physician's location, allowing the physician to take advantage of the system from anywhere.
  • the remote device 102 is connected with a prediction server 104, which is configured to receive a user selection of symptoms from a list of symptoms.
  • the doctor doesn't provide his or her own list of symptoms, but just clicks on the appropriate symptoms his/her patient presents with from a predetermined list of symptoms.
  • the list of patient symptoms may also include a preset weight (level of relevance) of each symptom to each disease. The higher the number of relevant symptoms for each disease, the higher a specificity is for those combined symptoms.
  • a prediction database 106 connected with the prediction server 104 may be configured to store different types of information needed to process the selected patient symptoms and match the patient symptoms with one or more of the rare disease stored in the portal.
  • the portal applies preset weights to each symptom of each rare disease to identify a list of candidate diseases, identifies disease profile information for the candidate diseases, and suggest treatments.
  • the system is designed to avoid transmitting or storing any information identifying the patient in order to avoid any risks of violating patient privacy rights. Therefore, the user only selects symptoms from a list and follows the steps to obtain predicted outcomes without creating a patient profile or entering any personally identifiable information about the patient.
  • a prediction database 106 which stores lists of diseases and symptoms and their associations with each other.
  • the symptoms may be associated with the diseases through various tables, such as a table of diseases with a list of symptoms known to exist for each disease.
  • Another separate database or database table may contain synonyms for symptoms and diseases in order to more accurately match the selected symptoms and diseases.
  • the prediction database 106 may store a list of preset weights that are applied to each symptom for each particular disease, although some symptoms may not have a weight. If a symptom lacks a weight, the applied weight value may be zero.
  • the weights reflect the relevance of a particular symptom to a particular disease.
  • the relevance may be determined by a number of different methods and based on a plurality of different information.
  • a symptom which is the primary symptom in a rare disease and which is also exclusive only to that rare disease may receive a weight value of 1 from a scale of 0 to 1, since the existence of that symptom is a definitive indication that the patient suffers from the associated rare disease.
  • the prediction database 106 may also be utilized to store complete descriptions and other information about each rare disease, such as detailed descriptions of the symptoms, how they are presented, the progression of the disease, diagnostic tests that can confirm its existence and specialists and facilities equipped to handle the disease.
  • the prediction database 106 may store a list of treatments that are applicable to certain rare diseases, such as lists of orphan drugs.
  • the prediction database 106 may also include information on the availability and approval of certain treatments in certain locations, the name and location of laboratories which can perform needed tests, the availability of clinical trials relating to the disease and information on the efficacy of the treatments, costs, etc. The system may therefore provide clinical trials with access to additional naive patients that would otherwise remain unidentified and undiagnosed.
  • information on new symptoms of certain rare diseases may be incorporated into the prediction database 106, while information on the relevance of a particular symptom to a particular rare disease may be incorporated into the prediction database 106 to adjust the weight of that particular symptom with respect to that particular rare disease. All of the information can be added to the prediction database 106 to improve the information available on each disease, and updates on new treatments or the results of new treatments may be added to the database 106.
  • the prediction server may be configured with one or more processing elements to receive, analyze and display information relating to the symptoms and confidences for rare diseases.
  • FIG. 2 illustrates a block diagram of the prediction server 104 with a receiving unit 116 configured to receive the list of selected patient symptoms from the input device 102. The received list of selected patient symptoms is then forwarded to a disease matching unit 118 which matches the list of selected patient symptoms with the symptoms and diseases in the prediction database 106 in order to produce a list of candidate diseases, as will be described further below. The list of candidate diseases is then forward to a confidence generating unit 120 which accesses the prediction database 108 to utilize the weights assigned to each symptom for each disease in order to determine a confidence for each candidate disease.
  • the prediction server 104 may store the results in an attached database or forward the results back to the input device 102 for viewing by the physician. In another embodiment, the confidences are then forwarded to a treatment recommendation unit 122 which accesses the prediction database 106 to determine one or more treatments for the candidate diseases. The various types of treatments are described in further detail below. The treatments may also be forwarded back to the input device 102 for viewing by the physician, who can then prescribe a particular treatment, such as medication, surgery, genetic modification, or even a referral to a specialist in the field of the rare diseases or a medical facility or clinical trial which specializes in the rare disease.
  • the portal will provide for a consistent, accurate real-time prediction of the supported rare diseases by utilizing a symptoms weighting database of predetermined weights for each symptom of each disease.
  • the diagnosis will rely on a curated relational database of symptoms and diseases. Rare diseases are then diagnosed by a unique combination of a set of weighted symptoms and varying levels of specificity.
  • a relevance of each symptom to each candidate disease improves the confidence of the prediction. It is possible that multiple diseases are diagnosed by a given set of symptoms, and it is also plausible that a given disease is diagnosed by multiple sets of symptoms. Hence, the relationship between symptoms and diseases is that of many-to-many.
  • a set could contain one or more symptoms with varying levels of relevance, which could also be incorporated into the method of predicting disease. Some symptoms may need to be described in terms of the relevance of the symptom to a disease. The higher the relevance of a combination of specific symptoms for a disease, the higher the overall specificity, or confidence, of that disease. For example, the individual symptoms might not be specific to one rare disease, but the combination of all of them might. Each symptom within a set may therefore contribute disproportionately to the prediction of a disease.
  • the relevance of a particular symptom may be utilized as a pre- weighting factor for a symptom.
  • FIG. 3 illustrates one embodiment of a method for predicting the likelihood of rare diseases.
  • the physician or user first selects a set of patient symptoms at the portal, which is then received at the prediction server.
  • the physician may be interacting with the portal via a web-interface or through an application running on the physician's input device, such as a desktop or portable computing device.
  • an initial matching process is performed by matching the list of selected patient symptoms with the list of symptoms stored in the associated diseases and symptoms database. Any disease with symptoms that match the list of patient symptoms is selected and then identified as a candidate disease.
  • the remaining process may skip the prediction algorithm (described below in step 308) and the identified disease may be immediately output as the predicted disease at step 310.
  • the confidence value would effectively be 100%, as there would be no other candidate diseases with the same set of symptoms. If the list of candidate diseases does not produce an exact match ("N" at step 306), the process continues to the prediction algorithm in step 308.
  • the candidate diseases identified based on the initial matching are input into a prediction algorithm.
  • the prediction algorithm is applied to each candidate disease using the selected symptoms that are each weighted for each candidate disease (using preset weighted values obtained from the prediction database 106) to determine a confidence for each candidate disease that indicates a likelihood that the patient suffers from that particular disease.
  • the confidence represents a specificity, or sum of the individual weights of each symptom for each disease, where the more relevant symptoms present, the higher the specificity.
  • the predictive algorithm sums the total weights for all of the identified symptoms, raises them by a value corresponding to the number of symptoms of the candidate disease not identified in the list of patient symptoms, and then subtracts them by a constant.
  • the predictive algorithm may therefore be understood by the following equation: Where d is the confidence value, w ; is the weight w for a symptom, i, N is the number of identified patient symptoms, and D is the number of symptoms not identified from a list of all symptoms for a candidate disease.
  • the weights for each symptom of each disease are determined based on a relevance of a symptom to a particular disease.
  • the relevance may be determined from a plurality of different knowledge sources and given a value based on a proprietary methodology indicative of the relevance of the symptom for a specific disease. This may include empirical observations and knowledge of a symptom's relationship with a particular disease.
  • the weights for each symptom may be determined by a weighting algorithm which uses a plurality of factors to determine the weighted value. Details of the weighting algorithm will be provided further below.
  • step 310 the predicted diseases and confidences are output to the physician.
  • the confidences may be output in order of the most likely disease to the least likely disease, and the number of candidate diseases displayed may be truncated to only provide the top five predictions or only the predictions over a certain threshold confidence value.
  • the confidences are represented herein as numerical values indicating a percentage from 0 - 100 for ease of understanding. However, the confidence may be represented as another numerical value on a different scale, or even converted to a descriptive term indicating whether it is "likely,” "very likely” or “unlikely” that the patient is suffering from the candidate disease.
  • the portal may display a user interface with a disease profile for each candidate disease, as has been described above.
  • the disease profile may provide detailed explanations of the disease, symptoms, timeline of onset and progression, etc. that is stored in the prediction database 106.
  • the user interface may then display, in step 314, the disease profile information, including possible tests that a physician may prescribe to make a definitive diagnosis of the rare disease, possible treatments for the disease (such as a list of orphan drugs that are not typically known to the medical community), clinical trials available for potential new treatments and treatment centers or specialized medical centers that specialize in the diagnosis and treatment of a rare disease.
  • a disease profile may be sponsored by one of the treatment providers, clinical trial sponsors, treatment locations or diagnostic test providers.
  • a weighting algorithm may be used to determine a weight for each symptom of each disease.
  • the weighting algorithm is a function of five factors:
  • Optional Symptoms The number of optional symptoms. Presence of optional symptoms in addition to the presence of all mandatory symptoms will enhance the confidence assigned to overall predicted diagnosis.
  • Combination of Symptoms Combination (subset) of symptoms collectively associated with the disease.
  • Likelihood Ratio Likelihood ratio represents the number of times a symptom was observed in cases divided by the total number of cases of a disease. For example, a certain symptom may or may not always be associated with a specific disease. This could be represented in terms of a ratio of the number of disease cases reporting this symptom (for e.g., in a database) to the total number of known cases with this disease. Since the numerator would be less than or equal to the denominator, the range of valid values for the likelihood ratio would be between 0.0 and 1.0.
  • w is the weight
  • N is the total number of identified patient symptoms
  • K D is a constant between 0.1 to 0.9
  • L is a likelihood ratio representing the number of times a symptom i was observed in cases divided by the total number of cases of a disease.
  • a unique weighted value is assigned to each symptom of each disease, such that the same symptom may have different weights for different diseases since that symptom's importance to each disease may vary. Scoring and Selection Algorithm
  • a scoring and selection algorithm is used to identify candidate diseases, generate a score for the candidate diseases and determine whether the candidate diseases exceed a threshold.
  • the score for a disease is the weighted count of the intersection between the mandatory symptoms for the disease and the provided symptoms, plus the weighted count of the intersection between the optional symptoms for the disease and the provided symptoms.
  • Score(D, ⁇ S ⁇ ) (Count( ⁇ S m (D) ⁇ ⁇ ⁇ S ⁇ )*W m ) + (Count( ⁇ S 0 (D) ⁇ ⁇ ⁇ S ⁇ )*W 0 ) (3), where:
  • W m Weighting factor for mandatory symptoms
  • W 0 Weighting factor for optional symptoms
  • the visibility of a disease in the result set is a Boolean operation based on whether the score for the disease is greater than a pre-computed threshold value.
  • the visibility is determined by Equation 4:
  • the visibility threshold (Threshold) is determined by a bi-state function which evaluates to a pre-set value, T, if any disease's score is greater than or equal to T for the set of all possible diseases, or evaluates to 0 if not.
  • the Threshold may be determined, in one embodiment, by Equation 5:
  • Threshold If(Max(Score( ⁇ D ⁇ , ⁇ S ⁇ )) ⁇ T) then 0 else T (5), where:
  • T Threshold score for disease inclusion
  • Count The number of items in a set.
  • T 20.
  • OMIM Online Mendelian Inheritance in Man (www.omim.org/)
  • Web-based software quality requirements that will be adhered to will include:
  • the portal will be developed using a strict Agile software development methodology at capability maturity model (CMM) Level 3 maturity; 2) 508 compliance: www.section508.gov/; and 3) W3C Web Accessibility Initiative to promote a high degree of usability for people with disabilities.
  • CMS capability maturity model
  • the portal will comply with all applicable and evolving regulatory requirements set-forth by FDA, EMA and other international and national agencies.
  • the system includes a scroll down menu with a disease prediction list. Clicking on the name of a rare disease a multi-page overview appears with detailed information about the disease, symptoms, diagnosis and treatment(s).
  • the system will include names of local, regional and international laboratories that do have testing experience for rare diseases; patient societies, advocacy groups and rare disease non-profit organization; rare disease experts and rare disease centers of excellence and much more.
  • This portal will include the names of national and international clinical rare disease experts and their centers of excellence. This portal will include information about local, regional and international patient societies and advocacy groups.
  • information about one or more treatments is included. Its up to the physicians to conduct further testing to confirm a diagnosis and determine how to treat his/her patient.
  • the method described herein includes treating the patient for one or more of the diseases based on the confidence values provided by the aforementioned method.
  • the patient is then treated for one or more of the candidate diseases in step 412, for example by administering a treatment such as an orphan drug, performing a test to confirm a diagnosis of the disease, enrolling the patient in a clinical trial for the disease, or forwarding the patient to an expert in the disease or a medical facility which specializes in the disease.
  • a treatment such as an orphan drug
  • performing a test to confirm a diagnosis of the disease enrolling the patient in a clinical trial for the disease, or forwarding the patient to an expert in the disease or a medical facility which specializes in the disease.
  • the system may also provide an interface and content for patients and related parties separate from the physician interface.
  • a similar portal would be available for use by patients and their families. This portal will describe the same 400 or more rare diseases, symptoms and treatment(s) but in lay terms only. Everything, including the query of multiple symptoms will be lay term based. This portal would be available in all major languages. With the knowledge obtained patient(s) can visit their physician and have him/her use the portal for further research.
  • the patient interface may also provide for interactions between patients suffering from similar rare disease conditions so they can discuss the treatments, symptoms and overall experiences.
  • the information received from the physicians regarding the symptoms may be utilized along with the predicted confidences as additional analytics helpful to further improve the accuracy of identifying rare diseases and their symptoms.
  • the system does not collect information that would identify a specific individual patient in order to protect privacy, the information collected would be limited to a selection of symptoms, specificities and predicted diseases associated with an anonymous patient.
  • information on the location of the physician inputting the patient data could be used in order to track the locations of cases of rare disease.
  • the system may also be configured to continuously receive third party information from publications, papers, hospitals or other healthcare facilities, etc. that collect patient data.
  • FIG. 5 is a block diagram that illustrates an embodiment of a computer/server system 500 upon which an embodiment of the inventive methodology may be implemented.
  • the system 500 includes a computer/server platform 501 including a processor 502 and memory 503 which operate to execute instructions, as known to one of skill in the art.
  • the term "computer-readable storage medium” as used herein refers to any tangible medium, such as a disk or semiconductor memory, that participates in providing instructions to processor 502 for execution.
  • the computer platform 501 receives input from a plurality of input devices 504, such as a keyboard, mouse, touch device or verbal command.
  • the computer platform 501 may additionally be connected to a removable storage device 505, such as a portable hard drive, optical media (CD or DVD), disk media or any other tangible medium from which a computer can read executable code, data or write results to.
  • the computer platform may further be connected to network resources 506 which connect to the Internet or other components of a local public or private network.
  • the network resources 506 may provide instructions and data to the computer platform from a remote location on a network 507.
  • the connections to the network resources 506 may be via wireless protocols, such as the 802.11 standards, Bluetooth® or cellular protocols, or via physical transmission media, such as cables or fiber optics.
  • the network resources may include storage devices for storing data and executable instructions at a location separate from the computer platform 501.
  • the computer interacts with a display 508 to output data and other information to a user, as well as to request additional instructions and input from the user.
  • the display 508 may therefore further act as an input device 504 for interacting with a user.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Biomedical Technology (AREA)
  • Public Health (AREA)
  • Pathology (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

A rare disease matching and prediction portal is provided where a number of symptoms are matched with a list of rare diseases to produce candidate diseases, after which the candidate diseases are evaluated based on weighted lists of symptoms for each candidate disease to produce a confidence indicating a likelihood that a patient suffers from one of the rare diseases. Information curated from publications and other curated databases related to rare diseases are utilized to determine the weighted list of symptoms for each disease based on prevalence relevance of each symptom to each candidate disease, and a customized algorithm is applied to determine the confidences for the candidate diseases. Along with the confidences, the portal may provide a disease profile for each disease which includes possible treatments for the candidate diseases. The patient may then be treated for at least one of the candidate diseases based on the confidences.

Description

SYSTEMS AND METHODS FOR RARE DISEASE PREDICTION AND
TREATMENT Background
1. Field of the Invention
[0001] Various embodiments described herein relate generally to the field of health- related predictive analytics, and more particularly to correlating multiple symptoms with rare disease information and applying customized algorithms to generate confidence values for possible rare disease matches.
2. Related Art
[0002] Globally, approximately 7000 rare diseases have been identified. Rare diseases are notoriously difficult to diagnose and most of them go unnoticed and/or receive little attention because they affect only thousands or sometimes even smaller numbers of patients worldwide. Collectively there are about 350 million people living around the world with a rare disease.
[0003] Rare diseases are often chronic, debilitating, life-threatening and with degenerative progression. They not only have an impact on the patients, but they also have an impact on their family, friends, their physicians and society. 80% of rare diseases are genetic in origin and thus are present throughout a person's life. 50% of the people affected by rare diseases are children. 30% of children with a rare disease will not live to see their 5th birthday. A recent survey showed that it takes 7.6 years in the US and 5.6 years in the UK for a patient with a rare disease to be properly diagnosed. [0004] One can only speculate about the time it takes to diagnose patients with a rare disease in countries outside of the US and Europe. Today, medical schools around the world still do not pay sufficient attention to rare diseases, resulting in a lack of awareness within the medical communities.
[0005] Although great progress has been made with the development of treatments for rare diseases over the past three decades, for the vast majority of these diseases there is still no approved treatment yet.
[0006] The good news is that for about 480 of the 7000 rare diseases that have been identified worldwide an FDA and/or EMA approved treatment is developed and marketed. However, as long as one cannot make or get a proper diagnosis first that knowledge is irrelevant.
[0007] Leading and emerging biopharmaceutical companies have developed and market orphan drugs for about 480 rare diseases. All these diseases will be included in this portal. Thanks to these companies thousands of patients (mostly in Western countries) benefit from these, sometimes life changing treatments on a daily basis. The impact that some of these treatments have on patients and their families, is extraordinary. Tens of thousands of patients are still waiting to be diagnosed.
[0008] Because rare diseases are inherently difficult to diagnose, not in the least because of the significant lack of knowledge about these diseases within the medical community, physicians increasingly use search engines. However, search engines are not designed for rare diseases. This is because search engines consider pages important when they are linked to by other pages. Rare diseases by definition are unlikely to have a high profile on the web relative to more common diseases. So the difficulty of diagnosing a rare disease remains unchanged. Summary
[0009] Systems and methods are provided herein for a rare disease matching and prediction portal where physicians can select from a list of patient symptoms which are then matched with a list of rare diseases to produce candidate diseases, after which the candidate diseases are evaluated based on weighted lists of symptoms for each candidate disease to produce a confidence indicating a likelihood that a patient suffers from one of the rare disease. Information curated from peer reviewed medical and scientific publications related to rare diseases are utilized to determine the weighted list of symptoms for each disease, and a customized algorithm is applied to determine the confidences from a list of candidate diseases. Along with the confidences, the portal may provide a disease profile for each disease which includes additional tests, specialized facilities and experts, and possible treatments for the candidate diseases. Additional disease specific testing may be conducted to confirm a diagnosis. The patient may then be treated for at least one of the candidate diseases based on the confidences.
[0010] In a first exemplary aspect, a method of rare disease prediction comprises the steps of: receiving one or more patient symptoms pertaining to a patient; comparing the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases; applying a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease; generating a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; identifying at least one treatment for each candidate disease; and treating the patient with the at least one treatment.
[0011] In a further aspect, a system for rare disease prediction comprises a receiving unit which receives one or more patient symptoms pertaining to a patient; a disease matching unit which compares the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases; a symptom weighting unit which applies a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease; a confidence generating unit which generates a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; and a treatment recommendation unit which identifies at least one treatment for one or more of the candidate diseases.
[0012] Other features and advantages should become apparent from the following description of the preferred embodiments, taken in conjunction with the accompanying drawings.
Brief Description of the Drawings
[0013] Various embodiments disclosed herein are described in detail with reference to the following figures. The drawings are provided for purposes of illustration only and merely depict typical or exemplary embodiments. These drawings are provided to facilitate the reader's understanding and shall not be considered limiting of the breadth, scope, or applicability of the embodiments. It should be noted that for clarity and ease of illustration these drawings are not necessarily made to scale.
[0014] FIG. 1 is a block diagram illustrating an exemplary system for predicting likelihoods of rare diseases in patients, according to one embodiment.
[0015] FIG. 2 is a block diagram illustrating an exemplary prediction server configured to predict the likelihoods of rare diseases in patients, according to one embodiment.
[0016] FIG. 3 is a flow diagram illustrating an exemplary method of predicting the likelihoods of rare diseases in patients, according to one embodiment. [0017] FIG. 4 is a flow diagram illustrating an exemplary method of predicting the likelihoods of rare diseases in patients and treating the patients for at least one rare disease, according to one embodiment.
[0018] FIG. 5 is a block diagram that illustrates an embodiment of a computer/server system upon which an embodiment of the inventive methodology may be implemented.
[0019] The various embodiments mentioned above are described in further detail with reference to the aforementioned figures and the following detailed description of exemplary embodiments.
Detailed Description
[0020] The systems and methods described herein provide a portal for predictive analytics relating to comprehensive lists of symptoms weighted for each disease in order to determine the likelihood that a patient may suffer from a rare disease. The embodiments described herein enable physicians to query multiple symptoms that the patient presents with and receive a list of candidate diseases and confidences associated with each candidate disease. The symptoms of these rare diseases are ranked and/or weighted according to its specificity with a rare disease. The portal will process the selected symptoms and their disease-specific weights with a customized algorithm to generate a confidence for each candidate disease. The confidences are then displayed to the physician along with disease profile information including confirmatory diagnostic tests, specialized facilities and possible treatments.
[0021] The embodiments do not provide a diagnostic indication, but can be regarded a "hypothetico-deductive method" in the sense that the suggested candidate diseases can be viewed as hypotheses that require further disease-specific testing to confirm a positive or negative diagnosis. [0022] The beneficiaries of these systems include patients, physicians, policy makers and industry. First and foremost, thousands of currently undiagnosed and/or misdiagnosed patients suffering from a rare, chronic, debilitating, life-threatening but treatable disease, around the world. Physicians are provided with a tool that supports them in the process of diagnosing a patient with a rare treatable disease, faster than their current capability. Biopharmaceutical companies that developed and market these orphan drugs will have the opportunity to serve many more patients around the world and meet their unmet medical needs faster and without having to make significant investments in local country specific infrastructure and the like. As a result, its revenue will grow faster. Policy makers benefit as they look for statistics around the number of patients with a disease in a region or worldwide.
[0023] The embodiments described herein will increase the number of and expedite access to subjects eligible to participate in clinical trials. The objectives are to 1) increase the number of and access to naive (previously untreated) patients, 2) reduce the time of accruing these patients, 3) reduce the overall time of all phases of clinical trials, 4) reduce the overall costs of clinical trials, 5) reduce the overall "time to market" for a candidate drug and treatment of patients suffering from a rare disease, and 6) generate ROI faster. The method can be extended and adapted to rare diseases for which no treatment with an orphan drug is available but for which clinical trials are on-going or scheduled to be conducted. A patient predicted to have an identified rare disease might meet the inclusion criteria for these clinical trials. Finding patients suffering a specific rare disease and that can participate in a clinical trial is a significant challenge for CROs (Clinical Research Organizations). A similar portal may be developed to be used by the general public for rare disease prediction that uses lay medical terms and descriptions. Furthermore, the portal can be developed for common diseases without modification using the same principles for rare disease prediction, only with a larger data set of symptoms, diseases and weights,
[0024] FIG. 1 illustrates a block diagram of one embodiment of a system for predicting rare diseases. A physician or health care provider will access the system from a remote device 102 at the physician's location, allowing the physician to take advantage of the system from anywhere. The remote device 102 is connected with a prediction server 104, which is configured to receive a user selection of symptoms from a list of symptoms. In one embodiment, the doctor doesn't provide his or her own list of symptoms, but just clicks on the appropriate symptoms his/her patient presents with from a predetermined list of symptoms. In one embodiment, the list of patient symptoms may also include a preset weight (level of relevance) of each symptom to each disease. The higher the number of relevant symptoms for each disease, the higher a specificity is for those combined symptoms.
[0025] In one embodiment, a prediction database 106 connected with the prediction server 104 may be configured to store different types of information needed to process the selected patient symptoms and match the patient symptoms with one or more of the rare disease stored in the portal. The portal applies preset weights to each symptom of each rare disease to identify a list of candidate diseases, identifies disease profile information for the candidate diseases, and suggest treatments. The system is designed to avoid transmitting or storing any information identifying the patient in order to avoid any risks of violating patient privacy rights. Therefore, the user only selects symptoms from a list and follows the steps to obtain predicted outcomes without creating a patient profile or entering any personally identifiable information about the patient. Although only one database is illustrated herein, the system could altematively contain a plurality of databases to store various categories of information instead of a single database divided into normalized database tables. [0026] In this embodiment, a prediction database 106 is provided which stores lists of diseases and symptoms and their associations with each other. The symptoms may be associated with the diseases through various tables, such as a table of diseases with a list of symptoms known to exist for each disease. Another separate database or database table may contain synonyms for symptoms and diseases in order to more accurately match the selected symptoms and diseases. The prediction database 106 may store a list of preset weights that are applied to each symptom for each particular disease, although some symptoms may not have a weight. If a symptom lacks a weight, the applied weight value may be zero. The weights reflect the relevance of a particular symptom to a particular disease. The relevance may be determined by a number of different methods and based on a plurality of different information. For example, a symptom which is the primary symptom in a rare disease and which is also exclusive only to that rare disease may receive a weight value of 1 from a scale of 0 to 1, since the existence of that symptom is a definitive indication that the patient suffers from the associated rare disease.
[0027] The prediction database 106 may also be utilized to store complete descriptions and other information about each rare disease, such as detailed descriptions of the symptoms, how they are presented, the progression of the disease, diagnostic tests that can confirm its existence and specialists and facilities equipped to handle the disease. The prediction database 106 may store a list of treatments that are applicable to certain rare diseases, such as lists of orphan drugs. The prediction database 106 may also include information on the availability and approval of certain treatments in certain locations, the name and location of laboratories which can perform needed tests, the availability of clinical trials relating to the disease and information on the efficacy of the treatments, costs, etc. The system may therefore provide clinical trials with access to additional naive patients that would otherwise remain unidentified and undiagnosed. [0028] In one example, information on new symptoms of certain rare diseases may be incorporated into the prediction database 106, while information on the relevance of a particular symptom to a particular rare disease may be incorporated into the prediction database 106 to adjust the weight of that particular symptom with respect to that particular rare disease. All of the information can be added to the prediction database 106 to improve the information available on each disease, and updates on new treatments or the results of new treatments may be added to the database 106.
[0029] The prediction server may be configured with one or more processing elements to receive, analyze and display information relating to the symptoms and confidences for rare diseases. FIG. 2 illustrates a block diagram of the prediction server 104 with a receiving unit 116 configured to receive the list of selected patient symptoms from the input device 102. The received list of selected patient symptoms is then forwarded to a disease matching unit 118 which matches the list of selected patient symptoms with the symptoms and diseases in the prediction database 106 in order to produce a list of candidate diseases, as will be described further below. The list of candidate diseases is then forward to a confidence generating unit 120 which accesses the prediction database 108 to utilize the weights assigned to each symptom for each disease in order to determine a confidence for each candidate disease. Once the confidences are computed, the prediction server 104 may store the results in an attached database or forward the results back to the input device 102 for viewing by the physician. In another embodiment, the confidences are then forwarded to a treatment recommendation unit 122 which accesses the prediction database 106 to determine one or more treatments for the candidate diseases. The various types of treatments are described in further detail below. The treatments may also be forwarded back to the input device 102 for viewing by the physician, who can then prescribe a particular treatment, such as medication, surgery, genetic modification, or even a referral to a specialist in the field of the rare diseases or a medical facility or clinical trial which specializes in the rare disease. Methodology for Predicting Disease
[0030] The portal will provide for a consistent, accurate real-time prediction of the supported rare diseases by utilizing a symptoms weighting database of predetermined weights for each symptom of each disease. The diagnosis will rely on a curated relational database of symptoms and diseases. Rare diseases are then diagnosed by a unique combination of a set of weighted symptoms and varying levels of specificity.
[0031] Determining a relevance of each symptom to each candidate disease improves the confidence of the prediction. It is possible that multiple diseases are diagnosed by a given set of symptoms, and it is also plausible that a given disease is diagnosed by multiple sets of symptoms. Hence, the relationship between symptoms and diseases is that of many-to-many. In one embodiment, a set could contain one or more symptoms with varying levels of relevance, which could also be incorporated into the method of predicting disease. Some symptoms may need to be described in terms of the relevance of the symptom to a disease. The higher the relevance of a combination of specific symptoms for a disease, the higher the overall specificity, or confidence, of that disease. For example, the individual symptoms might not be specific to one rare disease, but the combination of all of them might. Each symptom within a set may therefore contribute disproportionately to the prediction of a disease. In one embodiment, the relevance of a particular symptom may be utilized as a pre- weighting factor for a symptom.
[0032] FIG. 3 illustrates one embodiment of a method for predicting the likelihood of rare diseases. In step 302, the physician or user first selects a set of patient symptoms at the portal, which is then received at the prediction server. The physician may be interacting with the portal via a web-interface or through an application running on the physician's input device, such as a desktop or portable computing device. In step 304, an initial matching process is performed by matching the list of selected patient symptoms with the list of symptoms stored in the associated diseases and symptoms database. Any disease with symptoms that match the list of patient symptoms is selected and then identified as a candidate disease. At decision step 306, if the list of patient symptoms is an exact match with a list of symptoms for only one disease ("Y" at step 306), the remaining process may skip the prediction algorithm (described below in step 308) and the identified disease may be immediately output as the predicted disease at step 310. The confidence value would effectively be 100%, as there would be no other candidate diseases with the same set of symptoms. If the list of candidate diseases does not produce an exact match ("N" at step 306), the process continues to the prediction algorithm in step 308.
[0033] In step 308, the candidate diseases identified based on the initial matching are input into a prediction algorithm. The prediction algorithm is applied to each candidate disease using the selected symptoms that are each weighted for each candidate disease (using preset weighted values obtained from the prediction database 106) to determine a confidence for each candidate disease that indicates a likelihood that the patient suffers from that particular disease. In one embodiment, the confidence represents a specificity, or sum of the individual weights of each symptom for each disease, where the more relevant symptoms present, the higher the specificity. In one embodiment, the predictive algorithm sums the total weights for all of the identified symptoms, raises them by a value corresponding to the number of symptoms of the candidate disease not identified in the list of patient symptoms, and then subtracts them by a constant. The predictive algorithm may therefore be understood by the following equation:
Figure imgf000013_0001
Where d is the confidence value, w; is the weight w for a symptom, i, N is the number of identified patient symptoms, and D is the number of symptoms not identified from a list of all symptoms for a candidate disease.
[0034] In one embodiment, the weights for each symptom of each disease are determined based on a relevance of a symptom to a particular disease. The relevance may be determined from a plurality of different knowledge sources and given a value based on a proprietary methodology indicative of the relevance of the symptom for a specific disease. This may include empirical observations and knowledge of a symptom's relationship with a particular disease. In an alternative embodiment, the weights for each symptom may be determined by a weighting algorithm which uses a plurality of factors to determine the weighted value. Details of the weighting algorithm will be provided further below.
[0035] In step 310, the predicted diseases and confidences are output to the physician.
The confidences may be output in order of the most likely disease to the least likely disease, and the number of candidate diseases displayed may be truncated to only provide the top five predictions or only the predictions over a certain threshold confidence value. The confidences are represented herein as numerical values indicating a percentage from 0 - 100 for ease of understanding. However, the confidence may be represented as another numerical value on a different scale, or even converted to a descriptive term indicating whether it is "likely," "very likely" or "unlikely" that the patient is suffering from the candidate disease.
[0036] Once the confidences and candidate diseases are output to the physician, in step 312, the portal may display a user interface with a disease profile for each candidate disease, as has been described above. The disease profile may provide detailed explanations of the disease, symptoms, timeline of onset and progression, etc. that is stored in the prediction database 106. The user interface may then display, in step 314, the disease profile information, including possible tests that a physician may prescribe to make a definitive diagnosis of the rare disease, possible treatments for the disease (such as a list of orphan drugs that are not typically known to the medical community), clinical trials available for potential new treatments and treatment centers or specialized medical centers that specialize in the diagnosis and treatment of a rare disease. In one embodiment a disease profile may be sponsored by one of the treatment providers, clinical trial sponsors, treatment locations or diagnostic test providers.
Weighting Algorithm
[0037] In one embodiment, a weighting algorithm may be used to determine a weight for each symptom of each disease. In this embodiment, the weighting algorithm is a function of five factors:
1. Mandatory Symptoms: The number mandatory symptoms in a set. In the absence of evidence to the contrary (in literature or curated databases), all mandatory symptoms will receive equal weightage by default.
2. Optional Symptoms: The number of optional symptoms. Presence of optional symptoms in addition to the presence of all mandatory symptoms will enhance the confidence assigned to overall predicted diagnosis.
3. Relevance of Symptoms: The relevance of a symptom pertains to its relationship to a particular disease. When a very common symptom like anemia occurs with a rare disease, we would certainly give this symptom a very low weight because of its low level of relevance to this specific disease. In contrast, an uncommon symptom like anhydrosis (inability to sweat) is very disease specific and highly relevant to a specific disease, and will therefore be weighted higher.
4. Combination of Symptoms: Combination (subset) of symptoms collectively associated with the disease. 5. Likelihood Ratio: Likelihood ratio represents the number of times a symptom was observed in cases divided by the total number of cases of a disease. For example, a certain symptom may or may not always be associated with a specific disease. This could be represented in terms of a ratio of the number of disease cases reporting this symptom (for e.g., in a database) to the total number of known cases with this disease. Since the numerator would be less than or equal to the denominator, the range of valid values for the likelihood ratio would be between 0.0 and 1.0.
[0038] In one embodiment, the weight for a particular symptom may be calculated by the following equation: w . =— * KD * L (2)
N
where w; is the weight, N is the total number of identified patient symptoms, KD is a constant between 0.1 to 0.9, and L, is a likelihood ratio representing the number of times a symptom i was observed in cases divided by the total number of cases of a disease. As previously described, a unique weighted value is assigned to each symptom of each disease, such that the same symptom may have different weights for different diseases since that symptom's importance to each disease may vary. Scoring and Selection Algorithm
[0039] In an alternative embodiment, a scoring and selection algorithm is used to identify candidate diseases, generate a score for the candidate diseases and determine whether the candidate diseases exceed a threshold. The score for a disease is a function of the disease in question and a set of symptoms: Score(D, {S}), where D = Disease and {S} = A set of symptoms. The score for a disease is the weighted count of the intersection between the mandatory symptoms for the disease and the provided symptoms, plus the weighted count of the intersection between the optional symptoms for the disease and the provided symptoms. Thus, the equation for determining the score of a diseases is as follows in Equation 3:
Score(D, {S}) = (Count({Sm(D)} Π {S})*Wm) + (Count({S0(D)} Π {S})*W0) (3), where:
{S} = A set of symptoms,
{Sm(D)} = Set of all mandatory symptoms for disease D,
(S0(D)} = Set of all optional symptoms for disease D,
Wm = Weighting factor for mandatory symptoms,
W0 = Weighting factor for optional symptoms,
In the current implementation, Wm = 5 and W0 = 1.
[0040] The visibility of a disease in the result set is a Boolean operation based on whether the score for the disease is greater than a pre-computed threshold value. In one embodiment, the visibility is determined by Equation 4:
Visibility(D,{S}) = If(Score(D, {S}) > Threshold) then true else false (4).
[0041] The visibility threshold (Threshold) is determined by a bi-state function which evaluates to a pre-set value, T, if any disease's score is greater than or equal to T for the set of all possible diseases, or evaluates to 0 if not. As such, the Threshold may be determined, in one embodiment, by Equation 5:
Threshold = If(Max(Score({D}, {S})) < T) then 0 else T (5), where:
{D} = Set of all diseases,
T = Threshold score for disease inclusion, and
Count = The number of items in a set.
In the current implementation, T = 20. [0042] The above diseases scoring and selection algorithm provides an additional step-wise process to determine candidate diseases.
Data Standards and Regulatory Compliance Requirements
[0043] Because the medical terminologies vary significantly within the practice (both within a country/region as well as globally), it is important to adhere to controlled vocabularies and data standards to enable consistent usage of naming conventions. A single symptom or disease may have multiple names. The following medical and clinical data standards will be adopted as appropriate:
[0044] MESH (Medical Subject Headings): www.nlm.nih.gov/mesh/
[0045] Disease Ontology: disease-ontology.org/
[0046] ICD: www.who.int/classifications/icd/en/
[0047] SNOMED: www.ihtsdo.org/snomed-ct/
[0048] OMIM: Online Mendelian Inheritance in Man (www.omim.org/)
[0049] Web-based software quality requirements that will be adhered to will include:
1) the portal will be developed using a strict Agile software development methodology at capability maturity model (CMM) Level 3 maturity; 2) 508 compliance: www.section508.gov/; and 3) W3C Web Accessibility Initiative to promote a high degree of usability for people with disabilities. The portal will comply with all applicable and evolving regulatory requirements set-forth by FDA, EMA and other international and national agencies.
Disease Profile Information
[0050] In one embodiment, the system includes a scroll down menu with a disease prediction list. Clicking on the name of a rare disease a multi-page overview appears with detailed information about the disease, symptoms, diagnosis and treatment(s). The system will include names of local, regional and international laboratories that do have testing experience for rare diseases; patient societies, advocacy groups and rare disease non-profit organization; rare disease experts and rare disease centers of excellence and much more.
[0051] Taking into account that not all countries have the necessary resources to perform the relevant tests, names of local, regional and international laboratories that do have testing experience for rare diseases will be included. This portal will include the names of national and international clinical rare disease experts and their centers of excellence. This portal will include information about local, regional and international patient societies and advocacy groups.
Treatments
[0052] In one embodiment, information about one or more treatments is included. Its up to the physicians to conduct further testing to confirm a diagnosis and determine how to treat his/her patient. In one embodiment, the method described herein includes treating the patient for one or more of the diseases based on the confidence values provided by the aforementioned method. FIG. 4 illustrates one embodiment of a method of predicting a rare disease and treating a patient for the rare disease, wherein the steps are similar to those described above, including: receiving a list of symptoms in step 402; matching the received symptoms with possible diseases to produce a list of candidate diseases in step 404; running the prediction algorithm in step 406 to apply the customized weights for each symptom of each disease to the candidate diseases, which produces disease confidences in step 408; and providing treatment recommendations in step 410 such as possible treatments, tests, clinical trials, locations and experts. Based on the treatment recommendations, the patient is then treated for one or more of the candidate diseases in step 412, for example by administering a treatment such as an orphan drug, performing a test to confirm a diagnosis of the disease, enrolling the patient in a clinical trial for the disease, or forwarding the patient to an expert in the disease or a medical facility which specializes in the disease. Patient Interface
[0053] The system may also provide an interface and content for patients and related parties separate from the physician interface. To increase the awareness about rare diseases in the general public, a similar portal would be available for use by patients and their families. This portal will describe the same 400 or more rare diseases, symptoms and treatment(s) but in lay terms only. Everything, including the query of multiple symptoms will be lay term based. This portal would be available in all major languages. With the knowledge obtained patient(s) can visit their physician and have him/her use the portal for further research.
[0054] The patient interface may also provide for interactions between patients suffering from similar rare disease conditions so they can discuss the treatments, symptoms and overall experiences.
Disease and Symptom Analytics
[0055] In one embodiment, the information received from the physicians regarding the symptoms may be utilized along with the predicted confidences as additional analytics helpful to further improve the accuracy of identifying rare diseases and their symptoms. As the system does not collect information that would identify a specific individual patient in order to protect privacy, the information collected would be limited to a selection of symptoms, specificities and predicted diseases associated with an anonymous patient. However, information on the location of the physician inputting the patient data could be used in order to track the locations of cases of rare disease. As previously described, the system may also be configured to continuously receive third party information from publications, papers, hospitals or other healthcare facilities, etc. that collect patient data. Computer-Implemented Embodiment [0056] FIG. 5 is a block diagram that illustrates an embodiment of a computer/server system 500 upon which an embodiment of the inventive methodology may be implemented. The system 500 includes a computer/server platform 501 including a processor 502 and memory 503 which operate to execute instructions, as known to one of skill in the art. The term "computer-readable storage medium" as used herein refers to any tangible medium, such as a disk or semiconductor memory, that participates in providing instructions to processor 502 for execution. Additionally, the computer platform 501 receives input from a plurality of input devices 504, such as a keyboard, mouse, touch device or verbal command. The computer platform 501 may additionally be connected to a removable storage device 505, such as a portable hard drive, optical media (CD or DVD), disk media or any other tangible medium from which a computer can read executable code, data or write results to. The computer platform may further be connected to network resources 506 which connect to the Internet or other components of a local public or private network. The network resources 506 may provide instructions and data to the computer platform from a remote location on a network 507. The connections to the network resources 506 may be via wireless protocols, such as the 802.11 standards, Bluetooth® or cellular protocols, or via physical transmission media, such as cables or fiber optics. The network resources may include storage devices for storing data and executable instructions at a location separate from the computer platform 501. The computer interacts with a display 508 to output data and other information to a user, as well as to request additional instructions and input from the user. The display 508 may therefore further act as an input device 504 for interacting with a user.
[0057] While various embodiments have been described above, it should be understood that they have been presented by way of example only, and not of limitation. The breadth and scope should not be limited by any of the above-described exemplary embodiments. Where this document refers to technologies that would be apparent or known to one of ordinary skill in the art, such technologies encompass those apparent or known to the skilled artisan now or at any time in the future. In addition, the described embodiments are not restricted to the illustrated example architectures or configurations, but the desired features can be implemented using a variety of alternative architectures and configurations. As will become apparent to one of ordinary skill in the art after reading this document, the illustrated embodiments and their various alternatives can be implemented without confinement to the illustrated example. One of ordinary skill in the art would also understand how alternative functional, logical or physical partitioning and configurations could be utilized to implement the desired features of the described embodiments.
[0058] Furthermore, although items, elements or components may be described or claimed in the singular, the plural is contemplated to be within the scope thereof unless limitation to the singular is explicitly stated. The presence of broadening words and phrases such as "one or more," "at least," "but not limited to" or other like phrases in some instances shall not be read to mean that the narrower case is intended or required in instances where such broadening phrases may be absent.

Claims

Claims claimed is:
1. A method of disease prediction, comprising the steps of: receiving one or more patient symptoms pertaining to a patient; comparing the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases; applying a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease; generating a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; identifying at least one treatment for each candidate disease; and treating the patient with the at least one treatment.
2. The method of claim 1, wherein the confidence is generated by summing the individual weights of each symptom to determine a specificity of the symptoms for each candidate disease. .
3. The method of claim 1, further comprising identifying the one or more
candidate diseases by identifying a disease with at least one known symptom which matches at least one of the one or more patient symptoms.
4. The method of claim 1, further comprising displaying each candidate disease above a threshold confidence to a user.
5. The method of claim 4, further comprising identifying at least one treatment for at least one candidate disease above the threshold confidence.
6. The method of claim 1 , wherein the patient is treated for the candidate disease with the highest confidence value.
7. The method of claim 1 , wherein the at least one treatment is an orphan drug.
8. The method of claim 1 , further comprising performing at least one diagnostic test on the patient based on the confidence for each candidate disease to diagnose the at least one candidate disease.
9. The method of claim 1 , wherein the at least one treatment is one or more of: a clinical trial related to at least one candidate disease; a medical facility specializing in the treatment of the at least one candidate disease; and a medical expert specializing in the treatment of the at least one candidate disease.
10. A system for disease prediction, comprising:
a receiving unit which receives one or more patient symptoms pertaining to a patient;
a disease matching unit which compares the one or more patient symptoms with known symptoms of diseases to identify one or more candidate diseases;
a symptom weighting unit which applies a predetermined weighted value to each of the one or more patient symptoms for each candidate disease, wherein the weighted value is determined based on a relevance of each symptom to each candidate disease;
a confidence generating unit which generates a confidence for each candidate disease based on the one or more patient symptoms and applied weighted values; and a treatment recommendation unit which identifies at least one treatment for one or more of the candidate diseases.
11. The system of claim 10, wherein the confidence is generated by summing the individual weights of each symptom to determine a specificity of the symptoms for each candidate disease.
12. The system of claim 10, wherein the disease matching unit identifies the one or more candidate diseases by identifying a disease with at least one known symptom which matches at least one of the one or more patient.
13. The system of claim 10, further comprising a display unit which displays each candidate disease above a threshold confidence to a user.
14. The system of claim 10, wherein the at least one treatment is for the candidate disease with the highest confidence value.
15. The system of claim 10, wherein the at least one treatment is an orphan drug.
16. The system of claim 10, wherein the treatment recommendation unit performs at least one diagnostic test on the patient based on the confidence for each candidate disease to diagnose at least one candidate disease.
17. The system of claim 10, wherein the at least one treatment is one or more of: a clinical trial related to at least one candidate disease; a medical facility specializing in the treatment of the at least one candidate disease; and a medical expert specializing in the treatment of the at least one candidate disease.
PCT/US2015/064564 2014-08-27 2015-12-08 Systems and methods for rare disease prediction and treatment WO2016094450A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP15868521.4A EP3230944A4 (en) 2014-12-08 2015-12-08 Systems and methods for rare disease prediction and treatment

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
PCT/US2014/053018 WO2015031542A1 (en) 2013-08-27 2014-08-27 Systems and methods for rare disease prediction
US14/563,910 US20150112607A1 (en) 2014-12-08 2014-12-08 Systems and methods for rare disease prediction and treatment
US14/563,910 2014-12-08
USPCT/US2014/053018 2014-12-27

Publications (1)

Publication Number Publication Date
WO2016094450A1 true WO2016094450A1 (en) 2016-06-16

Family

ID=52826910

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2015/064564 WO2016094450A1 (en) 2014-08-27 2015-12-08 Systems and methods for rare disease prediction and treatment

Country Status (3)

Country Link
US (1) US20150112607A1 (en)
EP (1) EP3230944A4 (en)
WO (1) WO2016094450A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022241481A1 (en) * 2021-05-14 2022-11-17 Tmaccelerator Company, Llc Precision medicine systems and methods
WO2023217737A1 (en) 2022-05-11 2023-11-16 Symptoma Gmbh Health data enrichment for improved medical diagnostics

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190155993A1 (en) * 2017-11-20 2019-05-23 ThinkGenetic Inc. Method and System Supporting Disease Diagnosis
CN111326243B (en) * 2020-02-05 2023-04-07 讯飞医疗科技股份有限公司 Triage recommendation method and device, electronic equipment and storage medium
CA3166942A1 (en) * 2020-02-06 2021-08-12 Michael M. Segal Method and system for incorporating patient information
JP7452141B2 (en) * 2020-03-18 2024-03-19 富士通株式会社 Information presentation program, information presentation method, and information presentation device
US20220093252A1 (en) * 2020-09-23 2022-03-24 Sanofi Machine learning systems and methods to diagnose rare diseases
CN112948599A (en) * 2021-05-13 2021-06-11 北京富通东方科技有限公司 Disease diagnosis method based on path reasoning of medical knowledge graph
CN114334065B (en) * 2022-03-07 2022-06-28 阿里巴巴达摩院(杭州)科技有限公司 Medical record processing method, computer readable storage medium and computer device
CN116796046B (en) * 2023-08-29 2023-11-10 武汉大学人民医院(湖北省人民医院) Case retrieval method and device based on rare characteristics

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060135859A1 (en) * 2004-10-22 2006-06-22 Iliff Edwin C Matrix interface for medical diagnostic and treatment advice system and method
US20060173663A1 (en) * 2004-12-30 2006-08-03 Proventys, Inc. Methods, system, and computer program products for developing and using predictive models for predicting a plurality of medical outcomes, for evaluating intervention strategies, and for simultaneously validating biomarker causality
US20090192807A1 (en) * 2008-01-28 2009-07-30 Express Scripts, Inc. Identification of specialty drugs
US20130238359A1 (en) * 2012-03-07 2013-09-12 Jennifer L. Carter Bi-directional biomarker based clinical trial matcher
JP2014029644A (en) * 2012-07-31 2014-02-13 Panasonic Corp Similar case retrieval device and similar case retrieval method

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002073822A (en) * 2000-08-30 2002-03-12 Fujitsu Ltd Medical information system
US20040168951A1 (en) * 2003-02-27 2004-09-02 Mackie Robert W. Method and system for self administration of medications
US20140122109A1 (en) * 2012-10-29 2014-05-01 Consuli, Inc. Clinical diagnosis objects interaction

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060135859A1 (en) * 2004-10-22 2006-06-22 Iliff Edwin C Matrix interface for medical diagnostic and treatment advice system and method
US20060173663A1 (en) * 2004-12-30 2006-08-03 Proventys, Inc. Methods, system, and computer program products for developing and using predictive models for predicting a plurality of medical outcomes, for evaluating intervention strategies, and for simultaneously validating biomarker causality
US20090192807A1 (en) * 2008-01-28 2009-07-30 Express Scripts, Inc. Identification of specialty drugs
US20130238359A1 (en) * 2012-03-07 2013-09-12 Jennifer L. Carter Bi-directional biomarker based clinical trial matcher
JP2014029644A (en) * 2012-07-31 2014-02-13 Panasonic Corp Similar case retrieval device and similar case retrieval method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3230944A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022241481A1 (en) * 2021-05-14 2022-11-17 Tmaccelerator Company, Llc Precision medicine systems and methods
WO2023217737A1 (en) 2022-05-11 2023-11-16 Symptoma Gmbh Health data enrichment for improved medical diagnostics

Also Published As

Publication number Publication date
EP3230944A1 (en) 2017-10-18
US20150112607A1 (en) 2015-04-23
EP3230944A4 (en) 2018-06-27

Similar Documents

Publication Publication Date Title
Di Maio et al. The role of patient-reported outcome measures in the continuum of cancer clinical care: ESMO Clinical Practice Guideline
Hughto et al. Prevalence and co-occurrence of alcohol, nicotine, and other substance use disorder diagnoses among US transgender and cisgender adults
US20150112607A1 (en) Systems and methods for rare disease prediction and treatment
US11664097B2 (en) Healthcare information technology system for predicting or preventing readmissions
Banda et al. Finding missed cases of familial hypercholesterolemia in health systems using machine learning
Sadosky et al. Healthcare utilization and costs in diabetes relative to the clinical spectrum of painful diabetic peripheral neuropathy
Dossa et al. Sex differences in the pattern of patient referrals to male and female surgeons
Leong et al. The economic consequences of irritable bowel syndrome: a US employer perspective
Schilsky et al. Building a rapid learning health care system for oncology: the regulatory framework of CancerLinQ
Vaci et al. Natural language processing for structuring clinical text data on depression using UK-CRIS
US20210005318A1 (en) Artificial intelligence expert system
Okada et al. The standardization of uveitis nomenclature project: the future is here
US20140095201A1 (en) Leveraging Public Health Data for Prediction and Prevention of Adverse Events
US20100076786A1 (en) Computer System and Computer-Implemented Method for Providing Personalized Health Information for Multiple Patients and Caregivers
CN109427420B (en) Diagnostic validation tool
Lenert et al. A model to support shared decision making in electronic health records systems
US10586615B2 (en) Electronic health record quality enhancement
Idnay et al. A systematic review on natural language processing systems for eligibility prescreening in clinical research
Singh et al. Trends in hospitalizations for alcohol use disorder in the US from 1998 to 2016
Shek et al. Machine learning‐enabled multitrust audit of stroke comorbidities using natural language processing
Ruckdeschel et al. Unstructured data are superior to structured data for eliciting quantitative smoking history from the electronic health record
Li et al. Medical costs of substance use disorders in the US employer-sponsored insurance population
WO2015031542A1 (en) Systems and methods for rare disease prediction
Eggermont et al. Gender and age concordance between patient and GP: an observational study on associations with referral behaviour
Sun et al. Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15868521

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015868521

Country of ref document: EP