EP4176451A1 - Methods and systems for efficient sample pooling for diagnostic testing - Google Patents
Methods and systems for efficient sample pooling for diagnostic testingInfo
- Publication number
- EP4176451A1 EP4176451A1 EP21834522.1A EP21834522A EP4176451A1 EP 4176451 A1 EP4176451 A1 EP 4176451A1 EP 21834522 A EP21834522 A EP 21834522A EP 4176451 A1 EP4176451 A1 EP 4176451A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- subjects
- disease
- disorder
- pools
- absence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 187
- 238000002405 diagnostic procedure Methods 0.000 title claims description 68
- 238000011176 pooling Methods 0.000 title abstract description 70
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 44
- 230000036541 health Effects 0.000 claims abstract description 37
- 238000012545 processing Methods 0.000 claims abstract description 16
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 294
- 201000010099 disease Diseases 0.000 claims description 152
- 239000000523 sample Substances 0.000 claims description 147
- 208000035475 disorder Diseases 0.000 claims description 142
- 150000007523 nucleic acids Chemical class 0.000 claims description 68
- 102000039446 nucleic acids Human genes 0.000 claims description 62
- 108020004707 nucleic acids Proteins 0.000 claims description 62
- 238000012360 testing method Methods 0.000 claims description 41
- 208000025721 COVID-19 Diseases 0.000 claims description 31
- 210000004369 blood Anatomy 0.000 claims description 21
- 239000008280 blood Substances 0.000 claims description 21
- 238000010801 machine learning Methods 0.000 claims description 17
- 230000035945 sensitivity Effects 0.000 claims description 16
- 208000024891 symptom Diseases 0.000 claims description 16
- 230000003321 amplification Effects 0.000 claims description 14
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 14
- 241000725303 Human immunodeficiency virus Species 0.000 claims description 12
- 210000001175 cerebrospinal fluid Anatomy 0.000 claims description 10
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 claims description 8
- 230000036760 body temperature Effects 0.000 claims description 8
- 238000004458 analytical method Methods 0.000 claims description 7
- 239000012530 fluid Substances 0.000 claims description 7
- 201000004792 malaria Diseases 0.000 claims description 7
- 238000007637 random forest analysis Methods 0.000 claims description 7
- 230000036387 respiratory rate Effects 0.000 claims description 7
- 238000005259 measurement Methods 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- 206010036790 Productive cough Diseases 0.000 claims description 5
- 210000004381 amniotic fluid Anatomy 0.000 claims description 5
- 210000003567 ascitic fluid Anatomy 0.000 claims description 5
- 210000004027 cell Anatomy 0.000 claims description 5
- 230000029142 excretion Effects 0.000 claims description 5
- 210000004880 lymph fluid Anatomy 0.000 claims description 5
- 210000003097 mucus Anatomy 0.000 claims description 5
- 210000002381 plasma Anatomy 0.000 claims description 5
- 210000004910 pleural fluid Anatomy 0.000 claims description 5
- 210000003296 saliva Anatomy 0.000 claims description 5
- 210000000582 semen Anatomy 0.000 claims description 5
- 210000002966 serum Anatomy 0.000 claims description 5
- 210000003802 sputum Anatomy 0.000 claims description 5
- 208000024794 sputum Diseases 0.000 claims description 5
- 210000001138 tear Anatomy 0.000 claims description 5
- 210000001519 tissue Anatomy 0.000 claims description 5
- 210000002700 urine Anatomy 0.000 claims description 5
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 4
- 230000003044 adaptive effect Effects 0.000 claims description 4
- 238000013528 artificial neural network Methods 0.000 claims description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 4
- 230000036772 blood pressure Effects 0.000 claims description 4
- 229910002092 carbon dioxide Inorganic materials 0.000 claims description 4
- 239000001569 carbon dioxide Substances 0.000 claims description 4
- 238000003745 diagnosis Methods 0.000 claims description 4
- 229940079593 drug Drugs 0.000 claims description 4
- 239000003814 drug Substances 0.000 claims description 4
- 238000002565 electrocardiography Methods 0.000 claims description 4
- 238000000537 electroencephalography Methods 0.000 claims description 4
- 238000002567 electromyography Methods 0.000 claims description 4
- 210000001508 eye Anatomy 0.000 claims description 4
- 239000007789 gas Substances 0.000 claims description 4
- 239000008103 glucose Substances 0.000 claims description 4
- 239000005556 hormone Substances 0.000 claims description 4
- 229940088597 hormone Drugs 0.000 claims description 4
- 238000012417 linear regression Methods 0.000 claims description 4
- 238000007477 logistic regression Methods 0.000 claims description 4
- 238000002483 medication Methods 0.000 claims description 4
- 230000000926 neurological effect Effects 0.000 claims description 4
- 229910052760 oxygen Inorganic materials 0.000 claims description 4
- 239000001301 oxygen Substances 0.000 claims description 4
- 238000004393 prognosis Methods 0.000 claims description 4
- 230000000241 respiratory effect Effects 0.000 claims description 4
- 231100000430 skin reaction Toxicity 0.000 claims description 4
- 230000000391 smoking effect Effects 0.000 claims description 4
- 238000012706 support-vector machine Methods 0.000 claims description 4
- 210000004243 sweat Anatomy 0.000 claims description 4
- 238000012163 sequencing technique Methods 0.000 description 25
- 230000015654 memory Effects 0.000 description 20
- 238000003860 storage Methods 0.000 description 18
- 125000003729 nucleotide group Chemical group 0.000 description 14
- 239000002773 nucleotide Substances 0.000 description 13
- 238000004891 communication Methods 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- 108020004414 DNA Proteins 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 8
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 7
- 241000282414 Homo sapiens Species 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- 208000035473 Communicable disease Diseases 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 229920002477 rna polymer Polymers 0.000 description 5
- 208000010470 Ageusia Diseases 0.000 description 4
- 206010011224 Cough Diseases 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 206010037660 Pyrexia Diseases 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 235000019666 ageusia Nutrition 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000000116 mitigating effect Effects 0.000 description 3
- 239000002777 nucleoside Substances 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 229940113082 thymine Drugs 0.000 description 3
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 241001678559 COVID-19 virus Species 0.000 description 2
- PCDQPRRSZKQHHS-XVFCMESISA-N CTP Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-XVFCMESISA-N 0.000 description 2
- 206010061818 Disease progression Diseases 0.000 description 2
- 206010021143 Hypoxia Diseases 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 229920000388 Polyphosphate Polymers 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 108091092259 cell-free RNA Proteins 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 2
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 2
- 239000005549 deoxyribonucleoside Substances 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 230000005750 disease progression Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000012165 high-throughput sequencing Methods 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 208000018875 hypoxemia Diseases 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 239000001205 polyphosphate Substances 0.000 description 2
- 235000011176 polyphosphates Nutrition 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108090000623 proteins and genes Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 238000012070 whole genome sequencing analysis Methods 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 206010002653 Anosmia Diseases 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 206010013975 Dyspnoeas Diseases 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000941423 Grom virus Species 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 206010068319 Oropharyngeal pain Diseases 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 201000007100 Pharyngitis Diseases 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 208000000453 Skin Neoplasms Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- PGAVKCOVUIYSFO-XVFCMESISA-N UTP Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-XVFCMESISA-N 0.000 description 1
- 208000006593 Urologic Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 206010047700 Vomiting Diseases 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- -1 nucleoside monophosphate Chemical class 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000007841 sequencing by ligation Methods 0.000 description 1
- 208000013220 shortness of breath Diseases 0.000 description 1
- 201000000849 skin cancer Diseases 0.000 description 1
- 239000010454 slate Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000003867 tiredness Effects 0.000 description 1
- 208000016255 tiredness Diseases 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 229950010342 uridine triphosphate Drugs 0.000 description 1
- PGAVKCOVUIYSFO-UHFFFAOYSA-N uridine-triphosphate Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-UHFFFAOYSA-N 0.000 description 1
- 230000008673 vomiting Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/10—Machine learning using kernel methods, e.g. support vector machines [SVM]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/20—Ensemble learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/01—Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Definitions
- Bodily samples, health data, contact tracing data, location data, and/or movement data may be collected from a plurality of subjects (e.g., patients), and trained computer algorithms may be used to efficiently perform or direct the pooling of the bodily samples into a plurality of sample pools for diagnostic testing.
- Such efficient sample pooling can be used to perform frequent and widespread diagnostic testing of an infectious disease across a population, which may be essential for containment and mitigation, especially in cases of pandemic outbreaks (e.g., COVID-19).
- the present disclosure provides a method comprising: (a) obtaining a plurality of: health data, contact tracing data, location data, movement data, or any combination thereof associated with a plurality of subjects; and (b) processing the plurality of: health data, contact tracing data, location data, movement data, or any combination thereof with a trained computer algorithm to at least some individual subjects of the plurality of subjects to a pool from among a plurality of pools, wherein a number of pools of the plurality of pools is less than a number of subjects of the plurality of subjects.
- the method further comprises outputting an electronic recommendation to create, for each of at least two given pools of the plurality of pools, a pooled sample by combining bodily samples or portions thereof obtained from subjects in the at least two given pools.
- the method further comprises creating, for each of at least two given pools of the plurality of pools, a pooled sample by combining bodily samples or portions thereof obtained from subjects in the at least two given pools.
- the method further comprises obtaining the bodily samples or portions thereof from the plurality of subjects.
- the bodily samples are individually selected from the group consisting of: nasopharyngeal swab, oropharyngeal swab, blood, serum, plasma, vitreous, sputum, urine, stool, tears, perspiration, saliva, semen, mucosal excretions, mucus, spinal fluid, cerebrospinal fluid (CSF), pleural fluid, peritoneal fluid, amniotic fluid, lymph fluid, eye swab, cheek swab, vaginal swab, cervical swab, rectal swab, cells, and tissue.
- CSF cerebrospinal fluid
- the method further comprises isolating nucleic acids from the bodily samples, and creating, for a given pool of the plurality of pools, the pooled sample by combining at least some of nucleic acids isolated from bodily samples obtained from the subjects in the given pool.
- the method further comprises enriching the nucleic acids for a plurality of genomic regions.
- the method further comprises amplifying at least some of the nucleic acids.
- the amplification comprises selective amplification.
- the amplification comprises universal amplification.
- enriching at least some of the nucleic acids for the plurality of genomic regions comprises contacting the nucleic acids with a plurality of probes, each of the plurality of probes having sequence complementarity with at least a portion of a genomic region of the plurality of genomic regions.
- the plurality of genomic regions comprises genomic regions associated with a disease or disorder.
- the disease or disorder comprises coronavirus disease 2019 (COVID-19), human immunodeficiency virus (HIV), or malaria.
- the disease or disorder comprises COVID-19.
- the method further comprises performing a plurality of diagnostic tests on the plurality of pooled samples to obtain a plurality of diagnostic results associated with the plurality of pooled samples.
- the plurality of diagnostic tests are configured to detect a presence or absence of a disease or disorder based on analyzing at least the plurality of pooled samples.
- the disease or disorder comprises coronavirus disease 2019 (COVID-19), human immunodeficiency virus (HIV), or malaria.
- the disease or disorder comprises COVID-19.
- the method further comprises, for a given pool among the plurality of pools, detecting the absence of the disease or disorder in each of the individual subjects of the given pool when the absence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool. In some embodiments, the method further comprises, for a given pool among the plurality of pools, testing each of the individual subjects of the given pool for the disease or disorder when the presence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool.
- the method further comprises, for a given pool among the plurality of pools, testing each of a plurality of sub-pools of the given pool for the disease or disorder when the presence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 90%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 90%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 90%.
- PPV positive predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- PSV positive predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 90%.
- NPV negative predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- NPV negative predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.60. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.70. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.80. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.90.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.85, at least about 0.90, at least about 0.91, at least about 0.92, at least about 0.93, at least about 0.94, at least about 0.95, at least about 0.96, at least about 0.97, at least about 0.98, or at least about 0.99.
- AUC area under the curve
- the plurality of health data, contact tracing data, location data, movement data, or any combination thereof associated with the plurality of subjects comprises de-identified data.
- the plurality of health data comprises a diagnosis of a disease or disorder, a prognosis of a disease or disorder, a risk of having a disease or disorder, a treatment history of a disease or disorder, a history of previous treatment for a disease or disorder, a history of prescribed medications, a history of prescribed medical devices, age, height, weight, sex, smoking status, one or more symptoms, and one or more vital signs.
- the one or more vital signs comprise one or more of: heart rate, heart rate variability, blood pressure, respiratory rate, blood oxygen concentration (SpO?), carbon dioxide concentration in respiratory gases, a hormone level, sweat analysis, blood glucose, body temperature, impedance, conductivity, capacitance, resistivity, electromyography, galvanic skin response, neurological signals, electroencephalography, electrocardiography, immunology markers, and other physiological measurements.
- the trained computer algorithm comprises a trained machine learning classifier.
- the trained machine learning classifier comprises an algorithm selected from the group consisting of: a support vector machine, a neural network, a random forest, a linear regression, a logistic regression, a Bayesian classifier, a boosted classifier, a gradient boosting algorithm, an adaptive boosting (AdaBoost) algorithm, and an extreme gradient boosting (XGBoost) algorithm.
- the method further comprises processing health data, contact tracing data, location data, or movement data of the individual subject with the trained computer algorithm to determine an expected prevalence of a disease or disorder, and assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools based at least in part on the determined expected prevalence of the disease or disorder.
- the method further comprises assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools when the determined expected prevalence of the disease or disorder is less than a pre-determined prevalence threshold.
- the pre-determined prevalence threshold is about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, or about 50%.
- the method further comprises determining a maximum pool size based on the determined expected prevalence of the disease or disorder, and assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools based on the maximum pool size. In some embodiments, the method further comprises determining an expected level of severity of symptoms of the plurality of subjects. In some embodiments, the expected level of severity of symptoms comprises a moderate level of symptoms, an intermediate level of symptoms, or a severe level of symptoms.
- the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 50%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 100%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 200%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 300%.
- the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210%, at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%,
- the method further comprises, based on the detected presence or absence of the disease or disorder in the plurality of subjects, administering a therapeutically effective dose of a treatment to treat the disease or disorder detected in at least a subset of the plurality of subjects.
- the present disclosure provides a computer system, comprising: a database that is configured to store a plurality of: health data, contact tracing data, location data, movement data, or any combination thereof associated with a plurality of subjects; and one or more computer processors operatively coupled to the database, wherein the one or more computer processors are individually or collectively programmed to: process the plurality of: health data, contact tracing data, location data, movement data, or any combination thereof with a trained computer algorithm to assign at least some individual subjects of the plurality of subjects to a pool from among a plurality of pools, wherein a number of pools of the plurality of pools is less than a number of subjects of the plurality of subjects.
- Another aspect of the present disclosure provides a non-transitory computer- readable medium comprising machine-executable code that, upon execution by one or more computer processors, implements any of the methods above or elsewhere herein.
- Another aspect of the present disclosure provides a system comprising one or more computer processors and computer memory coupled thereto.
- the computer memory comprises machine-executable code that, upon execution by the one or more computer processors, implements any of the methods above or elsewhere herein.
- FIG. 1A shows an example method for performing sample pooling of a plurality of clinical samples for diagnostic testing.
- FIG. IB shows a general workflow for performing sample pooling of a plurality of clinical samples for diagnostic testing, including data input, consuming and consolidating data, performing a method for sample pooling, recommending pooling protocols, and performing diagnostic testing on the sample pools.
- FIG. 1C shows a workflow for performing sample handling and testing.
- FIG. ID shows an example of a system procedure diagram of methods and systems of the present disclosure.
- FIG. IE shows an example of a user activity diagram of methods and systems of the present disclosure.
- FIG. 2A shows an example of sample pooling of a plurality of clinical samples for diagnostic testing. If a sample pool comprising a plurality of individual clinical samples receives a negative clinical test outcome, then all the individual clinical samples in the sample pool must also have a negative clinical test outcome. Conversely, if the sample pool comprising the plurality of individual clinical samples receives a positive clinical test outcome, then either the individual clinical samples can be individually tested, or the sample pool can be further sub-divided into smaller subsets and the sample pooling process can be repeated.
- FIG. 2B shows an example of how methods and systems of the present disclosure advantageously use sample pooling. For example, currently about 2 out of 10 test results return positive; patterns can be learned via machine learning about the 8 out of 10 negative patients. Using methods and systems of the present disclosure, predictive analytics and machine learning may be applied to identify patients with a lower prevalence rate. A plurality of predicted negative samples may be pooled together to create sample pool, and a single diagnostic test may be performed on the sample pool.
- FIG. 3 shows a computer system that is programmed or otherwise configured to implement methods provided herein.
- FIG. 4A shows the number of people who were able to be tested with 1,000 diagnostic tests, using Boston vs. Cambridge cohort information (left 3 columns), clinical symptom information (middle 3 columns), and contact tracing information (right 3 columns). Within each set of 3 columns, the number of people who were able to be tested with 1,000 diagnostic tests is indicated using no sample pooling (left), simple sample pooling (middle), and intelligent sample pooling based on methods and systems of the present disclosure (right).
- FIG. 4B shows that using intelligent sample pooling based on methods and systems of the present disclosure, the rules for diagnostic testing can be redefined.
- FIG. 5A shows the percentage of test savings that is achieved using simple sample pooling (gray), using intelligent sample pooling based on methods and systems of the present disclosure (purple), and an upper bound achieved using intelligent sample pooling (dashed).
- FIG. 5B shows the percentage increase in diagnostic testing capacity that is achieved using simple sample pooling (gray), using intelligent sample pooling based on methods and systems of the present disclosure (purple), and an upper bound achieved using intelligent sample pooling (dashed).
- FIG. 6 shows a relative number of diagnostic tests that is required versus the sample pool size (ranging from 1 sample to 20 samples per pool), at a prevalence rate (PR) of 1% (light blue), 5% (orange), 10% (gray), 20% (yellow), and 30% (dark blue).
- PR prevalence rate
- FIG. 7 shows that using methods and systems of the present disclosure, an optimal pool size of 4 samples per pool was selected, and a 40% reduction in diagnostic test kit utilization was achieved, along with a clinical sensitivity of 80% and a clinical specificity of 96%.
- nucleic acid generally refers to a molecule comprising one or more nucleic acid subunits, or nucleotides.
- a nucleic acid may include one or more nucleotides selected from adenosine (A), cytosine (C), guanine (G), thymine (T) and uracil (U), or variants thereof.
- a nucleotide generally includes a nucleoside and at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more phosphate (P03) groups.
- a nucleotide can include a nucleobase, a five-carbon sugar (either ribose or deoxyribose), and one or more phosphate groups, individually or in combination.
- Ribonucleotides are nucleotides in which the sugar is ribose.
- Deoxyribonucleotides are nucleotides in which the sugar is deoxyribose.
- a nucleotide can be a nucleoside monophosphate or a nucleoside polyphosphate.
- a nucleotide can be a deoxyribonucleoside polyphosphate, such as, e.g., a deoxyribonucleoside triphosphate (dNTP), which can be selected from deoxyadenosine triphosphate (dATP), deoxycytosine triphosphate (dCTP), deoxyguanosine triphosphate (dGTP), uridine triphosphate (dUTP) and deoxythymidine triphosphate (dTTP) dNTPs, that include detectable tags, such as luminescent tags or markers (e.g., fluorophores).
- dNTP deoxyribonucleoside polyphosphate
- dNTP deoxyribonucleoside triphosphate
- dNTP deoxyribonucleoside triphosphate
- dNTP deoxyribonucleoside triphosphate
- dNTP deoxyribonucleoside triphosphate
- dNTP deoxyribonucleoside triphosphate
- Such subunit can be an A, C, G, T, or U, or any other subunit that is specific to one or more complementary A, C, G, T or U, or complementary to a purine (i.e., A or G, or variant thereof) or a pyrimidine (i.e., C, T or U, or variant thereof).
- a nucleic acid is deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or derivatives or variants thereof.
- a nucleic acid may be single- stranded or double stranded.
- a nucleic acid molecule may be linear, curved, or circular or any combination thereof.
- nucleic acid molecule generally refer to a polynucleotide that may have various lengths, such as either deoxyribonucleotides or ribonucleotides (RNA), or analogs thereof.
- a nucleic acid molecule can have a length of at least about 5 bases, 10 bases, 20 bases, 30 bases, 40 bases, 50 bases, 60 bases, 70 bases, 80 bases, 90, 100 bases, 110 bases, 120 bases, 130 bases, 140 bases, 150 bases, 160 bases, 170 bases, 180 bases, 190 bases, 200 bases, 300 bases, 400 bases, 500 bases, 1 kilobase (kb), 2 kb, 3, kb, 4 kb, 5 kb, 10 kb, or 50 kb or it may have any number of bases between any two of the aforementioned values.
- oligonucleotide is typically composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T) (uracil (U) for thymine (T) when the polynucleotide is RNA).
- A adenine
- C cytosine
- G guanine
- T thymine
- U uracil
- T thymine
- the terms “nucleic acid molecule,” “nucleic acid sequence,” “nucleic acid fragment,” “oligonucleotide” and “polynucleotide” are at least in part intended to be the alphabetical representation of a polynucleotide molecule. Alternatively, the terms may be applied to the polynucleotide molecule itself.
- Oligonucleotides may include one or more nonstandard nucleotide(s), nucleotide analog(s) and/or modified nucleotides.
- sample generally refers to a biological sample.
- biological samples include nucleic acid molecules, amino acids, polypeptides, proteins, carbohydrates, fats, or viruses.
- a biological sample is a nucleic acid sample including one or more nucleic acid molecules.
- the nucleic acid molecules may be cell-free or cell-free nucleic acid molecules, such as cell-free DNA (cfDNA) or cell-free RNA (cfRNA).
- the nucleic acid molecules may be derived from a variety of sources including human, mammal, non-human mammal, ape, monkey, chimpanzee, reptilian, amphibian, or avian, sources.
- samples may be extracted from a variety of animal fluids containing cell-free sequences, including but not limited to bodily fluid samples such as blood, serum, plasma, vitreous, sputum, urine, tears, perspiration, saliva, semen, mucosal excretions, mucus, spinal fluid, cerebrospinal fluid (CSF), pleural fluid, peritoneal fluid, amniotic fluid, lymph fluid, and the like.
- bodily fluid samples such as blood, serum, plasma, vitreous, sputum, urine, tears, perspiration, saliva, semen, mucosal excretions, mucus, spinal fluid, cerebrospinal fluid (CSF), pleural fluid, peritoneal fluid, amniotic fluid, lymph fluid, and the like.
- Cell free polynucleotides e.g., cfDNA
- subject generally refers to an individual having a biological sample that is undergoing processing or analysis.
- a subject can be an animal or plant.
- the subject can be a mammal, such as a human, dog, cat, horse, pig or rodent.
- the subject can be a patient, e.g., have or be suspected of having a disease, such as one or more cancers (e.g., brain cancer, breast cancer, cervical cancer, colorectal cancer, endometrial cancer, esophageal cancer, gastric cancer, hepatobiliary tract cancer, leukemia, liver cancer, lung cancer, lymphoma, ovarian cancer, pancreatic cancer, skin cancer, urinary tract cancer), one or more infectious diseases, one or more genetic disorder, or one or more tumors, or any combination thereof.
- the tumors may be of one or more types.
- the term “whole blood,” as used herein, generally refers to a blood sample that has not been separated into sub-components (e.g., by centrifugation).
- the whole blood of a blood sample may contain cfDNA and/or germline DNA.
- Whole blood DNA (which may contain cfDNA and/or germline DNA) may be extracted from a blood sample.
- Whole blood DNA sequencing reads (which may contain cfDNA sequencing reads and/or germline DNA sequencing reads) may be extracted from whole blood DNA.
- Diagnostic testing of subjects may be limited or scarce. In some cases, diagnostic testing may be saved for symptomatic subjects or high-risk subjects; this can be less than ideal because asymptomatic subjects continue to be contagious and spread infectious diseases such as viruses (e.g., COVID-19 or HIV), bacteria, or parasites (e.g., malaria). Therefore, frequent and widespread diagnostic testing of an infectious disease across a population may be essential for containment and mitigation, especially in cases of pandemic outbreaks.
- viruses e.g., COVID-19 or HIV
- bacteria e.g., malaria
- Methods and systems are provided for pooling a plurality of bodily samples. Bodily samples, health data, contact tracing data, location data, and/or movement data may be collected from a plurality of subjects (e.g., patients), and trained computer algorithms may be used to efficiently pool the bodily samples into a plurality of sample pools for diagnostic testing. Such efficient sample pooling can be used to perform frequent and widespread diagnostic testing of an infectious disease across a population, which may be essential for
- HIV HIV, or malaria.
- the present disclosure provides a method comprising: (a) obtaining a plurality of: health data, contact tracing data, location data, movement data, or any combination thereof associated with a plurality of subjects; and (b) processing the plurality of: health data, contact tracing data, location data, movement data, or any combination thereof with a trained computer algorithm to at least some individual subjects of the plurality of subjects to a pool from among a plurality of pools, wherein a number of pools of the plurality of pools is less than a number of subjects of the plurality of subjects.
- the method further comprises outputting an electronic recommendation to create, for each of at least two given pools of the plurality of pools, a pooled sample by combining bodily samples or portions thereof obtained from subjects in the at least two given pools.
- the method further comprises creating, for each of at least two given pools of the plurality of pools, a pooled sample by combining bodily samples or portions thereof obtained from subjects in the at least two given pools.
- the method further comprises obtaining the bodily samples or portions thereof from the plurality of subjects.
- the bodily samples are individually selected from the group consisting of: nasopharyngeal swab, oropharyngeal swab, blood, serum, plasma, vitreous, sputum, urine, stool, tears, perspiration, saliva, semen, mucosal excretions, mucus, spinal fluid, cerebrospinal fluid (CSF), pleural fluid, peritoneal fluid, amniotic fluid, lymph fluid, eye swab, cheek swab, vaginal swab, cervical swab, rectal swab, cells, and tissue.
- CSF cerebrospinal fluid
- the method further comprises isolating nucleic acids from the bodily samples, and creating, for a given pool of the plurality of pools, the pooled sample by combining at least some of nucleic acids isolated from bodily samples obtained from the subjects in the given pool.
- the method further comprises enriching the nucleic acids for a plurality of genomic regions.
- the method further comprises amplifying at least some of the nucleic acids.
- the amplification comprises selective amplification.
- the amplification comprises universal amplification.
- enriching the nucleic acids for the plurality of genomic regions comprises contacting the nucleic acids with a plurality of probes, each of the plurality of probes having sequence complementarity with at least a portion of a genomic region of the plurality of genomic regions.
- the plurality of genomic regions comprises genomic regions associated with a disease or disorder.
- the disease or disorder comprises coronavirus disease 2019 (COVID-19), human immunodeficiency virus (HIV), or malaria.
- the disease or disorder comprises COVID-19.
- the method further comprises performing a plurality of diagnostic tests on the plurality of pooled samples to obtain a plurality of diagnostic results associated with the plurality of pooled samples.
- the plurality of diagnostic tests are configured to detect a presence or absence of a disease or disorder based on analyzing at least the plurality of pooled samples.
- the disease or disorder comprises coronavirus disease 2019 (COVID-19), human immunodeficiency virus (HIV), or malaria.
- the disease or disorder comprises COVID-19.
- the method further comprises, for a given pool among the plurality of pools, detecting the absence of the disease or disorder in each of the individual subjects of the given pool when the absence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool. In some embodiments, the method further comprises, for a given pool among the plurality of pools, testing each of the individual subjects of the given pool for the disease or disorder when the presence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool.
- the method further comprises, for a given pool among the plurality of pools, testing each of a plurality of sub-pools of the given pool for the disease or disorder when the presence of the disease or disorder is detected based on analyzing the pooled sample corresponding to the given pool.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 90%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 50%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 90%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a clinical specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 90%.
- PPV positive predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a positive predictive value (PPV) of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- PSV positive predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 50%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 70%. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 90%.
- NPV negative predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with a negative predictive value (NPV) of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or 100%.
- NPV negative predictive value
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.60. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.70. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.80. In some embodiments, the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.90.
- the method further comprises detecting the presence or absence of the disease or disorder in the plurality of subjects with an area under the curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.85, at least about 0.90, at least about 0.91, at least about 0.92, at least about 0.93, at least about 0.94, at least about 0.95, at least about 0.96, at least about 0.97, at least about 0.98, or at least about 0.99.
- AUC area under the curve
- the plurality of health data comprises a diagnosis of a disease or disorder, a prognosis of a disease or disorder, a risk of having a disease or disorder, a treatment history of a disease or disorder, a history of previous treatment for a disease or disorder, a history of prescribed medications, a history of prescribed medical devices, age, height, weight, sex, smoking status, one or more symptoms, and one or more vital signs.
- the one or more vital signs comprise one or more of: heart rate, heart rate variability, blood pressure, respiratory rate, blood oxygen concentration (SpO?), carbon dioxide concentration in respiratory gases, a hormone level, sweat analysis, blood glucose, body temperature, impedance, conductivity, capacitance, resistivity, electromyography, galvanic skin response, neurological signals, electroencephalography, electrocardiography, immunology markers, and other physiological measurements.
- the trained computer algorithm comprises a trained machine learning classifier.
- the trained machine learning classifier comprises an algorithm selected from the group consisting of: a support vector machine, a neural network, a random forest, a linear regression, a logistic regression, a Bayesian classifier, a boosted classifier, a gradient boosting algorithm, an adaptive boosting (AdaBoost) algorithm, and an extreme gradient boosting (XGBoost) algorithm.
- the method further comprises processing health data, contact tracing data, location data, or movement data of the individual subject with the trained computer algorithm to determine an expected prevalence of a disease or disorder, and assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools based at least in part on the determined expected prevalence of the disease or disorder.
- the method further comprises assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools when the determined expected prevalence of the disease or disorder is less than a pre-determined prevalence threshold.
- the pre-determined prevalence threshold is about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, or about 50%.
- the method further comprises determining a maximum pool size based on the determined expected prevalence of the disease or disorder, and assigning the individual subject of the plurality of subjects to the pool from among the plurality of pools based on the maximum pool size.
- the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 50%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 100%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 200%. In some embodiments, the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least 300%.
- the number of pools of the plurality of pools is reduced relative to the number of subjects of the plurality of subjects by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210%, at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%,
- the method further comprises, based on the detected presence or absence of the disease or disorder in the plurality of subjects, administering a therapeutically effective dose of a treatment to treat the disease or disorder detected in at least a subset of the plurality of subjects.
- FIG. 1A shows an example method 100 for performing sample pooling of a plurality of clinical samples for diagnostic testing.
- the method 100 may comprise obtaining a plurality of bodily samples from a plurality of subjects (as in operation 102).
- the bodily samples comprises a nasopharyngeal swab, oropharyngeal swab, blood, serum, plasma, vitreous, sputum, urine, stool, tears, perspiration, saliva, semen, mucosal excretions, mucus, spinal fluid, cerebrospinal fluid (CSF), pleural fluid, peritoneal fluid, amniotic fluid, lymph fluid, eye swab, cheek swab, vaginal swab, cervical swab, rectal swab, cells, or tissue.
- CSF cerebrospinal fluid
- the method 100 may comprise obtaining a plurality of: health data, contact tracing data, location data, movement data, or any combination thereof associated with the plurality of subjects (as in operation 104).
- any of the data herein e.g., health data, contact tracing data, location data, or movement data associated with the plurality of subjects
- the plurality of health data comprises a diagnosis of a disease or disorder, a prognosis of a disease or disorder, a risk of having a disease or disorder, a treatment history of a disease or disorder, a history of previous treatment for a disease or disorder, a history of prescribed medications, a history of prescribed medical devices, age, height, weight, sex, smoking status, one or more symptoms, and one or more vital signs.
- the one or more vital signs comprise one or more of: heart rate, heart rate variability, blood pressure, respiratory rate, blood oxygen concentration (SpO?), carbon dioxide concentration in respiratory gases, a hormone level, sweat analysis, blood glucose, body temperature, impedance, conductivity, capacitance, resistivity, electromyography, galvanic skin response, neurological signals, electroencephalography, electrocardiography, immunology markers, and other physiological measurements.
- the plurality of contact tracing data, location data, or movement data comprises data associated with subjects’ environment, location, movement, or daily schedules. For example, for a delivery warehouse that is performing diagnostic testing on its employees, intelligent pooling may be performed based on the employees’ delivery routes, shift schedules, contact tracing information, etc.
- the method 100 is applied to genetic sequencing of samples, such that pooling of samples is performed where the mutation rate is low.
- the method 100 may comprise processing the plurality of: health data, contact tracing data, location data, movement data, or any combination thereof with a trained computer algorithm to at least some individual subjects of the plurality of subjects to a pool from among a plurality of pools (as in operation 106).
- a number of pools of the plurality of pools is less than a number of subjects of the plurality of subjects.
- the trained computer algorithm comprises a trained machine learning classifier.
- the trained machine learning classifier comprises an algorithm selected from the group consisting of: a support vector machine, a neural network, a random forest, a linear regression, a logistic regression, a Bayesian classifier, a boosted classifier, a gradient boosting algorithm, an adaptive boosting (AdaBoost) algorithm, and an extreme gradient boosting (XGBoost) algorithm.
- FIG. IB shows a general workflow for performing sample pooling of a plurality of clinical samples for diagnostic testing, including data input, consuming and consolidating data, performing a method for sample pooling, recommending pooling protocols, and performing diagnostic testing on the sample pools.
- performing the diagnostic testing on the sample pools comprises using an integrated robot and/or robotics application programming interface (API) to perform liquid handling of a plurality of bodily samples and/or sample pools.
- performing the diagnostic testing on the sample pools comprises detecting a plurality of diseases or disorders (e.g., common cold, influenza, COVID-19 infection, and/or COVID-19 immunity) using a same sample pool.
- diseases or disorders e.g., common cold, influenza, COVID-19 infection, and/or COVID-19 immunity
- performing the diagnostic testing on the sample pools comprises tagging a plurality of bodily samples and/or sample pools (e.g., using sample tags or sample barcodes) to increase a multiplexity of the diagnostic testing (e.g., via DNA sequencing,
- performing the diagnostic testing on the sample pools comprises performing genetic sequencing on a plurality of bodily samples and/or sample pools (e.g., in cases where a mutation rate is low).
- FIG. 1C shows a workflow for performing sample handling and testing.
- the workflow for performing sample handling and testing may comprise sample collection of a plurality of bodily samples from a plurality of subjects. Logistical synergies (or disadvantages) may be associated with the handling of the bodily samples. Seamless information flow may be required for the software (e.g., barcodes may be used to identify subject samples through a connected EHR system, if the bodily samples at the collection sites lack the required data).
- the workflow for performing sample handling and testing may comprise extracting nucleic acids (e.g., DNA or RNA) from the bodily fluid samples.
- the workflow for performing sample handling and testing may comprise pooling the samples, where the pools are selected using methods and systems of the present disclosure.
- the workflow for performing sample handling and testing may comprise performing a diagnostic test or panel on the pooled samples (e.g., an RT-PCR diagnostic panel).
- FIG. ID shows an example of a system procedure diagram of methods and systems of the present disclosure.
- FIG. IE shows an example of a user activity diagram of methods and systems of the present disclosure.
- FIG. 2A shows an example of sample pooling of a plurality of clinical samples for diagnostic testing. If a sample pool comprising a plurality of individual clinical samples receives a negative clinical test outcome, then all the individual clinical samples in the sample pool must also have a negative clinical test outcome. Conversely, if the sample pool comprising the plurality of individual clinical samples receives a positive clinical test outcome, then either the individual clinical samples can be individually tested, or the sample pool can be further sub-divided into smaller subsets and the sample pooling process can be repeated.
- FIG. 2B shows an example of how methods and systems of the present disclosure advantageously use sample pooling. For example, currently about 2 out of 10 test results return positive; patterns can be learned via machine learning about the 8 out of 10 negative patients. Using methods and systems of the present disclosure, predictive analytics and machine learning may be applied to identify patients with a lower prevalence rate. A plurality of predicted negative samples may be pooled together to create sample pool, and a single diagnostic test may be performed on the sample pool. As a result, the same number of subjects may be tested using a smaller number of diagnostic tests; the saved diagnostic tests may be used to test a broader population of subjects.
- nucleic acids may be extracted from bodily samples and analyzed for diagnostic testing.
- sequencing reads may be generated from the nuclei c acids using any suitable sequencing method.
- the sequencing method can be a first- generation sequencing method, such as Maxam-Gilbert or Sanger sequencing, or a high- throughput sequencing (e.g., next-generation sequencing or NGS) method.
- a high-throughput sequencing method may sequence simultaneously (or substantially simultaneously) at least 10,000, 100,000, 1 million, 10 million, 100 million, 1 billion, or more polynucleotide molecules.
- Sequencing methods may include, but are not limited to: pyrosequencing, sequencing-by-synthesis, single-molecule sequencing, nanopore sequencing, semiconductor sequencing, sequencing-by-ligation, sequencing-by-hybridization, Digital Gene Expression (Helicos), massively parallel sequencing, e.g., Helicos, Clonal Single Molecule Array (Solexa/Illumina), sequencing using PacBio, SOLiD, Ion Torrent, or Nanopore platforms.
- the sequencing comprises whole genome sequencing (WGS).
- the sequencing may be performed at a depth sufficient to assess tumor progression or tumor non-progression in a subject with a desired performance (e.g., accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), or the area under curve (AUC) of a receiver operator characteristic (ROC)).
- a desired performance e.g., accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), or the area under curve (AUC) of a receiver operator characteristic (ROC)).
- the sequencing reads may be aligned or mapped to a reference genome.
- the reference genome may comprise at least a portion of a genome (e.g., the human genome).
- the reference genome may comprise an entire genome (e.g., the entire human genome).
- the reference genome may comprise an entire genome with certain base conversions applied (e.g., the entire human genome with cytosines converted to thymines), as may be used for methylation data alignment.
- the reference genome may comprise a database comprising a plurality of genomic regions that correspond to coding and/or non-coding genomic regions of a genome.
- the database may comprise a plurality of genomic regions that correspond to disease-associated coding and/or non-coding genomic regions of a genome, such as single nucleotide variants (SNVs), copy number variants (CNVs), insertions or deletions (indels), and fusion genes.
- SNVs single nucleotide variants
- CNVs copy number variants
- indels insertions or deletions
- fusion genes fusion genes.
- the alignment may be performed using a Burrows- Wheeler algorithm (BWA), a sambamba algorithm, a samtools algorithm, or any other suitable alignment algorithm.
- a quantitative measure of the sequencing reads may be generated for each of a plurality of genomic regions. Quantitative measures of the sequencing reads may be generated, such as counts of sequencing reads that are aligned with a given genomic region. Sequencing reads having a portion or all of the sequencing read aligning with a given genomic region may be counted toward the quantitative measure for that genomic region.
- genomic regions may comprise disease markers. Patterns of specific and non-specific genomic regions may be indicative of disease progression or disease non-progression status. Changes over time in these patterns of genomic regions may be indicative of changes in disease progression or disease non-progression status.
- nucleic acids may be assayed by performing binding measurements of the nucleic acids at each of a plurality of genomic regions.
- performing the binding measurements comprises assaying the nucleic acids using probes that are selective for at least a portion of a plurality of genomic regions in the plurality of nucleic acids.
- the probes are nucleic acid molecules having sequence complementarity with nucleic acid sequences of the plurality of genomic regions.
- the nucleic acid molecules are primers or enrichment sequences.
- the assaying comprises use of array hybridization or polymerase chain reaction (PCR), or nucleic acid sequencing.
- the nucleic acids are enriched for at least a portion of the plurality of genomic regions.
- the enrichment comprises amplifying at least some of the nucleic acids.
- the nucleic acids may be amplified by selective amplification (e.g., by using a set of primers or probes comprising nucleic acid molecules having sequence complementarity with nucleic acid sequences of the plurality of genomic regions).
- the nucleic acids may be amplified by universal amplification (e.g., by using universal primers).
- the enrichment comprises selectively isolating at least a portion of the plurality of nucleic acids.
- the sequencing reads may be normalized or corrected.
- the sequencing reads may be de-deduplicated, normalized, and/or corrected to account for known biases in sequencing and library preparation and/or known biases in sequencing and library preparation.
- a subset of the quantitative measures e.g., statistical measures
- the plurality of genomic regions may comprise at least about 10 distinct genomic regions, at least about 50 distinct genomic regions, at least about 100 distinct genomic regions, at least about 500 distinct genomic regions, at least about 1 thousand distinct genomic regions, at least about 5 thousand distinct genomic regions, at least about 10 thousand distinct genomic regions, at least about 50 thousand distinct genomic regions, at least about 100 thousand distinct genomic regions, at least about 500 thousand distinct genomic regions, at least about 1 million distinct genomic regions, at least about 2 million distinct genomic regions, at least about 3 million distinct genomic regions, at least about 4 million distinct genomic regions, at least about 5 million distinct genomic regions, at least about 10 million distinct genomic regions, at least about 15 million distinct genomic regions, at least about 20 million distinct genomic regions, at least about 25 million distinct genomic regions, at least about 30 million distinct genomic regions, or more than 30 million distinct genomic regions.
- FIG. 3 shows a computer system 301 that is programmed or otherwise configured to, for example, direct the obtaining of bodily samples from subjects, obtain health data, contact tracing data, location data, or movement data associated with subjects, process health data, contact tracing data, location data, or movement data with a trained computer algorithm to assign individual subjects to a pool, output an electronic recommendation to create a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, direct the creating of a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, direct the isolating of nucleic acids from bodily samples, direct the enriching of nucleic acids, direct the amplifying of nucleic acids, direct the performing of diagnostic tests on pooled samples to obtain diagnostic results associated with the pooled samples, and detect the absence of the disease or disorder in individual subjects of a pool when an absence of a disease or disorder is detected based on analyzing a
- the computer system 301 can regulate various aspects of analysis, calculation, and generation of the present disclosure, such as, for example, directing the obtaining of bodily samples from subjects, obtaining health data, contact tracing data, location data, or movement data associated with subjects, processing health data, contact tracing data, location data, or movement data with a trained computer algorithm to assign individual subjects to a pool, outputting an electronic recommendation to create a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, directing the creating of a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, directing the isolating of nucleic acids from bodily samples, directing the enriching of nucleic acids, directing the amplifying of nucleic acids, directing the performing of diagnostic tests on pooled samples to obtain diagnostic results associated with the pooled samples, and detecting the absence of the disease or disorder in individual subjects of a pool when an absence of a disease or disorder is detected based on analyzing a pooled sample corresponding
- the computer system 301 can be an electronic device of a user or a computer system that is remotely located with respect to the electronic device.
- the electronic device can be a mobile electronic device.
- the computer system 301 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 305, which can be a single core or multi core processor, or a plurality of processors for parallel processing.
- the computer system 301 also includes memory or memory location 310 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 315 (e.g., hard disk), communication interface 320 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 325, such as cache, other memory, data storage and/or electronic display adapters.
- the memory 310, storage unit 315, interface 320 and peripheral devices 325 are in communication with the CPU 305 through a communication bus (solid lines), such as a motherboard.
- the storage unit 315 can be a data storage unit (or data repository) for storing data.
- the computer system 301 can be operatively coupled to a computer network (“network”) 330 with the aid of the communication interface 320.
- the network 330 can be the Internet, an internet and/or extranet, or an intranet and/or extranet that is in communication with the Internet.
- the network 330 in some cases is a telecommunication and/or data network.
- the network 330 can include one or more computer servers, which can enable distributed computing, such as cloud computing.
- one or more computer servers may enable cloud computing over the network 330 (“the cloud”) to perform various aspects of analysis, calculation, and generation of the present disclosure, such as, for example, directing the obtaining of bodily samples from subjects, obtaining health data, contact tracing data, location data, or movement data associated with subjects, processing health data, contact tracing data, location data, or movement data with a trained computer algorithm to assign individual subjects to a pool, outputting an electronic recommendation to create a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, directing the creating of a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, directing the isolating of nucleic acids from bodily samples, directing the enriching of nucleic acids, directing the amplifying of nucleic acids, directing the performing of diagnostic tests on pooled samples to obtain diagnostic results associated with the pooled samples, and detecting the absence of the disease or disorder in individual subjects of a pool when an absence of a disease or disorder
- cloud computing may be provided by cloud computing platforms such as, for example, Amazon Web Services (AWS), Mi crosoft Azure, Google Cloud Platform, and IBM cloud.
- the network 330 in some cases with the aid of the computer system 301, can implement a peer-to-peer network, which may enable devices coupled to the computer system 301 to behave as a client or a server.
- the CPU 305 can execute a sequence of machine-readable instructions, which can be embodied in a program or software.
- the instructions may be stored in a memory location, such as the memory 310.
- the instructions can be directed to the CPU 305, which can subsequently program or otherwise configure the CPU 305 to implement methods of the present disclosure. Examples of operations performed by the CPU 305 can include fetch, decode, execute, and writeback.
- the CPU 305 can be part of a circuit, such as an integrated circuit.
- a circuit such as an integrated circuit.
- One or more other components of the system 301 can be included in the circuit.
- the circuit is an application specific integrated circuit (ASIC).
- ASIC application specific integrated circuit
- the storage unit 315 can store files, such as drivers, libraries and saved programs.
- the storage unit 315 can store user data, e.g., user preferences and user programs.
- the computer system 301 in some cases can include one or more additional data storage units that are external to the computer system 301, such as located on a remote server that is in communication with the computer system 301 through an intranet or the Internet.
- the computer system 301 can communicate with one or more remote computer systems through the network 330.
- the computer system 301 can communicate with a remote computer system of a user (e.g., a physician, a nurse, a caretaker, a patient, or a subject).
- remote computer systems include personal computers (e.g., portable PC), slate or tablet PC’s (e.g., Apple® iPad, Samsung® Galaxy Tab), telephones, Smart phones (e.g., Apple® iPhone, Android-enabled device, Blackberry®), or personal digital assistants.
- the user can access the computer system 301 via the network 330.
- Methods as described herein can be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the computer system 301, such as, for example, on the memory 310 or electronic storage unit 315.
- the machine-executable or machine-readable code can be provided in the form of software.
- the code can be executed by the processor 305.
- the code can be retrieved from the storage unit 315 and stored on the memory 310 for ready access by the processor 305.
- the electronic storage unit 315 can be precluded, and machine-executable instructions are stored on memory 310.
- the code can be pre-compiled and configured for use with a machine having a processor adapted to execute the code, or can be compiled during runtime.
- the code can be supplied in a programming language that can be selected to enable the code to execute in a pre-compiled or as-compiled fashion.
- aspects of the systems and methods provided herein can be embodied in programming.
- Various aspects of the technology may be thought of as “products” or “articles of manufacture” typically in the form of machine (or processor) executable code and/or associated data that is carried on or embodied in a type of machine- readable medium.
- Machine-executable code can be stored on an electronic storage unit, such as memory (e.g., read-only memory, random-access memory, flash memory) or a hard disk.
- “Storage” type media can include any or all of the tangible memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide non-transitory storage at any time for the software programming. All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer into the computer platform of an application server.
- another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links.
- a machine-readable medium such as computer-executable code
- a tangible storage medium such as computer-executable code
- Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the databases, etc. shown in the drawings.
- Volatile storage media include dynamic memory, such as main memory of such a computer platform.
- Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system.
- Carrier-wave transmission media may take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications.
- RF radio frequency
- IR infrared
- Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD- ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a ROM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer may read programming code and/or data.
- Many of these forms of computer-readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
- the computer system 301 can include or be in communication with an electronic display 335 that comprises a user interface (E ⁇ ) 340 for providing, for example, health data, contact tracing data, location data, or movement data, recommendations to create pooled samples, and diagnostic results of pooled samples.
- UIs include, without limitation, a graphical user interface (GET) and web-based user interface.
- Methods and systems of the present disclosure can be implemented by way of one or more algorithms.
- An algorithm can be implemented by way of software upon execution by the central processing unit 305.
- the algorithm can, for example, direct the obtaining of bodily samples from subjects, obtain health data, contact tracing data, location data, or movement data associated with subjects, process health data, contact tracing data, location data, or movement data with a trained computer algorithm to assign individual subjects to a pool, output an electronic recommendation to create a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, direct the creating of a pooled sample by combining bodily samples or portions thereof obtained from subjects in a pool, direct the isolating of nucleic acids from bodily samples, direct the enriching of nucleic acids, direct the amplifying of nucleic acids, direct the performing of diagnostic tests on pooled samples to obtain diagnostic results associated with the pooled samples, and detect the absence of the disease or disorder in individual subjects of a pool when an absence of a disease or disorder is detected
- Example 1 Efficient sample pooling for increasing diagnostic testing capacity of COVID-19
- FIG. 4A shows the number of people who were able to be tested with 1,000 diagnostic tests, using Boston vs. Cambridge cohort information (left 3 columns), clinical symptom information (middle 3 columns), and contact tracing information (right 3 columns). Within each set of 3 columns, the number of people who were able to be tested with 1,000 diagnostic tests is indicated using no sample pooling (left), simple sample pooling (middle), and intelligent sample pooling based on methods and systems of the present disclosure (right).
- simple sample pooling outperformed no sample pooling for all three different types of information used; further, intelligent sample pooling based on methods and systems of the present disclosure outperformed simple sample pooling for all three different types of information used (and significantly so for the cases of clinical symptom information and contact tracing information. Therefore, using intelligent sample pooling based on methods and systems of the present disclosure, diagnostic testing capacity for COVID-19 can be increased by about 3X, with at least a 40% cost savings at 96% specificity.
- FIG. 4B shows that using intelligent sample pooling based on methods and systems of the present disclosure, the rules for diagnostic testing can be redefined. Based on the current state (left side), symptoms alone are the prevalence assessment. Some subjects who meet a set of certain criteria receive testing, while the others are denied testing (e.g., because of rationing of diagnostic testing due to a shortage of testing capacity). Using intelligent sample pooling based on methods and systems of the present disclosure (right side), the ratio of diagnostic tests available to individuals tested for COVID-19 is no longer 1:1.
- This ratio can be effectively increased, for example, to about 1.1, about 1.2, about 1.3, about 1.4, about 1.5, about 1.6, about 1.7, about 1.8, about 1.9, about 2.0, about 2.1, about 2.2, about 2.3, about 2.4, about 2.5, about 2.6, about 2.7, about 2.8, about 2.9, about 3.0, or more than about 3.0.
- patient details regarding those observations were obtained (e.g., clinical health data, contact tracing data, location data, movement data, and demographic data, including age).
- diagnostic test outcomes were obtained (represented in binary format, where values of 1 indicate positive test results and values of 0 indicates negative test results).
- gender data was obtained (represented in binary format, where values of 0 indicate male and values of 1 indicate female).
- race and ethnicity data was obtained (represented in one hot encoding via 6 binary values corresponding to each race/ethnicity category).
- the data were filtered to keep certain columns corresponding to pertinent data features, including marital status, race, ethnicity, gender, healthcare expenses, age, counts by county, positive cases by county, counts by city, positive cases by city, prevalence by county, prevalence by city.
- the data for each subject or patient may comprise one or more of: a diagnostic test result, an age, sex/gender, zip code (or another de- identified geolocator), reason for diagnostic testing, vital signs (if available and accessible), and symptoms (if available).
- symptoms may include fever, cough, sore throat, tiredness, body temperature, respiratory rate, loss of taste, loss of smell, vomiting, shortness of breath, chills, hypoxemia, etc.
- prevalence data was obtained from the Starschema COVID-19 epidemiological data set.
- the data was split into a training dataset and a testing dataset.
- a supervised machine learning classifier e.g., Random Forest classifier, extreme gradient boosting (XGBoost), and gradient boosting
- XGBoost extreme gradient boosting
- gradient boosting e.g., gradient boosting
- predicted outcomes for the testing dataset were obtained.
- a classification report and confusion matrix were generated to evaluate the classifier performance.
- the total numbers of positive cases and negative cases in the test dataset were obtained.
- a set of feature importance values e.g., weights
- Table 1 shows relative performance metrics obtained using three different supervised machine learning classifiers (Random Forest classifier, extreme gradient boosting (XGBoost), and gradient boosting).
- the Random Forest classifier had the following top 5 weighted features: fever, cough, body temperature, respiratory rate, and loss of taste.
- the XGBoost classifier had the following top 5 weighted features: fever, cough, hypoxemia, body temperature, and loss of taste.
- the gradient boosting classifier had the following top 5 weighted features: fever, cough, body temperature, respiratory rate, and loss of taste.
- FIG. 5A shows the percentage of test savings that is achieved using simple sample pooling (gray), using intelligent sample pooling based on methods and systems of the present disclosure (purple), and an upper bound achieved using intelligent sample pooling (dashed).
- the percentage of test savings achieved using simple pooling diminishes rapidly as the prevalence rate of the disease increases, and simple pooling is not viable above a prevalence rate of about 25%; further, a greater percentage of test savings is achieved using intelligent sample pooling based on methods and systems of the present disclosure, as compared to using simple sample pooling, across all prevalence rates of disease.
- FIG. 5B shows the percentage increase in diagnostic testing capacity that is achieved using simple sample pooling (gray), using intelligent sample pooling based on methods and systems of the present disclosure (purple), and an upper bound achieved using intelligent sample pooling (dashed).
- the percentage increase in diagnostic testing capacity achieved using simple pooling diminishes rapidly as the prevalence rate of the disease increases, and simple pooling is not viable above a prevalence rate of about 25%; further, a greater percentage increase in diagnostic testing capacity is achieved using intelligent sample pooling based on methods and systems of the present disclosure, as compared to using simple sample pooling, across all prevalence rates of disease. Therefore, using intelligent sample pooling based on methods and systems of the present disclosure, a consistent increase in testing capacity is achieved in all prevalence environments, but also a pool prevalence below 30% is strategically and actively maintained.
- Example 3 Number of diagnostic tests needed with intelligent sample pool testing
- FIG. 6 shows a relative number of diagnostic tests that is required versus the sample pool size (ranging from 1 sample to 20 samples per pool), at a prevalence rate (PR) of 1% (light blue), 5% (orange), 10% (gray), 20% (yellow), and 30% (dark blue).
- the relative number of tests is calculated by mimicking the utilization of resources. The relative number of tests is minimized based on two approaches. First, the best sample pool size or sample pooling strategy is selected. By referring to the resulting graph, the sample pool size that provides the lowest number of diagnostic tests is determined. Second, the prevalence rate (PR) is reduced. Lower prevalence rates within sample pools yield a lower number of diagnostic tests required, thereby improving upon the first strategy of selecting the best sample pool size or sample pooling strategy. Using methods and systems of the present disclosure, the prevalence rate of sample pools is reduced through intelligent segmentation using machine learning methodologies.
- Example 4 Increase in diagnostic testing capacity at high clinical sensitivity
- a population of 40,000 clinical test samples (e.g., which corresponds to Quest Diagnostic’s daily testing capacity) was simulated, assuming a disease prevalence rate of 10%, a clinical sensitivity of 90%, and a clinical specificity of 90%.
- an optimal pool size of 4 samples per pool was selected, and a 40% reduction in diagnostic test kit utilization was achieved, along with a clinical sensitivity of 80% and a clinical specificity of 96% (as shown in FIG. 7). Therefore, these results represent an increase in Quest Diagnostic’s testing capacity by 16,000 diagnostic tests, without changing their initial kit availability.
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- Epidemiology (AREA)
- Biomedical Technology (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Pathology (AREA)
- Databases & Information Systems (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Automatic Analysis And Handling Materials Therefor (AREA)
- Analysing Materials By The Use Of Radiation (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063047630P | 2020-07-02 | 2020-07-02 | |
PCT/US2021/039849 WO2022006246A1 (en) | 2020-07-02 | 2021-06-30 | Methods and systems for efficient sample pooling for diagnostic testing |
Publications (2)
Publication Number | Publication Date |
---|---|
EP4176451A1 true EP4176451A1 (en) | 2023-05-10 |
EP4176451A4 EP4176451A4 (en) | 2024-07-17 |
Family
ID=79315538
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP21834522.1A Pending EP4176451A4 (en) | 2020-07-02 | 2021-06-30 | Methods and systems for efficient sample pooling for diagnostic testing |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP4176451A4 (en) |
CN (1) | CN116134546A (en) |
AU (1) | AU2021300319A1 (en) |
CA (1) | CA3187387A1 (en) |
WO (1) | WO2022006246A1 (en) |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010039012A1 (en) * | 1999-06-14 | 2001-11-08 | Lapidus Stanley N. | Methods for diagnostic screening |
US7194301B2 (en) * | 2003-10-06 | 2007-03-20 | Transneuronic, Inc. | Method for screening and treating patients at risk of medical disorders |
US20120245460A1 (en) * | 2009-12-03 | 2012-09-27 | Cedars-Sinai Medical Center | Method of determining ischemia using paired stress and rest scans |
US20140108042A1 (en) * | 2012-10-11 | 2014-04-17 | Jayaram Reddy | Methods and systems for determining health risk score, risk of hospitalization and forecasting healthcare cost |
AU2019362855A1 (en) * | 2018-10-16 | 2021-05-20 | Twinstrand Biosciences, Inc. | Methods and reagents for efficient genotyping of large numbers of samples via pooling |
-
2021
- 2021-06-30 WO PCT/US2021/039849 patent/WO2022006246A1/en active Application Filing
- 2021-06-30 AU AU2021300319A patent/AU2021300319A1/en active Pending
- 2021-06-30 EP EP21834522.1A patent/EP4176451A4/en active Pending
- 2021-06-30 CA CA3187387A patent/CA3187387A1/en active Pending
- 2021-06-30 CN CN202180054373.5A patent/CN116134546A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CA3187387A1 (en) | 2022-01-06 |
EP4176451A4 (en) | 2024-07-17 |
AU2021300319A1 (en) | 2023-03-02 |
WO2022006246A1 (en) | 2022-01-06 |
CN116134546A (en) | 2023-05-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7302081B2 (en) | Variant Classifier Based on Deep Neural Networks | |
US20210327534A1 (en) | Cancer classification using patch convolutional neural networks | |
US11164655B2 (en) | Systems and methods for predicting homologous recombination deficiency status of a specimen | |
JP2024016039A (en) | Integrated machine-learning framework to estimate homologous recombination deficiency | |
US20220389522A1 (en) | Methods of assessing and monitoring tumor load | |
CN112203648A (en) | Method, apparatus and system for deep learning based prenatal examination | |
US20230287516A1 (en) | Determination of a physiological condition with nucleic acid fragment endpoints | |
US20220228217A1 (en) | Epiaging ; novel ecosystem for managing healthy aging | |
US20210358569A1 (en) | Methods and systems for assessing microsatellite instability | |
JP2024056939A (en) | Methods for fingerprinting of biological samples | |
EP4176451A1 (en) | Methods and systems for efficient sample pooling for diagnostic testing | |
US20200135300A1 (en) | Applying low coverage whole genome sequencing for intelligent genomic routing | |
US11746385B2 (en) | Methods of detecting tumor progression via analysis of cell-free nucleic acids |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20230126 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230516 |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Free format text: PREVIOUS MAIN CLASS: G16H0050200000 Ipc: G16H0010600000 |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20240618 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06N 20/00 20190101ALI20240612BHEP Ipc: G06N 3/02 20060101ALI20240612BHEP Ipc: G16H 50/80 20180101ALI20240612BHEP Ipc: G16H 50/70 20180101ALI20240612BHEP Ipc: G16H 50/30 20180101ALI20240612BHEP Ipc: G16H 50/20 20180101ALI20240612BHEP Ipc: G16H 10/60 20180101AFI20240612BHEP |