CN113892150A - 计算跨度的深度学习方法 - Google Patents

计算跨度的深度学习方法 Download PDF

Info

Publication number
CN113892150A
CN113892150A CN202080038894.7A CN202080038894A CN113892150A CN 113892150 A CN113892150 A CN 113892150A CN 202080038894 A CN202080038894 A CN 202080038894A CN 113892150 A CN113892150 A CN 113892150A
Authority
CN
China
Prior art keywords
highlighted
training
text
node
nodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080038894.7A
Other languages
English (en)
Chinese (zh)
Inventor
J·卡森
C·姆瓦拉布
T·H·罗杰斯
C·艾伦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN113892150A publication Critical patent/CN113892150A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/041Abduction
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/20ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/20ICT specially adapted for the handling or processing of medical references relating to practices or guidelines
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/60ICT specially adapted for the handling or processing of medical references relating to pathologies

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Public Health (AREA)
  • Biomedical Technology (AREA)
  • Primary Health Care (AREA)
  • Epidemiology (AREA)
  • Pathology (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
CN202080038894.7A 2019-06-27 2020-06-05 计算跨度的深度学习方法 Pending CN113892150A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/454,311 2019-06-27
US16/454,311 US11379660B2 (en) 2019-06-27 2019-06-27 Deep learning approach to computing spans
PCT/IB2020/055332 WO2020261002A1 (en) 2019-06-27 2020-06-05 Deep learning approach to computing spans

Publications (1)

Publication Number Publication Date
CN113892150A true CN113892150A (zh) 2022-01-04

Family

ID=74044096

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080038894.7A Pending CN113892150A (zh) 2019-06-27 2020-06-05 计算跨度的深度学习方法

Country Status (6)

Country Link
US (1) US11379660B2 (https=)
JP (1) JP7549417B2 (https=)
CN (1) CN113892150A (https=)
DE (1) DE112020002129T5 (https=)
GB (1) GB2598879A (https=)
WO (1) WO2020261002A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11087087B1 (en) * 2017-02-15 2021-08-10 Robert Mayer Comparative expression processing
US11379660B2 (en) * 2019-06-27 2022-07-05 International Business Machines Corporation Deep learning approach to computing spans
CN111259112B (zh) * 2020-01-14 2023-07-04 北京百度网讯科技有限公司 医疗事实的验证方法和装置
US11755822B2 (en) 2020-08-04 2023-09-12 International Business Machines Corporation Promised natural language processing annotations
US11520972B2 (en) * 2020-08-04 2022-12-06 International Business Machines Corporation Future potential natural language processing annotations
CN112509690B (zh) 2020-11-30 2023-08-04 北京百度网讯科技有限公司 用于控制质量的方法、装置、设备和存储介质
WO2023278980A1 (en) * 2021-06-28 2023-01-05 ACADEMIC MERIT LLC d/b/a FINETUNE LEARNING Interface to natural language generator for generation of knowledge assessment items
US11977836B1 (en) * 2021-11-26 2024-05-07 Amazon Technologies, Inc. Global explanations of machine learning model predictions for input containing text attributes
US20240220488A1 (en) * 2022-12-30 2024-07-04 International Business Machines Corporation Optimizing structured query language queries using candidate sets
JP7454090B1 (ja) 2023-07-12 2024-03-21 医療法人社団梅華会 医療の支援装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1618064A (zh) * 2002-01-29 2005-05-18 国际商业机器公司 翻译方法、已翻译句子的输入方法、记录介质、程序与计算机设备
US20140163962A1 (en) * 2012-12-10 2014-06-12 International Business Machines Corporation Deep analysis of natural language questions for question answering system
CN106484674A (zh) * 2016-09-20 2017-03-08 北京工业大学 一种基于深度学习的中文电子病历概念抽取方法
CN106997370A (zh) * 2015-08-07 2017-08-01 谷歌公司 基于作者的文本分类和转换
US20170300632A1 (en) * 2016-04-19 2017-10-19 Nec Laboratories America, Inc. Medical history extraction using string kernels and skip grams
US20200410050A1 (en) * 2019-06-27 2020-12-31 International Business Machines Corporation Deep learning approach to computing spans

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6574633B1 (en) 1999-11-01 2003-06-03 Honeywell International Inc. Method for dynamically grouping limited range physical entities in a topological space
US9244909B2 (en) * 2012-12-10 2016-01-26 General Electric Company System and method for extracting ontological information from a body of text
US9715662B2 (en) 2013-01-28 2017-07-25 International Business Machines Corporation Inconsistency detection between structured and non-structured data
CA2973138C (en) 2014-01-10 2020-06-16 Cluep Inc. Systems, devices, and methods for automatic detection of feelings in text
JP5807829B2 (ja) 2015-02-02 2015-11-10 洋彰 宮崎 自律型知識分析機
US10061714B2 (en) * 2016-03-18 2018-08-28 Oracle International Corporation Tuple encoding aware direct memory access engine for scratchpad enabled multicore processors
US10740678B2 (en) * 2016-03-31 2020-08-11 International Business Machines Corporation Concept hierarchies
KR102801724B1 (ko) * 2016-06-28 2025-04-30 삼성전자주식회사 언어 처리 방법 및 장치
US20180075011A1 (en) * 2016-09-13 2018-03-15 International Business Machines Corporation Hybrid Approach to Handling Hypotheticals in Texts
US10360301B2 (en) * 2016-10-10 2019-07-23 International Business Machines Corporation Personalized approach to handling hypotheticals in text
US10762992B2 (en) 2016-11-30 2020-09-01 Welltok, Inc. Synthetic ground truth expansion
US9715495B1 (en) * 2016-12-15 2017-07-25 Quid, Inc. Topic-influenced document relationship graphs
US20190006027A1 (en) 2017-06-30 2019-01-03 Accenture Global Solutions Limited Automatic identification and extraction of medical conditions and evidences from electronic health records
CN107526785B (zh) 2017-07-31 2020-07-17 广州市香港科大霍英东研究院 文本分类方法及装置
US10811125B2 (en) 2017-08-21 2020-10-20 International Business Machines Corporation Cognitive framework to identify medical case safety reports in free form text
CN108304387B (zh) * 2018-03-09 2021-06-15 联想(北京)有限公司 文本中噪音词的识别方法、装置、服务器组及存储介质
CN109062901B (zh) * 2018-08-14 2019-10-11 第四范式(北京)技术有限公司 神经网络训练方法和装置及命名实体识别方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1618064A (zh) * 2002-01-29 2005-05-18 国际商业机器公司 翻译方法、已翻译句子的输入方法、记录介质、程序与计算机设备
US20140163962A1 (en) * 2012-12-10 2014-06-12 International Business Machines Corporation Deep analysis of natural language questions for question answering system
CN106997370A (zh) * 2015-08-07 2017-08-01 谷歌公司 基于作者的文本分类和转换
US20170300632A1 (en) * 2016-04-19 2017-10-19 Nec Laboratories America, Inc. Medical history extraction using string kernels and skip grams
CN106484674A (zh) * 2016-09-20 2017-03-08 北京工业大学 一种基于深度学习的中文电子病历概念抽取方法
US20200410050A1 (en) * 2019-06-27 2020-12-31 International Business Machines Corporation Deep learning approach to computing spans

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MAREK REI: "Semi-supervised multitask learning for sequence labeling", 《ARXIV PREPRINT ARXIV》, 31 December 2017 (2017-12-31) *

Also Published As

Publication number Publication date
GB2598879A (en) 2022-03-16
JP2022537759A (ja) 2022-08-29
DE112020002129T5 (de) 2022-05-05
WO2020261002A1 (en) 2020-12-30
JP7549417B2 (ja) 2024-09-11
US20200410050A1 (en) 2020-12-31
US11379660B2 (en) 2022-07-05

Similar Documents

Publication Publication Date Title
US11942221B2 (en) Disambiguation of ambiguous portions of content for processing by automated systems
JP7549417B2 (ja) コンピューティング・スパンに対するディープ・ラーニング・アプローチ
US10360301B2 (en) Personalized approach to handling hypotheticals in text
US11823798B2 (en) Container-based knowledge graphs for determining entity relations in non-narrative text
JP7357630B2 (ja) 薬物副作用解析のための方法、コンピュータ・プログラム、および装置
US20180075011A1 (en) Hybrid Approach to Handling Hypotheticals in Texts
US11004550B2 (en) Treatment recommendations based on drug-to-drug interactions
US11275892B2 (en) Traversal-based sentence span judgements
US20180089383A1 (en) Container-Based Knowledge Graphs for Determining Entity Relations in Medical Text
US10380251B2 (en) Mining new negation triggers dynamically based on structured and unstructured knowledge
US11295080B2 (en) Automatic detection of context switch triggers
US20180096103A1 (en) Verification of Clinical Hypothetical Statements Based on Dynamic Cluster Analysis
US20180196921A1 (en) Abbreviation Expansion in Clinical Notes Using Frequency and Context
US20180121603A1 (en) Identification of Related Electronic Medical Record Documents in a Question and Answer System
US10839961B2 (en) Identifying drug-to-drug interactions in medical content and applying interactions to treatment recommendations
US20180089381A1 (en) Cognitive Building of Medical Condition Base Cartridges for a Medical System
US20180060503A1 (en) Targeted Adjustment of Previous Insights Based on Changes to Positional Statements
US20160098456A1 (en) Implicit Durations Calculation and Similarity Comparison in Question Answering Systems
US20190198137A1 (en) Automatic Summarization of Patient Data Using Medically Relevant Summarization Templates
US11334720B2 (en) Machine learned sentence span inclusion judgments
US20190198138A1 (en) Automatic Expansion of Medically Relevant Summarization Templates Using Semantic Expansion
Yan et al. Mˆ 2-meddialog: A dataset and benchmarks for multi-domain multi-service medical dialogues

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20220104