DE112020003311T5 - Verarbeitung natürlicher sprache unter verwendung eines ontologiegestützten modells zur begriffseinbettung - Google Patents

Verarbeitung natürlicher sprache unter verwendung eines ontologiegestützten modells zur begriffseinbettung Download PDF

Info

Publication number
DE112020003311T5
DE112020003311T5 DE112020003311.2T DE112020003311T DE112020003311T5 DE 112020003311 T5 DE112020003311 T5 DE 112020003311T5 DE 112020003311 T DE112020003311 T DE 112020003311T DE 112020003311 T5 DE112020003311 T5 DE 112020003311T5
Authority
DE
Germany
Prior art keywords
vectors
computer
term
vector
terms
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
DE112020003311.2T
Other languages
German (de)
English (en)
Inventor
Brendan Bull
Paul Lewis Felt
Andrew Hicks
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Merative US LP
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of DE112020003311T5 publication Critical patent/DE112020003311T5/de
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DE112020003311.2T 2019-08-20 2020-08-13 Verarbeitung natürlicher sprache unter verwendung eines ontologiegestützten modells zur begriffseinbettung Ceased DE112020003311T5 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/545,608 2019-08-20
US16/545,608 US11176323B2 (en) 2019-08-20 2019-08-20 Natural language processing using an ontology-based concept embedding model
PCT/IB2020/057621 WO2021033087A1 (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Publications (1)

Publication Number Publication Date
DE112020003311T5 true DE112020003311T5 (de) 2022-03-31

Family

ID=74646859

Family Applications (1)

Application Number Title Priority Date Filing Date
DE112020003311.2T Ceased DE112020003311T5 (de) 2019-08-20 2020-08-13 Verarbeitung natürlicher sprache unter verwendung eines ontologiegestützten modells zur begriffseinbettung

Country Status (6)

Country Link
US (1) US11176323B2 (https=)
JP (1) JP2022545062A (https=)
CN (1) CN114341862A (https=)
DE (1) DE112020003311T5 (https=)
GB (2) GB2616542A (https=)
WO (1) WO2021033087A1 (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11217227B1 (en) 2019-11-08 2022-01-04 Suki AI, Inc. Systems and methods for generating disambiguated terms in automatically generated transcriptions including instructions within a particular knowledge domain
US11538465B1 (en) 2019-11-08 2022-12-27 Suki AI, Inc. Systems and methods to facilitate intent determination of a command by grouping terms based on context
US11354513B2 (en) * 2020-02-06 2022-06-07 Adobe Inc. Automated identification of concept labels for a text fragment
US11416684B2 (en) 2020-02-06 2022-08-16 Adobe Inc. Automated identification of concept labels for a set of documents
US11494562B2 (en) * 2020-05-14 2022-11-08 Optum Technology, Inc. Method, apparatus and computer program product for generating text strings
US11645526B2 (en) * 2020-06-25 2023-05-09 International Business Machines Corporation Learning neuro-symbolic multi-hop reasoning rules over text
US20220172040A1 (en) * 2020-11-30 2022-06-02 Microsoft Technology Licensing, Llc Training a machine-learned model based on feedback
US12562244B2 (en) * 2021-03-01 2026-02-24 International Business Machines Corporation Combining domain-specific ontologies for language processing
JP7761880B2 (ja) * 2021-03-16 2025-10-29 公立大学法人会津大学 モデル推論プログラム、情報処理装置及びモデル推論方法
US11868381B2 (en) * 2021-03-29 2024-01-09 Google Llc Systems and methods for training language models to reason over tables
CN113420117B (zh) * 2021-06-23 2023-10-20 北京交通大学 一种基于多元特征融合的突发事件分类方法
CN113779196B (zh) * 2021-09-07 2024-02-13 大连大学 一种融合多层次信息的海关同义词识别方法
CN114003688B (zh) * 2021-10-14 2025-07-29 咪咕文化科技有限公司 问答数据的查询方法、装置、设备以及存储介质
US11954619B1 (en) * 2022-01-12 2024-04-09 Trueblue, Inc. Analysis and processing of skills related data from a communications session with improved latency
CN114782722B (zh) * 2022-04-29 2023-02-03 北京百度网讯科技有限公司 图文相似度的确定方法、装置及电子设备
CN115033671A (zh) * 2022-06-13 2022-09-09 联想(北京)有限公司 一种信息处理方法、装置和可读存储介质
CN115062699B (zh) * 2022-06-13 2025-08-12 中孚安全技术有限公司 一种基于Word2vec-FL的社区发现方法及系统
US20240211796A1 (en) * 2022-12-22 2024-06-27 Microsoft Technology Licensing, Llc Explanation of emergent semantics in embedding spaces via analogy
CN117009541B (zh) * 2023-06-07 2026-02-17 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1919771A4 (en) * 2005-08-31 2010-06-09 Intuview Itd EXPERT DECISION SUPPORT SYSTEM AND REAL-TIME METHODS OF OPERATING DOCUMENTS IN LANGUAGES OTHER THAN ENGLISH
US8027977B2 (en) * 2007-06-20 2011-09-27 Microsoft Corporation Recommending content using discriminatively trained document similarity
US8977537B2 (en) 2011-06-24 2015-03-10 Microsoft Technology Licensing, Llc Hierarchical models for language modeling
SG11201406534QA (en) * 2012-04-11 2014-11-27 Univ Singapore Methods, apparatuses and computer-readable mediums for organizing data relating to a product
US9519859B2 (en) * 2013-09-06 2016-12-13 Microsoft Technology Licensing, Llc Deep structured semantic model produced using click-through data
US9477752B1 (en) 2013-09-30 2016-10-25 Verint Systems Inc. Ontology administration and application to enhance communication data analytics
US20150127323A1 (en) * 2013-11-04 2015-05-07 Xerox Corporation Refining inference rules with temporal event clustering
US9672814B2 (en) 2015-05-08 2017-06-06 International Business Machines Corporation Semi-supervised learning of word embeddings
CN105260488B (zh) 2015-11-30 2018-10-02 哈尔滨工业大学 一种用于语义理解的文本序列迭代方法
JP6549500B2 (ja) * 2016-02-26 2019-07-24 トヨタ自動車株式会社 話題推定学習装置及び話題推定学習方法
US10740678B2 (en) * 2016-03-31 2020-08-11 International Business Machines Corporation Concept hierarchies
US10169454B2 (en) 2016-05-17 2019-01-01 Xerox Corporation Unsupervised ontology-based graph extraction from texts
US20180101773A1 (en) * 2016-10-07 2018-04-12 Futurewei Technologies, Inc. Apparatus and method for spatial processing of concepts
CN108268883B (zh) * 2016-12-31 2021-05-07 上海交通大学 基于开放数据的移动端信息模板自构建系统
US12411880B2 (en) * 2017-02-16 2025-09-09 Globality, Inc. Intelligent matching system with ontology-aided relation extraction
CN110352417B (zh) * 2017-03-06 2024-02-02 三菱电机株式会社 本体构建辅助装置
EP3385862A1 (en) * 2017-04-03 2018-10-10 Siemens Aktiengesellschaft A method and apparatus for performing hierarchical entity classification
US10963501B1 (en) * 2017-04-29 2021-03-30 Veritas Technologies Llc Systems and methods for generating a topic tree for digital information
JP6957967B2 (ja) * 2017-05-16 2021-11-02 富士通株式会社 生成プログラム、生成方法、生成装置、及びパラメータ生成方法
US11488713B2 (en) 2017-08-15 2022-11-01 Computer Technology Associates, Inc. Disease specific ontology-guided rule engine and machine learning for enhanced critical care decision support
KR102060176B1 (ko) * 2017-09-12 2019-12-27 네이버 주식회사 문서의 카테고리 분류를 위한 딥러닝 학습 방법 및 그 시스템
US11250331B2 (en) * 2017-10-31 2022-02-15 Microsoft Technology Licensing, Llc Distant supervision for entity linking with filtering of noise
US10817676B2 (en) * 2017-12-27 2020-10-27 Sdl Inc. Intelligent routing services and systems
WO2019132685A1 (ru) * 2017-12-29 2019-07-04 Общество С Ограниченной Ответственностью "Интеллоджик" Способ и система поддержки принятия врачебных решений
CN108717574B (zh) 2018-03-26 2021-09-21 浙江大学 一种基于连词标记和强化学习的自然语言推理方法
US10817657B2 (en) * 2018-12-26 2020-10-27 Nokia Solutions And Networks Oy Determination of field types in tabular data
CN110134943B (zh) 2019-04-03 2023-04-18 平安科技(深圳)有限公司 领域本体生成方法、装置、设备及介质
US10902203B2 (en) * 2019-04-23 2021-01-26 Oracle International Corporation Named entity disambiguation using entity distance in a knowledge graph
US11126647B2 (en) * 2019-12-13 2021-09-21 CS Disco, Inc. System and method for hierarchically organizing documents based on document portions

Also Published As

Publication number Publication date
US11176323B2 (en) 2021-11-16
JP2022545062A (ja) 2022-10-25
GB2601697A (en) 2022-06-08
US20210056168A1 (en) 2021-02-25
GB202308265D0 (en) 2023-07-19
GB202203106D0 (en) 2022-04-20
CN114341862A (zh) 2022-04-12
WO2021033087A1 (en) 2021-02-25
GB2616542A (en) 2023-09-13

Similar Documents

Publication Publication Date Title
DE112020003311T5 (de) Verarbeitung natürlicher sprache unter verwendung eines ontologiegestützten modells zur begriffseinbettung
DE112018004376T5 (de) Schützen kognitiver systeme vor auf gradienten beruhenden angriffen durch die verwendung irreführender gradienten
DE112020005268T5 (de) Automatisches erzeugen von schema-annotationsdateien zum umwandeln von abfragen in natürlicher sprache in eine strukturierte abfragesprache
DE112019001533T5 (de) Erweiterung von trainingsdaten für die klassifikation von natürlicher sprache
Salvatore et al. Automated method of content analysis: A device for psychotherapy process research
DE112020005095T5 (de) Automatische trennung und extraktion von tabellendaten unter verwendung von maschinellem lernen
DE112012005177B4 (de) Erzeugens eines Verarbeitungsmodells für natürliche Sprache für einen Informationsbereich
DE112017007530T5 (de) Entitätsmodell-erstellung
DE112018005459T5 (de) Datenanonymisierung
DE102019000294A1 (de) Erstellen unternehmensspezifischer Wissensgraphen
DE112021001986T5 (de) Verfahren und System zum Verarbeiten von Datenaufzeichnungen
US12008313B2 (en) Medical data verification method and electronic device
DE112018005167T5 (de) Aktualisieren von trainingsdaten
WO2021032824A1 (de) Verfahren und vorrichtung zur vorauswahl und ermittlung ähnlicher dokumente
DE102014113870A1 (de) Identifizieren und Anzeigen von Beziehungen zwischen Kandidatenantworten
DE112018006488T5 (de) Automatisierte extraktion echokardiografischer messwerte aus medizinischen bildern
DE112018006345T5 (de) Abrufen von unterstützenden belegen für komplexe antworten
DE112021004694T5 (de) Trainieren eines frage-antwort-dialogsystems zum vermeiden von gegnerischen angriffen
DE112018005418T5 (de) Kognitive dokumentbild-digitalisierung
DE102013202365A1 (de) Herausziehen von informationen aus krankenakten
DE112021003680T5 (de) Deterministisch lernende videoszenenerkennung
DE112020000227T5 (de) Maschinelles lernen eines computermodells auf grundlage von korrelationenvon trainingsdaten mit leistungstrends
DE112019002235T5 (de) Einbinden eines wörterbuch-bearbeitungssystems in ein text mining
DE112021003583T5 (de) Sprachenübergreifendes transferlernen ohne trainingsbeispiele
DE112018005272T5 (de) Suchen von mehrsprachigen dokumenten auf grundlage einer extraktion der dokumentenstruktur

Legal Events

Date Code Title Description
R012 Request for examination validly filed
R082 Change of representative

Representative=s name: MEISSNER BOLTE PATENTANWAELTE RECHTSANWAELTE P, DE

R081 Change of applicant/patentee

Owner name: MERATIVE US L.P. (N.D.GES.D.STAATES DELAWARE),, US

Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINES CORPORATION, ARMONK, NY, US

R002 Refusal decision in examination/registration proceedings
R003 Refusal decision now final