GB2616542A - Natural language processing using an ontology-based concept embedding model - Google Patents

Natural language processing using an ontology-based concept embedding model Download PDF

Info

Publication number
GB2616542A
GB2616542A GB2308265.4A GB202308265A GB2616542A GB 2616542 A GB2616542 A GB 2616542A GB 202308265 A GB202308265 A GB 202308265A GB 2616542 A GB2616542 A GB 2616542A
Authority
GB
United Kingdom
Prior art keywords
concept
vectors
concepts
vector
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB2308265.4A
Other languages
English (en)
Other versions
GB202308265D0 (en
Inventor
Bull Brendan
Lewis Felt Paul
Hicks Andrew
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Merative US LP
Original Assignee
Merative US LP
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Merative US LP filed Critical Merative US LP
Publication of GB202308265D0 publication Critical patent/GB202308265D0/en
Publication of GB2616542A publication Critical patent/GB2616542A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
GB2308265.4A 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model Withdrawn GB2616542A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/545,608 US11176323B2 (en) 2019-08-20 2019-08-20 Natural language processing using an ontology-based concept embedding model
GB2203106.6A GB2601697A (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Publications (2)

Publication Number Publication Date
GB202308265D0 GB202308265D0 (en) 2023-07-19
GB2616542A true GB2616542A (en) 2023-09-13

Family

ID=74646859

Family Applications (2)

Application Number Title Priority Date Filing Date
GB2308265.4A Withdrawn GB2616542A (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model
GB2203106.6A Withdrawn GB2601697A (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Family Applications After (1)

Application Number Title Priority Date Filing Date
GB2203106.6A Withdrawn GB2601697A (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Country Status (6)

Country Link
US (1) US11176323B2 (https=)
JP (1) JP2022545062A (https=)
CN (1) CN114341862A (https=)
DE (1) DE112020003311T5 (https=)
GB (2) GB2616542A (https=)
WO (1) WO2021033087A1 (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11217227B1 (en) 2019-11-08 2022-01-04 Suki AI, Inc. Systems and methods for generating disambiguated terms in automatically generated transcriptions including instructions within a particular knowledge domain
US11538465B1 (en) 2019-11-08 2022-12-27 Suki AI, Inc. Systems and methods to facilitate intent determination of a command by grouping terms based on context
US11354513B2 (en) * 2020-02-06 2022-06-07 Adobe Inc. Automated identification of concept labels for a text fragment
US11416684B2 (en) 2020-02-06 2022-08-16 Adobe Inc. Automated identification of concept labels for a set of documents
US11494562B2 (en) * 2020-05-14 2022-11-08 Optum Technology, Inc. Method, apparatus and computer program product for generating text strings
US11645526B2 (en) * 2020-06-25 2023-05-09 International Business Machines Corporation Learning neuro-symbolic multi-hop reasoning rules over text
US20220172040A1 (en) * 2020-11-30 2022-06-02 Microsoft Technology Licensing, Llc Training a machine-learned model based on feedback
US12562244B2 (en) * 2021-03-01 2026-02-24 International Business Machines Corporation Combining domain-specific ontologies for language processing
JP7761880B2 (ja) * 2021-03-16 2025-10-29 公立大学法人会津大学 モデル推論プログラム、情報処理装置及びモデル推論方法
US11868381B2 (en) * 2021-03-29 2024-01-09 Google Llc Systems and methods for training language models to reason over tables
CN113420117B (zh) * 2021-06-23 2023-10-20 北京交通大学 一种基于多元特征融合的突发事件分类方法
CN113779196B (zh) * 2021-09-07 2024-02-13 大连大学 一种融合多层次信息的海关同义词识别方法
CN114003688B (zh) * 2021-10-14 2025-07-29 咪咕文化科技有限公司 问答数据的查询方法、装置、设备以及存储介质
US11954619B1 (en) * 2022-01-12 2024-04-09 Trueblue, Inc. Analysis and processing of skills related data from a communications session with improved latency
CN114782722B (zh) * 2022-04-29 2023-02-03 北京百度网讯科技有限公司 图文相似度的确定方法、装置及电子设备
CN115033671A (zh) * 2022-06-13 2022-09-09 联想(北京)有限公司 一种信息处理方法、装置和可读存储介质
CN115062699B (zh) * 2022-06-13 2025-08-12 中孚安全技术有限公司 一种基于Word2vec-FL的社区发现方法及系统
US20240211796A1 (en) * 2022-12-22 2024-06-27 Microsoft Technology Licensing, Llc Explanation of emergent semantics in embedding spaces via analogy
CN117009541B (zh) * 2023-06-07 2026-02-17 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018064969A1 (en) * 2016-10-07 2018-04-12 Huawei Technologies Co., Ltd. Apparatus and method for spatial processing of concepts

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1919771A4 (en) * 2005-08-31 2010-06-09 Intuview Itd EXPERT DECISION SUPPORT SYSTEM AND REAL-TIME METHODS OF OPERATING DOCUMENTS IN LANGUAGES OTHER THAN ENGLISH
US8027977B2 (en) * 2007-06-20 2011-09-27 Microsoft Corporation Recommending content using discriminatively trained document similarity
US8977537B2 (en) 2011-06-24 2015-03-10 Microsoft Technology Licensing, Llc Hierarchical models for language modeling
SG11201406534QA (en) * 2012-04-11 2014-11-27 Univ Singapore Methods, apparatuses and computer-readable mediums for organizing data relating to a product
US9519859B2 (en) * 2013-09-06 2016-12-13 Microsoft Technology Licensing, Llc Deep structured semantic model produced using click-through data
US9477752B1 (en) 2013-09-30 2016-10-25 Verint Systems Inc. Ontology administration and application to enhance communication data analytics
US20150127323A1 (en) * 2013-11-04 2015-05-07 Xerox Corporation Refining inference rules with temporal event clustering
US9672814B2 (en) 2015-05-08 2017-06-06 International Business Machines Corporation Semi-supervised learning of word embeddings
CN105260488B (zh) 2015-11-30 2018-10-02 哈尔滨工业大学 一种用于语义理解的文本序列迭代方法
JP6549500B2 (ja) * 2016-02-26 2019-07-24 トヨタ自動車株式会社 話題推定学習装置及び話題推定学習方法
US10740678B2 (en) * 2016-03-31 2020-08-11 International Business Machines Corporation Concept hierarchies
US10169454B2 (en) 2016-05-17 2019-01-01 Xerox Corporation Unsupervised ontology-based graph extraction from texts
CN108268883B (zh) * 2016-12-31 2021-05-07 上海交通大学 基于开放数据的移动端信息模板自构建系统
US12411880B2 (en) * 2017-02-16 2025-09-09 Globality, Inc. Intelligent matching system with ontology-aided relation extraction
CN110352417B (zh) * 2017-03-06 2024-02-02 三菱电机株式会社 本体构建辅助装置
EP3385862A1 (en) * 2017-04-03 2018-10-10 Siemens Aktiengesellschaft A method and apparatus for performing hierarchical entity classification
US10963501B1 (en) * 2017-04-29 2021-03-30 Veritas Technologies Llc Systems and methods for generating a topic tree for digital information
JP6957967B2 (ja) * 2017-05-16 2021-11-02 富士通株式会社 生成プログラム、生成方法、生成装置、及びパラメータ生成方法
US11488713B2 (en) 2017-08-15 2022-11-01 Computer Technology Associates, Inc. Disease specific ontology-guided rule engine and machine learning for enhanced critical care decision support
KR102060176B1 (ko) * 2017-09-12 2019-12-27 네이버 주식회사 문서의 카테고리 분류를 위한 딥러닝 학습 방법 및 그 시스템
US11250331B2 (en) * 2017-10-31 2022-02-15 Microsoft Technology Licensing, Llc Distant supervision for entity linking with filtering of noise
US10817676B2 (en) * 2017-12-27 2020-10-27 Sdl Inc. Intelligent routing services and systems
WO2019132685A1 (ru) * 2017-12-29 2019-07-04 Общество С Ограниченной Ответственностью "Интеллоджик" Способ и система поддержки принятия врачебных решений
CN108717574B (zh) 2018-03-26 2021-09-21 浙江大学 一种基于连词标记和强化学习的自然语言推理方法
US10817657B2 (en) * 2018-12-26 2020-10-27 Nokia Solutions And Networks Oy Determination of field types in tabular data
CN110134943B (zh) 2019-04-03 2023-04-18 平安科技(深圳)有限公司 领域本体生成方法、装置、设备及介质
US10902203B2 (en) * 2019-04-23 2021-01-26 Oracle International Corporation Named entity disambiguation using entity distance in a knowledge graph
US11126647B2 (en) * 2019-12-13 2021-09-21 CS Disco, Inc. System and method for hierarchically organizing documents based on document portions

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018064969A1 (en) * 2016-10-07 2018-04-12 Huawei Technologies Co., Ltd. Apparatus and method for spatial processing of concepts

Also Published As

Publication number Publication date
DE112020003311T5 (de) 2022-03-31
US11176323B2 (en) 2021-11-16
JP2022545062A (ja) 2022-10-25
GB2601697A (en) 2022-06-08
US20210056168A1 (en) 2021-02-25
GB202308265D0 (en) 2023-07-19
GB202203106D0 (en) 2022-04-20
CN114341862A (zh) 2022-04-12
WO2021033087A1 (en) 2021-02-25

Similar Documents

Publication Publication Date Title
US11176323B2 (en) Natural language processing using an ontology-based concept embedding model
US11514691B2 (en) Generating training sets to train machine learning models
US11892998B2 (en) Efficient embedding table storage and lookup
US10936635B2 (en) Context-based generation of semantically-similar phrases
US20190361977A1 (en) Training data expansion for natural language classification
US20200134422A1 (en) Relation extraction from text using machine learning
US10929383B2 (en) Method and system for improving training data understanding in natural language processing
US11636376B2 (en) Active learning for concept disambiguation
KR102636493B1 (ko) 의료 데이터 검증 방법, 장치 및 전자 기기
US20190303498A1 (en) Generation of knowledge graph responsive to query
US11003701B2 (en) Dynamic faceted search on a document corpus
US20210319054A1 (en) Encoding entity representations for cross-document coreference
US20200286596A1 (en) Generating and managing clinical studies using a knowledge base
US11687808B2 (en) Artificial intelligence explaining for natural language processing
US11222165B1 (en) Sliding window to detect entities in corpus using natural language processing
US11640430B2 (en) Custom semantic search experience driven by an ontology
US11475211B1 (en) Elucidated natural language artifact recombination with contextual awareness
US20220318523A1 (en) Clause extraction using machine translation and natural language processing
US12386610B2 (en) Code modification management using machine learning
US20170039293A1 (en) Question answering system with data mining capabilities
US20220188349A1 (en) Visualization resonance for collaborative discourse
US12423507B2 (en) Elucidated natural language artifact recombination with contextual awareness
US11275796B2 (en) Dynamic faceted search on a document corpus
US12027070B2 (en) Cognitive framework for identification of questions and answers
GB2612423A (en) Automated system and method for hyper parameter tuning and retrofitting formulation

Legal Events

Date Code Title Description
COOA Change in applicant's name or ownership of the application

Owner name: MERATIVE US L.P.

Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINES CORPORATION

WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)