CN114341862A - 使用基于本体的概念嵌入模型的自然语言处理 - Google Patents

使用基于本体的概念嵌入模型的自然语言处理 Download PDF

Info

Publication number
CN114341862A
CN114341862A CN202080058467.5A CN202080058467A CN114341862A CN 114341862 A CN114341862 A CN 114341862A CN 202080058467 A CN202080058467 A CN 202080058467A CN 114341862 A CN114341862 A CN 114341862A
Authority
CN
China
Prior art keywords
concept
vector
computer
concepts
vectors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080058467.5A
Other languages
English (en)
Chinese (zh)
Inventor
B·布尔
P·L·费尔特
A·希克斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Maredif Usa
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN114341862A publication Critical patent/CN114341862A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN202080058467.5A 2019-08-20 2020-08-13 使用基于本体的概念嵌入模型的自然语言处理 Pending CN114341862A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/545,608 2019-08-20
US16/545,608 US11176323B2 (en) 2019-08-20 2019-08-20 Natural language processing using an ontology-based concept embedding model
PCT/IB2020/057621 WO2021033087A1 (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Publications (1)

Publication Number Publication Date
CN114341862A true CN114341862A (zh) 2022-04-12

Family

ID=74646859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080058467.5A Pending CN114341862A (zh) 2019-08-20 2020-08-13 使用基于本体的概念嵌入模型的自然语言处理

Country Status (6)

Country Link
US (1) US11176323B2 (https=)
JP (1) JP2022545062A (https=)
CN (1) CN114341862A (https=)
DE (1) DE112020003311T5 (https=)
GB (2) GB2616542A (https=)
WO (1) WO2021033087A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117009541A (zh) * 2023-06-07 2023-11-07 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11217227B1 (en) 2019-11-08 2022-01-04 Suki AI, Inc. Systems and methods for generating disambiguated terms in automatically generated transcriptions including instructions within a particular knowledge domain
US11538465B1 (en) 2019-11-08 2022-12-27 Suki AI, Inc. Systems and methods to facilitate intent determination of a command by grouping terms based on context
US11354513B2 (en) * 2020-02-06 2022-06-07 Adobe Inc. Automated identification of concept labels for a text fragment
US11416684B2 (en) 2020-02-06 2022-08-16 Adobe Inc. Automated identification of concept labels for a set of documents
US11494562B2 (en) * 2020-05-14 2022-11-08 Optum Technology, Inc. Method, apparatus and computer program product for generating text strings
US11645526B2 (en) * 2020-06-25 2023-05-09 International Business Machines Corporation Learning neuro-symbolic multi-hop reasoning rules over text
US20220172040A1 (en) * 2020-11-30 2022-06-02 Microsoft Technology Licensing, Llc Training a machine-learned model based on feedback
US12562244B2 (en) * 2021-03-01 2026-02-24 International Business Machines Corporation Combining domain-specific ontologies for language processing
JP7761880B2 (ja) * 2021-03-16 2025-10-29 公立大学法人会津大学 モデル推論プログラム、情報処理装置及びモデル推論方法
US11868381B2 (en) * 2021-03-29 2024-01-09 Google Llc Systems and methods for training language models to reason over tables
CN113420117B (zh) * 2021-06-23 2023-10-20 北京交通大学 一种基于多元特征融合的突发事件分类方法
CN113779196B (zh) * 2021-09-07 2024-02-13 大连大学 一种融合多层次信息的海关同义词识别方法
CN114003688B (zh) * 2021-10-14 2025-07-29 咪咕文化科技有限公司 问答数据的查询方法、装置、设备以及存储介质
US11954619B1 (en) * 2022-01-12 2024-04-09 Trueblue, Inc. Analysis and processing of skills related data from a communications session with improved latency
CN114782722B (zh) * 2022-04-29 2023-02-03 北京百度网讯科技有限公司 图文相似度的确定方法、装置及电子设备
CN115033671A (zh) * 2022-06-13 2022-09-09 联想(北京)有限公司 一种信息处理方法、装置和可读存储介质
CN115062699B (zh) * 2022-06-13 2025-08-12 中孚安全技术有限公司 一种基于Word2vec-FL的社区发现方法及系统
US20240211796A1 (en) * 2022-12-22 2024-06-27 Microsoft Technology Licensing, Llc Explanation of emergent semantics in embedding spaces via analogy

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157382A1 (en) * 2005-08-31 2009-06-18 Shmuel Bar Decision-support expert system and methods for real-time exploitation of documents in non-english languages
US9477752B1 (en) * 2013-09-30 2016-10-25 Verint Systems Inc. Ontology administration and application to enhance communication data analytics
CN108268883A (zh) * 2016-12-31 2018-07-10 上海交通大学 基于开放数据的移动端信息模板自构建系统
US20190130282A1 (en) * 2017-10-31 2019-05-02 Microsoft Technology Licensing, Llc Distant Supervision for Entity Linking with Filtering of Noise

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8027977B2 (en) * 2007-06-20 2011-09-27 Microsoft Corporation Recommending content using discriminatively trained document similarity
US8977537B2 (en) 2011-06-24 2015-03-10 Microsoft Technology Licensing, Llc Hierarchical models for language modeling
SG11201406534QA (en) * 2012-04-11 2014-11-27 Univ Singapore Methods, apparatuses and computer-readable mediums for organizing data relating to a product
US9519859B2 (en) * 2013-09-06 2016-12-13 Microsoft Technology Licensing, Llc Deep structured semantic model produced using click-through data
US20150127323A1 (en) * 2013-11-04 2015-05-07 Xerox Corporation Refining inference rules with temporal event clustering
US9672814B2 (en) 2015-05-08 2017-06-06 International Business Machines Corporation Semi-supervised learning of word embeddings
CN105260488B (zh) 2015-11-30 2018-10-02 哈尔滨工业大学 一种用于语义理解的文本序列迭代方法
JP6549500B2 (ja) * 2016-02-26 2019-07-24 トヨタ自動車株式会社 話題推定学習装置及び話題推定学習方法
US10740678B2 (en) * 2016-03-31 2020-08-11 International Business Machines Corporation Concept hierarchies
US10169454B2 (en) 2016-05-17 2019-01-01 Xerox Corporation Unsupervised ontology-based graph extraction from texts
US20180101773A1 (en) * 2016-10-07 2018-04-12 Futurewei Technologies, Inc. Apparatus and method for spatial processing of concepts
US12411880B2 (en) * 2017-02-16 2025-09-09 Globality, Inc. Intelligent matching system with ontology-aided relation extraction
CN110352417B (zh) * 2017-03-06 2024-02-02 三菱电机株式会社 本体构建辅助装置
EP3385862A1 (en) * 2017-04-03 2018-10-10 Siemens Aktiengesellschaft A method and apparatus for performing hierarchical entity classification
US10963501B1 (en) * 2017-04-29 2021-03-30 Veritas Technologies Llc Systems and methods for generating a topic tree for digital information
JP6957967B2 (ja) * 2017-05-16 2021-11-02 富士通株式会社 生成プログラム、生成方法、生成装置、及びパラメータ生成方法
US11488713B2 (en) 2017-08-15 2022-11-01 Computer Technology Associates, Inc. Disease specific ontology-guided rule engine and machine learning for enhanced critical care decision support
KR102060176B1 (ko) * 2017-09-12 2019-12-27 네이버 주식회사 문서의 카테고리 분류를 위한 딥러닝 학습 방법 및 그 시스템
US10817676B2 (en) * 2017-12-27 2020-10-27 Sdl Inc. Intelligent routing services and systems
WO2019132685A1 (ru) * 2017-12-29 2019-07-04 Общество С Ограниченной Ответственностью "Интеллоджик" Способ и система поддержки принятия врачебных решений
CN108717574B (zh) 2018-03-26 2021-09-21 浙江大学 一种基于连词标记和强化学习的自然语言推理方法
US10817657B2 (en) * 2018-12-26 2020-10-27 Nokia Solutions And Networks Oy Determination of field types in tabular data
CN110134943B (zh) 2019-04-03 2023-04-18 平安科技(深圳)有限公司 领域本体生成方法、装置、设备及介质
US10902203B2 (en) * 2019-04-23 2021-01-26 Oracle International Corporation Named entity disambiguation using entity distance in a knowledge graph
US11126647B2 (en) * 2019-12-13 2021-09-21 CS Disco, Inc. System and method for hierarchically organizing documents based on document portions

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157382A1 (en) * 2005-08-31 2009-06-18 Shmuel Bar Decision-support expert system and methods for real-time exploitation of documents in non-english languages
US9477752B1 (en) * 2013-09-30 2016-10-25 Verint Systems Inc. Ontology administration and application to enhance communication data analytics
CN108268883A (zh) * 2016-12-31 2018-07-10 上海交通大学 基于开放数据的移动端信息模板自构建系统
US20190130282A1 (en) * 2017-10-31 2019-05-02 Microsoft Technology Licensing, Llc Distant Supervision for Entity Linking with Filtering of Noise

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PRODROMOS KOLYVAKIS ET AL.: "Biomedical ontology alignment: an approach based on representation learning", JOURNAL OF BIOMEDICAL SEMANTICS, vol. 9, 15 August 2018 (2018-08-15), pages 1 - 20 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117009541A (zh) * 2023-06-07 2023-11-07 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质
CN117009541B (zh) * 2023-06-07 2026-02-17 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质

Also Published As

Publication number Publication date
DE112020003311T5 (de) 2022-03-31
US11176323B2 (en) 2021-11-16
JP2022545062A (ja) 2022-10-25
GB2601697A (en) 2022-06-08
US20210056168A1 (en) 2021-02-25
GB202308265D0 (en) 2023-07-19
GB202203106D0 (en) 2022-04-20
WO2021033087A1 (en) 2021-02-25
GB2616542A (en) 2023-09-13

Similar Documents

Publication Publication Date Title
CN114341862A (zh) 使用基于本体的概念嵌入模型的自然语言处理
CN112136125B (zh) 用于自然语言分类的训练数据扩展
US11714840B2 (en) Method and apparatus for information query and storage medium
US11892998B2 (en) Efficient embedding table storage and lookup
US9886501B2 (en) Contextual content graph for automatic, unsupervised summarization of content
KR102636493B1 (ko) 의료 데이터 검증 방법, 장치 및 전자 기기
US9881082B2 (en) System and method for automatic, unsupervised contextualized content summarization of single and multiple documents
US10929383B2 (en) Method and system for improving training data understanding in natural language processing
CN112560479A (zh) 摘要抽取模型训练方法、摘要抽取方法、装置和电子设备
US20170011289A1 (en) Learning word embedding using morphological knowledge
WO2020042925A1 (zh) 人机对话方法、装置、电子设备及计算机可读介质
CN110457708B (zh) 基于人工智能的词汇挖掘方法、装置、服务器及存储介质
US20170132288A1 (en) Extracting and Denoising Concept Mentions Using Distributed Representations of Concepts
WO2023159758A1 (zh) 数据增强方法和装置、电子设备、存储介质
US20170091164A1 (en) Dynamic Context Aware Abbreviation Detection and Annotation
CN105760363B (zh) 文本文件的词义消歧方法及装置
US20130262083A1 (en) Method and Apparatus for Processing Text with Variations in Vocabulary Usage
WO2022269510A1 (en) Method and system for interactive searching based on semantic similarity of semantic representations of text objects
CN110705304B (zh) 一种属性词提取方法
CN114925185A (zh) 交互方法、模型的训练方法、装置、设备及介质
CN115552389A (zh) 用于自然语言处理的概念歧义消除
CN112800244A (zh) 一种中医药及民族医药知识图谱的构建方法
CN114722833A (zh) 一种语义分类方法及装置
CN112015989A (zh) 用于推送信息的方法和装置
Mühlenberg et al. Towards information extraction from ISR reports for decision support using a two-stage learning-based approach

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20230506

Address after: Michigan, USA

Applicant after: Maredif USA

Address before: USA New York

Applicant before: International Business Machines Corp.

TA01 Transfer of patent application right
AD01 Patent right deemed abandoned

Effective date of abandoning: 20250829