JP2022545062A - オントロジーベースの概念埋め込みモデルを使用した自然言語処理 - Google Patents

オントロジーベースの概念埋め込みモデルを使用した自然言語処理 Download PDF

Info

Publication number
JP2022545062A
JP2022545062A JP2022508970A JP2022508970A JP2022545062A JP 2022545062 A JP2022545062 A JP 2022545062A JP 2022508970 A JP2022508970 A JP 2022508970A JP 2022508970 A JP2022508970 A JP 2022508970A JP 2022545062 A JP2022545062 A JP 2022545062A
Authority
JP
Japan
Prior art keywords
concept
concepts
computer
vector
vectors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2022508970A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022545062A5 (https=
Inventor
ブル、ブレンダン
フェルト、ポール、ルイス
ヒックス、アンドリュー
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of JP2022545062A publication Critical patent/JP2022545062A/ja
Publication of JP2022545062A5 publication Critical patent/JP2022545062A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2022508970A 2019-08-20 2020-08-13 オントロジーベースの概念埋め込みモデルを使用した自然言語処理 Pending JP2022545062A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/545,608 2019-08-20
US16/545,608 US11176323B2 (en) 2019-08-20 2019-08-20 Natural language processing using an ontology-based concept embedding model
PCT/IB2020/057621 WO2021033087A1 (en) 2019-08-20 2020-08-13 Natural language processing using an ontology-based concept embedding model

Publications (2)

Publication Number Publication Date
JP2022545062A true JP2022545062A (ja) 2022-10-25
JP2022545062A5 JP2022545062A5 (https=) 2023-08-21

Family

ID=74646859

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022508970A Pending JP2022545062A (ja) 2019-08-20 2020-08-13 オントロジーベースの概念埋め込みモデルを使用した自然言語処理

Country Status (6)

Country Link
US (1) US11176323B2 (https=)
JP (1) JP2022545062A (https=)
CN (1) CN114341862A (https=)
DE (1) DE112020003311T5 (https=)
GB (2) GB2616542A (https=)
WO (1) WO2021033087A1 (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11217227B1 (en) 2019-11-08 2022-01-04 Suki AI, Inc. Systems and methods for generating disambiguated terms in automatically generated transcriptions including instructions within a particular knowledge domain
US11538465B1 (en) 2019-11-08 2022-12-27 Suki AI, Inc. Systems and methods to facilitate intent determination of a command by grouping terms based on context
US11354513B2 (en) * 2020-02-06 2022-06-07 Adobe Inc. Automated identification of concept labels for a text fragment
US11416684B2 (en) 2020-02-06 2022-08-16 Adobe Inc. Automated identification of concept labels for a set of documents
US11494562B2 (en) * 2020-05-14 2022-11-08 Optum Technology, Inc. Method, apparatus and computer program product for generating text strings
US11645526B2 (en) * 2020-06-25 2023-05-09 International Business Machines Corporation Learning neuro-symbolic multi-hop reasoning rules over text
US20220172040A1 (en) * 2020-11-30 2022-06-02 Microsoft Technology Licensing, Llc Training a machine-learned model based on feedback
US12562244B2 (en) * 2021-03-01 2026-02-24 International Business Machines Corporation Combining domain-specific ontologies for language processing
JP7761880B2 (ja) * 2021-03-16 2025-10-29 公立大学法人会津大学 モデル推論プログラム、情報処理装置及びモデル推論方法
US11868381B2 (en) * 2021-03-29 2024-01-09 Google Llc Systems and methods for training language models to reason over tables
CN113420117B (zh) * 2021-06-23 2023-10-20 北京交通大学 一种基于多元特征融合的突发事件分类方法
CN113779196B (zh) * 2021-09-07 2024-02-13 大连大学 一种融合多层次信息的海关同义词识别方法
CN114003688B (zh) * 2021-10-14 2025-07-29 咪咕文化科技有限公司 问答数据的查询方法、装置、设备以及存储介质
US11954619B1 (en) * 2022-01-12 2024-04-09 Trueblue, Inc. Analysis and processing of skills related data from a communications session with improved latency
CN114782722B (zh) * 2022-04-29 2023-02-03 北京百度网讯科技有限公司 图文相似度的确定方法、装置及电子设备
CN115033671A (zh) * 2022-06-13 2022-09-09 联想(北京)有限公司 一种信息处理方法、装置和可读存储介质
CN115062699B (zh) * 2022-06-13 2025-08-12 中孚安全技术有限公司 一种基于Word2vec-FL的社区发现方法及系统
US20240211796A1 (en) * 2022-12-22 2024-06-27 Microsoft Technology Licensing, Llc Explanation of emergent semantics in embedding spaces via analogy
CN117009541B (zh) * 2023-06-07 2026-02-17 中电通商数字技术(上海)有限公司 临床医学检验知识库构建与应用方法、装置、设备及介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157382A1 (en) * 2005-08-31 2009-06-18 Shmuel Bar Decision-support expert system and methods for real-time exploitation of documents in non-english languages
JP2017151838A (ja) * 2016-02-26 2017-08-31 トヨタ自動車株式会社 話題推定学習装置及び話題推定学習方法
JP2018195012A (ja) * 2017-05-16 2018-12-06 富士通株式会社 学習プログラム、学習方法、学習装置、及び変換パラメータ製造方法
JP2019053730A (ja) * 2017-09-12 2019-04-04 ネイバー コーポレーションNAVER Corporation 文書のカテゴリ分類のためのディープラーニング学習方法およびそのシステム
US20190130282A1 (en) * 2017-10-31 2019-05-02 Microsoft Technology Licensing, Llc Distant Supervision for Entity Linking with Filtering of Noise

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8027977B2 (en) * 2007-06-20 2011-09-27 Microsoft Corporation Recommending content using discriminatively trained document similarity
US8977537B2 (en) 2011-06-24 2015-03-10 Microsoft Technology Licensing, Llc Hierarchical models for language modeling
SG11201406534QA (en) * 2012-04-11 2014-11-27 Univ Singapore Methods, apparatuses and computer-readable mediums for organizing data relating to a product
US9519859B2 (en) * 2013-09-06 2016-12-13 Microsoft Technology Licensing, Llc Deep structured semantic model produced using click-through data
US9477752B1 (en) 2013-09-30 2016-10-25 Verint Systems Inc. Ontology administration and application to enhance communication data analytics
US20150127323A1 (en) * 2013-11-04 2015-05-07 Xerox Corporation Refining inference rules with temporal event clustering
US9672814B2 (en) 2015-05-08 2017-06-06 International Business Machines Corporation Semi-supervised learning of word embeddings
CN105260488B (zh) 2015-11-30 2018-10-02 哈尔滨工业大学 一种用于语义理解的文本序列迭代方法
US10740678B2 (en) * 2016-03-31 2020-08-11 International Business Machines Corporation Concept hierarchies
US10169454B2 (en) 2016-05-17 2019-01-01 Xerox Corporation Unsupervised ontology-based graph extraction from texts
US20180101773A1 (en) * 2016-10-07 2018-04-12 Futurewei Technologies, Inc. Apparatus and method for spatial processing of concepts
CN108268883B (zh) * 2016-12-31 2021-05-07 上海交通大学 基于开放数据的移动端信息模板自构建系统
US12411880B2 (en) * 2017-02-16 2025-09-09 Globality, Inc. Intelligent matching system with ontology-aided relation extraction
CN110352417B (zh) * 2017-03-06 2024-02-02 三菱电机株式会社 本体构建辅助装置
EP3385862A1 (en) * 2017-04-03 2018-10-10 Siemens Aktiengesellschaft A method and apparatus for performing hierarchical entity classification
US10963501B1 (en) * 2017-04-29 2021-03-30 Veritas Technologies Llc Systems and methods for generating a topic tree for digital information
US11488713B2 (en) 2017-08-15 2022-11-01 Computer Technology Associates, Inc. Disease specific ontology-guided rule engine and machine learning for enhanced critical care decision support
US10817676B2 (en) * 2017-12-27 2020-10-27 Sdl Inc. Intelligent routing services and systems
WO2019132685A1 (ru) * 2017-12-29 2019-07-04 Общество С Ограниченной Ответственностью "Интеллоджик" Способ и система поддержки принятия врачебных решений
CN108717574B (zh) 2018-03-26 2021-09-21 浙江大学 一种基于连词标记和强化学习的自然语言推理方法
US10817657B2 (en) * 2018-12-26 2020-10-27 Nokia Solutions And Networks Oy Determination of field types in tabular data
CN110134943B (zh) 2019-04-03 2023-04-18 平安科技(深圳)有限公司 领域本体生成方法、装置、设备及介质
US10902203B2 (en) * 2019-04-23 2021-01-26 Oracle International Corporation Named entity disambiguation using entity distance in a knowledge graph
US11126647B2 (en) * 2019-12-13 2021-09-21 CS Disco, Inc. System and method for hierarchically organizing documents based on document portions

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157382A1 (en) * 2005-08-31 2009-06-18 Shmuel Bar Decision-support expert system and methods for real-time exploitation of documents in non-english languages
JP2017151838A (ja) * 2016-02-26 2017-08-31 トヨタ自動車株式会社 話題推定学習装置及び話題推定学習方法
JP2018195012A (ja) * 2017-05-16 2018-12-06 富士通株式会社 学習プログラム、学習方法、学習装置、及び変換パラメータ製造方法
JP2019053730A (ja) * 2017-09-12 2019-04-04 ネイバー コーポレーションNAVER Corporation 文書のカテゴリ分類のためのディープラーニング学習方法およびそのシステム
US20190130282A1 (en) * 2017-10-31 2019-05-02 Microsoft Technology Licensing, Llc Distant Supervision for Entity Linking with Filtering of Noise

Also Published As

Publication number Publication date
DE112020003311T5 (de) 2022-03-31
US11176323B2 (en) 2021-11-16
GB2601697A (en) 2022-06-08
US20210056168A1 (en) 2021-02-25
GB202308265D0 (en) 2023-07-19
GB202203106D0 (en) 2022-04-20
CN114341862A (zh) 2022-04-12
WO2021033087A1 (en) 2021-02-25
GB2616542A (en) 2023-09-13

Similar Documents

Publication Publication Date Title
JP2022545062A (ja) オントロジーベースの概念埋め込みモデルを使用した自然言語処理
US11892998B2 (en) Efficient embedding table storage and lookup
US10679345B2 (en) Automatic contour annotation of medical images based on correlations with medical reports
US10936635B2 (en) Context-based generation of semantically-similar phrases
US11514691B2 (en) Generating training sets to train machine learning models
CN105760417B (zh) 基于个性化用户模型和情境的认知交互式搜索的方法和系统
KR102636493B1 (ko) 의료 데이터 검증 방법, 장치 및 전자 기기
US10929383B2 (en) Method and system for improving training data understanding in natural language processing
US11003701B2 (en) Dynamic faceted search on a document corpus
US10558756B2 (en) Unsupervised information extraction dictionary creation
US20210319054A1 (en) Encoding entity representations for cross-document coreference
US20170371955A1 (en) System and method for precise domain question and answer generation for use as ground truth
US20200286596A1 (en) Generating and managing clinical studies using a knowledge base
AU2015204283A1 (en) Text mining system and tool
US20170068726A1 (en) Context based passage retreival and scoring in a question answering system
US11222165B1 (en) Sliding window to detect entities in corpus using natural language processing
US10558747B2 (en) Unsupervised information extraction dictionary creation
US11544312B2 (en) Descriptor uniqueness for entity clustering
US20170371956A1 (en) System and method for precise domain question and answer generation for use as ground truth
CN110442877A (zh) 使用机器人规划作为平行语言语料库
US11422798B2 (en) Context-based word embedding for programming artifacts
US11275796B2 (en) Dynamic faceted search on a document corpus
US20180025274A1 (en) Dynamic threshold filtering for watched questions
US10282066B2 (en) Dynamic threshold filtering for watched questions
US12027070B2 (en) Cognitive framework for identification of questions and answers

Legal Events

Date Code Title Description
RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20220518

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20230710

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230809

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230809

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20240911

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20241105

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20250430