JP2022545062A - オントロジーベースの概念埋め込みモデルを使用した自然言語処理 - Google Patents
オントロジーベースの概念埋め込みモデルを使用した自然言語処理 Download PDFInfo
- Publication number
- JP2022545062A JP2022545062A JP2022508970A JP2022508970A JP2022545062A JP 2022545062 A JP2022545062 A JP 2022545062A JP 2022508970 A JP2022508970 A JP 2022508970A JP 2022508970 A JP2022508970 A JP 2022508970A JP 2022545062 A JP2022545062 A JP 2022545062A
- Authority
- JP
- Japan
- Prior art keywords
- concept
- concepts
- computer
- vector
- vectors
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/545,608 | 2019-08-20 | ||
| US16/545,608 US11176323B2 (en) | 2019-08-20 | 2019-08-20 | Natural language processing using an ontology-based concept embedding model |
| PCT/IB2020/057621 WO2021033087A1 (en) | 2019-08-20 | 2020-08-13 | Natural language processing using an ontology-based concept embedding model |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2022545062A true JP2022545062A (ja) | 2022-10-25 |
| JP2022545062A5 JP2022545062A5 (https=) | 2023-08-21 |
Family
ID=74646859
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022508970A Pending JP2022545062A (ja) | 2019-08-20 | 2020-08-13 | オントロジーベースの概念埋め込みモデルを使用した自然言語処理 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US11176323B2 (https=) |
| JP (1) | JP2022545062A (https=) |
| CN (1) | CN114341862A (https=) |
| DE (1) | DE112020003311T5 (https=) |
| GB (2) | GB2616542A (https=) |
| WO (1) | WO2021033087A1 (https=) |
Families Citing this family (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11217227B1 (en) | 2019-11-08 | 2022-01-04 | Suki AI, Inc. | Systems and methods for generating disambiguated terms in automatically generated transcriptions including instructions within a particular knowledge domain |
| US11538465B1 (en) | 2019-11-08 | 2022-12-27 | Suki AI, Inc. | Systems and methods to facilitate intent determination of a command by grouping terms based on context |
| US11354513B2 (en) * | 2020-02-06 | 2022-06-07 | Adobe Inc. | Automated identification of concept labels for a text fragment |
| US11416684B2 (en) | 2020-02-06 | 2022-08-16 | Adobe Inc. | Automated identification of concept labels for a set of documents |
| US11494562B2 (en) * | 2020-05-14 | 2022-11-08 | Optum Technology, Inc. | Method, apparatus and computer program product for generating text strings |
| US11645526B2 (en) * | 2020-06-25 | 2023-05-09 | International Business Machines Corporation | Learning neuro-symbolic multi-hop reasoning rules over text |
| US20220172040A1 (en) * | 2020-11-30 | 2022-06-02 | Microsoft Technology Licensing, Llc | Training a machine-learned model based on feedback |
| US12562244B2 (en) * | 2021-03-01 | 2026-02-24 | International Business Machines Corporation | Combining domain-specific ontologies for language processing |
| JP7761880B2 (ja) * | 2021-03-16 | 2025-10-29 | 公立大学法人会津大学 | モデル推論プログラム、情報処理装置及びモデル推論方法 |
| US11868381B2 (en) * | 2021-03-29 | 2024-01-09 | Google Llc | Systems and methods for training language models to reason over tables |
| CN113420117B (zh) * | 2021-06-23 | 2023-10-20 | 北京交通大学 | 一种基于多元特征融合的突发事件分类方法 |
| CN113779196B (zh) * | 2021-09-07 | 2024-02-13 | 大连大学 | 一种融合多层次信息的海关同义词识别方法 |
| CN114003688B (zh) * | 2021-10-14 | 2025-07-29 | 咪咕文化科技有限公司 | 问答数据的查询方法、装置、设备以及存储介质 |
| US11954619B1 (en) * | 2022-01-12 | 2024-04-09 | Trueblue, Inc. | Analysis and processing of skills related data from a communications session with improved latency |
| CN114782722B (zh) * | 2022-04-29 | 2023-02-03 | 北京百度网讯科技有限公司 | 图文相似度的确定方法、装置及电子设备 |
| CN115033671A (zh) * | 2022-06-13 | 2022-09-09 | 联想(北京)有限公司 | 一种信息处理方法、装置和可读存储介质 |
| CN115062699B (zh) * | 2022-06-13 | 2025-08-12 | 中孚安全技术有限公司 | 一种基于Word2vec-FL的社区发现方法及系统 |
| US20240211796A1 (en) * | 2022-12-22 | 2024-06-27 | Microsoft Technology Licensing, Llc | Explanation of emergent semantics in embedding spaces via analogy |
| CN117009541B (zh) * | 2023-06-07 | 2026-02-17 | 中电通商数字技术(上海)有限公司 | 临床医学检验知识库构建与应用方法、装置、设备及介质 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090157382A1 (en) * | 2005-08-31 | 2009-06-18 | Shmuel Bar | Decision-support expert system and methods for real-time exploitation of documents in non-english languages |
| JP2017151838A (ja) * | 2016-02-26 | 2017-08-31 | トヨタ自動車株式会社 | 話題推定学習装置及び話題推定学習方法 |
| JP2018195012A (ja) * | 2017-05-16 | 2018-12-06 | 富士通株式会社 | 学習プログラム、学習方法、学習装置、及び変換パラメータ製造方法 |
| JP2019053730A (ja) * | 2017-09-12 | 2019-04-04 | ネイバー コーポレーションNAVER Corporation | 文書のカテゴリ分類のためのディープラーニング学習方法およびそのシステム |
| US20190130282A1 (en) * | 2017-10-31 | 2019-05-02 | Microsoft Technology Licensing, Llc | Distant Supervision for Entity Linking with Filtering of Noise |
Family Cites Families (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8027977B2 (en) * | 2007-06-20 | 2011-09-27 | Microsoft Corporation | Recommending content using discriminatively trained document similarity |
| US8977537B2 (en) | 2011-06-24 | 2015-03-10 | Microsoft Technology Licensing, Llc | Hierarchical models for language modeling |
| SG11201406534QA (en) * | 2012-04-11 | 2014-11-27 | Univ Singapore | Methods, apparatuses and computer-readable mediums for organizing data relating to a product |
| US9519859B2 (en) * | 2013-09-06 | 2016-12-13 | Microsoft Technology Licensing, Llc | Deep structured semantic model produced using click-through data |
| US9477752B1 (en) | 2013-09-30 | 2016-10-25 | Verint Systems Inc. | Ontology administration and application to enhance communication data analytics |
| US20150127323A1 (en) * | 2013-11-04 | 2015-05-07 | Xerox Corporation | Refining inference rules with temporal event clustering |
| US9672814B2 (en) | 2015-05-08 | 2017-06-06 | International Business Machines Corporation | Semi-supervised learning of word embeddings |
| CN105260488B (zh) | 2015-11-30 | 2018-10-02 | 哈尔滨工业大学 | 一种用于语义理解的文本序列迭代方法 |
| US10740678B2 (en) * | 2016-03-31 | 2020-08-11 | International Business Machines Corporation | Concept hierarchies |
| US10169454B2 (en) | 2016-05-17 | 2019-01-01 | Xerox Corporation | Unsupervised ontology-based graph extraction from texts |
| US20180101773A1 (en) * | 2016-10-07 | 2018-04-12 | Futurewei Technologies, Inc. | Apparatus and method for spatial processing of concepts |
| CN108268883B (zh) * | 2016-12-31 | 2021-05-07 | 上海交通大学 | 基于开放数据的移动端信息模板自构建系统 |
| US12411880B2 (en) * | 2017-02-16 | 2025-09-09 | Globality, Inc. | Intelligent matching system with ontology-aided relation extraction |
| CN110352417B (zh) * | 2017-03-06 | 2024-02-02 | 三菱电机株式会社 | 本体构建辅助装置 |
| EP3385862A1 (en) * | 2017-04-03 | 2018-10-10 | Siemens Aktiengesellschaft | A method and apparatus for performing hierarchical entity classification |
| US10963501B1 (en) * | 2017-04-29 | 2021-03-30 | Veritas Technologies Llc | Systems and methods for generating a topic tree for digital information |
| US11488713B2 (en) | 2017-08-15 | 2022-11-01 | Computer Technology Associates, Inc. | Disease specific ontology-guided rule engine and machine learning for enhanced critical care decision support |
| US10817676B2 (en) * | 2017-12-27 | 2020-10-27 | Sdl Inc. | Intelligent routing services and systems |
| WO2019132685A1 (ru) * | 2017-12-29 | 2019-07-04 | Общество С Ограниченной Ответственностью "Интеллоджик" | Способ и система поддержки принятия врачебных решений |
| CN108717574B (zh) | 2018-03-26 | 2021-09-21 | 浙江大学 | 一种基于连词标记和强化学习的自然语言推理方法 |
| US10817657B2 (en) * | 2018-12-26 | 2020-10-27 | Nokia Solutions And Networks Oy | Determination of field types in tabular data |
| CN110134943B (zh) | 2019-04-03 | 2023-04-18 | 平安科技(深圳)有限公司 | 领域本体生成方法、装置、设备及介质 |
| US10902203B2 (en) * | 2019-04-23 | 2021-01-26 | Oracle International Corporation | Named entity disambiguation using entity distance in a knowledge graph |
| US11126647B2 (en) * | 2019-12-13 | 2021-09-21 | CS Disco, Inc. | System and method for hierarchically organizing documents based on document portions |
-
2019
- 2019-08-20 US US16/545,608 patent/US11176323B2/en not_active Expired - Fee Related
-
2020
- 2020-08-13 CN CN202080058467.5A patent/CN114341862A/zh active Pending
- 2020-08-13 GB GB2308265.4A patent/GB2616542A/en not_active Withdrawn
- 2020-08-13 JP JP2022508970A patent/JP2022545062A/ja active Pending
- 2020-08-13 WO PCT/IB2020/057621 patent/WO2021033087A1/en not_active Ceased
- 2020-08-13 DE DE112020003311.2T patent/DE112020003311T5/de not_active Ceased
- 2020-08-13 GB GB2203106.6A patent/GB2601697A/en not_active Withdrawn
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090157382A1 (en) * | 2005-08-31 | 2009-06-18 | Shmuel Bar | Decision-support expert system and methods for real-time exploitation of documents in non-english languages |
| JP2017151838A (ja) * | 2016-02-26 | 2017-08-31 | トヨタ自動車株式会社 | 話題推定学習装置及び話題推定学習方法 |
| JP2018195012A (ja) * | 2017-05-16 | 2018-12-06 | 富士通株式会社 | 学習プログラム、学習方法、学習装置、及び変換パラメータ製造方法 |
| JP2019053730A (ja) * | 2017-09-12 | 2019-04-04 | ネイバー コーポレーションNAVER Corporation | 文書のカテゴリ分類のためのディープラーニング学習方法およびそのシステム |
| US20190130282A1 (en) * | 2017-10-31 | 2019-05-02 | Microsoft Technology Licensing, Llc | Distant Supervision for Entity Linking with Filtering of Noise |
Also Published As
| Publication number | Publication date |
|---|---|
| DE112020003311T5 (de) | 2022-03-31 |
| US11176323B2 (en) | 2021-11-16 |
| GB2601697A (en) | 2022-06-08 |
| US20210056168A1 (en) | 2021-02-25 |
| GB202308265D0 (en) | 2023-07-19 |
| GB202203106D0 (en) | 2022-04-20 |
| CN114341862A (zh) | 2022-04-12 |
| WO2021033087A1 (en) | 2021-02-25 |
| GB2616542A (en) | 2023-09-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2022545062A (ja) | オントロジーベースの概念埋め込みモデルを使用した自然言語処理 | |
| US11892998B2 (en) | Efficient embedding table storage and lookup | |
| US10679345B2 (en) | Automatic contour annotation of medical images based on correlations with medical reports | |
| US10936635B2 (en) | Context-based generation of semantically-similar phrases | |
| US11514691B2 (en) | Generating training sets to train machine learning models | |
| CN105760417B (zh) | 基于个性化用户模型和情境的认知交互式搜索的方法和系统 | |
| KR102636493B1 (ko) | 의료 데이터 검증 방법, 장치 및 전자 기기 | |
| US10929383B2 (en) | Method and system for improving training data understanding in natural language processing | |
| US11003701B2 (en) | Dynamic faceted search on a document corpus | |
| US10558756B2 (en) | Unsupervised information extraction dictionary creation | |
| US20210319054A1 (en) | Encoding entity representations for cross-document coreference | |
| US20170371955A1 (en) | System and method for precise domain question and answer generation for use as ground truth | |
| US20200286596A1 (en) | Generating and managing clinical studies using a knowledge base | |
| AU2015204283A1 (en) | Text mining system and tool | |
| US20170068726A1 (en) | Context based passage retreival and scoring in a question answering system | |
| US11222165B1 (en) | Sliding window to detect entities in corpus using natural language processing | |
| US10558747B2 (en) | Unsupervised information extraction dictionary creation | |
| US11544312B2 (en) | Descriptor uniqueness for entity clustering | |
| US20170371956A1 (en) | System and method for precise domain question and answer generation for use as ground truth | |
| CN110442877A (zh) | 使用机器人规划作为平行语言语料库 | |
| US11422798B2 (en) | Context-based word embedding for programming artifacts | |
| US11275796B2 (en) | Dynamic faceted search on a document corpus | |
| US20180025274A1 (en) | Dynamic threshold filtering for watched questions | |
| US10282066B2 (en) | Dynamic threshold filtering for watched questions | |
| US12027070B2 (en) | Cognitive framework for identification of questions and answers |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20220518 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20230710 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230809 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20230809 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20240911 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20241105 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20250430 |