JP2024535733A - 異なる文書スキーマ間の類似性スコアの生成 - Google Patents
異なる文書スキーマ間の類似性スコアの生成 Download PDFInfo
- Publication number
- JP2024535733A JP2024535733A JP2024513780A JP2024513780A JP2024535733A JP 2024535733 A JP2024535733 A JP 2024535733A JP 2024513780 A JP2024513780 A JP 2024513780A JP 2024513780 A JP2024513780 A JP 2024513780A JP 2024535733 A JP2024535733 A JP 2024535733A
- Authority
- JP
- Japan
- Prior art keywords
- document
- documents
- queries
- schema
- configuration
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/256—Integrating or interfacing systems involving database management systems in federated or virtual databases
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/319—Inverted lists
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/464,534 | 2021-09-01 | ||
| US17/464,534 US20230066143A1 (en) | 2021-09-01 | 2021-09-01 | Generating similarity scores between different document schemas |
| PCT/US2022/042177 WO2023034397A1 (en) | 2021-09-01 | 2022-08-31 | Generating similarity scores between different document schemas |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2024535733A true JP2024535733A (ja) | 2024-10-02 |
| JP2024535733A5 JP2024535733A5 (enExample) | 2025-07-11 |
Family
ID=83508834
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024513780A Pending JP2024535733A (ja) | 2021-09-01 | 2022-08-31 | 異なる文書スキーマ間の類似性スコアの生成 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20230066143A1 (enExample) |
| EP (1) | EP4396694A1 (enExample) |
| JP (1) | JP2024535733A (enExample) |
| CN (1) | CN118103830A (enExample) |
| WO (1) | WO2023034397A1 (enExample) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12248504B2 (en) | 2023-05-31 | 2025-03-11 | Docusign, Inc. | Document container with candidate documents |
| CN120994760B (zh) * | 2025-10-16 | 2026-02-10 | 深圳市蓝凌软件股份有限公司 | 基于多字段信息与离群值检测的文档检索方法 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002297605A (ja) * | 2001-03-30 | 2002-10-11 | Toshiba Corp | 構造化文書検索方法および構造化文書検索装置およびプログラム |
| US20080114740A1 (en) * | 2006-11-14 | 2008-05-15 | Xcential Group Llc | System and method for maintaining conformance of electronic document structure with multiple, variant document structure models |
| JP2009223781A (ja) * | 2008-03-18 | 2009-10-01 | Nec Corp | 情報推薦装置、情報推薦システム、情報推薦方法、プログラム及び記録媒体 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7882122B2 (en) * | 2005-03-18 | 2011-02-01 | Capital Source Far East Limited | Remote access of heterogeneous data |
| US20060218158A1 (en) * | 2005-03-23 | 2006-09-28 | Gunther Stuhec | Translation of information between schemas |
| WO2008083504A1 (en) * | 2007-01-10 | 2008-07-17 | Nick Koudas | Method and system for information discovery and text analysis |
| US8954469B2 (en) * | 2007-03-14 | 2015-02-10 | Vcvciii Llc | Query templates and labeled search tip system, methods, and techniques |
| US11068657B2 (en) * | 2010-06-28 | 2021-07-20 | Skyscanner Limited | Natural language question answering system and method based on deep semantics |
| US8346792B1 (en) * | 2010-11-09 | 2013-01-01 | Google Inc. | Query generation using structural similarity between documents |
| US20140200879A1 (en) * | 2013-01-11 | 2014-07-17 | Brian Sakhai | Method and System for Rating Food Items |
| US20140208779A1 (en) * | 2013-01-30 | 2014-07-31 | Fresh Food Solutions Llc | Systems and methods for extending the fresh life of perishables in the retail and vending setting |
| US10956415B2 (en) * | 2016-09-26 | 2021-03-23 | Splunk Inc. | Generating a subquery for an external data system using a configuration file |
| US10489466B1 (en) * | 2016-09-29 | 2019-11-26 | EMC IP Holding Company LLC | Method and system for document similarity analysis based on weak transitive relation of similarity |
| US11182437B2 (en) * | 2017-10-26 | 2021-11-23 | International Business Machines Corporation | Hybrid processing of disjunctive and conjunctive conditions of a search query for a similarity search |
| US11416448B1 (en) * | 2019-08-14 | 2022-08-16 | Amazon Technologies, Inc. | Asynchronous searching of protected areas of a provider network |
| US11651156B2 (en) * | 2020-05-07 | 2023-05-16 | Optum Technology, Inc. | Contextual document summarization with semantic intelligence |
| US20220245155A1 (en) * | 2021-02-04 | 2022-08-04 | Yext, Inc. | Distributed multi-source data processing and publishing platform |
| US11620319B2 (en) * | 2021-05-13 | 2023-04-04 | Capital One Services, Llc | Search platform for unstructured interaction summaries |
-
2021
- 2021-09-01 US US17/464,534 patent/US20230066143A1/en active Pending
-
2022
- 2022-08-31 WO PCT/US2022/042177 patent/WO2023034397A1/en not_active Ceased
- 2022-08-31 JP JP2024513780A patent/JP2024535733A/ja active Pending
- 2022-08-31 CN CN202280068598.0A patent/CN118103830A/zh active Pending
- 2022-08-31 EP EP22783095.7A patent/EP4396694A1/en active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002297605A (ja) * | 2001-03-30 | 2002-10-11 | Toshiba Corp | 構造化文書検索方法および構造化文書検索装置およびプログラム |
| US20080114740A1 (en) * | 2006-11-14 | 2008-05-15 | Xcential Group Llc | System and method for maintaining conformance of electronic document structure with multiple, variant document structure models |
| JP2009223781A (ja) * | 2008-03-18 | 2009-10-01 | Nec Corp | 情報推薦装置、情報推薦システム、情報推薦方法、プログラム及び記録媒体 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230066143A1 (en) | 2023-03-02 |
| WO2023034397A1 (en) | 2023-03-09 |
| CN118103830A (zh) | 2024-05-28 |
| EP4396694A1 (en) | 2024-07-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11394769B2 (en) | Framework for the deployment of event-based applications | |
| US10140352B2 (en) | Interfacing with a relational database for multi-dimensional analysis via a spreadsheet application | |
| US10331463B2 (en) | Dynamic role-based view definitions in a repository system | |
| US10691299B2 (en) | Display of hierarchical datasets using high-water mark scrolling | |
| KR102313789B1 (ko) | 이종 전자 디바이스들에 대한 애플리케이션 배포물의 구분 | |
| US10942900B2 (en) | Techniques for tenant controlled visualizations and management of files in cloud storage systems | |
| US10614048B2 (en) | Techniques for correlating data in a repository system | |
| JP6439043B2 (ja) | 文脈検索文字列同義語の自動生成 | |
| US10855561B2 (en) | Predictive service request system and methods | |
| US10346632B2 (en) | Entity security implied by an asset in a repository system | |
| CN107077466A (zh) | 计算机自然语言处理中通用本体的词元映射 | |
| US9665560B2 (en) | Information retrieval system based on a unified language model | |
| US10380124B2 (en) | Searching data sets | |
| US20170124181A1 (en) | Automatic fuzzy matching of entities in context | |
| JP2024535733A (ja) | 異なる文書スキーマ間の類似性スコアの生成 | |
| US11392560B2 (en) | Consolidating and transforming metadata changes | |
| US20160070747A1 (en) | Techniques to reduce contention windows | |
| US10015120B2 (en) | Providing message delivery services between requestors and providers | |
| US10372488B2 (en) | Parallel processing using memory mapping | |
| US20150199625A1 (en) | Logical and physical organization management |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250703 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20250703 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20260210 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20260217 |