JP2024535733A - 異なる文書スキーマ間の類似性スコアの生成 - Google Patents

異なる文書スキーマ間の類似性スコアの生成 Download PDF

Info

Publication number
JP2024535733A
JP2024535733A JP2024513780A JP2024513780A JP2024535733A JP 2024535733 A JP2024535733 A JP 2024535733A JP 2024513780 A JP2024513780 A JP 2024513780A JP 2024513780 A JP2024513780 A JP 2024513780A JP 2024535733 A JP2024535733 A JP 2024535733A
Authority
JP
Japan
Prior art keywords
document
documents
queries
schema
configuration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2024513780A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024535733A5 (enExample
Inventor
マテイ,リビウ・セバスチャン
トロヤン,フィリプ
ブロン,マーク・ミシェル
ハインド,アンドリュー・ケネス
ジョウ,インジャオ
ペトリカ,マリア-モニカ
シャー,ラジェシュ・アシュウィンバイ
Original Assignee
オラクル・インターナショナル・コーポレイション
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by オラクル・インターナショナル・コーポレイション filed Critical オラクル・インターナショナル・コーポレイション
Publication of JP2024535733A publication Critical patent/JP2024535733A/ja
Publication of JP2024535733A5 publication Critical patent/JP2024535733A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/256Integrating or interfacing systems involving database management systems in federated or virtual databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/319Inverted lists
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2024513780A 2021-09-01 2022-08-31 異なる文書スキーマ間の類似性スコアの生成 Pending JP2024535733A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/464,534 2021-09-01
US17/464,534 US20230066143A1 (en) 2021-09-01 2021-09-01 Generating similarity scores between different document schemas
PCT/US2022/042177 WO2023034397A1 (en) 2021-09-01 2022-08-31 Generating similarity scores between different document schemas

Publications (2)

Publication Number Publication Date
JP2024535733A true JP2024535733A (ja) 2024-10-02
JP2024535733A5 JP2024535733A5 (enExample) 2025-07-11

Family

ID=83508834

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2024513780A Pending JP2024535733A (ja) 2021-09-01 2022-08-31 異なる文書スキーマ間の類似性スコアの生成

Country Status (5)

Country Link
US (1) US20230066143A1 (enExample)
EP (1) EP4396694A1 (enExample)
JP (1) JP2024535733A (enExample)
CN (1) CN118103830A (enExample)
WO (1) WO2023034397A1 (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12248504B2 (en) 2023-05-31 2025-03-11 Docusign, Inc. Document container with candidate documents
CN120994760B (zh) * 2025-10-16 2026-02-10 深圳市蓝凌软件股份有限公司 基于多字段信息与离群值检测的文档检索方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002297605A (ja) * 2001-03-30 2002-10-11 Toshiba Corp 構造化文書検索方法および構造化文書検索装置およびプログラム
US20080114740A1 (en) * 2006-11-14 2008-05-15 Xcential Group Llc System and method for maintaining conformance of electronic document structure with multiple, variant document structure models
JP2009223781A (ja) * 2008-03-18 2009-10-01 Nec Corp 情報推薦装置、情報推薦システム、情報推薦方法、プログラム及び記録媒体

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7882122B2 (en) * 2005-03-18 2011-02-01 Capital Source Far East Limited Remote access of heterogeneous data
US20060218158A1 (en) * 2005-03-23 2006-09-28 Gunther Stuhec Translation of information between schemas
WO2008083504A1 (en) * 2007-01-10 2008-07-17 Nick Koudas Method and system for information discovery and text analysis
US8954469B2 (en) * 2007-03-14 2015-02-10 Vcvciii Llc Query templates and labeled search tip system, methods, and techniques
US11068657B2 (en) * 2010-06-28 2021-07-20 Skyscanner Limited Natural language question answering system and method based on deep semantics
US8346792B1 (en) * 2010-11-09 2013-01-01 Google Inc. Query generation using structural similarity between documents
US20140200879A1 (en) * 2013-01-11 2014-07-17 Brian Sakhai Method and System for Rating Food Items
US20140208779A1 (en) * 2013-01-30 2014-07-31 Fresh Food Solutions Llc Systems and methods for extending the fresh life of perishables in the retail and vending setting
US10956415B2 (en) * 2016-09-26 2021-03-23 Splunk Inc. Generating a subquery for an external data system using a configuration file
US10489466B1 (en) * 2016-09-29 2019-11-26 EMC IP Holding Company LLC Method and system for document similarity analysis based on weak transitive relation of similarity
US11182437B2 (en) * 2017-10-26 2021-11-23 International Business Machines Corporation Hybrid processing of disjunctive and conjunctive conditions of a search query for a similarity search
US11416448B1 (en) * 2019-08-14 2022-08-16 Amazon Technologies, Inc. Asynchronous searching of protected areas of a provider network
US11651156B2 (en) * 2020-05-07 2023-05-16 Optum Technology, Inc. Contextual document summarization with semantic intelligence
US20220245155A1 (en) * 2021-02-04 2022-08-04 Yext, Inc. Distributed multi-source data processing and publishing platform
US11620319B2 (en) * 2021-05-13 2023-04-04 Capital One Services, Llc Search platform for unstructured interaction summaries

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002297605A (ja) * 2001-03-30 2002-10-11 Toshiba Corp 構造化文書検索方法および構造化文書検索装置およびプログラム
US20080114740A1 (en) * 2006-11-14 2008-05-15 Xcential Group Llc System and method for maintaining conformance of electronic document structure with multiple, variant document structure models
JP2009223781A (ja) * 2008-03-18 2009-10-01 Nec Corp 情報推薦装置、情報推薦システム、情報推薦方法、プログラム及び記録媒体

Also Published As

Publication number Publication date
US20230066143A1 (en) 2023-03-02
WO2023034397A1 (en) 2023-03-09
CN118103830A (zh) 2024-05-28
EP4396694A1 (en) 2024-07-10

Similar Documents

Publication Publication Date Title
US11394769B2 (en) Framework for the deployment of event-based applications
US10140352B2 (en) Interfacing with a relational database for multi-dimensional analysis via a spreadsheet application
US10331463B2 (en) Dynamic role-based view definitions in a repository system
US10691299B2 (en) Display of hierarchical datasets using high-water mark scrolling
KR102313789B1 (ko) 이종 전자 디바이스들에 대한 애플리케이션 배포물의 구분
US10942900B2 (en) Techniques for tenant controlled visualizations and management of files in cloud storage systems
US10614048B2 (en) Techniques for correlating data in a repository system
JP6439043B2 (ja) 文脈検索文字列同義語の自動生成
US10855561B2 (en) Predictive service request system and methods
US10346632B2 (en) Entity security implied by an asset in a repository system
CN107077466A (zh) 计算机自然语言处理中通用本体的词元映射
US9665560B2 (en) Information retrieval system based on a unified language model
US10380124B2 (en) Searching data sets
US20170124181A1 (en) Automatic fuzzy matching of entities in context
JP2024535733A (ja) 異なる文書スキーマ間の類似性スコアの生成
US11392560B2 (en) Consolidating and transforming metadata changes
US20160070747A1 (en) Techniques to reduce contention windows
US10015120B2 (en) Providing message delivery services between requestors and providers
US10372488B2 (en) Parallel processing using memory mapping
US20150199625A1 (en) Logical and physical organization management

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250703

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20250703

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20260210

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20260217