JP2017188137A5 - - Google Patents

Download PDF

Info

Publication number
JP2017188137A5
JP2017188137A5 JP2017115395A JP2017115395A JP2017188137A5 JP 2017188137 A5 JP2017188137 A5 JP 2017188137A5 JP 2017115395 A JP2017115395 A JP 2017115395A JP 2017115395 A JP2017115395 A JP 2017115395A JP 2017188137 A5 JP2017188137 A5 JP 2017188137A5
Authority
JP
Japan
Prior art keywords
similarity
instructions
fields
field
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2017115395A
Other languages
Japanese (ja)
Other versions
JP6964384B2 (en
JP2017188137A (en
Filing date
Publication date
Priority claimed from JP2017523549A external-priority patent/JP6159908B6/en
Application filed filed Critical
Publication of JP2017188137A publication Critical patent/JP2017188137A/en
Publication of JP2017188137A5 publication Critical patent/JP2017188137A5/ja
Application granted granted Critical
Publication of JP6964384B2 publication Critical patent/JP6964384B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Claims (6)

複数のデータストア内のテーブル内の複数のフィールド内の文字列の集合から重複を排除するステップと、
前記重複を排除した文字列を保存するステップと、
前記保存された文字列間の類似性を判定するステップと、
前記判定された文字列間の類似性に基づいて前記複数のフィールド間の類似性を判定するステップと、
前記複数のフィールド間の類似性が高いと判定されたフィールドを含むテーブル間の類似関係を表現したデータを生成するステップとを
含むコンピューターにより実行される方法。
Steps to eliminate duplication from a collection of strings in multiple fields in a table in multiple datastores,
The step of saving the duplicated character string and
The step of determining the similarity between the saved character strings and
A step of determining the similarity between the plurality of fields based on the similarity between the determined character strings, and
A method performed by a computer that includes a step of generating data that expresses the similarity between tables that include fields that are determined to have high similarity between the plurality of fields.
前記類似性を判定するステップは、さらに、
前記複数のフィールド内の文字列分割するステップと、
前記文字列間の類似度を求めるステップとを含む、
請求項1に記載の方法。
The step of determining the similarity further
The step of dividing the character strings in the plurality of fields and
Including the step of finding the similarity between the character strings .
The method according to claim 1.
請求項1、請求項2、請求項3、または、請求項4に記載の方法で作成された前記テーブル間の類似関係を表現したデータを使用した、コンピューターにより実行される方法であって、
第一のデータストア内の第一のテーブルの第一のフィールドに対するクエリーを受信するステップと、
前記テーブル間の類似関係を表現したデータに基づいて前記第一のフィールドに類似する第二のフィールドを識別するステップと、
前記第二のフィールドと前記第二のフィールドを含む第二のテーブルと前記第二のテーブルを含む第二のデータストアとのいずれかひとつ以上を表示するステップとを含む方法。
A method executed by a computer using data expressing similar relationships between the tables created by the method according to claim 1, claim 2, claim 3, or claim 4.
The step of receiving a query for the first field of the first table in the first datastore, and
A step of identifying a second field similar to the first field based on data expressing the similarity between the tables , and
A method comprising displaying any one or more of the second field, a second table containing the second field, and a second datastore containing the second table.
複数のデータストア内のテーブル内の複数のフィールド内の文字列の集合から重複を排除する命令群と、
前記重複を排除した文字列を保存する命令群と、
前記保存された文字列間の類似性を判定する命令群と、
前記判定された文字列間の類似性に基づいて前記複数のフィールド間の類似性を判定する命令群と、
前記複数のフィールド間の類似性が高いと判定されたフィールドを含むテーブル間の類似関係を表現したデータを生成する命令群とを
コンピューターに実行させるプログラム。
Instructions to eliminate duplication from a set of strings in multiple fields in a table in multiple datastores,
A group of instructions for saving the duplicated character string and
A group of instructions for determining the similarity between the saved character strings and
A group of instructions for determining the similarity between the plurality of fields based on the similarity between the determined character strings, and
A program that causes a computer to execute a group of instructions that generate data expressing the similarity between tables including fields determined to have high similarity between the plurality of fields.
前記類似性を判定する命令群は、さらに、
前記複数のフィールド内の文字列分割する命令群と、
前記文字列間の類似度を求める命令群とを含む
請求項9に記載のプログラム。
The instruction group for determining the similarity further
A group of instructions that divides the character strings in the plurality of fields,
Including a group of instructions for obtaining the similarity between the character strings.
The program according to claim 9.
請求項9、請求項10、請求項11、または、請求項12に記載のプログラムで作成された前記テーブル間の類似関係を表現したデータを使用したプログラムであって、
第一のデータストア内の第一のテーブルの第一のフィールドに対するクエリーを受信する命令群と、
前記テーブル間の類似関係を表現したデータに基づいて前記第一のフィールドに類似する第二のフィールドを識別する命令群と、
前記第二のフィールドと前記第二のフィールドを含む第二のテーブルと前記第二のテーブルを含む第二のデータストアとのいずれかひとつ以上を表示する命令群とを
コンピューターに実行させるプログラム。
A program using data expressing similar relationships between the tables created by the program according to claim 9, claim 10, claim 11, or claim 12.
A set of instructions that receive a query for the first field of the first table in the first datastore,
A group of instructions for identifying a second field similar to the first field based on data expressing the similarity between the tables , and
A program that causes a computer to execute a group of instructions for displaying any one or more of the second field, a second table including the second field, and a second data store including the second table.
JP2017115395A 2016-03-31 2017-06-12 Methods, programs, and systems for the automatic discovery of relationships between fields in a mixed heterogeneous data source environment. Active JP6964384B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201662315784P 2016-03-31 2016-03-31
US62/315,784 2016-03-31
JP2017523549A JP6159908B6 (en) 2016-03-31 2017-03-27 Method, program, and system for automatic discovery of relationships between fields in a heterogeneous data source mixed environment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
JP2017523549A Division JP6159908B6 (en) 2016-03-31 2017-03-27 Method, program, and system for automatic discovery of relationships between fields in a heterogeneous data source mixed environment

Publications (3)

Publication Number Publication Date
JP2017188137A JP2017188137A (en) 2017-10-12
JP2017188137A5 true JP2017188137A5 (en) 2020-09-24
JP6964384B2 JP6964384B2 (en) 2021-11-10

Family

ID=59965634

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2017115395A Active JP6964384B2 (en) 2016-03-31 2017-06-12 Methods, programs, and systems for the automatic discovery of relationships between fields in a mixed heterogeneous data source environment.

Country Status (3)

Country Link
US (1) US20190317938A1 (en)
JP (1) JP6964384B2 (en)
WO (1) WO2017170459A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11829391B2 (en) * 2019-01-14 2023-11-28 Salesforce, Inc. Systems, methods, and apparatuses for executing a graph query against a graph representing a plurality of data stores
KR20200094853A (en) 2019-01-25 2020-08-10 삼성전자주식회사 Electronic device and Method for controlling the electronic device thereof
CN110879901B (en) * 2019-11-22 2022-03-18 浙江大学 Data self-adaptive desensitization method and system based on relational graph
CN111767320B (en) * 2020-06-29 2023-08-18 中国银行股份有限公司 Data blood relationship determination method and device
WO2022049680A1 (en) * 2020-09-02 2022-03-10 日本電気株式会社 Coupling table specification system, coupling table search device, method, and program
KR102576146B1 (en) * 2020-11-20 2023-09-07 주식회사 와이즈넛 The method of coupling with heterogeneous data using relation of fields in data
CN113656372B (en) * 2021-08-13 2022-06-21 南方电网数字电网研究院有限公司 Standard index database data mart architecture device and method
US11636085B2 (en) 2021-09-01 2023-04-25 International Business Machines Corporation Detection and utilization of similarities among tables in different data systems
CN113760918A (en) * 2021-09-13 2021-12-07 上海航空工业(集团)有限公司 Method, device, computer equipment and medium for determining data blood relationship
CN116483840B (en) * 2023-06-19 2023-11-07 广东奥飞数据科技股份有限公司 Multi-source heterogeneous data integration system based on distributed computing

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000222430A (en) * 1999-02-03 2000-08-11 Osaka Gas Co Ltd Virtual database management system
JP2004227037A (en) * 2003-01-20 2004-08-12 Sangaku Renkei Kiko Kyushu:Kk Field matching device, program therefor, computer readable recording medium, and identical field determination method
JP4451624B2 (en) * 2003-08-19 2010-04-14 富士通株式会社 Information system associating device and associating method
JP4997856B2 (en) * 2006-07-19 2012-08-08 富士通株式会社 Database analysis program, database analysis apparatus, and database analysis method
JP5194818B2 (en) * 2008-01-16 2013-05-08 富士通株式会社 Data classification method and data processing apparatus
US9507824B2 (en) * 2014-08-22 2016-11-29 Attivio Inc. Automated creation of join graphs for unrelated data sets among relational databases

Similar Documents

Publication Publication Date Title
JP2017188137A5 (en)
WO2016029018A3 (en) Executing constant time relational queries against structured and semi-structured data
JP2016539427A5 (en)
JP2016524756A5 (en)
JP2017536601A5 (en)
JP2016519347A5 (en)
US20160055233A1 (en) Pre-join tags for entity-relationship modeling of databases
CO2017007032A2 (en) Updating language understanding classifier models for a personal digital assistant based on mass outsourcing
WO2015191731A8 (en) Systems and methods for software analytics
BR112019005422A2 (en) video keyframe display on online social networks
JP2017004555A5 (en)
JP2012226744A5 (en)
JP2017530469A5 (en)
JP2015109068A5 (en)
JP2014528134A5 (en)
US10002142B2 (en) Method and apparatus for generating schema of non-relational database
JP2013517574A5 (en)
JP2014534532A5 (en)
JP2019533245A5 (en)
JP2018133080A5 (en) Data management device, information processing device, content data output method, content data ID sharing method, program, and data structure
GB2547361A (en) System generator module for electronic document and electronic file
SG10201810036QA (en) Processing queries containing a union-type operation
WO2016106242A8 (en) Identifying join relationships based on transactional access patterns
IN2013CH04496A (en)
AU2017322114A8 (en) Real-time document filtering systems and methods