WO2014192116A1 - データ連携支援装置及びデータ連携支援方法 - Google Patents
データ連携支援装置及びデータ連携支援方法 Download PDFInfo
- Publication number
- WO2014192116A1 WO2014192116A1 PCT/JP2013/065026 JP2013065026W WO2014192116A1 WO 2014192116 A1 WO2014192116 A1 WO 2014192116A1 JP 2013065026 W JP2013065026 W JP 2013065026W WO 2014192116 A1 WO2014192116 A1 WO 2014192116A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- column
- information
- unit
- tables
- primary key
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 29
- 239000000284 extract Substances 0.000 claims abstract description 14
- 238000000605 extraction Methods 0.000 claims description 85
- 230000008569 process Effects 0.000 description 13
- 230000015654 memory Effects 0.000 description 11
- 238000012545 processing Methods 0.000 description 9
- 238000004891 communication Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 238000007639 printing Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2272—Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
Definitions
- the present invention relates to a data linkage support apparatus and a data linkage support method.
- a so-called schema matching technique exists as a technique for automatically extracting the correspondence between similar columns (for example, see Non-Patent Document 2).
- a data linkage support apparatus includes: A device that supports data linkage of a database having a plurality of tables, Primary key information indicating whether or not each of the plurality of columns included in each of the plurality of tables is a primary key, and attributes of data stored in each of the plurality of columns included in each of the plurality of tables An information storage unit for storing attribute information indicating A table selection unit that selects one table among the plurality of tables as a linkage destination of data linkage; For each table other than the table selected by the table selection unit among the plurality of tables, based on the primary key information stored in the information storage unit, from the plurality of columns included in the corresponding table, to the primary key A column specifying unit for specifying the first column and the second column that is not the primary key; The column specifying unit specifies the table selected by the table selecting unit from the tables other than the table selected by the table selecting unit among the plurality of tables based on the attribute information stored in the information storage unit.
- FIG. 6 is a table showing an example of a schema matching result according to the first embodiment.
- 4 is a table showing an example of cooperation relationship information according to the first embodiment.
- 5 is a flowchart showing an example of the operation of the data linkage support apparatus according to the first embodiment.
- 4 is a table showing an example of an extraction result according to the first embodiment.
- 5 is a flowchart showing an example of the operation of the data linkage support apparatus according to the first embodiment.
- 4 is a table showing examples of input information and recommended information according to the first embodiment.
- FIG. 3 is a diagram illustrating an example of a hardware configuration of the data cooperation support apparatus according to the first embodiment.
- FIG. 1 is a block diagram showing a configuration of a data cooperation support apparatus 100 according to the present embodiment.
- the information storage unit 101 stores the cooperation source information 111 and the cooperation destination information 112 using a storage device.
- the cooperation destination information 112 is information that defines a plurality of tables (that is, cooperation destination tables) of a database (that is, a cooperation destination database) that is a cooperation destination of data cooperation.
- the cooperation destination information 112 includes primary key information and attribute information of the cooperation destination database.
- the primary key information of the cooperation destination database is information indicating whether each of a plurality of columns included in each of the plurality of cooperation destination tables is a primary key.
- the attribute information of the cooperation destination database is information indicating data attributes (for example, column names and data types) stored in each of a plurality of columns included in each of the plurality of cooperation destination tables.
- the column specifying unit 103 includes each of the tables other than the cooperation destination table selected by the table selection unit 102 among the plurality of tables of the cooperation source database and the cooperation destination database based on the above-described primary key information.
- the first column that is the primary key and the second column that is not the primary key are identified from the plurality of columns.
- the table extraction unit 104 uses the following (1) and (1) for each of the tables other than the cooperation destination table selected by the table selection unit 102 among the plurality of tables of the cooperation source database and the cooperation destination database. A table satisfying the condition (2) is extracted, and an extraction result 141 is output.
- the cooperation destination table selected by the table selection unit 102 includes a third column similar to the first column specified by the column specification unit 103.
- the cooperation destination table selected by the table selection unit 102 includes a fourth column similar to the second column specified by the column specification unit 103.
- the table extraction unit 104 has the second largest number of second destination columns selected by the table selection unit 102 for the tables for which two or more second columns are specified by the column specification unit 103. Only the table in which the fourth column exists for each column is included in the extraction result 141. For example, assume that there are five second columns in a certain collaboration source table. In the cooperation destination table selected by the table selection unit 102, a fourth column similar to one of the five second columns, and another fourth column similar to the other of the five second columns, Is present. For other linkage source tables, more than two fourth columns do not exist, and for a linkage destination table different from the linkage destination table selected by the table selection unit 102, more than two fourth columns are provided. If the column does not exist, the link source table satisfies the condition (2). Note that not only the collaboration source table but also a collaboration destination table different from the collaboration destination table selected by the table selection unit 102 may satisfy the condition (2).
- the table extraction unit 104 belongs to a table belonging to the same schema as the cooperation destination table selected by the table selection unit 102 and a different schema of the same instance as the cooperation destination table selected by the table selection unit 102.
- Three types of tables, that is, a table and a table belonging to a schema of a different instance from the cooperation destination table selected by the table selection unit 102 can be included in the extraction result 141.
- the reference relationship information 151 and the linkage relationship information 152 are created by the table extraction unit 104 and stored in the information storage unit 105.
- the reference relationship information 151 and the linkage relationship information 152 may be output as the final extraction result 141 by the table extraction unit 104, or are temporarily stored in the information storage unit 105 to determine the final extraction result 141. May be used for
- the information output unit 106 uses a combination of the cooperation destination table selected by the table selection unit 102 and the table extracted by the table extraction unit 104 for data linkage. Recommendation information 161 that recommends this is generated. The information output unit 106 outputs the generated recommended information 161 via an output device.
- the information output unit 106 uses the table selection unit 102 for each table type.
- Information recommending that the combination of the selected cooperation destination table and the table extracted by the table extraction unit 104 be used for data cooperation is output as recommendation information 161.
- the cooperation destination information 112 is also in the same format as the cooperation source information 111 shown in FIG.
- step S11 the table selection unit 102 reads the cooperation destination information 112 from the information storage unit 101.
- the table selection unit 102 displays a list of cooperation destination tables defined by the cooperation destination information 112 on the screen via the output device.
- the table selection unit 102 receives a user's operation of viewing this list via the input device.
- the table selection unit 102 selects one cooperation destination table in accordance with a user operation.
- the table selection unit 102 generates reference source information 211 that defines the selected cooperation destination table in the same format as the table definition information 202.
- the table selection unit 102 generates reference source information 211 as shown in FIG. 4 for the “order details table” in “SVR001” of the “ordering system”.
- the “order details table” in “SVR001” of the “ordering system” includes columns of “order details ID”, “order ID”, “store ID”, “number of orders”, and “store name”. And “order detail ID” and “order ID” are defined as primary keys.
- the column specifying unit 103 generates primary key definition information 212 as shown in FIG. 4 for the “store” table in “SVR001” of the “ordering system”.
- this primary key definition information 212 it is defined that the “store” table in “SVR001” of “ordering system” includes a column of “store ID” as a primary key.
- the column identification unit 103 delivers the primary key definition information 212 to the table extraction unit 104 for each linkage source table.
- the column specifying unit 103 performs the same process for the cooperation destination information 112 and generates primary key definition information 212 for each cooperation destination table. However, the column specifying unit 103 does not generate primary key definition information 212 for the cooperation destination table selected in step S11.
- the column identification unit 103 delivers the primary key definition information 212 to the table extraction unit 104 for each linkage destination table.
- step S13 the table extraction unit 104 sets a set of reference source information 211 delivered from the table selection unit 102 and primary key definition information 212 delivered from the column identification unit 103 for each linkage source table.
- the table extraction unit 104 also sets a set of reference source information 211 delivered from the table selection unit 102 and primary key definition information 212 delivered from the column specifying unit 103 for each cooperation destination table.
- step S15 the table extraction unit 104 determines a set of columns whose similarity is higher than a predetermined threshold (that is, a set of the first column and the third column) from the schema matching result 221 obtained in step S14. Extract.
- a predetermined threshold that is, a set of the first column and the third column
- the table extraction unit 104 must extract a set of columns so that there is no duplication in the columns of the reference source information 211 for all the columns of the primary key definition information 212. I must.
- the table extraction unit 104 selects the column pair having the highest similarity.
- the table extracting unit 104 proceeds to step S18 without extracting the column pair.
- the table extraction unit 104 extracts a set of the “store ID” column of the primary key definition information 212 and the “store ID” column of the reference source information 211. Become.
- step S16 the table extraction unit 104 generates reference relationship information 151 indicating the extracted set of columns.
- the table extraction unit 104 stores the reference relationship information 151 in the information storage unit 105.
- step S17 the data linkage support apparatus 100 performs the operation shown in FIG.
- FIG. 7 is a flowchart showing an example of the operation of the data cooperation support apparatus 100 in step S17.
- FIG. 8 is a table showing examples of non-foreign key definition information 231 and non-primary key definition information 232 described later.
- FIG. 9 is a table showing an example of the result 241 of schema matching.
- FIG. 10 is a table showing an example of the cooperation relationship information 152.
- the column specifying unit 103 reads the reference relationship information 151 from the information storage unit 105.
- the column specifying unit 103 refers to the table definition information 202 included in the cooperation source information 111 or the cooperation destination information 112 read in step S12 for the reference destination table defined in the reference relationship information 151.
- the column specifying unit 103 specifies a column (that is, the second column) that is not the primary key of the reference destination table from the table definition information 202 of the reference destination table.
- the column specifying unit 103 generates non-primary key definition information 232 by narrowing down the table definition information 202 to only the specified column.
- step S23 the table extraction unit 104 sets a set of non-foreign key definition information 231 delivered from the table selection unit 102 and non-primary key definition information 232 delivered from the column specifying unit 103.
- step S24 the table extraction unit 104 executes schema matching between the non-foreign key definition information 231 and the non-primary key definition information 232 of the set set in step S23.
- the table extraction unit 104 obtains a schema matching result 241 as shown in FIG.
- the schema matching result 241 indicates the similarity between the column of the non-primary key definition information 232 and the column of the non-foreign key definition information 231 between 0 and 1.0, as in the example of FIG. It is in the form of a table.
- the similarity between the “store name” column of the non-primary key definition information 232 and the “order detail ID” column of the non-foreign key definition information 231 is zero.
- the similarity between the “store name” column of the non-primary key definition information 232 and the “store name” column of the non-foreign key definition information 231 is 1.0.
- step S25 the table extraction unit 104 determines a set of columns whose similarity is higher than a predetermined threshold (that is, a set of the second column and the fourth column) from the schema matching result 241 obtained in step S24. Extract.
- a predetermined threshold that is, a set of the second column and the fourth column
- the table extraction unit 104 must extract a set of columns so that the columns of the non-foreign key definition information 231 do not overlap.
- the table extraction unit 104 selects a column pair having the highest similarity.
- the table extracting unit 104 ends the process without extracting the column pair.
- the table extraction unit 104 extracts a combination of the “store name” column of the non-primary key definition information 232 and the “store name” column of the non-foreign key definition information 231. Will do.
- the table extraction unit 104 does not extract a set of columns for the “address” column and the “store manager ID” column of the non-primary key definition information 232.
- the table extraction unit 104 generates cooperation relationship information 152 as shown in FIG.
- the cooperation relationship information 152 includes information on the cooperation source column name and the cooperation destination column name.
- the cooperation relationship information 152 is combined with the reference relationship information 151 shown in FIG. That is, the information obtained by combining the reference relationship information 151 and the linkage relationship information 152 includes a reference destination instance name, a reference destination schema name, a reference destination table name, a reference destination column name, a linkage source column name, a reference source instance name, and a reference. Contains information on the original schema name, reference source table name, reference source column name, and linkage destination column name.
- FIG. 11 is a flowchart showing an example of the operation of the data cooperation support apparatus 100 after the operation of FIG.
- FIG. 12 is a table showing an example of the extraction result 141.
- the table extraction unit 104 reads the reference relationship information 151 and the cooperation relationship information 152 from the information storage unit 105.
- the table extraction unit 104 sets the reference destination column and the reference source column set indicated by the reference relationship information 151 and the linkage relationship information 152 (that is, the set of the first column and the third column) as the reference destination table and the reference. Group according to which schema the original table belongs to. Specifically, the table extraction unit 104 sets a combination of a reference destination column and a reference source column, a group in which the reference destination table and the reference source table belong to the same schema, and an instance in which the reference destination table and the reference source table are the same. Are classified into three groups: a group belonging to different schemas and a group belonging to schemas of different instances of the reference destination table and the reference source table.
- step S32 the table extraction unit 104 selects a group that has not yet executed the processes in steps S32 and S33 from the three groups described above.
- step S31 the table extraction unit 104 extracts the group having the largest number of pairs of the corresponding cooperation source column and the cooperation destination column from the groups classified into the selected group.
- step S ⁇ b> 33 the table extraction unit 104 uses an extraction result 141 indicating a pair of the reference destination column and the reference source column extracted in step S ⁇ b> 32 in the same format as the information obtained by combining the reference relationship information 151 and the linkage relationship information 152. Output.
- the table extraction unit 104 outputs an extraction result 141 as shown in FIG.
- This extraction result 141 indicates that there is a reference relationship between the “store” table in “SVR001” of “ordering system” and the “order details” table in “SVR001” of “ordering system”.
- the “store ID” column of the “store” table is the reference destination
- the “store name” is the link source
- the “store ID” column of the “order details” table is the reference source (ie, the external key)
- the “store name” "Is shown to be a partner is the “store name” "Is shown to be a partner.
- step S34 the table extraction unit 104 determines whether there is a group that has not yet executed the processes in steps S32 and S33 among the three groups described above. If there is such a group, the table extraction unit 104 returns to step S32. If there is no such group, the table extraction unit 104 ends the process.
- FIG. 13 is a flowchart showing an example of the operation of the data cooperation support apparatus 100 after the operation of FIG.
- FIG. 14 is a table showing examples of input information 251 and recommendation information 161 described later.
- step S41 the information output unit 106 acquires the input information 251 input from the user to search for a reference destination via the input device.
- the information output unit 106 acquires input information 251 as shown in FIG.
- This input information 251 indicates that the user wants to search for a reference destination in the “store ID” column of the “order details” table in “SVR001” of “ordering system”.
- the information output unit 106 may interpret that the user wants to search for a reference destination for all the columns in the “order details” table.
- the information output unit 106 refers to the extraction result 141 output from the table extraction unit 104 in step S33.
- the information output unit 106 extracts information that matches the input information 251 acquired in step S41 from the extraction result 141. Specifically, the information output unit 106 selects the instance and schema indicated by the input information 251 from the combination of the reference source instance, reference source schema, reference source table, and reference source column indicated by the extraction result 141. , Extract the one that matches the combination of table and column.
- step S43 the information output unit 106 determines whether there is information extracted in step S42. If there is extracted information, the information output unit 106 proceeds to step S44. If there is no extracted information, the information output unit 106 proceeds to step S45.
- step S44 the information output unit 106 generates recommended information 161 that recommends a reference destination corresponding to the input information 251 for each of the three groups described above based on the information extracted in step S42.
- the information output unit 106 outputs the recommended information 161 via the output device, and ends the process.
- step S45 the information output unit 106 outputs information notifying that there is no reference destination corresponding to the input information 251 via the output device, and ends the process.
- this embodiment it is possible to specify a reference relationship between tables and a column that is a foreign key from only the definition information of the database by the operation described above. Furthermore, it is possible to identify a column that is common between tables having a reference relationship, and to extract a table that is optimal for use in data linkage (that is, a table having the largest number of columns that can be used for data linkage). That is, according to the present embodiment, in data linkage, it is possible to efficiently detect which table has a reference relationship and which table is a linkage destination. Furthermore, it is possible to efficiently detect from which table column the data to be stored in the column of the table to be linked is extracted.
- the table extraction unit 104 adds, to the cooperation destination table selected by the table selection unit 102, the table for which two or more second columns are specified by the column specifying unit 103. Only the table having the fourth column individually for the most second columns is included in the extraction result 141.
- the table extraction unit 104 adds the second table to the cooperation destination table selected by the table selection unit 102 for the table for which two or more second columns are specified by the column specifying unit 103.
- a table in which the fourth column is individually present for many second columns can also be included in the extraction result 141.
- “N” can be set to any number greater than 2.
- FIG. 15 is a diagram illustrating an example of a hardware configuration of the data cooperation support apparatus 100 according to the first embodiment.
- the data linkage support apparatus 100 is a computer, and includes an LCD 901 (Liquid / Crystal / Display), a keyboard 902 (K / B), a mouse 903, an FDD 904 (Flexible / Disk / Drive), and a CDD 905 (Compact / Disc / Disk / Drive) and a hardware device such as a printer 906 are provided. These hardware devices are connected by cables and signal lines. Instead of the LCD 901, a CRT (Cathode / Ray / Tube) or other display device may be used. Instead of the mouse 903, a touch panel, a touch pad, a trackball, a pen tablet, or other pointing devices may be used.
- the data linkage support apparatus 100 includes a CPU 911 (Central Processing Unit) that executes a program.
- the CPU 911 is an example of a processing device.
- the CPU 911 includes a ROM 913 (Read / Only / Memory), a RAM 914 (Random / Access / Memory), a communication board 915, an LCD 901, a keyboard 902, a mouse 903, an FDD 904, a CDD 905, a printer 906, and an HDD 920 (Hard / Disk) via a bus 912. Connected with Drive) to control these hardware devices.
- an SSD Solid State Drive
- a flash memory an optical disk device
- a memory card reader / writer or other recording medium
- the RAM 914 is an example of a volatile memory.
- the ROM 913, the FDD 904, the CDD 905, and the HDD 920 are examples of nonvolatile memories. These are examples of storage devices.
- the communication board 915, the keyboard 902, the mouse 903, the FDD 904, and the CDD 905 are examples of input devices.
- the communication board 915, the LCD 901, and the printer 906 are examples of output devices.
- the communication board 915 is connected to a LAN (Local / Area / Network) or the like.
- the communication board 915 is not limited to a LAN, but includes an IP-VPN (Internet / Protocol / Virtual / Private / Network), a wide area LAN, an ATM (Asynchronous / Transfer / Mode) network, a WAN (Wide / Area / Network), or the Internet. It does not matter if it is connected to.
- LAN, WAN, and the Internet are examples of networks.
- the HDD 920 stores an operating system 921 (OS), a window system 922, a program group 923, and a file group 924.
- the programs in the program group 923 are executed by the CPU 911, the operating system 921, and the window system 922.
- the program group 923 includes programs that execute the functions described as “unit” in the description of the embodiment of the present invention.
- the program is read and executed by the CPU 911.
- the file group 924 includes data and information described as “ ⁇ data”, “ ⁇ information”, “ ⁇ ID (identifier)”, “ ⁇ flag”, and “ ⁇ result” in the description of the embodiment of the present invention. Signal values, variable values, and parameters are included as " ⁇ file", " ⁇ database”, and " ⁇ table” items.
- “ ⁇ file”, “ ⁇ database”, and “ ⁇ table” are stored in a recording medium such as the RAM 914 and the HDD 920.
- Data, information, signal values, variable values, and parameters stored in a recording medium such as the RAM 914 and the HDD 920 are read out to the main memory and the cache memory by the CPU 911 via a read / write circuit, and extracted, searched, referenced, compared, and calculated. It is used for processing (operation) of the CPU 911 such as calculation, control, output, printing, and display.
- processing of the CPU 911 such as extraction, search, reference, comparison, calculation, calculation, control, output, printing, and display, data, information, signal values, variable values, and parameters are temporarily stored in the main memory, cache memory, and buffer memory.
- the arrow portions in the block diagrams and flowcharts mainly indicate input / output of data and signals.
- Data and signals are recorded in memory such as RAM 914, FDD904 flexible disk (FD), CDD905 compact disk (CD), HDD920 magnetic disk, optical disk, DVD (Digital Versatile Disc), or other recording media Is done.
- Data and signals are transmitted by a bus 912, a signal line, a cable, or other transmission media.
- what is described as “to part” may be “to circuit”, “to device”, and “to device”, and “to step” and “to process”.
- Firmware and software are stored as programs in a recording medium such as a flexible disk, a compact disk, a magnetic disk, an optical disk, and a DVD.
- the program is read by the CPU 911 and executed by the CPU 911.
- the program causes the computer to function as “ ⁇ unit” described in the description of the embodiment of the present invention.
- the program causes a computer to execute the procedures and methods of “unit” described in the description of the embodiment of the present invention.
- 100 data linkage support device 101 information storage unit, 102 table selection unit, 103 column identification unit, 104 table extraction unit, 105 information storage unit, 106 information output unit, 111 linkage source information, 112 linkage destination information, 141 extraction result, 151 reference relationship information, 152 linkage relationship information, 161 recommended information, 201 table location information, 202 table definition information, 211 reference source information, 212 primary key definition information, 221 result, 231 non-foreign key definition information, 232 non-primary key definition Information, 241 result, 251 input information, 901 LCD, 902 keyboard, 903 mouse, 904 FDD, 905 CDD, 906 printer, 911 CPU, 912 bus, 913 ROM, 914 RAM, 915 communication board, 920 HDD, 921 operating system, 922 window system, 923 programs, 924 files.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
複数のテーブルを有するデータベースのデータ連携を支援する装置であり、
前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれが主キーになっているかどうかを示す主キー情報と、前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれに格納されるデータの属性を示す属性情報とを記憶する情報記憶部と、
前記複数のテーブルのうち、データ連携の連携先となる1つのテーブルを選択するテーブル選択部と、
前記複数のテーブルのうち前記テーブル選択部により選択されたテーブル以外のテーブルのそれぞれについて、前記情報記憶部に記憶された主キー情報に基づき、該当するテーブルに含まれる複数のカラムから、主キーになっている第1カラムと、主キーになっていない第2カラムとを特定するカラム特定部と、
前記複数のテーブルのうち前記テーブル選択部により選択されたテーブル以外のテーブルから、前記情報記憶部に記憶された属性情報に基づき、前記テーブル選択部により選択されたテーブルに、前記カラム特定部により特定された第1カラムと類似する第3カラムが含まれ、かつ、前記テーブル選択部により選択されたテーブルに、前記カラム特定部により特定された第2カラムと類似する第4カラムが含まれるテーブルを抽出するテーブル抽出部と、
前記テーブル選択部により選択されたテーブルと前記テーブル抽出部により抽出されたテーブルとの組み合わせをデータ連携に利用することを推奨する推奨情報を出力する情報出力部とを備える。
図1は、本実施の形態に係るデータ連携支援装置100の構成を示すブロック図である。
(1)テーブル選択部102により選択された連携先テーブルに、カラム特定部103により特定された第1カラムと類似する第3カラムが含まれる。
(2)テーブル選択部102により選択された連携先テーブルに、カラム特定部103により特定された第2カラムと類似する第4カラムが含まれる。
Claims (5)
- 複数のテーブルを有するデータベースのデータ連携を支援するデータ連携支援装置において、
前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれが主キーになっているかどうかを示す主キー情報と、前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれに格納されるデータの属性を示す属性情報とを記憶する情報記憶部と、
前記複数のテーブルのうち、データ連携の連携先となる1つのテーブルを選択するテーブル選択部と、
前記複数のテーブルのうち前記テーブル選択部により選択されたテーブル以外のテーブルのそれぞれについて、前記情報記憶部に記憶された主キー情報に基づき、該当するテーブルに含まれる複数のカラムから、主キーになっている第1カラムと、主キーになっていない第2カラムとを特定するカラム特定部と、
前記複数のテーブルのうち前記テーブル選択部により選択されたテーブル以外のテーブルから、前記情報記憶部に記憶された属性情報に基づき、前記テーブル選択部により選択されたテーブルに、前記カラム特定部により特定された第1カラムと類似する第3カラムが含まれ、かつ、前記テーブル選択部により選択されたテーブルに、前記カラム特定部により特定された第2カラムと類似する第4カラムが含まれるテーブルを抽出するテーブル抽出部と、
前記テーブル選択部により選択されたテーブルと前記テーブル抽出部により抽出されたテーブルとの組み合わせをデータ連携に利用することを推奨する推奨情報を出力する情報出力部と
を備えることを特徴とするデータ連携支援装置。 - 前記テーブル抽出部は、前記カラム特定部により2つ以上の第1カラムが特定されたテーブルについては、前記テーブル選択部により選択されたテーブルに、全ての第1カラムに対して個別に前記第3カラムが存在するテーブルを抽出することを特徴とする請求項1のデータ連携支援装置。
- 前記テーブル抽出部は、前記カラム特定部により2つ以上の第2カラムが特定されたテーブルについては、前記テーブル選択部により選択されたテーブルに、最も多くの第2カラムに対して個別に前記第4カラムが存在するテーブルを抽出することを特徴とする請求項1のデータ連携支援装置。
- 前記テーブル抽出部は、前記テーブル選択部により選択されたテーブルと同じスキーマに属するテーブルと、前記テーブル選択部により選択されたテーブルと同じインスタンスの異なるスキーマに属するテーブルと、前記テーブル選択部により選択されたテーブルと異なるインスタンスのスキーマに属するテーブルとの3種類のテーブルのうち少なくとも2種類のテーブルを抽出し、
前記情報出力部は、前記テーブル抽出部により抽出されたテーブルの種類ごとに、前記テーブル選択部により選択されたテーブルと前記テーブル抽出部により抽出されたテーブルとの組み合わせをデータ連携に利用することを推奨する情報を前記推奨情報として出力することを特徴とする請求項1のデータ連携支援装置。 - 複数のテーブルを有するデータベースのデータ連携を支援するデータ連携支援方法において、
前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれが主キーになっているかどうかを示す主キー情報と、前記複数のテーブルのそれぞれに含まれる複数のカラムのそれぞれに格納されるデータの属性を示す属性情報とを記憶する情報記憶部を備えるコンピュータを用いて、
前記複数のテーブルのうち、データ連携の連携先となる1つのテーブルを選択し、
前記複数のテーブルのうち前記選択により選択されたテーブル以外のテーブルのそれぞれについて、前記情報記憶部に記憶された主キー情報に基づき、該当するテーブルに含まれる複数のカラムから、主キーになっている第1カラムと、主キーになっていない第2カラムとを特定し、
前記複数のテーブルのうち前記選択により選択されたテーブル以外のテーブルから、前記情報記憶部に記憶された属性情報に基づき、前記選択により選択されたテーブルに、前記特定により特定された第1カラムと類似する第3カラムが含まれ、かつ、前記選択により選択されたテーブルに、前記特定により特定された第2カラムと類似する第4カラムが含まれるテーブルを抽出し、
前記選択により選択されたテーブルと前記抽出により抽出されたテーブルとの組み合わせをデータ連携に利用することを推奨する推奨情報を出力することを特徴とするデータ連携支援方法。
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/065026 WO2014192116A1 (ja) | 2013-05-30 | 2013-05-30 | データ連携支援装置及びデータ連携支援方法 |
US14/783,384 US20160042022A1 (en) | 2013-05-30 | 2013-05-30 | Data coordination support apparatus and data coordination support method |
GB1518688.5A GB2529095A (en) | 2013-05-30 | 2013-05-30 | Data link assistance device and data link assistance method |
CN201380076749.8A CN105229618A (zh) | 2013-05-30 | 2013-05-30 | 数据协调支援装置和数据协调支援方法 |
SG11201508822WA SG11201508822WA (en) | 2013-05-30 | 2013-05-30 | Data coordination support apparatus and data coordination support method |
JP2015519559A JP5972461B2 (ja) | 2013-05-30 | 2013-05-30 | データ連携支援装置及びデータ連携支援方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/065026 WO2014192116A1 (ja) | 2013-05-30 | 2013-05-30 | データ連携支援装置及びデータ連携支援方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014192116A1 true WO2014192116A1 (ja) | 2014-12-04 |
Family
ID=51988188
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/065026 WO2014192116A1 (ja) | 2013-05-30 | 2013-05-30 | データ連携支援装置及びデータ連携支援方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20160042022A1 (ja) |
JP (1) | JP5972461B2 (ja) |
CN (1) | CN105229618A (ja) |
GB (1) | GB2529095A (ja) |
SG (1) | SG11201508822WA (ja) |
WO (1) | WO2014192116A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2016136354A (ja) * | 2015-01-23 | 2016-07-28 | 三菱電機株式会社 | データ連携推定装置、データ連携推定方法及びプログラム |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10650050B2 (en) * | 2016-12-06 | 2020-05-12 | Microsoft Technology Licensing, Llc | Synthesizing mapping relationships using table corpus |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007188343A (ja) * | 2006-01-13 | 2007-07-26 | Mitsubishi Electric Corp | スキーマ統合支援装置、スキーマ統合支援方法およびスキーマ統合支援プログラム |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6721727B2 (en) * | 1999-12-02 | 2004-04-13 | International Business Machines Corporation | XML documents stored as column data |
US6957225B1 (en) * | 2002-05-07 | 2005-10-18 | Oracle International Corporation | Automatic discovery and use of column correlations in tables |
JP4893130B2 (ja) * | 2005-08-05 | 2012-03-07 | セイコーエプソン株式会社 | 情報処理装置、データ処理方法およびプログラム |
CN100383881C (zh) * | 2005-12-08 | 2008-04-23 | 杭州海康威视数字技术有限公司 | 一种嵌入式设备中硬盘关键信息区的保护方法 |
US7860903B2 (en) * | 2005-12-23 | 2010-12-28 | Teradata Us, Inc. | Techniques for generic data extraction |
WO2007127956A2 (en) * | 2006-04-28 | 2007-11-08 | Business Objects, S.A. | Apparatus and method for merging metadata within a repository |
US20080294673A1 (en) * | 2007-05-25 | 2008-11-27 | Microsoft Corporation | Data transfer and storage based on meta-data |
JP5054065B2 (ja) * | 2009-05-14 | 2012-10-24 | 株式会社エヌ・ティ・ティ・ドコモ | データベース装置、データベース整合システム、及び、データベース整合方法 |
US8868621B2 (en) * | 2010-10-21 | 2014-10-21 | Rillip, Inc. | Data extraction from HTML documents into tables for user comparison |
JP2012185765A (ja) * | 2011-03-08 | 2012-09-27 | Nec Corp | 情報処理装置、情報処理装置の管理方法、情報処理装置の管理プログラム |
-
2013
- 2013-05-30 US US14/783,384 patent/US20160042022A1/en not_active Abandoned
- 2013-05-30 SG SG11201508822WA patent/SG11201508822WA/en unknown
- 2013-05-30 WO PCT/JP2013/065026 patent/WO2014192116A1/ja active Application Filing
- 2013-05-30 JP JP2015519559A patent/JP5972461B2/ja not_active Expired - Fee Related
- 2013-05-30 GB GB1518688.5A patent/GB2529095A/en not_active Withdrawn
- 2013-05-30 CN CN201380076749.8A patent/CN105229618A/zh active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007188343A (ja) * | 2006-01-13 | 2007-07-26 | Mitsubishi Electric Corp | スキーマ統合支援装置、スキーマ統合支援方法およびスキーマ統合支援プログラム |
Non-Patent Citations (2)
Title |
---|
LING LING YAN: "Data-driven understanding and refinement of schema mappings, SIGMOD '01", PROCEEDINGS OF THE 2001 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, June 2001 (2001-06-01), pages 485 - 496 * |
RONALD FAGIN: "Clio: Schema Mapping Creation and Data Exchange", LECTURE NOTES IN COMPUTER SCIENCE, vol. 5600, 2009, pages 198 - 236 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2016136354A (ja) * | 2015-01-23 | 2016-07-28 | 三菱電機株式会社 | データ連携推定装置、データ連携推定方法及びプログラム |
Also Published As
Publication number | Publication date |
---|---|
GB2529095A (en) | 2016-02-10 |
JPWO2014192116A1 (ja) | 2017-02-23 |
CN105229618A (zh) | 2016-01-06 |
JP5972461B2 (ja) | 2016-08-17 |
GB201518688D0 (en) | 2015-12-02 |
SG11201508822WA (en) | 2015-12-30 |
US20160042022A1 (en) | 2016-02-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10229655B2 (en) | Contextual zoom | |
US20090319494A1 (en) | Field mapping for data stream output | |
US10733361B2 (en) | Content reproducing apparatus | |
JP4832952B2 (ja) | データベース解析システム及びデータベース解析方法及びプログラム | |
US20200334274A1 (en) | Quick data structuring computing system and related methods | |
US20240143634A1 (en) | System and methods for categorizing captured data | |
JP5972461B2 (ja) | データ連携支援装置及びデータ連携支援方法 | |
JP2018106556A (ja) | 画面情報生成装置、画面情報生成方法、及びプログラム | |
JP2009146084A (ja) | テーブル管理装置 | |
JP6017055B2 (ja) | データ連携支援装置及びデータ連携支援方法 | |
WO2016199466A1 (ja) | Rdbシステム | |
JP2013152580A (ja) | コード生成装置及びコード生成方法及びプログラム | |
US20150066994A1 (en) | Managing sharing relationship of tables | |
JP5855989B2 (ja) | データ処理装置及びデータ処理方法及びデータ処理プログラム | |
US10255260B2 (en) | System and framework for transforming domain data | |
JP2015114896A (ja) | 文書処理システム | |
WO2023281605A1 (ja) | 情報処理方法、情報処理装置、およびプログラム | |
US20230031202A1 (en) | Method and system for generating document field predictions | |
JP2024088830A (ja) | 情報処理プログラム、情報処理方法および情報処理装置 | |
US9792092B2 (en) | Method of unifying information and tool from a plurality of information sources and computer program product and matterizer using the same | |
JP6455229B2 (ja) | ストレージ装置、読出記憶装置決定方法、読出記憶装置決定プログラム、およびストレージシステム | |
US20140095984A1 (en) | Information processing device, method and program recording medium | |
JP2019040261A (ja) | 情報処理装置及びプログラム | |
US20120259848A1 (en) | System and method for automatically building a search database for digital data of mediums | |
JP2018073116A (ja) | 検索プログラム、検索装置および検索方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201380076749.8 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13885758 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2015519559 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14783384 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 1518688 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20130530 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1518688.5 Country of ref document: GB |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13885758 Country of ref document: EP Kind code of ref document: A1 |
|
ENPC | Correction to former announcement of entry into national phase, pct application did not enter into the national phase |
Ref country code: GB |