WO2012048555A1 - Method and device for importing data into database - Google Patents

Method and device for importing data into database Download PDF

Info

Publication number
WO2012048555A1
WO2012048555A1 PCT/CN2011/072076 CN2011072076W WO2012048555A1 WO 2012048555 A1 WO2012048555 A1 WO 2012048555A1 CN 2011072076 W CN2011072076 W CN 2011072076W WO 2012048555 A1 WO2012048555 A1 WO 2012048555A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
record
database
data record
file
Prior art date
Application number
PCT/CN2011/072076
Other languages
French (fr)
Chinese (zh)
Inventor
胡丽蓉
刘永平
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012048555A1 publication Critical patent/WO2012048555A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2365Ensuring data consistency and integrity

Definitions

  • the present invention relates to the field of databases, and in particular to a data storage method and apparatus.
  • BACKGROUND Currently, real-time software systems in many fields generate massive backup data for subsequent statistics and verification. For example, in the telecom industry, the traffic system, the billing system, and the authentication system all generate a large number of offline bills. These bills will be imported into the database on a daily or monthly basis, and reconciliation, statistics, and reports will be made for the user to check the bill. , or for various subsequent processing such as data mining, so the efficient, accurate and flexible storage of these data has important application value.
  • the inventors have found that the data warehousing method in the prior art has the following problems:
  • the ordinary one-by-one warehousing method is more efficient. For example, users in the telecommunications industry usually have billions of units, and their billing data is very large. The ordinary storage method will inevitably consume more time, but it takes a few days to enter the warehouse to meet the application requirements.
  • the main object of the present invention is to provide a data storage method and apparatus to solve at least one of the above problems.
  • a data storage method comprising: taking one or more data records from a data file; verifying the extracted data records; Successful data records are imported into the database.
  • the step of verifying the retrieved data record includes: determining whether the field in each of the extracted data records satisfies a preset format; if satisfied, the determined data record is successfully verified; if not, the The judged data record is saved in the error log file.
  • the step of importing the successfully verified data record into the database includes: assigning a serial number to each of the above-mentioned successfully verified data records, wherein each of the above-mentioned successful data records corresponding to the school-risk is unique in the above database The serial number of the above-mentioned school-risk data data assigned to the above-mentioned serial number is imported into the database.
  • the step of importing the data record of the school-risk success into the database includes: importing the data record of the above school-risk success into the above database in batch mode; if the current batch of the above-mentioned school-risk successful data record import fails, the above The current batch of the above-mentioned school-risk successful data records and the serial number corresponding to each of the data records of the current batch of the above-mentioned verification successes are saved in the inbound failure record file; the above-mentioned storage is performed in a single manner The data record saved in the failure log file is re-imported into the above database. If the import fails, the data record that failed to be imported is saved to the error file.
  • the step of importing the data record of the successful school-risk into the database includes: determining whether the data table currently used in the database satisfies a predetermined rule; if not, using the data table currently used to store the data record of the school-risk success If it is satisfied, another free data table in the above database is used to store the data record of the above-mentioned school-risk success.
  • the step of using the other idle data table in the database to store the data record of the successful school-risk includes: determining whether the currently used data table is in a preset plurality of data tables for storing data records. The last data table; if the currently used data table is the last one of the plurality of data tables for storing the data record, the preset plurality of data records for storing the data record are used.
  • the first data table in the data table is used to store the data record successfully verified; if the currently used data table is not the last data table in the plurality of data tables for storing the data record, The next data table of the currently used data table is used in the plurality of preset data tables for storing data records to store the data record of the above verification success.
  • the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time.
  • a data storage device comprising: a reading unit configured to take one or more data records from a data file; a verification unit, set to be The extracted data record is verified; the import unit is set to import the data record with successful verification into the database.
  • the importing unit includes: an allocating module, configured to allocate a serial number for each of the above-mentioned successful data records, wherein the serial number corresponding to each of the successfully verified data records is a unique serial number in the database; Import module, set to the above school to be assigned the above serial number
  • the device further includes: a storage unit configured to: after verifying the extracted data record, save the data record of the verification failure in the error record file; and save the data record of the failed storage to the storage failure record In the file, the import unit is further configured to re-import the data record saved in the storage failure record file into the database in a single manner.
  • the importing unit further includes: a determining module, configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined amount Threshold; the currently used data table is used for more than a predetermined length of time; the table change module is configured to use the currently used data table to store the data record of the above-mentioned succession when the predetermined rule is not satisfied; When the above predetermined rule is satisfied, another idle data table in the above database is used to store the above-mentioned successful data record of the school-risk.
  • a determining module configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined amount Threshold; the currently used data table is used for more than a predetermined length of time; the table change module is
  • the data record is calibrated when the data is put into storage, and the batch warehousing mode and the single warehousing mode are combined to improve the accuracy of data warehousing.
  • the present invention also assigns a unique serial number to each data record, thereby avoiding repeated storage and omission of storage; further, when the data is stored in the database, the target data table can be automatically switched, preventing The efficiency of the query or secondary processing is affected by the excessive amount of data in a single table.
  • FIG. 1 is a schematic diagram of a location of a data warehousing system in an application according to an embodiment of the present invention
  • FIG. 2 is a preferred flowchart of a data warehousing method according to an embodiment of the present invention. Another preferred flowchart of the data storage method of the embodiment of the present invention;
  • FIG. 1 is a schematic diagram of a location of a data warehousing system in an application according to an embodiment of the present invention
  • FIG. 2 is a preferred flowchart of a data warehousing method according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a location of a data warehousing system in an application according to an embodiment of the present invention
  • FIG. 2 is a preferred flowchart of a data warehousing method according to an embodiment of the present invention
  • FIG. 1 is
  • FIG. 4 is a schematic diagram of a preferred structure of the data storage device according to the embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION is a schematic diagram of a location of a data storage system in an application according to an embodiment of the present invention.
  • the utility model imports a large amount of offline data into a data table, and the application system can directly query data from the tables, and can also customize a database task.
  • the data is processed in the second table and imported into other tables or libraries for query.
  • Applications can migrate data from the inbound table to other tables or libraries as needed.
  • FIG. 2 is a preferred flowchart of a data storage method according to an embodiment of the present invention, which includes the following steps:
  • the step of verifying the retrieved data record comprises: determining whether a field in each of the retrieved data records satisfies a preset format; if satisfied, the determined data record is successfully verified; If it is not satisfied, the determined data record is saved in the error log file (Error file). For example, it is judged whether the format of the time information field in the data record conforms to a predetermined format, such as year-month-day.
  • the step of importing the data record of the successful school-risk into the database comprises: assigning a serial number to each of the data records of the school-risk success, wherein each of the school-risk successful data records corresponds to the pipeline The number is a unique serial number in the database; the data record of the school-risk success assigned with the serial number is imported into the database.
  • assigning a unique serial number to each data record duplicate warehousing and missing warehousing are avoided, and the acknowledgment of the warehousing is further improved.
  • the verification will be performed.
  • the step of importing the successful data record into the database includes: importing the data record of the school-risk success into the database in a batch manner; if the current batch of the school-risk successful data record import fails, the The current batch of the school-risk successful data record and the serial number corresponding to each of the current batch of the school-risk successful data records are saved in the inbound failure record file (Fail file) And re-importing the data record saved in the warehousing failure record file into the database in a single manner, and if the import fails, saving the data record that failed the import into the error record file.
  • the warehousing efficiency is ensured; the record of the failure of the 4 metrics into the database is further added to the database, thereby effectively avoiding The defect that the original data record is leaked due to a system abnormality or the like further improves the accuracy of the storage.
  • the step of importing the successfully verified data record into the database further includes: determining the database Whether the currently used data table satisfies a predetermined rule; if not, the currently used data table is used to store the successfully verified data record; if satisfied, another idle data in the database is used The table stores the data record of the verification success.
  • the data record is stored by using a plurality of data tables, thereby preventing the query or the secondary processing from being affected due to the excessive amount of data in the single table.
  • the step of using the other idle data table in the database to store the data record of the successful school-to-risk comprises: determining whether the currently used data table is preset for storing The last data table of the plurality of data tables stored in the data record; if the currently used data table is the last data table of the plurality of data tables for storing the data record, The preset first data table of the plurality of data tables for storing the data record to store the data record of the verification success; if the currently used data table is not the preset for storage The last data table of the plurality of data tables of the data record, the next data table of the currently used data table is used in the preset plurality of data tables for storing the data records to store A data record that verifies the success.
  • the storage space is effectively saved by recycling the data table.
  • the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined threshold; and the currently used data table is used for more than a predetermined length of time.
  • Embodiment 2 For convenience of description, in the embodiment of the present invention, the following identification is made: data file F (File), file record R (Record), data table T (Table), serial number SN (Serial Number), batch storage The failure record temporarily saves the directory Fail file, and saves the data in the same table with M table structures.
  • the data warehousing process includes the following steps: Step S1: fetching n records from the data file F, assigning a serial number to each record, performing validity check on the fields that need to be logged, and verifying failure The record is saved to the specified Error file for future reference, and the verification is successfully packaged into a bulk packet or data file.
  • Step S2 The data prepared in the previous step is batch-inputted into the database, and the target data table Tn.
  • Step S3 If the batch storage fails, save the batch record to the Fail file (the SN corresponding to each record needs to be saved at the same time).
  • Step S4 Save the position that F has already processed. If the file has not been processed, return to step S1 to continue the storage.
  • Step S5 F batch storage is completed.
  • Step S6 The data record of the failed inbound storage saved in the Fail file is replenished into the library one by one with an insert statement.
  • Step S7 If the Tn data amount reaches the set value, the target data table is changed to ⁇ +1. If ⁇ +1 reaches the last table, the entry starts from T1.
  • Step S8 Processing the next data file.
  • the preferred embodiment uses ⁇ ( ⁇ >1) data tables to recycle and share the massive data.
  • the embodiment of the present invention introduces a serial number field, which prevents the heavy order by controlling the serial number, and has an obvious advantage in comparing the inbound rate compared with the usual establishment of the primary key on the data table; using a combination of the two storage methods , not only ensure the efficiency of warehousing, but also take into account the accuracy of the warehousing, prevent leakage; maintain the progress of the current warehousing file, can restore the warehousing after the system is abnormal, no need to manually process the massive data files, the degree of automation High to reduce the burden on maintenance personnel.
  • FIG. 3 is another preferred flowchart of a data storage method according to an embodiment of the present invention, which includes the following steps:
  • Preprocessing the data record sorting the inbound field, verifying, and assigning the serial number. S308. If the preprocessing is successful, the records are assembled into a batch data block; if the preprocessing fails, the record is saved to an Error file.
  • the data warehousing method checks the original records in the data file, and eliminates the erroneous data rows to improve the warehousing success rate; assigns a unique serial number to each record (this field does not need to be Indexing ;), to ensure that the data is not stored in the warehouse; the correct records are stored in the batch mode, if the batch of data fails to be stored, the file is saved, and then the insert is re-submitted into the library by the insert method; Switch the target data table.
  • This method preprocesses the original records, can be used to store a variety of data files, and adapt to a variety of target data bases; batch mode and insert mode combination, complementary advantages, while meeting the requirements of fast warehousing, real-time query and data accuracy. Its advantages are:
  • the offline data files generated by the high-efficiency system generally have no errors, the batch storage method has high success rate, and the overall storage efficiency is high; the current file storage status is saved in time, and the file can be continuously stored when the storage system is abnormally restarted. It is not necessary to insert from the beginning, and the storage efficiency is maximized.
  • the inbound target data table can be automatically switched to prevent the data volume in a single table from being too large and affecting the efficiency of the query or secondary processing.
  • FIG. 4 is a schematic diagram of a preferred structure of a data storage device according to an embodiment of the present invention, comprising: a reading unit 402 configured to take one or more data records from a data file; and a verification unit 404 And connected to the reading unit 402, wherein the school-risk unit 404 is configured to perform the school-risk on the retrieved data record; the import unit 406 is connected to the school-risk unit 404, wherein the import unit 406 is set to check Successful data records are imported into the database.
  • the data record is verified when the data is stored in the library, thereby improving the accuracy of the data storage.
  • the step of verifying the extracted data record by the checking unit 404 includes: determining whether a field in each of the extracted data records satisfies a preset format; if satisfied, the determined data The record verification is successful; if not, the determined data record is saved to the error log file. For example, it is judged whether the format of the time information field in the data record conforms to a predetermined format, such as year-month-day.
  • the importing unit 406 includes: an allocating module, connected to the school-risk unit 404, wherein the allocating module is configured to allocate a serial number for each of the data records that are successfully verified, wherein each of the school-risks The serial number corresponding to the successful data record is the only serial number in the database; the import module is connected to the distribution module, wherein the import module is configured to successfully assign the school-risk to which the serial number is assigned The data record is imported into the database.
  • an allocating module connected to the school-risk unit 404, wherein the allocating module is configured to allocate a serial number for each of the data records that are successfully verified, wherein each of the school-risks The serial number corresponding to the successful data record is the only serial number in the database
  • the import module is connected to the distribution module, wherein the import module is configured to successfully assign the school-risk to which the serial number is assigned
  • the data record is imported into the database.
  • by assigning a unique serial number to each data record duplicate warehousing and missing warehous
  • the data storage device of the embodiment of the present invention further includes: a storage unit 408 connected to the school-risk unit 404, wherein the storage unit 408 is configured to, after the pair of the retrieved data records are verified, The data record that failed the verification is saved in the error log file; the data record that failed the inbound storage is saved in the inbound failure record file.
  • the import unit 406 is further configured to re-import the data records saved in the inbound failure log file into the database in a single manner.
  • the step of importing the successfully verified data record into the database comprises: importing the data record of the school-risk success into the database in a batch manner; if the current batch of the school-risk is successful If the data record import fails, the current batch of the school-risk successful data record and the serial number corresponding to each of the current batch of the successfully verified data records are saved to the storage.
  • the failure record file (Fail file); re-import the data record saved in the inbound failure record file into the database in a single way, and if the import fails, save the data record that failed to be imported into the error record file.
  • the warehousing efficiency is ensured by using the batch warehousing method; the record of the batch warehousing failure is further added to the database, thereby effectively avoiding the original reason due to system abnormality and the like.
  • the data record leaks into the defect, further improving the accuracy of the storage.
  • the correction - the storage unit 404 risk failure log The data record saved in the file is verified; if the verification is successful, the import unit 406 imports the data record successfully verified in the inbound failure record file into the database; if the verification fails, the The data record of the failure of the warehousing failure record file in the school-risk is saved in the error log file for subsequent reference.
  • the data record in the warehousing failure record file is further added to the database, thereby effectively avoiding the defect of deleting the original correct data record due to the system exception, further improving the warehousing.
  • the importing unit 406 further includes: a determining module, configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least the following a: the amount of data stored in the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time; the meter changing module is connected to the determining module, wherein the The table module is configured to use the currently used data table to store the data record of the verification success when the predetermined rule is not met; when the predetermined rule is met, another idle space in the database is used A data table is used to store the data record for which the verification was successful.
  • a determining module configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least the following a: the amount of data stored in the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time; the meter changing module
  • the saving module of the importing unit 406 uses another idle data table in the database to store the data record of the verification success: determining whether the currently used data table is preset for Storing a last data table of the plurality of data tables of the data record; if yes, using the first one of the plurality of data tables for storing the data record to store the school-risk success Data record; if not, at the preset multiple sheets for storing data records The next data table of the currently used data table is used in the data table to store the data record for which the verification is successful.
  • the storage space is effectively saved by recycling the data table.
  • steps shown in the flowchart of the accompanying drawings may be performed in a computer system such as a set of computer executable instructions, and, although the logical order is shown in the flowchart, in some cases, The steps shown or described may be performed in an order different than that herein.
  • the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices.
  • the invention is not limited to any specific combination of hardware and software.
  • the above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method and device for importing data into a database are disclosed. The method comprises the following steps of: taking out one or more data records from data files (S202); verifying the data records taken out (S204); importing the data records successfully verified into the database (S206). The solution resolves the problem of low accuracy in the method for importing the data into a database in prior art, and improves the accuracy of importing the data into the database.

Description

数据 方法和装置 技术领域 本发明涉及数据库领域, 具体而言, 涉及一种数据入库方法和装置。 背景技术 当前很多领域的实时软件系统都会产生海量的备份数据, 待后续统计、 核对。 如电信行业中话务系统、 计费系统、 鉴权系统均产生大量的离线话单, 这些话单将按天或按月统一导入数据库, 进行对账、 统计、 出 4艮表供用户查 询账单, 或用于数据挖掘等各种后续处理, 所以这些数据的高效、 精确、 灵 活入库有着重要的应用价值。 然而, 发明人发现现有技术中的数据入库方法存在以下几个问题:  TECHNICAL FIELD The present invention relates to the field of databases, and in particular to a data storage method and apparatus. BACKGROUND Currently, real-time software systems in many fields generate massive backup data for subsequent statistics and verification. For example, in the telecom industry, the traffic system, the billing system, and the authentication system all generate a large number of offline bills. These bills will be imported into the database on a daily or monthly basis, and reconciliation, statistics, and reports will be made for the user to check the bill. , or for various subsequent processing such as data mining, so the efficient, accurate and flexible storage of these data has important application value. However, the inventors have found that the data warehousing method in the prior art has the following problems:
1 ) 普通的逐条式入库方式效率较氏。 例如, 电信行业中的用户通常以 亿为单位, 其账单数据非常庞大, 普通入库方法不可避免将会消耗较多的时 间, 但动辄花费数天时间来入库是满足不了应用要求的。 1) The ordinary one-by-one warehousing method is more efficient. For example, users in the telecommunications industry usually have billions of units, and their billing data is very large. The ordinary storage method will inevitably consume more time, but it takes a few days to enter the warehouse to meet the application requirements.
2 ) 批量入库方式灵活性较差、 限制较大。 目前, 各种主流数据库提供 了批量导入数据的功能,不过直接使用这样的主流数据库会带来较大的问题: 文件格式必须满足数据库的限制; 文件的一条或几条记录错误可能导致整个 文件都无法入库。 2) The bulk storage method is less flexible and has a greater limit. At present, various mainstream databases provide the function of importing data in batches, but directly using such a mainstream database will bring about a big problem: The file format must meet the limitations of the database; one or several errors in the file may cause the entire file to be Unable to enter the library.
3 ) 逐条式入库方式或批量入库方式精确性较低。 目前, 在数据库异常、 入库系统异常、 数据某些记录异常等情况下, 无法保证入库的精确性, 即, 无法避免数据重复入库和遗漏入库, 而不精确的入库将大大降低入库数据的 可用性。 发明内容 3) The item-by-block warehousing method or batch warehousing method is less accurate. At present, in the case of database exceptions, warehousing system exceptions, some data logging exceptions, etc., the accuracy of warehousing cannot be guaranteed, that is, data duplication and omission are not avoided, and inaccurate warehousing will be greatly reduced. Availability of inbound data. Summary of the invention
本发明的主要目的在于提供一种数据入库方法和装置, 以解决上述问题至少 之一。 为了实现上述目的,才艮据本发明的一个方面,提供了一种数据入库方法, 包括: 从数据文件中取出一条或多条数据记录; 对所取出的数据记录进行校 验; 将校验成功的数据记录导入数据库。 上述对所取出的数据记录进行校验的步骤包括: 判断所取出的每条数据 记录中的字段是否满足预设的格式; 若满足, 则所判断的数据记录检验成功; 若不满足, 则将所判断的数据记录保存到错误记录文件中。 上述将校验成功的数据记录导入数据库的步骤包括: 为每一条上述校验 成功的数据记录分配流水号, 其中, 每一条上述校 -险成功的数据记录对应的 流水号在上述数据库中是唯一的流水号; 将分配有上述流水号的上述校 -险成 功的数据记录导入数据库中。 上述将校 -险成功的数据记录导入数据库的步骤包括: 以批量方式将上述 校-险成功的数据记录导入到上述数据库中; 若当前一批上述校-险成功的数据 记录导入失败, 则将上述当前一批上述校-险成功的数据记录以及与上述当前 一批上述校验成功的数据记录中的每一条数据记录对应的流水号保存到入库 失败记录文件中; 以单条方式将上述入库失败记录文件中保存的数据记录重 新导入到上述数据库中, 若导入失败, 则将导入失败的数据记录保存到错误 ΐ己录文件中。 上述将校-险成功的数据记录导入数据库的步骤包括: 判断上述数据库中 当前使用的数据表是否满足预定规则; 若不满足, 则使用上述当前使用的数 据表来存储上述校-险成功的数据记录; 若满足, 则使用上述数据库中另一张 空闲的数据表来存储上述校-险成功的数据记录。 上述使用上述数据库中另一张空闲的数据表来存储上述校-险成功的数据 记录的步骤包括: 判断上述当前使用的数据表是否为预设的用于存储数据记 录的多张数据表中的最后一张数据表; 若上述当前使用的数据表是上述预设 的用于存储数据记录的多张数据表中的最后一张数据表, 则使用上述预设的 用于存储数据记录的多张数据表中的第一张数据表来存储上述校验成功的数 据记录; 若上述当前使用的数据表不是上述预设的用于存储数据记录的多张 数据表中的最后一张数据表, 则在上述预设的用于存储数据记录的多张数据 表中使用上述当前使用的数据表的下一张数据表来存储上述校验成功的数据 记录。 上述预定规则包括以下至少之一: 上述当前使用的数据表所存储的数据 量超过预定的阈值; 上述当前使用的数据表被使用的时间超过预定的时间长 度。 为了实现上述目的,根据本发明的另一方面,提供了一种数据入库装置, 包括: 读取单元, 设置为从数据文件中取出一条或多条数据记录; 校验单元, 设置为对所取出的数据记录进行校验; 导入单元, 设置为将校验成功的数据 记录导入数据库。 上述导入单元包括: 分配模块, 设置为为每一条上述校-险成功的数据记 录分配流水号, 其中, 每一条上述校验成功的数据记录对应的流水号在上述 数据库中是唯一的流水号; 导入模块, 设置为将分配有上述流水号的上述校The main object of the present invention is to provide a data storage method and apparatus to solve at least one of the above problems. In order to achieve the above object, according to an aspect of the present invention, a data storage method is provided, comprising: taking one or more data records from a data file; verifying the extracted data records; Successful data records are imported into the database. The step of verifying the retrieved data record includes: determining whether the field in each of the extracted data records satisfies a preset format; if satisfied, the determined data record is successfully verified; if not, the The judged data record is saved in the error log file. The step of importing the successfully verified data record into the database includes: assigning a serial number to each of the above-mentioned successfully verified data records, wherein each of the above-mentioned successful data records corresponding to the school-risk is unique in the above database The serial number of the above-mentioned school-risk data data assigned to the above-mentioned serial number is imported into the database. The step of importing the data record of the school-risk success into the database includes: importing the data record of the above school-risk success into the above database in batch mode; if the current batch of the above-mentioned school-risk successful data record import fails, the above The current batch of the above-mentioned school-risk successful data records and the serial number corresponding to each of the data records of the current batch of the above-mentioned verification successes are saved in the inbound failure record file; the above-mentioned storage is performed in a single manner The data record saved in the failure log file is re-imported into the above database. If the import fails, the data record that failed to be imported is saved to the error file. The step of importing the data record of the successful school-risk into the database includes: determining whether the data table currently used in the database satisfies a predetermined rule; if not, using the data table currently used to store the data record of the school-risk success If it is satisfied, another free data table in the above database is used to store the data record of the above-mentioned school-risk success. The step of using the other idle data table in the database to store the data record of the successful school-risk includes: determining whether the currently used data table is in a preset plurality of data tables for storing data records. The last data table; if the currently used data table is the last one of the plurality of data tables for storing the data record, the preset plurality of data records for storing the data record are used. The first data table in the data table is used to store the data record successfully verified; if the currently used data table is not the last data table in the plurality of data tables for storing the data record, The next data table of the currently used data table is used in the plurality of preset data tables for storing data records to store the data record of the above verification success. The predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time. In order to achieve the above object, according to another aspect of the present invention, a data storage device is provided, comprising: a reading unit configured to take one or more data records from a data file; a verification unit, set to be The extracted data record is verified; the import unit is set to import the data record with successful verification into the database. The importing unit includes: an allocating module, configured to allocate a serial number for each of the above-mentioned successful data records, wherein the serial number corresponding to each of the successfully verified data records is a unique serial number in the database; Import module, set to the above school to be assigned the above serial number
-险成功的数据记录导入数据库中。 上述装置还包括: 存储单元, 设置为在上述对所取出的数据记录进行校 验之后, 将校验失败的数据记录保存在错误记录文件中; 将入库失败的数据 记录保存到入库失败记录文件中; 其中, 上述导入单元还设置为以单条方式 将上述入库失败记录文件中保存的数据记录重新导入到上述数据库中。 上述导入单元还包括: 判断模块, 设置为判断上述数据库中当前使用的 数据表是否满足预定规则, 其中, 上述预定规则包括以下至少之一: 上述当 前使用的数据表所存储的数据量超过预定的阈值; 上述当前使用的数据表被 使用的时间超过预定的时间长度; 换表模块, 设置为在不满足上述预定规则 时, 使用上述当前使用的数据表来存储上述校-险成功的数据记录; 在满足上 述预定规则时, 使用上述数据库中另一张空闲的数据表来存储上述校-险成功 的数据记录。 通过本发明, 在数据入库时对数据记录进行校 -险, 结合批量入库方式和 单条入库方式, 提高了数据入库的精确性。 此外, 在数据入库时, 本发明还 为每条数据记录分配了唯一的流水号, 从而避免了重复入库和遗漏入库; 进 一步, 数据入库时, 目标数据表可自动切换, 防止了由于单张表内数据量过 大而影响查询或二次处理的效率。 本发明的其它特征和优点将在随后的说明书中阐述, 并且, 部分地从说 明书中变得显而易见, 或者通过实施本发明而了解。 本发明的目的和其他优 点可通过在所写的说明书、 权利要求书、 以及附图中所特别指出的结构来实 现和获得。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部 分, 本发明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的 不当限定。 在附图中: 图 1是根据本发明实施例的数据入库系统在应用中的位置示意图; 图 2是根据本发明实施例的数据入库方法的一种优选的流程图; 图 3是根据本发明实施例的数据入库方法的另一种优选的流程图; 图 4是才艮据本发明实施例的数据入库装置的一种优选的结构示意图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在 不冲突的情况下, 本申请中的实施例及实施例中的特征可以相互组合。 图 1是根据本发明实施例的数据入库系统在应用中的位置示意图, 其将 海量离线数据导入到数据表中, 应用系统可以直接从这些表中查询数据, 也 可定制数据库任务,对表中数据进行二次加工后导入到其他表或库中供查询。 应用程序可以根据需要,将数据从入库表迁移到其它表或库中里。再使用 DB ( Date Base, 数据库)任务删除入库表数据。 例如应用为了提供查询效率, 可 以将数据迁移到另一个位置。 实施例 1 图 2是根据本发明实施例的数据入库方法的一种优选的流程图, 其包括 如下步 4聚: - Risk successful data records are imported into the database. The device further includes: a storage unit configured to: after verifying the extracted data record, save the data record of the verification failure in the error record file; and save the data record of the failed storage to the storage failure record In the file, the import unit is further configured to re-import the data record saved in the storage failure record file into the database in a single manner. The importing unit further includes: a determining module, configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined amount Threshold; the currently used data table is used for more than a predetermined length of time; the table change module is configured to use the currently used data table to store the data record of the above-mentioned succession when the predetermined rule is not satisfied; When the above predetermined rule is satisfied, another idle data table in the above database is used to store the above-mentioned successful data record of the school-risk. Through the invention, the data record is calibrated when the data is put into storage, and the batch warehousing mode and the single warehousing mode are combined to improve the accuracy of data warehousing. In addition, when the data is stored in the database, the present invention also assigns a unique serial number to each data record, thereby avoiding repeated storage and omission of storage; further, when the data is stored in the database, the target data table can be automatically switched, preventing The efficiency of the query or secondary processing is affected by the excessive amount of data in a single table. Other features and advantages of the invention will be set forth in the description which follows, and The object and other advantages of the present invention The points may be realized and obtained by the structures specified in the written description, the claims, and the drawings. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are set to illustrate,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, In the drawings: FIG. 1 is a schematic diagram of a location of a data warehousing system in an application according to an embodiment of the present invention; FIG. 2 is a preferred flowchart of a data warehousing method according to an embodiment of the present invention; Another preferred flowchart of the data storage method of the embodiment of the present invention; FIG. 4 is a schematic diagram of a preferred structure of the data storage device according to the embodiment of the present invention. BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict. 1 is a schematic diagram of a location of a data storage system in an application according to an embodiment of the present invention. The utility model imports a large amount of offline data into a data table, and the application system can directly query data from the tables, and can also customize a database task. The data is processed in the second table and imported into other tables or libraries for query. Applications can migrate data from the inbound table to other tables or libraries as needed. Then use the DB (Date Base) task to delete the inbound table data. For example, in order to provide query efficiency, an application can migrate data to another location. Embodiment 1 FIG. 2 is a preferred flowchart of a data storage method according to an embodiment of the present invention, which includes the following steps:
S202 , 从数据文件中取出一条或多条数据记录; S204, 对所取出的数据记录进行校验; S206, 将校验成功的数据记录导入数据库。 通过本发明, 在数据入库时对数据记录进行校验, 从而提高了数据入库 的精确性。 优选的, 所述对所取出的数据记录进行校验的步骤包括: 判断所取出的 每条数据记录中的字段是否满足预设的格式; 若满足, 则所判断的这条数据 记录检验成功; 若不满足, 则将所判断的这条数据记录保存到错误记录文件 ( Error文件)中。 例如, 判断数据记录中的时间信息字段的格式是否符合预 定的格式, 如, 年 -月-日等。 优选的, 所述将校-险成功的数据记录导入数据库的步骤包括: 为每一条 所述校 -险成功的数据记录分配流水号, 其中, 每一条所述校-险成功的数据记 录对应的流水号在所述数据库中是唯一的流水号; 将分配有所述流水号的所 述校-险成功的数据记录导入数据库中。 在本优选的实施例中, 通过为每条数 据记录分配了唯一的流水号, 避免了重复入库和遗漏入库, 进一步提高了入 库的^"确性。 优选的, 所述将校验成功的数据记录导入数据库的步骤包括: 以批量方 式将所述校-险成功的数据记录导入到所述数据库中; 若当前一批所述校 -险成 功的数据记录导入失败, 则将所述当前一批所述校-险成功的数据记录以及与 所述当前一批所述校-险成功的数据记录中的每一条数据记录对应的流水号保 存到入库失败记录文件 (Fail文件) 中; 以单条方式将所述入库失败记录文 件中保存的数据记录重新导入到所述数据库中, 若导入失败, 则将导入失败 的数据记录保存到错误记录文件中。 在本优选的实施例中, 通过使用批量入 库方式, 保证了入库效率; 通过对 4比量入库失败的记录进一步补入到数据库 中, 从而有效地避免了因为系统异常等原因而将原本正确的数据记录漏入的 缺陷, 进一步提高了入库的精确性。 优选的, 所述将校验成功的数据记录导入数据库的步骤, 还包括: 判断 所述数据库中当前使用的数据表是否满足预定规则; 若不满足, 则使用所述 当前使用的数据表来存储所述校验成功的数据记录; 若满足, 则使用所述数 据库中另一张空闲的数据表来存储所述校验成功的数据记录。 在本优选的实 施例中, 通过使用多张数据表来存储数据记录, 防止了由于单张表内数据量 过大而影响查询或二次处理的效率。 优选的, 所述使用所述数据库中另一张空闲的数据表来存储所述校 -险成 功的数据记录的步骤包括: 判断所述当前使用的数据表是否为预设的用于存 储数据记录的多张数据表中的最后一张数据表; 若所述当前使用的数据表是 所述预设的用于存储数据记录的多张数据表中的最后一张数据表, 则使用所 述预设的用于存储数据记录的多张数据表中的第一张数据表来存储所述校验 成功的数据记录; 若所述当前使用的数据表不是所述预设的用于存储数据记 录的多张数据表中的最后一张数据表, 则在所述预设的用于存储数据记录的 多张数据表中使用所述当前使用的数据表的下一张数据表来存储所述校验成 功的数据记录。 在本优选的实施例中, 通过循环使用数据表, 有效地节省了 存储空间。 优选的, 所述预定规则包括以下至少之一: 所述当前使用的数据表所存 储的数据量超过预定的阈值; 所述当前使用的数据表被使用的时间超过预定 的时间长度。 实施例 2 为方便描述, 在本发明实施例中, 做如下标识: 数据文件 F(File), 文件 记录 R(Record), 数据表 T(Table), 流水号 SN(Serial Number), 批量入库失败 记录临时保存目录 Fail文件中, 用 M张表结构相同的表保存数据。 在本实施例中, 数据入库过程包括如下步骤: 步骤 S 1 : 从数据文件 F取出 n条记录, 给每条记录分配流水号, 对需要 入库的字段做有效性校验, 校验失败的记录保存到指定 Error (错误)文件备 查, 校验成功的封装成批量数据包或数据文件。 步骤 S2: 将上一步准备的数据进行批量入库, 目标数据表 Tn。 步骤 S3: 若批量入库失败, 将该批记录保存到 Fail (失败) 文件中 (每 条记录对应的 SN需同时保存)。 步骤 S4: 保存 F当前已经处理的位置, 若文件还未处理完则返回步骤 S 1继续入库。 步骤 S5 : F批量入库完毕。 步骤 S6: Fail文件中保存的入库失败的数据记录用 insert (插入) 语句 逐条补录入库。 步骤 S7: 若 Tn数据量达到设定值, 目标数据表换为 Τη+1 , 若 η+1达到 最后一张表 Μ, 则从 T1开始入。 步骤 S8: 处理下一个数据文件。 优选的, 对于数据表处理, 由于一张数据表内数据量过大会影响应用查 询数据的效率, 因此, 本优选的实施例釆用 Μ ( Μ>1 ) 张数据表循环使用, 分担海量数据。 此外, 为了循环使用, 需要定期清理数据表, 保证由入库表 Tn转到入库表 Tn+ 1时, Τη+ 1表数据已经为空。表内容如何清理由具体应用 决定, 如: 将需要的信息同步到另外的查询表, 并在表上建立索引, 供应用 夬速查询。 入库系统绝大部分时间工作在批量方式正常入库流程, 在此流程进行原 始记录预处理, 剔除错误记录, 同时筛选出应用需要的字段进行批量入库; 在适当的时机(根据具体应用设置)转到 Fail文件进行补入数据记录的操作, 此时, Fail文件中批量入库失败的数据记录被逐条插入数据库。 本发明实施例引入了序列号字段, 通过对序列号的控制来防止重单, 与 通常在数据表上建立主键来排除重单相比入库速率上优势明显; 使用两种入 库方式的组合, 既保证入库效率, 又兼顾入库准确性, 防止漏单; 维护当前 入库文件的处理进度, 在系统异常后可自行恢复入库, 在处理海量数据文件 时无需人工千预, 自动化程度高减轻维护人员负担。 当然, 为了保证数据的精确, insert方式单条入库批量失败记录时, 需要 先判断记录的序列号在表中是否已经存在, 这在一定程度上会降低效率。 但 考虑到批量失败的几率很小, 插入(insert )方式入库次数相应很少, 所以对 总体入库性能影响不大。 实施例 3 图 3是根据本发明实施例的数据入库方法的另一种优选的流程图, 其包 括如下步骤: S202, taking one or more data records from the data file; S204, verifying the extracted data records; S206, importing the successfully verified data records into the database. Through the invention, the data record is verified when the data is stored in the library, thereby improving the accuracy of the data storage. Preferably, the step of verifying the retrieved data record comprises: determining whether a field in each of the retrieved data records satisfies a preset format; if satisfied, the determined data record is successfully verified; If it is not satisfied, the determined data record is saved in the error log file (Error file). For example, it is judged whether the format of the time information field in the data record conforms to a predetermined format, such as year-month-day. Preferably, the step of importing the data record of the successful school-risk into the database comprises: assigning a serial number to each of the data records of the school-risk success, wherein each of the school-risk successful data records corresponds to the pipeline The number is a unique serial number in the database; the data record of the school-risk success assigned with the serial number is imported into the database. In the preferred embodiment, by assigning a unique serial number to each data record, duplicate warehousing and missing warehousing are avoided, and the acknowledgment of the warehousing is further improved. Preferably, the verification will be performed. The step of importing the successful data record into the database includes: importing the data record of the school-risk success into the database in a batch manner; if the current batch of the school-risk successful data record import fails, the The current batch of the school-risk successful data record and the serial number corresponding to each of the current batch of the school-risk successful data records are saved in the inbound failure record file (Fail file) And re-importing the data record saved in the warehousing failure record file into the database in a single manner, and if the import fails, saving the data record that failed the import into the error record file. In the preferred embodiment By using the batch warehousing method, the warehousing efficiency is ensured; the record of the failure of the 4 metrics into the database is further added to the database, thereby effectively avoiding The defect that the original data record is leaked due to a system abnormality or the like further improves the accuracy of the storage. Preferably, the step of importing the successfully verified data record into the database further includes: determining the database Whether the currently used data table satisfies a predetermined rule; if not, the currently used data table is used to store the successfully verified data record; if satisfied, another idle data in the database is used The table stores the data record of the verification success. In the preferred embodiment, the data record is stored by using a plurality of data tables, thereby preventing the query or the secondary processing from being affected due to the excessive amount of data in the single table. Preferably, the step of using the other idle data table in the database to store the data record of the successful school-to-risk comprises: determining whether the currently used data table is preset for storing The last data table of the plurality of data tables stored in the data record; if the currently used data table is the last data table of the plurality of data tables for storing the data record, The preset first data table of the plurality of data tables for storing the data record to store the data record of the verification success; if the currently used data table is not the preset for storage The last data table of the plurality of data tables of the data record, the next data table of the currently used data table is used in the preset plurality of data tables for storing the data records to store A data record that verifies the success. In the preferred embodiment, the storage space is effectively saved by recycling the data table. Preferably, the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined threshold; and the currently used data table is used for more than a predetermined length of time. Embodiment 2 For convenience of description, in the embodiment of the present invention, the following identification is made: data file F (File), file record R (Record), data table T (Table), serial number SN (Serial Number), batch storage The failure record temporarily saves the directory Fail file, and saves the data in the same table with M table structures. In this embodiment, the data warehousing process includes the following steps: Step S1: fetching n records from the data file F, assigning a serial number to each record, performing validity check on the fields that need to be logged, and verifying failure The record is saved to the specified Error file for future reference, and the verification is successfully packaged into a bulk packet or data file. Step S2: The data prepared in the previous step is batch-inputted into the database, and the target data table Tn. Step S3: If the batch storage fails, save the batch record to the Fail file (the SN corresponding to each record needs to be saved at the same time). Step S4: Save the position that F has already processed. If the file has not been processed, return to step S1 to continue the storage. Step S5: F batch storage is completed. Step S6: The data record of the failed inbound storage saved in the Fail file is replenished into the library one by one with an insert statement. Step S7: If the Tn data amount reaches the set value, the target data table is changed to Τη+1. If η+1 reaches the last table, the entry starts from T1. Step S8: Processing the next data file. Preferably, for the data table processing, since the data amount in one data table exceeds the efficiency of the application query data, the preferred embodiment uses Μ (Μ>1) data tables to recycle and share the massive data. In addition, in order to recycle, it is necessary to periodically clean up the data table to ensure that the Τη+1 table data is already empty when the inbound table Tn is transferred to the inbound table Tn+1. How to clean up the contents of the table is determined by the specific application, such as: Synchronize the required information to another query table, and index the table, and supply the idle query. The warehousing system works most of the time in the batch mode normal warehousing process. In this process, the original record preprocessing is performed, the error records are eliminated, and the fields required by the application are filtered out for batch storage; at the appropriate time (according to the specific application setting) ) Go to the Fail file to add data records. At this time, the data records in the Fail file that failed to be bulk-inbound are inserted into the database one by one. The embodiment of the present invention introduces a serial number field, which prevents the heavy order by controlling the serial number, and has an obvious advantage in comparing the inbound rate compared with the usual establishment of the primary key on the data table; using a combination of the two storage methods , not only ensure the efficiency of warehousing, but also take into account the accuracy of the warehousing, prevent leakage; maintain the progress of the current warehousing file, can restore the warehousing after the system is abnormal, no need to manually process the massive data files, the degree of automation High to reduce the burden on maintenance personnel. Of course, in order to ensure the accuracy of the data, when the insert mode fails to record in a single batch, it is necessary to first determine whether the serial number of the record already exists in the table, which will reduce the efficiency to a certain extent. However, considering the probability of batch failure is small, the number of inserts into the insert mode is relatively small, so it has little effect on the overall storage performance. Embodiment 3 FIG. 3 is another preferred flowchart of a data storage method according to an embodiment of the present invention, which includes the following steps:
S302. 从入库目录获取数据文件 F。 S302. Obtain the data file F from the inbound directory.
S304. 从数据文件 F中读取数据记录。 S304. Read the data record from the data file F.
S306. 对数据记录进行预处理: 分拣入库字段、 校验、 分配流水号。 S308. 若预处理成功, 将记录组装成批量数据块; 若预处理失败, 将该 记录保存到 Error (错误) 文件。 S306. Preprocessing the data record: sorting the inbound field, verifying, and assigning the serial number. S308. If the preprocessing is successful, the records are assembled into a batch data block; if the preprocessing fails, the record is saved to an Error file.
S310. 将预处理成功的数据记录批量入库到表 Tn;优选的,以 BCP( Bulk Copy Program, 批量导入方式) 方式将上述数据记录批量入库。 S312. 若入库失败, 将该批数据记录保存到 Fail (失败) 文件。 S310. Batching the data records that have been successfully processed into the table Tn; preferably, the data records are batched into the library in a BCP (Bulk Copy Program) manner. S312. If the warehousing fails, save the batch data record to the Fail file.
S314. 更新入库状态文件。 S314. Update the inbound status file.
S316 若数据文件 F未处理完, 跳转 S304, 继续读取数据记录; 若数据 文件 F处理完, 则转至 S318。 S316 If the data file F has not been processed, the process jumps to S304 and continues to read the data record; if the data file F is processed, the process goes to S318.
S318. 若 Fail文件中存在入库失败的数据记录, 则使用 insert方式单条 将 Fail文件中保存的入库失败的数据记录补入到数据表 Tn。 S318. If there is a data record in the Fail file that fails to be inbound, use the insert method to add the data record of the failed storage in the Fail file to the data table Tn.
S320. 若数据表 Tn数据量达到设置值, 换下一张表进行数据入库。 S320. If the data amount of the data table Tn reaches the set value, replace the next table for data storage.
S322. F入库完毕、 更新入库^ I 态文件。 总体来说, 根据本发明实施例的数据入库方法对数据文件中的原始记录 进行校验, 剔除有明显错误数据行以提高入库成功率; 为每条记录分配唯一 流水号(此字段无需建立索引;), 保证数据入库不重复; 校验无误的记录以批 量方式入库, 若这批数据入库失败则保存之, 后续用 insert方式单条再次补 录入库; 入库过程中自动切换目标数据表。 此方法对原始记录预处理, 可入库多种数据文件, 并适应多种目标数据 库; 批量方式与 insert方式组合使用, 优势互补, 同时满足快速入库、 实时 查询和数据精确性需求。 其优势有: S322. F is completed and updated into the library. In general, the data warehousing method according to an embodiment of the present invention checks the original records in the data file, and eliminates the erroneous data rows to improve the warehousing success rate; assigns a unique serial number to each record (this field does not need to be Indexing ;), to ensure that the data is not stored in the warehouse; the correct records are stored in the batch mode, if the batch of data fails to be stored, the file is saved, and then the insert is re-submitted into the library by the insert method; Switch the target data table. This method preprocesses the original records, can be used to store a variety of data files, and adapt to a variety of target data bases; batch mode and insert mode combination, complementary advantages, while meeting the requirements of fast warehousing, real-time query and data accuracy. Its advantages are:
1 ) 高效性 系统产生的离线数据文件一般没有错误, 批量入库方式成功率高, 总体 入库效率高; 适时保存当前文件的入库状态, 入库系统异常重启时可继续入 库该文件, 无需从头单条插入, 尽可能提高入库效率; 入库目标数据表可自 动切换, 防止单张表内数据量过大而影响查询或二次处理的效率。 1) The offline data files generated by the high-efficiency system generally have no errors, the batch storage method has high success rate, and the overall storage efficiency is high; the current file storage status is saved in time, and the file can be continuously stored when the storage system is abnormally restarted. It is not necessary to insert from the beginning, and the storage efficiency is maximized. The inbound target data table can be automatically switched to prevent the data volume in a single table from being too large and affecting the efficiency of the query or secondary processing.
2 ) 准确性 每个记录分配唯一流水号, 保证记录不会重复入库; 批量入库失败的记 录以 insert方式二次入库, 正常 ΐ己录不会遗漏入库。 3 ) 通用性 支持多种格式的数据文件、 多种目标数据库。 实施例 4 图 4是根据本发明实施例的数据入库装置的一种优选的结构示意图, 其 包括: 读取单元 402 , 设置为从数据文件中取出一条或多条数据记录; 校验 单元 404 , 连接至读取单元 402 , 其中, 校-险单元 404设置为对所取出的数 据记录进行校 -险; 导入单元 406 , 连接至校-险单元 404 , 其中, 导入单元 406 设置为将校验成功的数据记录导入数据库。 通过本发明, 在数据入库时对数 据记录进行校验, 从而提高了数据入库的精确性。 优选的, 所述校验单元 404对所取出的数据记录进行校验的步骤包括: 判断所取出的每条数据记录中的字段是否满足预设的格式; 若满足, 则所判 断的这条数据记录检验成功; 若不满足, 则将所判断的数据记录保存到错误 记录文件中。 例如, 判断数据记录中的时间信息字段的格式是否符合预定的 格式, 如, 年 -月-日等。 优选的, 导入单元 406包括: 分配模块, 连接至校-险单元 404 , 其中, 该分配模块设置为为每一条所述校验成功的数据记录分配流水号, 其中, 每 一条所述校-险成功的数据记录对应的流水号在所述数据库中是唯一的流水 号; 导入模块, 连接至上述分配模块, 其中, 该导入模块设置为将分配有所 述流水号的所述校-险成功的数据记录导入数据库中。 在本优选的实施例中, 通过为每条数据记录分配了唯一的流水号, 避免了重复入库和遗漏入库, 进 一步提高了入库的精确性。 优选的, 居本发明实施例的数据入库装置还包括: 存储单元 408 , 连 接至校-险单元 404 , 其中, 存储单元 408设置为在所述对所取出的数据记录 进行校验之后, 将校验失败的数据记录保存在错误记录文件中; 将入库失败 的数据记录保存到入库失败记录文件中。 在这种场景下, 所述导入单元 406 还设置为以单条方式将所述入库失败记录文件中保存的数据记录重新导入到 所述数据库中。 优选的, 导入单元 406将校验成功的数据记录导入数据库的步骤包括: 以批量方式将所述校-险成功的数据记录导入到所述数据库中; 若当前一批所 述校-险成功的数据记录导入失败, 则将所述当前一批所述校-险成功的数据记 录以及与所述当前一批所述校验成功的数据记录中的每一条数据记录对应的 流水号保存到入库失败记录文件 (Fail文件) 中; 以单条方式将所述入库失 败记录文件中保存的数据记录重新导入到所述数据库中, 若导入失败, 则将 导入失败的数据记录保存到错误记录文件中。 在本优选的实施例中, 通过使 用批量入库方式, 保证了入库效率; 通过对批量入库失败的记录进一步补入 到数据库中, 从而有效地避免了因为系统异常等原因而将原本正确的数据记 录漏入的缺陷, 进一步提高了入库的精确性。 优选的, 在导入单元 406以单条方式^ 1所述入库失败 ΐ己录文件中保存的 数据记录重新导入到所述数据库的过程中, 所述校-险单元 404对所述入库失 败记录文件中保存的数据记录进行校验; 若校验成功, 则所述导入单元 406 将所述入库失败记录文件中校验成功的数据记录导入所述数据库; 若校验失 败, 则将所述入库失败记录文件中校 -险失败的数据记录保存在错误记录文件 中, 以便后续备查。 在本优选的实施例中, 通过对入库失败记录文件中数据 记录进一步补入到数据库中, 从而有效地避免了因为系统异常而将原本正确 的数据记录删除的缺陷, 进一步提高了入库的精确性。 优选的, 在上述各个优选的实施例中, 所述导入单元 406还包括: 判断 模块 ,设置为判断所述数据库中当前使用的数据表是否满足预定规则, 其中, 所述预定规则包括以下至少之一: 所述当前使用的数据表所存储的数据量超 过预定的阈值; 所述当前使用的数据表被使用的时间超过预定的时间长度; 换表模块, 连接至上述判断模块, 其中, 该换表模块设置为在不满足所述预 定规则时, 使用所述当前使用的数据表来存储所述校验成功的数据记录; 在 满足所述预定规则时, 使用所述数据库中另一张空闲的数据表来存储所述校 验成功的数据记录。 在本优选的实施例中, 通过使用多张数据表来存储数据 记录, 防止了由于单张表内数据量过大而影响查询或二次处理的效率。 优选的, 导入单元 406的保存模块使用所述数据库中另一张空闲的数据 表来存储所述校验成功的数据记录的步骤包括: 判断所述当前使用的数据表 是否为预设的用于存储数据记录的多张数据表中的最后一张数据表; 若是, 则使用所述预设的用于存储数据记录的多张数据表中的第一张数据表来存储 所述校-险成功的数据记录; 若否, 则在所述预设的用于存储数据记录的多张 数据表中使用所述当前使用的数据表的下一张数据表来存储所述校验成功的 数据记录。 在本优选的实施例中, 通过循环使用数据表, 有效地节省了存储 空间。 需要说明的是, 在附图的流程图示出的步骤可以在诸如一组计算机可执 行指令的计算机系统中执行, 并且, 虽然在流程图中示出了逻辑顺序, 但是 在某些情况下, 可以以不同于此处的顺序执行所示出或描述的步骤。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可 以用通用的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布 在多个计算装置所组成的网络上, 可选地, 它们可以用计算装置可执行的程 序代码来实现, 从而, 可以将它们存储在存储装置中由计算装置来执行, 或 者将它们分别制作成各个集成电路模块, 或者将它们中的多个模块或步骤制 作成单个集成电路模块来实现。 这样, 本发明不限制于任何特定的硬件和软 件结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本 领域的技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的^"神和 原则之内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护 范围之内。 2) Accuracy Each record is assigned a unique serial number to ensure that the records will not be re-stocked; the records that fail to be bulk-inbound are placed in the library in the insert mode, and the normal records will not be missed. 3) Universality supports data files in multiple formats and multiple target databases. Embodiment 4 FIG. 4 is a schematic diagram of a preferred structure of a data storage device according to an embodiment of the present invention, comprising: a reading unit 402 configured to take one or more data records from a data file; and a verification unit 404 And connected to the reading unit 402, wherein the school-risk unit 404 is configured to perform the school-risk on the retrieved data record; the import unit 406 is connected to the school-risk unit 404, wherein the import unit 406 is set to check Successful data records are imported into the database. Through the invention, the data record is verified when the data is stored in the library, thereby improving the accuracy of the data storage. Preferably, the step of verifying the extracted data record by the checking unit 404 includes: determining whether a field in each of the extracted data records satisfies a preset format; if satisfied, the determined data The record verification is successful; if not, the determined data record is saved to the error log file. For example, it is judged whether the format of the time information field in the data record conforms to a predetermined format, such as year-month-day. Preferably, the importing unit 406 includes: an allocating module, connected to the school-risk unit 404, wherein the allocating module is configured to allocate a serial number for each of the data records that are successfully verified, wherein each of the school-risks The serial number corresponding to the successful data record is the only serial number in the database; the import module is connected to the distribution module, wherein the import module is configured to successfully assign the school-risk to which the serial number is assigned The data record is imported into the database. In the preferred embodiment, by assigning a unique serial number to each data record, duplicate warehousing and missing warehousing are avoided, further improving the accuracy of warehousing. Preferably, the data storage device of the embodiment of the present invention further includes: a storage unit 408 connected to the school-risk unit 404, wherein the storage unit 408 is configured to, after the pair of the retrieved data records are verified, The data record that failed the verification is saved in the error log file; the data record that failed the inbound storage is saved in the inbound failure record file. In this scenario, the import unit 406 is further configured to re-import the data records saved in the inbound failure log file into the database in a single manner. Preferably, the importing unit 406, the step of importing the successfully verified data record into the database comprises: importing the data record of the school-risk success into the database in a batch manner; if the current batch of the school-risk is successful If the data record import fails, the current batch of the school-risk successful data record and the serial number corresponding to each of the current batch of the successfully verified data records are saved to the storage. In the failure record file (Fail file); re-import the data record saved in the inbound failure record file into the database in a single way, and if the import fails, save the data record that failed to be imported into the error record file. . In the preferred embodiment, the warehousing efficiency is ensured by using the batch warehousing method; the record of the batch warehousing failure is further added to the database, thereby effectively avoiding the original reason due to system abnormality and the like. The data record leaks into the defect, further improving the accuracy of the storage. Preferably, in the process of introducing a single unit 406 in the embodiment 1 ^ ΐ storage preservation have failed file recorded in the data recording reintroduced into the database, the correction - the storage unit 404 risk failure log The data record saved in the file is verified; if the verification is successful, the import unit 406 imports the data record successfully verified in the inbound failure record file into the database; if the verification fails, the The data record of the failure of the warehousing failure record file in the school-risk is saved in the error log file for subsequent reference. In the preferred embodiment, the data record in the warehousing failure record file is further added to the database, thereby effectively avoiding the defect of deleting the original correct data record due to the system exception, further improving the warehousing. Accuracy. Preferably, in each of the above preferred embodiments, the importing unit 406 further includes: a determining module, configured to determine whether the data table currently used in the database satisfies a predetermined rule, wherein the predetermined rule includes at least the following a: the amount of data stored in the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time; the meter changing module is connected to the determining module, wherein the The table module is configured to use the currently used data table to store the data record of the verification success when the predetermined rule is not met; when the predetermined rule is met, another idle space in the database is used A data table is used to store the data record for which the verification was successful. In the preferred embodiment, by using a plurality of data tables to store data records, the efficiency of the query or secondary processing is prevented due to the excessive amount of data in a single table. Preferably, the saving module of the importing unit 406 uses another idle data table in the database to store the data record of the verification success: determining whether the currently used data table is preset for Storing a last data table of the plurality of data tables of the data record; if yes, using the first one of the plurality of data tables for storing the data record to store the school-risk success Data record; if not, at the preset multiple sheets for storing data records The next data table of the currently used data table is used in the data table to store the data record for which the verification is successful. In the preferred embodiment, the storage space is effectively saved by recycling the data table. It should be noted that the steps shown in the flowchart of the accompanying drawings may be performed in a computer system such as a set of computer executable instructions, and, although the logical order is shown in the flowchart, in some cases, The steps shown or described may be performed in an order different than that herein. Obviously, those skilled in the art should understand that the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device, or they may be separately fabricated into individual integrated circuit modules, or they may be Multiple modules or steps are made into a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software. The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Claims

权 利 要 求 书 Claim
1. 一种数据入库方法, 包括: 1. A data warehousing method, including:
从数据文件中取出一条或多条数据记录;  Take one or more data records from the data file;
对所取出的数据记录进行校验;  Verify the data records taken;
将校验成功的数据记录导入数据库。  Import the data record with successful verification into the database.
2. 根据权利要求 1所述的方法, 其中, 所述对所取出的数据记录进行校 -险的步 4聚包括: 2. The method according to claim 1, wherein the step of performing the calibration of the extracted data records comprises:
判断所取出的每条数据记录中的字段是否满足预设的格式; 若满足, 则所判断的数据记录检验成功; 若不满足, 则将所判断 的数据记录保存到错误记录文件中。  It is judged whether the field in each data record taken out meets a preset format; if it is satisfied, the determined data record is successfully checked; if not, the determined data record is saved in the error record file.
3. 根据权利要求 1所述的方法, 其中, 所述将校验成功的数据记录导入 数据库的步骤包括: 3. The method according to claim 1, wherein the step of importing the successfully verified data record into the database comprises:
为每一条所述校验成功的数据记录分配流水号, 其中, 每一条所 述校验成功的数据记录对应的流水号在所述数据库中是唯一的流水 号;  And assigning a serial number to each of the data records that are successfully verified, wherein each of the data records corresponding to the successfully verified data record is a unique serial number in the database;
将分配有所述流水号的所述校-险成功的数据记录导入数据库中。  The school-risk successful data record to which the serial number is assigned is imported into the database.
4. 根据权利要求 1所述的方法, 其中, 所述将校验成功的数据记录导入 数据库的步骤包括: 4. The method according to claim 1, wherein the step of importing the successfully verified data record into the database comprises:
以批量方式将所述校-险成功的数据记录导入到所述数据库中; 若当前一批所述校-险成功的数据记录导入失败, 则将所述当前一 批所述校-险成功的数据记录以及与所述当前一批所述校-险成功的数据 记录中的每一条数据记录对应的流水号保存到入库失败记录文件中; 以单条方式将所述入库失败记录文件中保存的数据记录重新导入 到所述数据库中, 若导入失败, 则将导入失败的数据记录保存到错误 ΐ己录文件中。  Importing the school-risk successful data record into the database in batch mode; if the current batch of the school-risk successful data record import fails, the current batch of the school-risk success The data record and the serial number corresponding to each data record in the current batch of the school-risk successful data records are saved in the storage failure record file; and the storage failure record file is saved in a single manner. The data record is re-imported into the database. If the import fails, the data record that failed to be imported is saved to the error file.
5. 根据权利要求 1至 4中任一项所述的方法, 其中, 所述将校验成功的 数据记录导入数据库的步骤包括: 判断所述数据库中当前使用的数据表是否满足预定规则; 若不满足, 则使用所述当前使用的数据表来存储所述校验成功的 数据记录; The method according to any one of claims 1 to 4, wherein the step of importing the successfully verified data record into the database comprises: Determining whether the data table currently used in the database satisfies a predetermined rule; if not, using the currently used data table to store the data record with successful verification;
若满足, 则使用所述数据库中另一张空闲的数据表来存储所述校 -险成功的数据记录。  If so, another idle data table in the database is used to store the data record for the success of the school.
6. 根据权利要求 5所述的方法, 其中, 所述使用所述数据库中另一张空 闲的数据表来存储所述校验成功的数据记录的步骤包括: 6. The method according to claim 5, wherein the step of storing the successfully verified data record by using another idle data table in the database comprises:
判断所述当前使用的数据表是否为预设的用于存储数据记录的多 张数据表中的最后一张数据表;  Determining whether the currently used data table is the last one of the plurality of data tables for storing the data record;
若所述当前使用的数据表是所述预设的用于存储数据记录的多张 数据表中的最后一张数据表, 则使用所述预设的用于存储数据记录的 多张数据表中的第一张数据表来存储所述校验成功的数据记录;  If the currently used data table is the last one of the plurality of data tables for storing the data record, using the preset plurality of data tables for storing the data record The first data table to store the data record of the verification success;
若所述当前使用的数据表不是所述预设的用于存储数据记录的多 张数据表中的最后一张数据表, 则在所述预设的用于存储数据记录的 多张数据表中使用所述当前使用的数据表的下一张数据表来存储所述 校验成功的数据记录。  If the currently used data table is not the last one of the plurality of data tables for storing the data record, in the preset multiple data tables for storing the data record The data record of the verification success is stored using the next data table of the currently used data table.
7. 根据权利要求 5所述的方法, 其中, 所述预定规则包括以下至少之一: 7. The method according to claim 5, wherein the predetermined rule comprises at least one of the following:
所述当前使用的数据表所存储的数据量超过预定的阈值; 所述当前使用的数据表被使用的时间超过预定的时间长度。  The amount of data stored by the currently used data table exceeds a predetermined threshold; the currently used data table is used for more than a predetermined length of time.
8. —种数据入库装置, 包括: 8. A data storage device, including:
读取单元, 设置为从数据文件中取出一条或多条数据记录; 校验单元, 设置为对所取出的数据记录进行校验;  a reading unit, configured to take one or more data records from the data file; and a verification unit configured to verify the retrieved data records;
导入单元, 设置为将校验成功的数据记录导入数据库。  Import unit, set to import data records with successful verification into the database.
9. 根据权利要求 8所述的装置, 其中, 所述导入单元包括: 9. The device according to claim 8, wherein the importing unit comprises:
分配模块,设置为为每一条所述校验成功的数据记录分配流水号 , 其中, 每一条所述校-险成功的数据记录对应的流水号在所述数据库中 是唯一的流水号; 导入模块, 设置为将分配有所述流水号的所述校验成功的数据记 录导入数据库中。 An allocation module, configured to allocate a serial number for each of the data records that are successfully verified, wherein each of the data records corresponding to the successful data record of the school-risk is a unique serial number in the database; The import module is configured to import the data record of the verification successfully assigned the serial number into the database.
10. 根据权利要求 8所述的装置, 其中, 还包括: 10. The device according to claim 8, further comprising:
存储单元, 设置为在所述对所取出的数据记录进行校验之后, 将 校 -险失败的数据记录保存在错误记录文件中; 将入库失败的数据记录 保存到入库失败记录文件中;  a storage unit, configured to save the data record of the failed test data in the error record file after the data record of the extracted data is verified; and save the data record of the failed storage to the storage failure record file;
其中, 所述导入单元还设置为以单条方式^ 1所述入库失败 ΐ己录文 件中保存的数据记录重新导入到所述数据库中。 Wherein the unit is further arranged to import a single failure mode storage ΐ ^ 1 has the recorded file data records saved into the database again.
11. 根据权利要求 8至 10中任一项所述的装置, 其中, 所述导入单元还包 括: The device according to any one of claims 8 to 10, wherein the importing unit further comprises:
判断模块, 设置为判断所述数据库中当前使用的数据表是否满足 预定规则, 其中, 所述预定规则包括以下至少之一: 所述当前使用的 数据表所存储的数据量超过预定的阈值; 所述当前使用的数据表被使 用的时间超过预定的时间长度;  The determining module is configured to determine whether the data table currently used in the database satisfies a predetermined rule, where the predetermined rule includes at least one of the following: the amount of data stored in the currently used data table exceeds a predetermined threshold; The currently used data table is used for more than a predetermined length of time;
换表模块, 设置为在不满足所述预定规则时, 使用所述当前使用 的数据表来存储所述校-险成功的数据记录; 在满足所述预定规则时, 使用所述数据库中另一张空闲的数据表来存储所述校验成功的数据记 录。  a table changing module, configured to use the currently used data table to store the data record of the school-risk success when the predetermined rule is not satisfied; when the predetermined rule is met, use another one in the database An idle data table is stored to store the data record for which the verification was successful.
PCT/CN2011/072076 2010-10-13 2011-03-23 Method and device for importing data into database WO2012048555A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010506274.X 2010-10-13
CN201010506274XA CN101980187A (en) 2010-10-13 2010-10-13 Data warehousing method and device

Publications (1)

Publication Number Publication Date
WO2012048555A1 true WO2012048555A1 (en) 2012-04-19

Family

ID=43600689

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/072076 WO2012048555A1 (en) 2010-10-13 2011-03-23 Method and device for importing data into database

Country Status (2)

Country Link
CN (1) CN101980187A (en)
WO (1) WO2012048555A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021051627A1 (en) * 2019-09-19 2021-03-25 平安科技(深圳)有限公司 Database-based batch importing method, apparatus and device, and storage medium

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101980187A (en) * 2010-10-13 2011-02-23 中兴通讯股份有限公司 Data warehousing method and device
CN102750377B (en) * 2012-06-21 2016-07-27 深圳市谷米科技有限公司 Mass data storage and search method
CN103365989B (en) * 2013-07-08 2016-10-26 中国中医科学院中医临床基础医学研究所 Electronic patient record clinical data check method and system
CN104166738A (en) * 2014-09-04 2014-11-26 北京国双科技有限公司 Database query processing method and device
CN105045839B (en) * 2015-06-30 2018-09-04 上海瀚之友信息技术服务有限公司 A kind of batch data operating method
CN106649414B (en) * 2015-11-04 2020-01-31 阿里巴巴集团控股有限公司 Method and equipment for pre-detecting data anomalies of data warehouses
CN106802898B (en) * 2015-11-26 2020-09-04 北大方正集团有限公司 Data entry method and device
CN109525863A (en) * 2017-09-20 2019-03-26 北京国双科技有限公司 Viewing-data methods of exhibiting and device
CN110019244A (en) 2018-05-14 2019-07-16 网联清算有限公司 Data processing method, device and computer readable storage medium
CN109710626B (en) * 2018-12-29 2021-08-24 北京金山安全软件有限公司 Data warehousing management method and device, electronic equipment and storage medium
CN112364040B (en) * 2020-12-01 2024-05-10 北京锐安科技有限公司 Data checking method, device, medium and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101515291A (en) * 2009-03-26 2009-08-26 北京泰合佳通信息技术有限公司 Method for leading data into database in a batch way and system thereof
CN101719168A (en) * 2010-01-25 2010-06-02 浪潮通信信息系统有限公司 Algorithm configurability-based universal data loading method
CN101751397A (en) * 2008-12-04 2010-06-23 中国移动通信集团安徽有限公司 Method and device for importing files into database
CN101799832A (en) * 2010-03-31 2010-08-11 山东高效能服务器和存储研究院 Method for rapidly writing mass data into database once
CN101980187A (en) * 2010-10-13 2011-02-23 中兴通讯股份有限公司 Data warehousing method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7912820B2 (en) * 2003-06-06 2011-03-22 Microsoft Corporation Automatic task generator method and system
CN100552679C (en) * 2006-04-20 2009-10-21 上海宝信软件股份有限公司 Regular text data is imported the method for database
CN101094026A (en) * 2006-06-19 2007-12-26 上海全成通信技术有限公司 Method for synchronizing, transmitting mass data, and inputting mass data to database
CN100524306C (en) * 2006-06-21 2009-08-05 千乡万才科技(中国)有限公司 Method for leading-in and deriving data in browser/servicer system
CN101425124A (en) * 2007-11-02 2009-05-06 王志明 Automatic code-printing optical mark reader

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101751397A (en) * 2008-12-04 2010-06-23 中国移动通信集团安徽有限公司 Method and device for importing files into database
CN101515291A (en) * 2009-03-26 2009-08-26 北京泰合佳通信息技术有限公司 Method for leading data into database in a batch way and system thereof
CN101719168A (en) * 2010-01-25 2010-06-02 浪潮通信信息系统有限公司 Algorithm configurability-based universal data loading method
CN101799832A (en) * 2010-03-31 2010-08-11 山东高效能服务器和存储研究院 Method for rapidly writing mass data into database once
CN101980187A (en) * 2010-10-13 2011-02-23 中兴通讯股份有限公司 Data warehousing method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021051627A1 (en) * 2019-09-19 2021-03-25 平安科技(深圳)有限公司 Database-based batch importing method, apparatus and device, and storage medium

Also Published As

Publication number Publication date
CN101980187A (en) 2011-02-23

Similar Documents

Publication Publication Date Title
WO2012048555A1 (en) Method and device for importing data into database
RU2449358C1 (en) Distributed file system and data block consistency managing method thereof
US10776396B2 (en) Computer implemented method for dynamic sharding
US10222987B2 (en) Data deduplication with augmented cuckoo filters
CN109376160B (en) Data synchronization method, device, computer equipment and storage medium
CN107391758B (en) Database switching method, device and equipment
WO2018149271A1 (en) Data query method, device and calculating apparatus
CN111078662B (en) Block chain data storage method and device
CN106981024B (en) Transaction limit calculation processing system and processing method thereof
CN111026767B (en) Block chain data storage method and device and hardware equipment
CN110555770A (en) Block chain world state checking and recovering method based on incremental hash
CN113419823A (en) Alliance chain system suitable for high-concurrency affairs and design method thereof
CN114092252A (en) Block chain transaction execution method, device, equipment and readable storage medium
US20230376226A1 (en) Storage block management information synchronous recording method and system, and terminal and storage medium
CN114860745A (en) Database expansion method based on artificial intelligence and related equipment
CN114996240A (en) Data table verification method, device, equipment, storage medium and program
CN114564500A (en) Method and system for implementing structured data storage and query in block chain system
CN113326232A (en) Data updating method and device
CN113220777A (en) Service data processing method and device, computer equipment and storage medium
US11372838B2 (en) Parallel processing of changes in a distributed system
CN114090547A (en) Data migration method and device, computer equipment and storage medium
CN111241082B (en) Data correction method and device
US12019611B1 (en) Method and apparatus for comparing two database tables without regard to an original order of rows in either database table
CN114900531B (en) Data synchronization method, device and system
CN115756408A (en) Method for realizing one-key import of accounting scene based on java

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11831952

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11831952

Country of ref document: EP

Kind code of ref document: A1