CN116126873B - Data summarization method and device based on nonstandard data table and storage medium - Google Patents

Data summarization method and device based on nonstandard data table and storage medium Download PDF

Info

Publication number
CN116126873B
CN116126873B CN202310424092.5A CN202310424092A CN116126873B CN 116126873 B CN116126873 B CN 116126873B CN 202310424092 A CN202310424092 A CN 202310424092A CN 116126873 B CN116126873 B CN 116126873B
Authority
CN
China
Prior art keywords
data
nonstandard
summary table
matching
data table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310424092.5A
Other languages
Chinese (zh)
Other versions
CN116126873A (en
Inventor
杜成
缪洪波
王鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongrui Fangsheng Human Resources Technology Beijing Co ltd
Original Assignee
Zhongrui Fangsheng Human Resources Technology Beijing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongrui Fangsheng Human Resources Technology Beijing Co ltd filed Critical Zhongrui Fangsheng Human Resources Technology Beijing Co ltd
Priority to CN202310424092.5A priority Critical patent/CN116126873B/en
Publication of CN116126873A publication Critical patent/CN116126873A/en
Application granted granted Critical
Publication of CN116126873B publication Critical patent/CN116126873B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Abstract

The embodiment of the application discloses a data summarization method, a device and a storage medium based on a nonstandard data table, wherein the method comprises the following steps: acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data; acquiring matching fields of a non-standard summary table and a non-standard data table, searching the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the non-standard summary table and the non-standard data table based on a matching result; and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identification marks, the line numbers and/or the column numbers corresponding to the headers. Therefore, the method has the advantages of high flexibility, labor saving, efficiency improvement, and data summarization accuracy improvement through data matching of the identity marks in the database.

Description

Data summarization method and device based on nonstandard data table and storage medium
Technical Field
The application relates to the technical field of computer information processing, in particular to a data summarizing method, device and storage medium based on a nonstandard data table.
Background
With the rapid development of big data technology and the continuous enhancement of data-driven business awareness, more and more data are applied to the analysis of enterprise daily business. In the daily data processing work of enterprises, more data is collected and arranged manually besides the data stored in the database generated by the application system. The manually collected and processed data and other data are integrated, processed and summarized and then are applied to scenes such as business analysis, report forms, wind control and the like.
At present, the main tool for data summarization is Excel, which is taken as electronic form software in Microsoft office series office software, can be used for making electronic forms, completing a plurality of complex data operations, analyzing and predicting data, and becomes a great deal of convenience assistant for user management companies and personal finance and statistics data at home and abroad and drawing various specialized forms.
However, the amount of original data generated in large enterprise management is very large, the data is sourced from different units in departments, the organization mode, structure and form of the data are different, and the data summarization is to extract various source data from various local applications as an original data table and input the original data table into an Excel summarization table according to a unified specification. When the original system is a manual processing system, various data are scattered in various original materials, the formats of the extracted data tables are different, and the data collection is always a very time-consuming and labor-consuming work. At this time, how to ensure the efficiency and accuracy of data summarization is important.
Disclosure of Invention
An object of the embodiments of the present application is to provide a data summarizing method, apparatus and storage medium based on a nonstandard data table, so as to solve the problem of low data summarizing efficiency and accuracy when Excel is used for data summarizing in the prior art.
In order to achieve the above objective, an embodiment of the present application provides a data summarizing method based on a nonstandard data table, including: acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data;
acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result;
and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
Optionally, the nonstandard summary table and the nonstandard data table are Excel tables;
the method for analyzing the nonstandard summary table and the nonstandard data table comprises the following steps: the nonstandard summary table and nonstandard data table are parsed using POI tools, stream processing algorithms, and/or win32 modules.
Optionally, the parsing the nonstandard summary table and the nonstandard data table to obtain corresponding header and data includes:
analyzing and displaying the row numbers of the head and the data of the nonstandard summary table and the nonstandard data table and the column numbers of the data ending column, modifying based on the obtained modification value input by the user, and obtaining the corresponding head and data according to the modified row numbers of the head and the data of the nonstandard summary table and the nonstandard data table, the column numbers of the data ending column and the column numbers of the neglected data.
Optionally, the obtaining the matching fields of the nonstandard summary table and the nonstandard data table, searching the identity of the data corresponding to the matching fields in the system database based on the matching fields, and includes:
after the matching field is obtained, comparing the matching field with a plurality of field names customized by a user in the system, finding out the field name which is the same as the matching field in the system, and finding out the identity of the data corresponding to the matching field based on the field name.
Optionally, after obtaining the nonstandard summary table and at least one nonstandard data table, the method further includes:
and carrying out accuracy verification on the header and the data of the nonstandard summary table and the nonstandard data table, wherein the verification comprises verification on whether the header or the data in the nonstandard summary table or the nonstandard data table contains a third-party table link, an empty row or an empty column, whether the header contains an empty header, whether the header is repeated or whether the data format is correct.
Optionally, obtaining the mapping relationship between the headers of the non-standard summary table and/or the non-standard summary table and the header of the non-standard data table includes:
obtaining a direct corresponding relation between the non-standard summary table and the header of the non-standard data table set by a user, or obtaining a calculation formula set by the user, calculating the header of the non-standard data table based on the calculation formula, and then establishing a corresponding relation with the header of the non-standard summary table, or obtaining a calculation formula set by the user, calculating the header of the non-standard summary table based on the calculation formula, and then establishing a corresponding relation between the headers of the non-standard summary table.
Optionally, after backfilling the data in the nonstandard data table into the nonstandard summary table, further comprising:
and displaying and outputting the nonstandard summary table after data backfilling.
In order to achieve the above object, the present application further provides a data summarizing device based on a nonstandard data table, including: a memory; and
a processor coupled to the memory, the processor configured to:
acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data;
acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result;
and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
To achieve the above object, the present application also provides a computer storage medium having stored thereon a computer program which, when executed by a machine, implements the steps of the method as described above.
The embodiment of the application has the following advantages:
the embodiment of the application provides a data summarization method based on a nonstandard data table, which comprises the following steps: acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data; acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result; and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
According to the method, data summarization can be performed based on the nonstandard data table and the nonstandard summary table, the flexibility is high, the data of a plurality of nonstandard data tables are automatically backfilled into the nonstandard summary table based on the analyzed table heads and data of the nonstandard data tables and the mapping relation set by a user, the labor for data input during data summarization is greatly saved, the data summarization efficiency is improved, the data mismatching rate of the table is reduced through data matching identity identification in a system database, and the data summarization accuracy is improved. In addition, compared with the input nonstandard summary table, the output nonstandard summary table keeps the fonts, colors and styles of the input table, and meanwhile, the cell formulas which are not covered by backfill data are reserved, so that data summary based on the nonstandard summary table is greatly facilitated.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below. It will be apparent to those skilled in the art from this disclosure that the drawings described below are merely exemplary and that other embodiments may be derived from the drawings provided without undue effort.
Fig. 1 is a flowchart of a data summarizing method based on a nonstandard data table according to an embodiment of the present application;
fig. 2 is a block diagram of a data summarizing device based on a nonstandard data table according to an embodiment of the present application.
Detailed Description
Other advantages and advantages of the present application will become apparent to those skilled in the art from the following description of specific embodiments, which is to be read in light of the present disclosure, wherein the present embodiments are described in some, but not all, of the several embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.
In addition, the technical features described below in the different embodiments of the present application may be combined with each other as long as they do not collide with each other.
An embodiment of the present application provides a data summarizing method based on a non-standard data table, referring to fig. 1, fig. 1 is a flowchart of a data summarizing method based on a non-standard data table provided in an embodiment of the present application, and it should be understood that the method may further include additional blocks not shown and/or blocks shown may be omitted, and the scope of the present application is not limited in this respect. The method comprises the following steps:
at step 101, a non-standard summary table and at least one non-standard data table are obtained, the non-standard summary table and the non-standard data table are parsed, and corresponding header and data are obtained.
Specifically, a nonstandard summary table uploaded by a user and at least one nonstandard data table are obtained, wherein the nonstandard summary table can be a report of data statistics required by the user finally, the table form of the nonstandard summary table can be customized by the user, the nonstandard data table is a table for providing a data source for the nonstandard summary table, and the format of the nonstandard data table can be customized or determined based on the source of extracted data. In some embodiments, the nonstandard summary table and the nonstandard data table are Excel tables, and the POI tools, the stream processing algorithm and/or the win32 module are used for analyzing the nonstandard summary table and the nonstandard data table to obtain the header and the data of the nonstandard summary table and the nonstandard data table. The data is character or numerical data under the header. The nonstandard summary table can be an employee wage nonstandard summary table of an enterprise, data in the nonstandard summary table can have null values, the table head can comprise employee names, employee unique numbers, basic income, personal income tax and the like, and the data corresponding to the employee names can be Zhang three, lisi four and the like. The nonstandard data table may be social security details of enterprise staff, income change data, and the like.
In some embodiments, the parsing the nonstandard summary table and the nonstandard data table to obtain corresponding header and data includes:
analyzing and displaying the row numbers of the head and the data of the nonstandard summary table and the nonstandard data table and the column numbers of the data ending column, modifying based on the obtained modification value input by the user, and obtaining the corresponding head and data according to the modified row numbers of the head and the data of the nonstandard summary table and the nonstandard data table, the column numbers of the data ending column and the column numbers of the neglected data.
Specifically, after the uploaded nonstandard summary table and nonstandard data table are analyzed through the analysis tool, the obtained line numbers of the line where the table heads and the data of the nonstandard summary table and the nonstandard data table are located, the column numbers of the data ending column and the like are displayed for a user to check and modify, then the modified value input by the user and/or the line number of the data which can be ignored are obtained, and the corresponding table heads and data are obtained according to the modified line numbers and column numbers.
In some embodiments, after obtaining the nonstandard summary table and at least one of the nonstandard data tables, further comprising:
and carrying out accuracy verification on the header and the data of the nonstandard summary table and the nonstandard data table, wherein the verification comprises verification on whether the header or the data in the nonstandard summary table or the nonstandard data table contains a third-party table link, an empty row or an empty column, whether the header contains an empty header, whether the header is repeated or whether the data format is correct.
Specifically, after analyzing the nonstandard summary table and the nonstandard data table, performing accuracy verification on the analyzed header and data, and when the third party table link, the empty row, the empty column, the empty header, the header repetition or the data format error are included, performing error reporting to remind a user of modification.
At step 102, matching fields of the nonstandard summary table and the nonstandard data table are obtained, the identity of the data corresponding to the matching fields is searched in a system database based on the matching fields, and the identity is matched with the data of the nonstandard summary table and the nonstandard data table based on a matching result.
In some embodiments, the obtaining the matching fields of the nonstandard summary table and the nonstandard data table, searching the identity of the data corresponding to the matching fields in the system database based on the matching fields, includes:
after the matching field is obtained, comparing the matching field with a plurality of field names customized by a user in the system, finding out the field name which is the same as the matching field in the system, and finding out the identity of the data corresponding to the matching field based on the field name.
Specifically, when the matching fields of the nonstandard summary table and the nonstandard data table set by the user are employee names, searching a field name corresponding to the employee name customized by the user from a system database, wherein the field in the system can be provided with a plurality of corresponding field aliases, such as employee names, chinese names or employee names, through the user customization, so long as the field names are the same as one of the field aliases, after the field names which are the same as the matching fields in the system are searched, acquiring the identity identifiers of the corresponding data, such as the personal data of Zhang three and the personal data of Lifour, respectively corresponding to different identity identifiers, so as to distinguish whether the data are Zhang three or Lifour.
At step 103, a mapping relationship between headers of the nonstandard summary table and/or between headers of the nonstandard summary table and the nonstandard data table is obtained, and based on the mapping relationship and the identity, the line number and/or the column number corresponding to the headers, data in the nonstandard data table is backfilled into the nonstandard summary table.
In some embodiments, obtaining the mapping between the headers of the non-standard summary table and/or the non-standard summary table and the headers of the non-standard data table comprises:
obtaining a direct corresponding relation between the non-standard summary table and the header of the non-standard data table set by a user, or obtaining a calculation formula set by the user, calculating the header of the non-standard data table based on the calculation formula, and then establishing a corresponding relation with the header of the non-standard summary table, or obtaining a calculation formula set by the user, calculating the header of the non-standard summary table based on the calculation formula, and then establishing a corresponding relation between the headers of the non-standard summary table.
Specifically, displaying the header of the parsed nonstandard summary table, setting the header of the nonstandard data table corresponding to each nonstandard summary table header in a drop-down frame by a user to establish a direct corresponding relation between the headers, or acquiring a calculation formula set by the user, calculating the header of the nonstandard data table based on the calculation formula, establishing a corresponding relation with the header of the nonstandard summary table, inquiring each piece of data under the header and an identity mark corresponding to the data based on the list number of the nonstandard data table header and the line number of the corresponding data during backfilling, inquiring each piece of data under the header and the identity mark corresponding to the data based on the list number of the nonstandard summary table header, and then backfilling the data in the nonstandard data table to the data position of the same identity mark in the corresponding nonstandard summary table, wherein the data frame in the nonstandard summary table is generally empty, and the data at the position can be covered by the data in the nonstandard data table if the data frame is not empty; in addition, a corresponding relation can be established between the headers of the nonstandard summary table by setting a calculation formula, the line number and/or the column number of the selected header-based query is only carried out on the data under the header of the selected nonstandard summary table, the identity of the data under the header is queried, and the data calculated based on the calculation formula is backfilled under the selected header.
In some embodiments, after backfilling the data in the nonstandard data table into the nonstandard summary table, further comprising:
and displaying and outputting the nonstandard summary table after data backfilling. Compared with the input nonstandard summary table, the output nonstandard summary table keeps the fonts, colors and styles of the input table, and meanwhile keeps the cell formulas which are not covered by backfill data, so that data summary based on the nonstandard summary table is greatly facilitated.
According to the method, data summarization can be performed based on the nonstandard data table and the nonstandard summary table, the flexibility is high, the data of a plurality of nonstandard data tables are automatically backfilled into the nonstandard summary table based on the analyzed table heads and data of the nonstandard data tables and the mapping relation set by a user, the labor for data input during data summarization is greatly saved, the data summarization efficiency is improved, the data mismatching rate of the table is reduced through data matching identity identification in a system database, and the data summarization accuracy is improved.
Fig. 2 is a block diagram of a data summarizing device based on a nonstandard data table according to an embodiment of the present application. The device comprises:
a memory 201; and a processor 202 connected to the memory 201, the processor 202 configured to: acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data;
acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result;
and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
In some embodiments, the processor 202 is further configured to: the nonstandard summary table and the nonstandard data table are Excel tables;
the method for analyzing the nonstandard summary table and the nonstandard data table comprises the following steps: the nonstandard summary table and nonstandard data table are parsed using POI tools, stream processing algorithms, and/or win32 modules.
In some embodiments, the processor 202 is further configured to: the analyzing the nonstandard summary table and the nonstandard data table to obtain corresponding header and data comprises the following steps:
analyzing and displaying the row numbers of the head and the data of the nonstandard summary table and the nonstandard data table and the column numbers of the data ending column, modifying based on the obtained modification value input by the user, and obtaining the corresponding head and data according to the modified row numbers of the head and the data of the nonstandard summary table and the nonstandard data table, the column numbers of the data ending column and the column numbers of the neglected data.
In some embodiments, the processor 202 is further configured to: the obtaining the matching fields of the nonstandard summary table and the nonstandard data table, searching the identity of the data corresponding to the matching fields in the system database based on the matching fields, and the method comprises the following steps:
after the matching field is obtained, comparing the matching field with a plurality of field names customized by a user in the system, finding out the field name which is the same as the matching field in the system, and finding out the identity of the data corresponding to the matching field based on the field name.
In some embodiments, the processor 202 is further configured to: after the non-standard summary table and at least one of the non-standard data tables are obtained, the method further comprises:
and carrying out accuracy verification on the header and the data of the nonstandard summary table and the nonstandard data table, wherein the verification comprises verification on whether the header or the data in the nonstandard summary table or the nonstandard data table contains a third-party table link, an empty row or an empty column, whether the header contains an empty header, whether the header is repeated or whether the data format is correct.
In some embodiments, the processor 202 is further configured to: the obtaining the mapping relation between the headers of the nonstandard summary table and/or between the nonstandard summary table and the header of the nonstandard data table comprises the following steps:
obtaining a direct corresponding relation between the non-standard summary table and the header of the non-standard data table set by a user, or obtaining a calculation formula set by the user, calculating the header of the non-standard data table based on the calculation formula, and then establishing a corresponding relation with the header of the non-standard summary table, or obtaining a calculation formula set by the user, calculating the header of the non-standard summary table based on the calculation formula, and then establishing a corresponding relation between the headers of the non-standard summary table.
In some embodiments, the processor 202 is further configured to: after backfilling the data in the nonstandard data table into the nonstandard summary table, further comprising:
and displaying and outputting the nonstandard summary table after data backfilling.
Reference is made to the foregoing method embodiments for specific implementation methods, and details are not repeated here.
The present application may be a method, apparatus, system, and/or computer program product. The computer program product may include a computer readable storage medium having computer readable program instructions embodied thereon for performing the various aspects of the present application.
The computer readable storage medium may be a tangible device that can hold and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: portable computer disks, hard disks, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), static Random Access Memory (SRAM), portable compact disk read-only memory (CD-ROM), digital Versatile Disks (DVD), memory sticks, floppy disks, mechanical coding devices, punch cards or in-groove structures such as punch cards or grooves having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media, as used herein, are not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., optical pulses through fiber optic cables), or electrical signals transmitted through wires.
The computer readable program instructions described herein may be downloaded from a computer readable storage medium to a respective computing/processing device or to an external computer or external storage device over a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmissions, wireless transmissions, routers, firewalls, switches, gateway computers and/or edge servers. The network interface card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium in the respective computing/processing device.
Computer program instructions for performing the operations of the present application may be assembly instructions, instruction Set Architecture (ISA) instructions, machine-related instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, c++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The computer readable program instructions may be executed entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, aspects of the present application are implemented by personalizing electronic circuitry, such as programmable logic circuitry, field Programmable Gate Arrays (FPGAs), or Programmable Logic Arrays (PLAs), with state information for computer readable program instructions, which may execute the computer readable program instructions.
Various aspects of the present application are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer-readable program instructions.
These computer readable program instructions may be provided to a processing unit of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processing unit of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable medium having the instructions stored therein includes an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer, other programmable apparatus or other devices implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Note that all features disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic set of equivalent or similar features. Where used, further, preferably, still further and preferably, the brief description of the other embodiment is provided on the basis of the foregoing embodiment, and further, preferably, further or more preferably, the combination of the contents of the rear band with the foregoing embodiment is provided as a complete construct of the other embodiment. A further embodiment is composed of several further, preferably, still further or preferably arrangements of the strips after the same embodiment, which may be combined arbitrarily.
While the application has been described in detail with respect to the general description and specific embodiments thereof, it will be apparent to those skilled in the art that certain modifications and improvements may be made thereto based upon the application. Accordingly, such modifications or improvements may be made without departing from the spirit of the application and are intended to be within the scope of the invention as claimed.

Claims (9)

1. A method for summarizing data based on a nonstandard data table, comprising:
acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data;
acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result;
the obtaining the matching fields of the nonstandard summary table and the nonstandard data table, searching the identity of the data corresponding to the matching fields in the system database based on the matching fields, and the method comprises the following steps:
after the matching field is obtained, comparing the matching field with a plurality of field names customized by a user in a system, finding out the field name which is the same as the matching field in the system, and finding out the identity of the data corresponding to the matching field based on the field name;
and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
2. The method for data summarization based on non-standard data tables according to claim 1, wherein,
the nonstandard summary table and the nonstandard data table are Excel tables;
the method for analyzing the nonstandard summary table and the nonstandard data table comprises the following steps: the nonstandard summary table and nonstandard data table are parsed using POI tools, stream processing algorithms, and/or win32 modules.
3. The method for summarizing data based on non-standard data tables according to claim 1, wherein said parsing the non-standard summary tables and the non-standard data tables to obtain corresponding headers and data comprises:
analyzing and displaying the row numbers of the head and the data of the nonstandard summary table and the nonstandard data table and the column numbers of the data ending column, modifying based on the obtained modification value input by the user, and obtaining the corresponding head and data according to the modified row numbers of the head and the data of the nonstandard summary table and the nonstandard data table, the column numbers of the data ending column and the column numbers of the neglected data.
4. The non-standard data table-based data summarization method of claim 1, further comprising, after obtaining the non-standard summary table and at least one of the non-standard data tables:
and carrying out accuracy verification on the header and the data of the nonstandard summary table and the nonstandard data table, wherein the verification comprises verification on whether the header or the data in the nonstandard summary table or the nonstandard data table contains a third-party table link, an empty row or an empty column, whether the header contains an empty header, whether the header is repeated or whether the data format is correct.
5. The method of claim 1, wherein obtaining a mapping between headers of the nonstandard data table and/or between the nonstandard data table and the nonstandard data table comprises:
obtaining a direct corresponding relation between the non-standard summary table and the header of the non-standard data table set by a user, or obtaining a calculation formula set by the user, calculating the header of the non-standard data table based on the calculation formula, and then establishing a corresponding relation with the header of the non-standard summary table, or obtaining a calculation formula set by the user, calculating the header of the non-standard summary table based on the calculation formula, and then establishing a corresponding relation between the headers of the non-standard summary table.
6. The non-standard data table-based data summarization method of claim 1, further comprising, after backfilling data in the non-standard data table into the non-standard summary table:
and displaying and outputting the nonstandard summary table after data backfilling.
7. A non-standard data table-based data summarizing apparatus, comprising:
a memory; and
a processor coupled to the memory, the processor configured to:
acquiring a nonstandard summary table and at least one nonstandard data table, analyzing the nonstandard summary table and the nonstandard data table, and acquiring corresponding header and data;
acquiring matching fields of the nonstandard summary table and the nonstandard data table, searching for the identity of data corresponding to the matching fields in a system database based on the matching fields, and matching the identity to the data of the nonstandard summary table and the nonstandard data table based on a matching result;
the obtaining the matching fields of the nonstandard summary table and the nonstandard data table, searching the identity of the data corresponding to the matching fields in the system database based on the matching fields, and the method comprises the following steps:
after the matching field is obtained, comparing the matching field with a plurality of field names customized by a user in a system, finding out the field name which is the same as the matching field in the system, and finding out the identity of the data corresponding to the matching field based on the field name;
and acquiring mapping relations between the headers of the nonstandard summary table and/or between the nonstandard summary table and the headers of the nonstandard data table, and backfilling data in the nonstandard data table into the nonstandard summary table based on the mapping relations and the identity, the line number and/or the column number corresponding to the headers.
8. The non-standard data table-based data summarization apparatus of claim 7, wherein the processor is further configured to:
the analyzing the nonstandard summary table and the nonstandard data table to obtain corresponding header and data comprises the following steps:
analyzing and displaying the row numbers of the head and the data of the nonstandard summary table and the nonstandard data table and the column numbers of the data ending column, modifying based on the obtained modification value input by the user, and obtaining the corresponding head and data according to the modified row numbers of the head and the data of the nonstandard summary table and the nonstandard data table, the column numbers of the data ending column and the column numbers of the neglected data.
9. A computer storage medium having stored thereon a computer program, which when executed by a machine performs the steps of the method according to any of claims 1 to 6.
CN202310424092.5A 2023-04-20 2023-04-20 Data summarization method and device based on nonstandard data table and storage medium Active CN116126873B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310424092.5A CN116126873B (en) 2023-04-20 2023-04-20 Data summarization method and device based on nonstandard data table and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310424092.5A CN116126873B (en) 2023-04-20 2023-04-20 Data summarization method and device based on nonstandard data table and storage medium

Publications (2)

Publication Number Publication Date
CN116126873A CN116126873A (en) 2023-05-16
CN116126873B true CN116126873B (en) 2023-07-04

Family

ID=86312204

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310424092.5A Active CN116126873B (en) 2023-04-20 2023-04-20 Data summarization method and device based on nonstandard data table and storage medium

Country Status (1)

Country Link
CN (1) CN116126873B (en)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102436475B (en) * 2011-09-29 2013-12-25 用友软件股份有限公司 Data table summarizing device and data table summarizing method
CN105893526A (en) * 2016-03-30 2016-08-24 上海坤士合生信息科技有限公司 Multi-source data fusion system and method
CN108897874B (en) * 2018-07-03 2020-10-30 北京字节跳动网络技术有限公司 Method and apparatus for processing data
CN109558440B (en) * 2018-10-18 2023-12-15 平安科技(深圳)有限公司 Data batch processing method and device, computer equipment and storage medium
CN115630057A (en) * 2022-10-20 2023-01-20 广东电网有限责任公司 Method for realizing automatic integration of multi-source heterogeneous data

Also Published As

Publication number Publication date
CN116126873A (en) 2023-05-16

Similar Documents

Publication Publication Date Title
US20210271697A1 (en) Systems and methods for processing a natural language query in data tables
US20050183002A1 (en) Data and metadata linking form mechanism and method
US8321469B2 (en) Systems and methods of profiling data for integration
CN103605651A (en) Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis
CN112181949A (en) Online data modeling method and device
CN111143370B (en) Method, apparatus and computer-readable storage medium for analyzing relationships between a plurality of data tables
CN114253995A (en) Data tracing method, device, equipment and computer readable storage medium
CN112445775A (en) Fault analysis method, device, equipment and storage medium of photoetching machine
US10360208B2 (en) Method and system of process reconstruction
US20230376900A1 (en) Financial documents examination methods and systems
CN116484084B (en) Metadata blood-margin analysis method, medium and system based on application information mining
CN116126873B (en) Data summarization method and device based on nonstandard data table and storage medium
CN110704635B (en) Method and device for converting triplet data in knowledge graph
CN112582073A (en) Medical information acquisition method, device, electronic equipment and medium
CN112783482A (en) Visual form generation method, device, equipment and storage medium
CN113760891A (en) Data table generation method, device, equipment and storage medium
CN112634004A (en) Blood margin map analysis method and system for credit investigation data
US10664501B2 (en) Deriving and interpreting users collective data asset use across analytic software systems
CN115295109A (en) Medical data tracing method and device, storage medium and terminal equipment
CN114186554A (en) Script data processing method and device, computer equipment and storage medium
US11170164B2 (en) System and method for cell comparison between spreadsheets
CN112199335A (en) Generation method and generation system of operation log
CN111563178A (en) Rule logic diagram comparison method, device, medium and electronic equipment
WO2023185377A1 (en) Multi-granularity data pattern mining method and related device
CN116881512A (en) Cross-system metadata blood-edge automatic analysis method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant