CN115686455A - Application development method, device and equipment based on spreadsheet and storage medium - Google Patents

Application development method, device and equipment based on spreadsheet and storage medium Download PDF

Info

Publication number
CN115686455A
CN115686455A CN202211291388.6A CN202211291388A CN115686455A CN 115686455 A CN115686455 A CN 115686455A CN 202211291388 A CN202211291388 A CN 202211291388A CN 115686455 A CN115686455 A CN 115686455A
Authority
CN
China
Prior art keywords
area
row
line
data
header
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211291388.6A
Other languages
Chinese (zh)
Inventor
项志锋
陈璐
禚洪宇
谭若愚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dingtalk China Information Technology Co Ltd
Original Assignee
Dingtalk China Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dingtalk China Information Technology Co Ltd filed Critical Dingtalk China Information Technology Co Ltd
Publication of CN115686455A publication Critical patent/CN115686455A/en
Pending legal-status Critical Current

Links

Images

Abstract

The embodiment of the application provides an application development method, an application development device, application development equipment and a storage medium based on a spreadsheet. In the embodiment of the application, the operations of text extraction, preprocessing, header identification and the like are sequentially performed on the electronic form to obtain the field data of at least one header field, and the application development is performed based on the field data of at least one header field. Therefore, the spreadsheet application mode can effectively deal with the application of complex spreadsheets, greatly improves the development efficiency of spreadsheet application, and improves the data security and maintainability of the spreadsheets.

Description

Application development method, device and equipment based on spreadsheet and storage medium
Cross-referencing
The present application is incorporated by reference in its entirety into the present application entitled "method and system for building form-based applications" filed on 20/10/2021 in chinese patent application No. 2021112236157.
Technical Field
The present application relates to the field of computer development technologies, and in particular, to a method, an apparatus, a device, and a storage medium for application development based on a spreadsheet.
Background
Currently, enterprises generally employ spreadsheets for information collection and data statistics. In practical application, the electronic form is used for maintaining data, so that the problems of data coverage, information leakage, incapability of tracing a filling person, inconvenience in filling of a mobile terminal and the like easily occur.
Disclosure of Invention
Aspects of the present application provide a method, an apparatus, a device and a storage medium for spreadsheet-based application development, so as to implement spreadsheet application and improve data security and maintainability of a spreadsheet.
The embodiment of the application provides an application development method based on an electronic form, which comprises the following steps: performing text extraction on the spreadsheet to obtain text data; preprocessing the electronic form according to the text data to obtain a target form; identifying a header area of a target table; acquiring field data of at least one header field associated with the header area from the text data; an application is generated based on the field data of the at least one header field.
An embodiment of the present application further provides an application development apparatus based on an electronic form, including: the extraction module is used for extracting texts of the electronic forms to obtain text data; the preprocessing module is used for preprocessing the spreadsheet according to the text data to obtain a target spreadsheet; the identification module is used for identifying a header area of the target table; the acquisition module is used for acquiring field data of at least one header field associated with the header area from the text data; and the generating module is used for generating the application program according to the field data of at least one header field.
An embodiment of the present application further provides an electronic device, including: a memory and a processor; a memory for storing a computer program; a processor is coupled to the memory for executing the computer program for performing the steps in the spreadsheet-based application development method.
Embodiments of the present application also provide a computer readable storage medium storing a computer program, which when executed by a processor, causes the processor to implement the steps in the spreadsheet-based application development method.
In the embodiment of the application, the operations of text extraction, preprocessing, header identification and the like are sequentially performed on the electronic form to obtain the field data of at least one header field, and the application development is performed based on the field data of at least one header field. Therefore, the spreadsheet application mode can effectively deal with the application of complex spreadsheets, greatly improves the development efficiency of spreadsheet application, and improves the data security and maintainability of the spreadsheets.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1 is a diagram of an exemplary application scenario provided in an embodiment of the present application;
FIG. 2 is a flowchart of a spreadsheet-based application development method according to an embodiment of the present application;
FIG. 3 is an exemplary spreadsheet;
FIG. 4 is an application interface in an application program corresponding to the spreadsheet shown in FIG. 3;
FIG. 5 is another application interface in an application corresponding to the spreadsheet shown in FIG. 3;
FIG. 6 is another application interface in an application corresponding to the spreadsheet shown in FIG. 3;
fig. 7 is a schematic structural diagram of an application development apparatus based on a spreadsheet according to an embodiment of the present application;
fig. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be described in detail and completely with reference to the following specific embodiments of the present application and the accompanying drawings. It should be apparent that the described embodiments are only some of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
In the embodiments of the present application, "at least one" means one or more, "a plurality" means two or more. "and/or" describes the access relationship of the associated object, meaning that there may be three relationships, e.g., A and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone, wherein A and B can be singular or plural. In the description of the text of the present application, the character "/" generally indicates that the former and latter associated objects are in an "or" relationship. In the embodiments of the present application, "first", "second", "third", "fourth", "fifth", and "sixth" are only used to distinguish the contents of different objects, and no other special meaning is given.
Currently, enterprises generally employ spreadsheets for information collection and data statistics. In practical application, the electronic form is used for maintaining data, so that the problems of data coverage, information leakage, incapability of tracing a filling person, inconvenience in filling of a mobile terminal and the like easily occur. Therefore, the embodiment of the application provides an application development method, an application development device, an application development equipment and a storage medium based on the electronic form. In the embodiment of the application, the operations of text extraction, preprocessing, header identification and the like are sequentially performed on the electronic form to obtain the field data of at least one header field, and the application development is performed based on the field data of at least one header field. Therefore, the spreadsheet application mode can effectively deal with the application of complex spreadsheets, greatly improves the development efficiency of spreadsheet application, and improves the data security and maintainability of the spreadsheets.
Fig. 1 is a diagram of an exemplary application scenario provided in an embodiment of the present application. In the Application scenario, spreadsheet Application can be realized, and a user performs data management and maintenance by using an Application program (App) converted from a spreadsheet, so that data security and data maintainability can be greatly improved. Specifically, for convenience of understanding and distinction, a user having a spreadsheet application requirement is referred to as a first user, and referring to (1) in fig. 1, the first user sends a spreadsheet to be applied to a server through a terminal device. Referring to (2) in fig. 1, the server performs a text extraction operation to extract text data in the spreadsheet. The text data comprises the text content of each cell in the spreadsheet and the position information of the cell corresponding to the extracted text content in the spreadsheet. After extracting the text data of the electronic form, the server performs a preprocessing operation as shown in (3) of fig. 1. Specifically, the server may segment the electronic form based on the extracted text data to segment the largest-area content block from the electronic form, and in fig. 1, through the content segmentation operation, may find a plurality of content blocks in the electronic form and find a largest-area content block 1 from the plurality of content blocks. Next, the content block 1, which is the largest in area, is clipped, and the table obtained by the clipping is referred to as a target table. Referring to (4) in fig. 1, after cutting out the target table from the electronic table, the server identifies a header area of the target table, the header area mainly serves as column data, and a column header of a column in which the cell is located is recorded in one cell in the header area. For example, for the employee information table, the following are recorded in sequence for each cell of the header area: column headings for serial number, gender, age, and position. For ease of understanding and distinction, the fields associated with the header regions are referred to as header fields. The field names of the header fields are located in the cells of the header area, and the field values of the header fields are located in the cells of the column in which the field names are located. After the header area of the target form is identified, the field name and the field value of at least one header field can be obtained from the text data of the electronic form according to the position information of the header area in the electronic form, and the data type of the field value corresponding to each header field is analyzed. Referring to (5) in fig. 1, the server provides field data of at least one header field to a Low-Code Development Platform (LCDP). Referring to (6) in fig. 1, the low-code development platform performs application development based on the maintained page component library. The low-code development platform is a development platform which can rapidly generate an application program without coding or with a small amount of code. For example, low code development platforms provide a page component library that includes a large number of page components, each encapsulating source code that implements a particular function. The low-code development platform can generate an application program by utilizing one or more page components, so that the development efficiency of the application program is greatly improved, and the programming capability requirement for developers is reduced.
As shown in (7) and (8) in fig. 1, the application developed by the low-code development platform is provided to the terminal device of the first user via the server. Taking an example that the application program is composed of a home page, a data management page and a form submission page, a first user opens the application program on the terminal device, the home page of the application program is displayed in a display screen of the terminal device, and an entry link of the data management page and an entry link of the form submission page are displayed on the home page. The first user triggers an entry link of the data management page, and the terminal device displays the data management page for the first user to view and manage data. The first user triggers an entry link of the form submission page, and the terminal device displays the form submission page so that the first user can fill in and submit data. In some application scenarios, referring to (9) in fig. 1, a first user may push an application to a plurality of second users, the second users being different from the first users. And each second user opens an application program on the terminal equipment, enters the form submission page for data filling, and triggers a submission control on the form submission page after the data filling is finished so as to submit the filled data to the terminal equipment of the first user. Referring to arrow r in fig. 1, data filled in by each second user may be synchronized to the data management page, so as to implement data aggregation, and facilitate maintenance and management of data.
In some application scenarios, for example, in a group chat scenario, a first user may set operation permissions of respective second users for an application, for example, some second users have permissions to view, share, or modify all pages in the application, some second users have permissions to view, share, or modify only some pages in the application, some second users have permissions to view only some pages in the application, and the like, and specific permission control may be set by the first user as needed.
In the embodiment of the present application, the terminal device includes, for example and without limitation, a mobile phone, a tablet computer, a notebook computer, a wearable device, and an in-vehicle device. The server includes, for example, but not limited to, a single server or a distributed server cluster of multiple servers. It should be understood that the number of servers and terminal devices in fig. 1 is merely illustrative. The application scenario shown in fig. 1 is merely an example, and the application scenario is not limited in the embodiments of the present application.
The technical solutions provided by the embodiments of the present application are described in detail below with reference to the accompanying drawings.
Fig. 2 is a flowchart of an application development method based on a spreadsheet according to an embodiment of the present application. The method may be performed by a spreadsheet-based application development apparatus, which may be comprised of software and/or hardware, and may be generally integrated into an electronic device. Referring to fig. 2, the method may include the steps of:
201. and performing text extraction on the electronic form to obtain text data.
202. And preprocessing the electronic form according to the text data to obtain a target form.
203. A header area of the target table is identified.
204. Field data of at least one header field associated with the header region is obtained from the text data.
205. An application is generated based on the field data of the at least one header field.
Specifically, the spreadsheet may be various types of spreadsheet documents. In practical applications, the way of text extraction for the electronic forms is not limited. For example, the spreadsheet may be text extracted using various text extraction functions such as LEFT (), RIGHT (), MID (), and the like. The LEFT () function is used to extract a specified number of characters from the LEFT side of the text included in the spreadsheet. RIGHT () is used to extract a specified number of characters from the RIGHT side of the text included in the spreadsheet. MID () is used to extract a specified number of characters from a specified position of text included in the electronic form. As another example, a spreadsheet is text extracted using an adapted regular expression. For another example, the electronic form is converted into a picture, and the picture is processed by using an OCR (Optical Character Recognition) technique to obtain text data in the electronic form. Of course, the way of text extraction for the spreadsheet is not limited to the above example, and more description can be found in the related art.
It is noted that if the spreadsheet comprises a plurality of sub-tables, steps 201 to 205 are performed for each sub-table. For example, where the spreadsheet has a plurality of form pages, and a form page may be considered a sub-form of the spreadsheet, steps 201 through 205 are performed for each form page. Of course, the applications of multiple sub-forms can be integrated together to form a complete application.
In the present embodiment, the text data to be extracted from the spreadsheet includes, for example, but is not limited to: at least one text content, and position information of a cell to which each text content belongs in the spreadsheet. Different users have different tabulation habits, so that the spreadsheet has diversified characteristics, and the complexity of the spreadsheet is different. For example, some spreadsheets are simple, where the first row is a header area and each row behind the first row is a detail area for recording detail data. For another example, some spreadsheets are complex, and there are a header line, a remark line, a header area, a detail area, and a non-detail area for recording non-detail data in the spreadsheet. For example, some spreadsheets are complicated, and a header area, a detail area, a non-detail area, and the like are subjected to cell consolidation, and the cell consolidation methods are different from each other.
In consideration of the complexity of the electronic forms, in order to reduce the development difficulty of the electronic forms for application, the electronic forms are preprocessed to obtain a target form for recording detail data, and the target form may also be referred to as a detail form. Specifically, after text data is extracted from the electronic form, the electronic form is cut and cropped by using the text data to obtain a target form. The table area with high probability of recording detail data can be found out from the electronic table through the segmentation operation, then the table area with high probability of recording detail data is further cut to obtain the table area only comprising the detail data, and the table area only comprising the detail data is used as a target table.
In practical applications, there may also be some header lines, remark lines, or lines with merged cells in the initial table, and the data recorded in these lines is not detail data, and needs to be cut out during the cutting process, that is, eliminated. Some initial tables record non-detail data besides detail data, and the rows where the non-detail data are located are removed in the clipping process. Therefore, further optionally, in order to obtain a more accurate target table, the initial table is cut, and an optional implementation manner of obtaining the target table is as follows: identifying an invalid area, a non-detail area and a detail area in an initial table; and clipping the invalid area and the non-detail area in the initial table, and taking the detail area which is not clipped as a target table.
Specifically, the invalid area refers to a table area composed of a plurality of invalid rows. In consideration of practical application, some invalid rows may exist at the head and/or tail of the initial table. Therefore, when identifying the invalid area in the initial table, whether each line in the initial table is an invalid line or not can be identified in sequence according to the sequence of line numbers from small to large and/or according to the sequence of line numbers from large to small until the valid line is identified for the first time; and determining the area formed by the at least one identified invalid line as an invalid area. Specifically, the invalid area of the initial table may be identified from the head, that is, whether the first row is an invalid row is determined from the first row of the initial table, if so, whether the second row is an invalid row is continuously determined until the first valid row is found, and the table area formed by rows before the first valid row is determined as the invalid area. And/or, the invalid region of the initial table may be identified from the end, that is, from the last line of the initial table, whether the last line is an invalid line is determined, if so, whether the penultimate line is an invalid line is continuously determined until the first valid line is found, and the table region formed by the lines after the first valid line is determined as the invalid region.
In practical applications, the definition of invalid lines can be set as required. Invalid lines include, for example, but are not limited to: the line in which all cells are merged, the header line in which header information is recorded, the memo line in which memo information is recorded, or the line in which summary information is recorded is not limited. Thus, as an example, identifying whether each row in the initial table is an invalid row in turn includes: judging whether the current line to be identified comprises at least one of remark information, title information or all cells combined; if the judgment result is yes, determining that the current line is an invalid line; and if the judgment result is negative, determining that the current line is the effective line.
In the present embodiment, after the invalid area in the initial table is identified, the non-detail area and the detail area in the initial table may be identified. The detail area is mainly a table area for recording the detail data, and the data recorded in the non-detail area is also the service data but not the detail data. Referring to the supplier statement shown in fig. 3, the statement includes, from top to bottom, a title line, a non-detail area, a detail area, and a non-detail area. The title line records a title, the detail area records various detail data, and the non-detail area records related service data except the detail data.
In practical applications, the non-detail area and the detail area in the initial table may be divided according to whether the designated symbol is included in the row. The designated symbol is flexibly set according to practical application conditions, and the designated symbol may be in the first type area but may not exist in the second type area in which the detail data is recorded. The designated symbol may be, for example, a colon: underlining _, split symbol | and the like, but not limited thereto.
In general, the cell merging patterns of the non-detail area and the detail area are not very same, and thus, the non-detail area and the detail area in the initial table may be divided by combining the cell merging patterns of two adjacent rows. The cell merge mode may indicate that there are several merge cells in the corresponding row, and which columns of cells the merge cells are merged into, and so on. The merging modes of the cells in different rows are the same, which indicates that the number of the merging cells in different rows is the same, and the merging cells are obtained by merging the cells in the same column. Different rows have different cell merging modes, which means that the number of merging cells in different rows is different and/or the merging cells are not obtained by merging cells in the same column.
Based on the above, in order to improve the recognition accuracy of the non-detail area and the detail area, an optional implementation manner for recognizing the non-detail area and the detail area in the initial table is as follows: according to a first effective area except for the invalid area in the initial table, sequentially traversing whether each line in the first effective area comprises a designated symbol or not according to the sequence of line numbers from small to large until a starting line of a detail area is determined from the first effective area; if the traversed current line comprises the designated symbol, determining that the current line belongs to a non-detail area; if the traversed current line does not comprise the designated symbol, determining that the current line belongs to the non-detail area under the condition that the cell merging mode of the current line is the same as that of the previous line; in a case where the cell merging mode of the current row is different from that of the previous row, it is determined that the current row belongs to a start row of the detail area, and each row from the start row of the detail area in the first effective area is determined as the detail area. And the line number of the previous line of the current line is smaller than that of the current line. It is noted that if the current row is the first row and the first row does not include the designated symbol, the first row may be defaulted to belong to the detail region.
Specifically, for a first effective area excluding the invalid area in the initial table, the non-detailed area and the detailed area of the first effective area are identified from the head. If the first line of the first effective area has the designated symbol, the first line belongs to the non-detail area, and whether the second line comprises the designated symbol is continuously judged. If the second line of the first effective area has a designated symbol, the second line belongs to the non-detail area.
And if the second row of the first effective area does not have the designated symbol, judging whether the cell merging modes of the first row and the second row are the same, and if so, indicating that the first row and the second row both belong to the non-detail area. If the two are different, the first line belongs to the non-detail area, the second line belongs to the detail area, and the second line is the initial line of the detail area. At this time, the first line is divided into a non-detail region, and the second line and its succeeding lines are divided into detail regions. And by analogy, continuously traversing whether each row in the first effective area comprises the specified symbol or not in turn until the starting row of the detail area is determined from the first effective area. Continuing with the example of fig. 3, first, the union-parallel line obtained by merging the first line and the second line is traversed, and the text content in the union-parallel line is determined as the heading line, that is, the invalid line, through semantic analysis. Then, go through the third to sixth rows, since the third to sixth rows have a colon: and determining the third to sixth behavior non-detail areas. Next, go through line 7, since line 7 has no colon: and the merging mode of the cells in the 7 th row is different from that of the cells in the 7 th row, the starting row of the 7 th row detail area is determined, and the 7 th row and the following rows are used as the clear area.
In practical applications, there may be non-detail areas at the end of the initial table, or there may be no non-detail areas. For example, in fig. 3, the detail region is followed by a non-detail region. In order to further improve the accuracy of identifying the non-detail region and the detail region, the method for determining each line from the start line of the detail region in the first effective region as the detail region includes: for a second effective area from the initial line of the detail area in the first effective area, sequentially traversing whether each line in the second effective area comprises a designated symbol according to the sequence of line numbers from large to small until the end line of the detail area is determined from the second effective area; if the traversed current row comprises the designated symbol, determining that the current row belongs to a non-detail area; if the traversed current row does not comprise the designated symbol, determining that the current row belongs to the non-detail area under the condition that the cell merging modes of the current row and the next row are the same; and under the condition that the cell merging modes of the current row and the next row are different, determining that the current row belongs to the ending row of the detail area, and determining each row from the starting row of the detail area to the ending row of the detail area in the second effective area as the detail area. Wherein the line number of the next line of the current line is greater than the current line. It is noted that if the current row is the first last row and the first last row does not include the designated symbol, the first last row may be defaulted to belong to the detail region.
Specifically, for a second effective area from the start line of the detailed area in the first effective area, the non-detailed area and the detailed area of the second effective area are identified from the end. If the last line of the second effective area has the designated symbol, the last line belongs to the non-detail area, and whether the last line comprises the designated symbol is continuously judged. And if the last row and the last row of the second effective area have no designated symbols, judging whether the cell merging modes of the last row and the last row are the same, and if so, indicating that the last row and the last row belong to the non-detail area. If the two lines are different, the last line belongs to the non-detail area, the last line belongs to the detail area, and the last line is the ending line of the detail area. At this time, the last line is divided into non-detail areas, and the penultimate line and its preceding lines are divided into detail areas. And analogically, sequentially traversing whether each row in the second effective area comprises the specified symbol or not until the ending row of the detail area is determined from the second effective area.
Continuing with fig. 3 as an example, first, traverse the merge line obtained by merging the 1 st line and the 2 nd line, and determine the text content in the merge line as a header line, that is, an invalid line, through semantic analysis. Then, go through rows 3 to 6, since rows three to sixth have a colon: and determining the non-detail areas of the 3 rd to 6 th behaviors. Next, go through line 7, since line 7 has no colon: and determining a starting row of the 7 th row detail area according to a different cell merging mode of the 7 th row and the 7 th row. This completes the task of traversing from the head, and then beginning traversing from the tail. Since rows 17 to 14 all have a colon: and determining the 17 th to 14 th behavior non-detail areas. When traversing to line 13, since line 13 has no colon: and the cell consolidation method of the 13 th row is different from the cell consolidation method of the 14 th row, and the 13 th row is determined to be the end row of the detail region, thereby determining that the 7 th to 13 th rows are the detail region.
In the present embodiment, after the target form is cut out from the electronic form, the header area of the target form may be identified in at least one of the following ways.
Mode 1: and identifying the header area of the target table by adopting a mode of taking the merging cells as identification targets.
Specifically, it may be determined whether the first row of the target table includes a merge cell; if the first row comprises the merging cells, traversing whether each row of the target table behind the first row comprises the merging cells or not until the row which does not comprise the merging cells is traversed for the first time; and determining at least one continuous row comprising the merging cells from the first row as a header area of the target table.
Mode 2: and identifying the header area of the target table by using the serial number as an identification target.
Specifically, it may be determined whether the target table includes a sequence number column, where column data of the sequence number column is a sequence number; and if the target table comprises the sequence number columns, determining the previous row of the row with the minimum sequence number as the header area of the target table.
Mode 3: and identifying the header area of the target table in a mode of taking the similarity between two adjacent rows as an identification target.
Specifically, whether two adjacent rows in the target table have similarity or not can be sequentially traversed until the two adjacent rows do not have similarity through first traversal; and determining the row with the small row number and at least one row before the row in the two adjacent rows without similarity traversed for the first time as a header area.
Further optionally, sequentially traversing whether two adjacent rows in the target table have similarity includes: determining a feature vector of each of two adjacent lines according to attribute information of at least one cell included in each of the two adjacent lines aiming at two adjacent lines traversed currently, wherein the attribute information includes data length or data type of data in the cell; calculating the similarity between two adjacent lines according to the respective feature vectors of the two adjacent lines; if the similarity is larger than or equal to a preset similarity threshold, determining that the two adjacent rows have similarity; and if the similarity is smaller than a preset similarity threshold, determining that the two adjacent rows have no similarity. The preset similarity threshold value can be flexibly set according to the actual application requirement.
Specifically, in determining the feature vector of each line, one vector of each cell is determined based on the attribute information of the cell, for example (0,0,1); and combining the vectors of all the cells to obtain the characteristic vector of the line. In calculating the similarity between two adjacent rows, any one of the euclidean distance, the cosine distance, and the manhattan distance may be calculated based on the feature vectors of the respective two adjacent rows, and the any one of the distances may be taken as the similarity between the two adjacent rows.
Mode 4: and identifying a header area of the target table in a default mode.
For example, the first row is taken as the header area of the target table by default.
It should be noted that, in practical applications, one or more of the manner 1, the manner 2, or the manner 3 may be preferentially adopted for header area identification, and if the header area is not identified by the manner 1, the manner 2, or the manner 3, the header area may be identified by the manner 4.
Further optionally, after the header area of the target table is identified, the header area may also be checked, for example, blank cells in the header area are removed, or repeated rows in the header area are removed, and the like.
In the embodiment, after the header area of the target form is identified, field data of at least one header field associated with the header area is acquired from the text data in the electronic form. Specifically, the field name of the header field refers to the text content recorded in the cell in the header area, and the field value of the header field refers to the text content in each cell in the column in which the cell in which the field name is recorded is located. Since the text data extracted from the spreadsheet includes at least one text content and the location information of the cell to which each text content belongs in the spreadsheet, for each cell in the header area, the text data of the spreadsheet can be queried according to the location information of the cell to obtain the text content in the cell, that is, the field name in the cell. Meanwhile, based on the position information of other cells in the column of the cell, the text data of the electronic form can be inquired to obtain the text content of other cells, namely the field value of the header field corresponding to the field name. To this end, the field name and the field value of at least one header field associated with the header region may be obtained from text data of the spreadsheet. And analyzes the data type of the field value of each header field, including, but not limited to: long text type, date type, number type, enumeration type, address type, and person name type, among others.
In practical application, the corresponding page component can be called from the page component library according to the field data of at least one header field, and the application program is generated based on at least one page component. For example, if the data type of the field value of the header field is a long text type, a text input component can be called from the page component library; if the data type of the field value of the header field is a date type, a date component can be called from the page component library; if the data type of the field value of the header field is a numeric type, such as an identification number, a mobile phone number, a landline number, etc., the text input component may be called from the page component library. If the data type of the field value of the header field is an enumeration type, a radio box component, a check box component or a check box component can be called from the page component library. If the data type of the field value of the header field is an address type, the text input component can be called from the page component library. If the data type of the field value of the header field is the person name type, a radio box element, a check box element or a check box element can be called from the page element library. There is flexibility in matching as to which data types to adapt which page components.
It is noted that, in practical applications, updating of the page component library is supported. The user can develop a new page component for realizing a specific function as required and add the new page component into the page component library, and certainly, the function of the existing page component in the page component library can be updated, and the like without limitation.
Further optionally, in order to facilitate the user to submit data and maintain management data, the implementation manner of generating the application program according to the field data of the at least one header field is as follows: calling at least one page component which is respectively matched with the data type of the field value of at least one header field from a page component library, respectively configuring the field name of at least one header field to the corresponding page component as a component name, and generating a form submission page according to the configured at least one page component; calling a list component from a page component library, and loading field data of at least one header field into the list component to obtain a data management page; and generating an application program at least according to the form submission page and the data management page.
Specifically, the form submission page is used for the user to fill in and submit data, and the user can operate each page component on the form submission page, complete corresponding data filling, and trigger submission of the filled-in data. The page name of each page component on the form submission page is the field name of the header field, the user triggers the page component to input the field value corresponding to the header field, and the data input mode of the page component is determined by the data type of the field value.
The data management page may load field data of each header field for data summarization, may also load field data of each field in a non-detailed area, and may also count field data of each header field, and load a statistical result, which is not limited herein. It is worth noting that when the data management page needs to load field data of each field of the non-detail area, a matched page component can be selected from the page component library according to the field value of each field of the non-detail area, the selected page component is loaded into the data management page, and the page component is configured according to the corresponding field data.
It should be noted that, in the case that the data management page has a data summarization function, a user submits new data of the page through a form and can summarize the new data into the data management page.
Further optionally, some operation type controls, such as an additional control, a deletion control, an import control, an export control, more controls, a search control, a filter control, a sort control, and the like, may be set in the form submission page or the data management page, so as to improve the interactive friendliness of the application program.
Further optionally, a home page may be generated for the application program, and the entry link of the data management page and the entry link of the form submission page are displayed on the home page, so as to improve the interaction friendliness of the application program.
For example, the application generated based on the vendor statement shown in fig. 3 includes a header page shown in fig. 4, a "vendor statement" page shown in fig. 5, and a "vendor statement data management page" shown in fig. 6. After the user opens the application program on the terminal device, a home page of the application program is displayed in a display screen of the terminal device, and a portal link of a data management page (i.e., the page shown in fig. 6) and a portal link of a form submission page (i.e., the page shown in fig. 5) are displayed on the home page. And the user triggers an entry link of the data management page, and the terminal equipment displays the data management page for the user to view and manage the data. The user triggers an entry link of the form submission page, and the terminal device displays the form submission page for the user to fill in and submit data. It is to be noted that the data management page shown in fig. 6 displays field data in the non-detailed area shown in fig. 3. Of course, the field data in the detailed area shown in fig. 3 may also be displayed.
According to the technical scheme, the electronic form is subjected to text extraction, preprocessing, header identification and other operations in sequence to obtain field data of at least one header field, and application development is carried out on the field data of at least one header field. Therefore, the spreadsheet application mode can effectively deal with the application of complex spreadsheets, greatly improves the development efficiency of spreadsheet application, and improves the data security and maintainability of the spreadsheets.
Fig. 7 is a schematic structural diagram of an application development apparatus based on a spreadsheet according to an embodiment of the present application. As shown in fig. 7, the apparatus may include: the system comprises an extraction module 71, a preprocessing module 72, an identification module 73, an acquisition module 74 and a generation module 75;
the extraction module 71 is configured to perform text extraction on the electronic form to obtain text data;
the preprocessing module 72 is configured to preprocess the spreadsheet according to the text data to obtain a target form;
an identifying module 73, configured to identify a header area of the target table;
an obtaining module 74, configured to obtain field data of at least one header field associated with the header region from the text data;
a generating module 75, configured to generate an application according to the field data of the at least one header field.
Further optionally, the text data includes position information of a cell to which each of the at least one text content belongs in the spreadsheet; correspondingly, the preprocessing module 72 preprocesses the electronic form according to the text data, and when the target form is obtained, the preprocessing module is specifically configured to: determining a plurality of content blocks in the spreadsheet and the position information of each content block in the spreadsheet according to the position information corresponding to the cell to which at least one text content belongs, wherein each content block is composed of at least one cell including the text content; determining the area of each content block according to the position information of each content block in the electronic table; cutting out the content block with the largest area from the electronic form, and taking the content block with the largest area as an initial form; and cutting the initial table to obtain a target table.
Further optionally, the preprocessing module 72 cuts the initial table to obtain a target table, which is specifically configured to: identifying an invalid area, a non-detail area and a detail area in an initial table; and clipping the invalid area and the non-detail area in the initial table, and taking the detail area which is not clipped as a target table.
Further optionally, when the preprocessing module 72 identifies an invalid area in the initial table, the preprocessing module is specifically configured to: sequentially identifying whether each line in the initial table is an invalid line or not according to the sequence of line numbers from small to large and/or the sequence of line numbers from large to small until the valid line is identified for the first time; and determining the area formed by the at least one identified invalid line as an invalid area.
Further optionally, when the preprocessing module 72 sequentially identifies whether each row in the initial table is an invalid row, the preprocessing module is specifically configured to: judging whether the current line to be identified comprises at least one of remark information, title information or all combined cells or not; if the judgment result is yes, determining that the current line is an invalid line; and if the judgment result is negative, determining that the current line is the effective line.
Further optionally, when the preprocessing module 72 identifies the non-detail area and the detail area in the initial table, the preprocessing module is specifically configured to: sequentially traversing whether each line in the first effective area comprises a designated symbol or not according to the sequence of line numbers from small to large aiming at the first effective area except the invalid area in the initial table until the initial line of the detail area is determined from the first effective area; if the traversed current row comprises the designated symbol, determining that the current row belongs to a non-detail area; if the traversed current line does not comprise the designated symbol, determining that the current line belongs to the non-detail area under the condition that the cell merging mode of the current line is the same as that of the previous line; and under the condition that the cell merging modes of the current line and the previous line are different, determining that the current line belongs to the starting line of the detail area, and determining each line in the first effective area from the starting line of the detail area as the detail area.
Further optionally, when determining, as the detail area, each line starting from the start line of the detail area in the first effective area, the preprocessing module 72 is specifically configured to: for a second effective area from the initial line of the detail area in the first effective area, sequentially traversing whether each line in the second effective area comprises a designated symbol according to the sequence of line numbers from large to small until the end line of the detail area is determined from the second effective area; if the traversed current row comprises the designated symbol, determining that the current row belongs to a non-detail area; if the traversed current row does not comprise the designated symbol, determining that the current row belongs to the non-detail area under the condition that the cell merging modes of the current row and the next row are the same; and under the condition that the cell merging modes of the current row and the next row are different, determining that the current row belongs to the ending row of the detail area, and determining each row from the starting row of the detail area to the ending row of the detail area in the second effective area as the detail area.
Further optionally, when the identifying module 73 identifies the header area of the target table, it is specifically configured to:
identifying a header area of the target table by adopting a mode of taking the merging cells as an identification target; and/or the presence of a gas in the gas,
identifying a header area of a target table by adopting a mode of taking a serial number as an identification target; and/or the presence of a gas in the gas,
identifying a header area of a target table by adopting a mode of taking the similarity between two adjacent lines as an identification target;
and identifying a header area of the target table in a default mode.
Further optionally, when the identification module 73 identifies the header area of the target table by using the merged cell as the identification target, the identification module is specifically configured to: judging whether a first row of the target table comprises a merging cell or not; if the first row comprises the merging cells, traversing whether each row of the target table behind the first row comprises the merging cells or not until the row which does not comprise the merging cells is traversed for the first time; determining at least one continuous row comprising the merging cells from the first row as a header area of the target table; alternatively, the first and second electrodes may be,
when the identifying module 73 identifies the header area of the target table by using the serial number as the identification target, it is specifically configured to:
judging whether a sequence number column is included in the target table or not, wherein the column data of the sequence number column is a sequence number; if the target table comprises a sequence number column, determining the previous row of the row with the minimum sequence number as a header area of the target table; alternatively, the first and second liquid crystal display panels may be,
when the identifying module 73 identifies the header area of the target table by using the similarity between two adjacent rows as the identification target, it is specifically configured to: sequentially traversing whether two adjacent rows in the target table have similarity or not until the two adjacent rows do not have similarity in the first traversal; and determining the row with the small row number and at least one row before the row in the two adjacent rows without similarity traversed for the first time as a header area.
Further optionally, when the identifying module 73 sequentially traverses whether there is a similarity between two adjacent rows in the target table, the identifying module is specifically configured to: determining a feature vector of each of two adjacent lines according to attribute information of at least one cell included in each of the two adjacent lines aiming at two adjacent lines traversed currently, wherein the attribute information includes data length or data type of data in the cell; calculating the similarity between two adjacent lines according to the respective feature vectors of the two adjacent lines; if the similarity is larger than or equal to a preset similarity threshold, determining that the two adjacent rows have similarity; and if the similarity is smaller than a preset similarity threshold, determining that the two adjacent rows have no similarity.
Further optionally, the field data includes: accordingly, when the generating module 75 generates the application program according to the field data of at least one header field, the field name, the field value, and the data type of the field value are specifically configured to: calling at least one page component which is matched with the data type of the field value of at least one table header field from a page component library, configuring the field name of the at least one table header field as a component name to the corresponding page component, and generating a form submission page according to the configured at least one page component; calling a list component from a page component library, and loading field data of at least one header field into the list component to obtain a data management page; and generating an application program at least according to the form submission page and the data management page.
The apparatus shown in fig. 7 may perform the method shown in the embodiment shown in fig. 2, and details of implementation principles and technical effects are not repeated. The specific manner in which each module and unit of the apparatus shown in fig. 7 in the above-described embodiment perform operations has been described in detail in the embodiment related to the method, and will not be described in detail herein.
It should be noted that the execution subjects of the steps of the methods provided in the above embodiments may be the same device, or different devices may be used as the execution subjects of the methods. For example, the execution subjects of step 201 to step 205 may be device a; for another example, the execution subject of steps 201 and 203 may be device a, and the execution subject of steps 204 to 205 may be device B; and so on.
In addition, in some of the flows described in the above embodiments and the drawings, a plurality of operations occurring in a specific order are included, but it should be clearly understood that these operations may be executed out of order or in parallel as they appear herein, and the sequence numbers of the operations, such as 201, 202, etc., are used merely to distinguish various operations, and the sequence numbers themselves do not represent any execution order. Additionally, the flows may include more or fewer operations, and the operations may be performed sequentially or in parallel. It should be noted that, the descriptions of "first", "second", etc. in this document are used for distinguishing different messages, devices, modules, etc., and do not represent a sequential order, nor do they limit the types of "first" and "second".
Fig. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present application. As shown in fig. 8, the electronic apparatus includes: a memory 81 and a processor 82;
memory 81 is used to store computer programs and may be configured to store other various data to support operations on the computing platform. Examples of such data include instructions for any application or method operating on the computing platform, contact data, phonebook data, messages, pictures, videos, and so forth.
The Memory 81 may be implemented by any type of volatile or nonvolatile Memory device or combination thereof, such as Static Random-access Memory (SRAM), electrically Erasable Programmable Read-Only Memory (EEPROM), erasable Programmable Read-Only Memory (EPROM), programmable Read-Only Memory (PROM), read-Only Memory (ROM), magnetic Memory, flash Memory, magnetic disk or optical disk.
A processor 82 coupled to the memory 81 for executing the computer program in the memory 81 for: performing text extraction on the spreadsheet to obtain text data; preprocessing the electronic form according to the text data to obtain a target form; identifying a header area of a target table; acquiring field data of at least one header field associated with the header area from the text data; an application is generated based on the field data of the at least one header field.
Further, as shown in fig. 8, the electronic device further includes: communication components 83, display 84, power components 85, audio components 86, and the like. Only some of the components are schematically shown in fig. 8, and the electronic device is not meant to include only the components shown in fig. 8. In addition, the components within the dashed line frame in fig. 8 are optional components, not necessary components, and may be determined according to the product form of the electronic device. The electronic device of this embodiment may be implemented as a terminal device such as a desktop computer, a notebook computer, a smart phone, or an IOT (Internet of things) device, and may also be a server device such as a conventional server, a cloud server, or a server array. If the electronic device of this embodiment is implemented as a terminal device such as a desktop computer, a notebook computer, or a smart phone, the electronic device may include components within a dashed line frame in fig. 8; if the electronic device of this embodiment is implemented as a server device such as a conventional server, a cloud server, or a server array, the components in the dashed box in fig. 8 may not be included.
For details of the implementation process of each action performed by the processor, reference may be made to the foregoing method embodiment or the related description in the device embodiment, and details are not described herein again.
Accordingly, the present application further provides a computer-readable storage medium storing a computer program, where the computer program is capable of implementing the steps that can be executed by the electronic device in the foregoing method embodiments when executed.
Accordingly, the present application also provides a computer program product, which includes a computer program/instruction, when the computer program/instruction is executed by a processor, the processor is enabled to implement the steps that can be executed by an electronic device in the above method embodiments.
The communication component is configured to facilitate wired or wireless communication between the device in which the communication component is located and other devices. The device where the communication component is located can access a wireless network based on a communication standard, such as a WiFi, a 2G, 3G, 4G/LTE, 5G and other mobile communication networks, or a combination thereof. In an exemplary embodiment, the communication component receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the Communication component further includes a Near Field Communication (NFC) module to facilitate short-range Communication. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared Data Association (IrDA) technology, ultra Wide Band (UWB) technology, bluetooth (BT) technology, and other technologies.
The Display includes a screen, which may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation.
The power supply assembly provides power for various components of the device in which the power supply assembly is located. The power components may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the device in which the power component is located.
The audio component may be configured to output and/or input an audio signal. For example, the audio component includes a Microphone (MIC) configured to receive an external audio signal when the device in which the audio component is located is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signal may further be stored in a memory or transmitted via a communication component. In some embodiments, the audio assembly further comprises a speaker for outputting audio signals.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-readable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The Memory may include volatile Memory in a computer readable medium, random Access Memory (RAM), and/or nonvolatile Memory such as Read Only Memory (ROM) or flash Memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase Change RAM (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash Memory or other Memory technology, compact disc Read Only Memory (CD-ROM), digital Versatile Disc (DVD) or other optical storage, magnetic cassettes, magnetic tape storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.
The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (14)

1. A spreadsheet-based application development method, comprising:
performing text extraction on the spreadsheet to obtain text data;
preprocessing the electronic form according to the text data to obtain a target form;
identifying a header area of the target table;
acquiring field data of at least one header field associated with the header area from the text data;
an application is generated based on the field data of the at least one header field.
2. The method according to claim 1, wherein the text data includes position information of a cell to which each of at least one text content belongs in the spreadsheet; correspondingly, the step of preprocessing the spreadsheet according to the text data to obtain a target form comprises:
determining a plurality of content blocks in the spreadsheet and the position information of each content block in the spreadsheet according to the position information corresponding to the cell to which at least one text content belongs, wherein each content block is composed of at least one cell including the text content;
determining the area of each content block according to the position information of each content block in the electronic table;
cutting the content block with the largest area from the electronic form, and taking the content block with the largest area as an initial form;
and cutting the initial table to obtain a target table.
3. The method of claim 2, wherein clipping the initial table to obtain a target table comprises:
identifying an invalid area, a non-detail area and a detail area in the initial table;
and clipping the invalid area and the non-detail area in the initial table, and taking the detail area which is not clipped as the target table.
4. The method of claim 3, wherein identifying invalid regions in the initial table comprises:
sequentially identifying whether each line in the initial table is an invalid line or not according to the sequence of line numbers from small to large and/or the sequence of line numbers from large to small until an effective line is identified for the first time;
and determining the area formed by the at least one identified invalid line as the invalid area.
5. The method of claim 3, wherein identifying whether each row in the initial table is an invalid row in turn comprises:
judging whether the current line to be identified comprises at least one of remark information, title information or all cells combined;
if the judgment result is yes, determining that the current line is an invalid line; and if the judgment result is negative, determining that the current line is the effective line.
6. The method of claim 3, wherein identifying non-detail regions and detail regions in the initial table comprises:
for a first effective area except the invalid area in the initial table, sequentially traversing whether each line in the first effective area comprises a designated symbol according to the sequence of line numbers from small to large until the initial line of the detail area is determined from the first effective area;
if the traversed current line comprises the designated symbol, determining that the current line belongs to a non-detail area;
if the traversed current row does not comprise the designated symbol, determining that the current row belongs to a non-detail area under the condition that the cell merging mode of the current row and the cell merging mode of the previous row are the same;
and under the condition that the cell merging modes of the current line and the previous line are different, determining that the current line belongs to the starting line of the detail area, and determining each line in the first effective area from the starting line of the detail area as the detail area.
7. The method of claim 6, wherein determining each row in the first active area from the starting row of the detail area as the detail area comprises:
for a second effective area from a starting row of the detail area in the first effective area, sequentially traversing whether each row in the second effective area comprises a designated symbol according to the sequence of row numbers from large to small until an ending row of the detail area is determined from the second effective area;
if the traversed current line comprises a designated symbol, determining that the current line belongs to a non-detail area;
if the traversed current row does not comprise the designated symbol, determining that the current row belongs to a non-detail area under the condition that the cell merging modes of the current row and the next row are the same;
and under the condition that the cell merging modes of the current row and the next row are different, determining that the current row belongs to an ending row of a detail area, and determining each row from a starting row of the detail area to the ending row of the detail area in the second effective area as the detail area.
8. The method of claim 1, wherein identifying the header area of the target table comprises:
identifying a header area of the target table by adopting a mode of taking the merging cells as an identification target; and/or the presence of a gas in the gas,
identifying a header area of the target table by adopting a mode of taking a serial number as an identification target; and/or the presence of a gas in the gas,
identifying a header area of the target table by adopting a mode of taking the similarity between two adjacent rows as an identification target;
and identifying a header area of the target table in a default mode.
9. The method of claim 8, wherein identifying the header area of the target table by using the merged cell as an identification target comprises:
judging whether a first row of the target table comprises a merging cell or not;
if the first row comprises the merging cells, traversing whether each row of the target table behind the first row comprises the merging cells or not until the row which does not comprise the merging cells is traversed for the first time;
determining at least one row including a merged cell that is continuous from the first row as a header area of the target table; alternatively, the first and second electrodes may be,
adopting a mode of taking the sequence number as an identification target to identify a header area of the target table, comprising the following steps:
judging whether the target table comprises a serial number column, wherein the column data of the serial number column is a serial number;
if the sequence number column is included in the target table, determining the last row of the row with the minimum sequence number as a table head area of the target table; alternatively, the first and second electrodes may be,
adopting a mode of taking the similarity between two adjacent rows as an identification target to identify a header area of the target table, wherein the method comprises the following steps:
sequentially traversing whether two adjacent rows in the target table have similarity or not until the two adjacent rows do not have similarity after the first traversal;
and determining the row with the small row number and at least one row before the row in the two adjacent rows without similarity traversed for the first time as the header area.
10. The method of claim 9, wherein traversing sequentially whether there is similarity between two adjacent rows in the target table comprises:
for two adjacent lines traversed currently, determining a feature vector of each line in the two adjacent lines according to attribute information of at least one cell included in each line in the two adjacent lines, wherein the attribute information includes data length or data type of data in the cell;
calculating the similarity between two adjacent lines according to the respective feature vectors of the two adjacent lines;
if the similarity is larger than or equal to a preset similarity threshold, determining that the two adjacent rows have similarity;
and if the similarity is smaller than a preset similarity threshold, determining that the two adjacent rows have no similarity.
11. The method of any one of claims 1 to 10, wherein the field data comprises: the field name, the field value and the data type of the field value, correspondingly, the application program is generated according to the field data of at least one header field, and the method comprises the following steps:
calling at least one page component which is respectively matched with the data type of the field value of at least one header field from a page component library, respectively configuring the field name of at least one header field to the corresponding page component as a component name, and generating a form submission page according to the configured at least one page component;
calling a list component from the page component library, and loading field data of at least one header field into the list component to obtain a data management page;
and generating the application program at least according to the form submission page and the data management page.
12. An apparatus for spreadsheet-based application development, comprising:
the extraction module is used for extracting texts of the electronic forms to obtain text data;
the preprocessing module is used for preprocessing the spreadsheet according to the text data to obtain a target form;
the identification module is used for identifying a header area of the target table;
the acquisition module is used for acquiring field data of at least one header field associated with the header area from the text data;
and the generating module is used for generating the application program according to the field data of at least one header field.
13. An electronic device, comprising: a memory and a processor; the memory for storing a computer program; the processor is coupled to the memory for executing the computer program for performing the steps of the method of any of claims 1-11.
14. A computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, causes the processor to carry out the steps of the method of any one of claims 1 to 11.
CN202211291388.6A 2021-10-20 2022-10-19 Application development method, device and equipment based on spreadsheet and storage medium Pending CN115686455A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111223615 2021-10-20
CN2021112236157 2021-10-20

Publications (1)

Publication Number Publication Date
CN115686455A true CN115686455A (en) 2023-02-03

Family

ID=85066681

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211291388.6A Pending CN115686455A (en) 2021-10-20 2022-10-19 Application development method, device and equipment based on spreadsheet and storage medium

Country Status (1)

Country Link
CN (1) CN115686455A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116127928A (en) * 2023-04-17 2023-05-16 广东粤港澳大湾区国家纳米科技创新研究院 Table data identification method and device, storage medium and computer equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116127928A (en) * 2023-04-17 2023-05-16 广东粤港澳大湾区国家纳米科技创新研究院 Table data identification method and device, storage medium and computer equipment
CN116127928B (en) * 2023-04-17 2023-07-07 广东粤港澳大湾区国家纳米科技创新研究院 Table data identification method and device, storage medium and computer equipment

Similar Documents

Publication Publication Date Title
CN109582909B (en) Webpage automatic generation method and device, electronic equipment and storage medium
US10991054B2 (en) Method and device for acquiring transaction record, and computer readable storage medium
CN109558479B (en) Rule matching method, device, equipment and storage medium
US10366154B2 (en) Information processing device, information processing method, and computer program product
CN106610774B (en) Webpage table editing method and device
CN106896932A (en) A kind of candidate word recommends method and device
CN110321470A (en) Document processing method, device, computer equipment and storage medium
US20220124421A1 (en) Method of generating bullet comment, device, and storage medium
US20150278248A1 (en) Personal Information Management Service System
US20230115459A1 (en) Method and device for processing document and method and device for processing page
CN113297269A (en) Data query method and device
CN115686455A (en) Application development method, device and equipment based on spreadsheet and storage medium
WO2018208412A1 (en) Detection of caption elements in documents
CN116401704A (en) Sensitive data identification method, electronic equipment and storage medium
CN109120509B (en) Information collection method and device
CN111552527A (en) Method, device and system for translating characters in user interface and storage medium
CN104240107A (en) Community data screening system and method thereof
CN110597765A (en) Large retail call center heterogeneous data source data processing method and device
CN112925939A (en) Picture searching method, description information generating method, device and storage medium
US20220301285A1 (en) Processing picture-text data
CN111723177B (en) Modeling method and device of information extraction model and electronic equipment
CN111768767B (en) User tag extraction method and device, server and computer readable storage medium
WO2022104998A1 (en) Handwriting content evaluation method and apparatus, and electronic device
WO2022105004A1 (en) Method and apparatus for electronizing written form, and electronic device
WO2022104999A1 (en) Handwriting graphics analysis method and apparatus, and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination