A kind of system and method for discerning traditional form information and creating corresponding Web list
Technical field
The present invention relates to the field of designing and developing of Web list, particularly a kind of system and method for discerning traditional form information and creating corresponding Web list.
Background technology
The Web list is the basic module that makes up based on the application program of Web, also is based on the most general most important information carrier and interactive tool in the Web application system of workflow.In application in the past based on Web, usually by the professional and technical personnel after the demand of understanding the user, in specific system development platform, with pure manual mode, or by some intelligent designer, design the Web form interface of reflection user's request, the establishment background program is set up the mutual rule of response, introduces the application system based on Web after the compiling.Can there be following shortcoming in this method usually:
1, excessively specialized, cost is high, efficient is low: the process of designing form is loaded down with trivial details and complicated, by in design environment, adding and a large amount of controls and data type being set, usually programming to be aided with, the design effort of a Web data form can be finished via compiling; Need to contact a large amount of computer major term and operation in the process, as " control ", " field " and to being provided with of its etc., cause the user that is unfamiliar with computer programming with regard to the list design process to the depending on unduly of professional, and can't be based on the design voluntarily of reality custom; Design and give the professional, then because of having the lot of data list in the Web application usually, the implementation cost that a large amount of design efforts makes Web use significantly improves, and implements efficient and significantly reduce;
2, adaptability to changes is poor: usually the demand based on the Web application system of workflow changes often, when demand changes, can produce requirements such as increase to the data list, deletion, modification, further cost rises, decrease in efficiency and the requirement of this continuous variation is because bring professional's dependence;
3, can not accurately reflect reality: implementation method in the past also depends on practical operation user that Web uses introduces statement ability from practical work to the professional and technical personnel, and the professional is to the understandability of practical work demand, because the huge difference of occupational area, caused the reflect reality huge obstacle of process of Web application system.
Chinese patent application number is 96106616.4, name is called the document of " recognition system of Chinese and English list and recognition methods ", and system and method are disclosed, relate to process to the optical scanning and the identification of printed medium list, the achievement of this invention relates to the conversion of the list of printed medium to digital document, but does not relate to the conversion to the Web list of the digital document that comprises form information;
Chinese patent application number is 200610200869.6, the document that name is called " flow process form processing system and method " discloses a kind of disposal route of flow chart, this method is based on the flow process and the business information of model abstract, having proposed a kind of is the service flow processing system and the method for carrier with the list of creating by hand-designed, compiling, and does not relate to the design process of Web list;
Chinese patent application number is 200710074352.1, the document that name is called " setting up Web list and the automatic method of setting up the corresponding data table in database fast by Microsoft Word " discloses a kind of method, its essence is current a kind of by the Web list intelligent designer of widespread usage, the core technology of this method is based on the application to third party software, artificial agreement list Text Flag and specialized routine are mutual, and the attribute setting up procedure to the form controls field in reciprocal process there is no innovation, therefore can't improve currently, be difficult in practical Web application system development and the implementation process than the bigger value of existing method embodiment by the excessive specialized weakness of Web list method for designing of widespread usage;
Chinese patent application number is 200810014332.X, name is called the document of " Worksheet self-defining method ", its disclosed method, its essence remains the process that a professional carries out the Web list design of specialty, therefore with current just there is no by the numerous intelligent list designer of widespread usage basic different;
Chinese patent application number is 03149846.9, the document that name is called " list treating apparatus, list disposal route and storage medium and program " discloses a kind of list treating apparatus and method, these apparatus and method have been described a kind of when the existing associated data sheet field of list is covered, can form data type and performance image not limited and be handled, its essence is already present list is carried out data processing that traditional form is discerned and the design process of Web list but do not relate to;
Chinese patent application number is 200510099656.4, the document that name is called " based on the data form of Web " discloses a kind of method, make can dynamically be reflected in the data source of being bound alternately based on the list of Web, its essence is a kind of disposal route, but do not relate to the design process of traditional form identification and Web list already present Web list.
Summary of the invention
Primary and foremost purpose of the present invention is to overcome the shortcoming and defect of prior art, provides that a kind of cost is low, applicability is strong, the identification traditional form information of processing ease and create the system of corresponding Web list.
Another object of the present invention is to overcome the shortcoming and defect of prior art, provide that a kind of cost is low, applicability is strong, the identification traditional form information of processing ease and create the method for corresponding Web list.
Primary and foremost purpose of the present invention is achieved through the following technical solutions, and a kind of system that discerns traditional form information and create corresponding Web list comprises:
Client computer: the computer equipment that possess network communicating function, can carry out message exchange by network with other computing machine; Be used for reading in traditional form information from the routine information carrier, be converted to the format digital document, or directly read in the format digital document that includes traditional form information from the routine information carrier, it is discerned and analyzes, extract its information key word, required control class, layout and field attribute in the corresponding Web list template of determining to create defaultly, and by with user interactions, revise with setting Web list template between interactive controls and nonreciprocal control and each interactive controls between related; To and analyze by identification, with user interactions, revise and the complete Web list template data of setting formation, send to server with XML (extend markup language) form;
Server: possess network communicating function, the network information service based on Web, the computer equipment that can carry out message exchange with other computing machine by network can be provided; Be used to create and revise the Web list template data that first tables of data sends in order to storage client, resolve Web list template data, the dynamic creation template example and the second related tables of data during operation, and be second tables of data of the control binding association in this example, the example of creating is sent to client computer or other client with HTML (hypertext language) form.
To better implement the present invention, described Client Hardware partly comprises:
CPU (central processing unit), system storage, input equipment, output device, movable storage device, non-moving memory device, data fetch equipment, image reading device and network communication unit;
The software section of described client computer comprises:
Operating system: control and administration client hardware and software resource, the work of organize computer systems rationally and effectively make the bottom software system of application software stable operation thereon;
Common object repository: the interface routine that is used for the format digital document of set form is carried out read-write operation;
The traditional form recognition unit: the interface that utilizes common object repository to provide reads traditional form information from the format digital document; The traditional form information that reads in is calculated and analyzes, obtain corresponding Web list layout and appearance information; And by the cell in the traditional form information that reads in being discerned and analysis, user are revised and set the back and determine related between the field attribute of the classification of control, control and the control, formation Web list template data; The Web list layout of above-mentioned correspondence and appearance information and Web list template data are stored in the storer of system, and send to server with the XML form;
Image identification and converting unit: the printed medium that comprises traditional form information being carried out optical scanning, obtain image, is the discernible format digital document of traditional form recognition unit again with image transitions;
Web browser based on Web: be used for information that browser server returns and carry out alternately with server;
Described server hardware partly comprises: CPU (central processing unit), system storage, input equipment, output device, the non-moving memory device of high capacity and network communication unit;
Described server software and data division comprise:
Operating system: control and administration client hardware and software resource, the work of organize computer systems rationally and effectively make the bottom software system of application software stable operation thereon;
Network information service program: be used on the Internet and LAN (Local Area Network), releasing news and application program that Web uses;
Data base server: be used to provide establishment, inquiry, the modification of database and tables of data, the application program of deletion service;
The Web application runtime environment: the client that Web is used presents, state responds the software platform that carries out management and control with server end;
Example list backstage responder: the program that client is responded in the enterprising line operate of example list that generates according to Web list template and the information beamed back;
Information characteristics storehouse: be used to provide and determine control and the required information characteristics database of control field attribute defaultly;
First tables of data: be used for the tables of data of storage client with the Web list template data of XML form transmission;
Second tables of data: be used for tables of data with the example binding of creating according to the first tables of data Web list template data;
Web list template resolution unit: be used for extracting Web list template data from first tables of data according to the request of client computer, and create an example of this template, the value of each control in the example is tied to corresponding field in the second related tables of data, and this example is sent to client computer with html format or other client is mutual to respond.
Traditional form recognition unit in the described client computer specifically comprises:
Traditional form information read module: be used for utilizing interface that common object repository provides to read traditional form information and the traditional form information that reads is stored in the routine data district of system storage from the format digital document;
Computation analysis module: the traditional form information that reads in is carried out data unit and system conversion, obtain corresponding Web list layout and appearance information, and be stored in corresponding Web list layout and appearance information in the routine data district of system storage with the XML character string forms;
Identification module: the cell in the traditional form information that reads in is traveled through, extract the information key word of non-empty cell lattice, by comparing with the information characteristics storehouse of server stores, determine related between non-empty cell lattice and adjacent blank cell defaultly, and classification, layout and the field attribute of each control in default the Web list template of determining to create;
The user revises and setting module: by input equipment and the output device and the user interactions of client computer, modification does not meet the default determined value of user's expection, in the Web list template that setting will be created between interactive controls and the nonreciprocal control, and the association between each interactive controls; With revising the classification of the control of determining the back, the field attribute and the formation of the association between the control Web list template data of control, be stored in the XML character string forms in the routine data district of system storage;
Sending module: the XML character string that will store Web list layout and appearance information and Web list template data sends to server.
Described in the information characteristics storehouse of server stores, in actual applications by user's infinite expanding, to reflect complicated real information characteristics.
Another object of the present invention is achieved through the following technical solutions, and a kind of method of discerning traditional form information and creating corresponding Web list may further comprise the steps:
(1) read in traditional form information: client computer is read in traditional form information from the routine information carrier, be converted to the format digital document that contains traditional form information, or directly read in the format digital document that includes traditional form information from the routine information carrier, read in traditional form information and be stored in the XML character string forms the routine data district of system storage from this format digital document again; Described traditional form information comprises: content of text, image and graph object, cell quantity, cell coordinate, cell attribute, lines/wire frame pattern, text appearance attribute etc.;
(2) identification and analysis: the traditional form information that reads in is calculated and analyzes, obtain corresponding Web list layout and appearance information; And the cell in the traditional form information that reads in traveled through, extract the information key word of non-empty cell lattice, information characteristics storehouse comparison with server stores, determine related between non-empty cell lattice and adjacent blank cell defaultly, and classification, layout and the field attribute of each control in default the Web list template of determining to create;
(3) revise and set:, do not meet the default determined value of user's expection in the modify steps (2) with user interactions; In the Web list template that setting will be created between interactive controls and the nonreciprocal control, and the association between each interactive controls; To revise the classification of the control of determining the back, the field attribute of control and the Web list template data that the association between the control forms through identification, user is stored in the routine data district of system storage with the XML character string forms;
(4) send and store: will send to server through the XML character string that includes Web list template data that above-mentioned steps obtains; Server writes in first tables of data of having created after receiving the XML character string of client computer transmission;
(5) resolve: when server is received client about the interactive request of Web list by network, server reads this Web list template data from first tables of data, it is resolved, the template data that obtains according to parsing when operation is created an example of this Web list template, and create second tables of data related with this example, the control in this example is bound with the corresponding field in related second tables of data; The example of creating is sent to client computer with html format or other client is mutual to respond.
To better implement the present invention, described step (2) identification and analysis specifically may further comprise the steps:
(2.1) computation analysis module in the described traditional form recognition unit is carried out data unit and system conversion to the traditional form information that reads in, obtain corresponding Web list layout and appearance information, these information comprise list block split position information, lines, wire frame style information, cell line number, columns, each cell height, width, cell sum, size of images, memory location etc., and are stored in corresponding Web list layout and appearance information in the routine data district of system storage with the XML character string forms;
(2.2) be written into the information characteristics storehouse from server;
(2.3) identification module in the traditional form recognition unit reads in the data of a cell in the traditional form information, determine related between non-empty cell lattice and adjacent blank cell defaultly, be whether first judging unit lattice are blank cells, if it is then related with the adjacent non-empty cell lattice of its left or top, read the content of the non-empty cell lattice related, and enter step (2.4) with this blank cell; If not blank cell, then default setting is a label control, and enters step (2.5);
(2.4) content that just reads of the identification module in the traditional form recognition unit is searched relevant information characteristics in the information characteristics storehouse, if find relevant information characteristics, then in order to control and the field attribute of determining current blank cell and note, enter step (2.5); If search less than relevant information characteristics, then be set to default attribute and enter step (2.5);
(2.5) judge to be the last location lattice, if then discern and analyze end; If not then read in another cell data, enter step (2.3).
Determine in the described step (2) to be specially related between non-empty cell lattice and adjacent blank cell defaultly:
Non-empty cell lattice are related with the blank cell on its right side, and are related with the blank cell of below when the right side does not have blank cell, when right side and below all do not have blank cell, and the corresponding independently nonreciprocal control of these non-empty cell lattice.
The classification of each control in the Web list template of determining in the described step (2) to create is specially defaultly:
The corresponding nonreciprocal control of non-empty cell lattice comprises: the Input that label control, picture control maybe can not be edited; Specifically be that key word is compared in the information characteristics storehouse and searched the corresponding controls classification with the cell content;
The corresponding interactive controls of blank cell comprises: editable single-line text boxes control, editable multiline text frame control, combobox control, list control, single selected control part, final election control or calendar control; Be that key word is compared in the information characteristics storehouse and searched the corresponding controls classification specifically with the content of adjacent related non-empty cell lattice with this blank cell.
Classification, layout and the field attribute of each control in the Web list template of determining in the described step (2) to create defaultly, determine to be specially the field attribute of each control wherein defaultly:
The field of the corresponding regular length of non-empty cell lattice, character types, perhaps corresponding fixing picture-storage address, perhaps corresponding hypertext link;
Blank cell is that corresponding field attribute searched in key word in the information characteristics storehouse to be adjacent related non-empty cell lattice content.
Described step (3) is revised and is set, its program interface adopts and meets the statement text that the non-computer professional distinguishes custom, the corresponding a kind of control of each bar statement text, a perhaps group field attribute, field attribute comprises the data type and the field maximum length of field.
Set the association between the interactive controls in the described step (3), specifically be meant: the identification field with each control is an index, revise the value of a character string field in first tables of data, this value has shown the master control of this control and other control and by control relationship and correlation rule.
Described step (5) is specially when server is received a network requests about the Web list of storing template information, comprise a Web list template identification field in this request, server is that index extracts all control information with this template identification field from first tables of data, resolve, create these controls in the position corresponding successively during operation with the traditional form cell, thereby create an example of this template, the value of each control in the example is tied to corresponding field in the second related tables of data, default value is set, and loading corresponding example list backstage responder simultaneously, it is mutual to respond that this example is sent to client computer or other client of filing a request with html format.
Principle of work of the present invention: the user is not relying under computer major technician's the prerequisite as far as possible, with generally being used for the traditional form of the information of transmitting in workflow in the reality, be converted into quickly and easily based on the Web list that can use easily in the application system of Web; The process of this method starts from reading traditional form information after conversion from the format digital document that the routine information carrier directly obtains or obtains, information characteristics storehouse comparison via information extraction key word and server, thereby determine and this traditional form information cell each control class, layout and field attribute one to one defaultly, again based on this default value and user interactions, set between control related, up to create a function, outward appearance is corresponding with traditional form, till the Web list that can be used by the Web application system.When based on the application system server of Web when receiving the network user's interactive request, it is mutual to respond to create the Web list corresponding with traditional form apace when operation, thereby, significantly reduce Web application system implementation cost by simplifying Web list design process greatly.
The present invention compared with prior art has following advantage and beneficial effect:
The first, exploitation and the enforcement for the Web application system provides a kind of new method, and the exploitation and the implementation process of traditional Web application system realize two processes nothing more than understanding user's request and technology; Method provided by the invention makes the developer when understanding user's request, be absorbed in the research of the traditional list of the daily use of user, thereby understand the characteristics of motion of workflow, information flow, logistics and cash flow, with mode profound understanding service logic near technology, can simplify and accelerate understanding process greatly, and improve the degree of accuracy that demand is held demand; The system provided by the invention instrument efficiently that then is the developer in the process that technology realizes, make the Web list that in exploitation and implementation amount, occupies vast scale design and performance history is simplified greatly and accelerated, thereby be absorbed in service logic itself, for the user provides more excellent products ﹠ services;
The second, system provided by the invention, " black box " as input traditional form output Web list has been communicated with amateur crowd and Web application, thereby made the exploitation and the enforcement of " self-service " Web application system become possibility; The non-computer professional is to meet the simple lead-in mode of work habit, fast traditional forms such as paper list used in everyday or electrical form are converted into the Web list that can use in the application system based on Web, and need not contact a large amount of computer major terms and process is specialized in intervention, break away from depending on unduly to the professional and technical personnel; Based in the described method to the intelligent identification technology of traditional form, with based on the interaction technique that meets reality statement custom fully, make that amateur crowd can be by described system, perhaps based on the more perfect system of described method development, realize the professional migration of using of reality, thereby accelerate informationalized paces to Web;
Three, based on the practicality of described system and method, easy and characteristics efficiently, it is inevitable to bring declining to a great extent of cost to become in the exploitation of Web application system and implementation process; Therefore, in the market competition of " cost is the king ", described system and method can obtain general application, thereby the Web application market is produced far-reaching influence;
Four, traditional list is designed to the technical ability that majority generally have in the applied economics activity, and therefore, described system and method has been expanded the human resources channel that project team was set up when the user implemented the Web application; By the past with the technician with to the exploitation professional, that the smart ripe personnel of management take as the leading factor with implement team's pattern, turning to the service management personnel is core, the general business personnel carry out the enforcement team pattern of practical operation, and the latter's human resources channel is obviously more extensive;
Five, improved the adaptability to changes of Web application system; Each carves the market environment that the applied economics activity is taken root in all in the violent variation of generation, this mechanism that requires to implement the Web application system also will comply with these variations, and to the adaptation of these variations often be embodied in workflow and the information transmitted on stream on, therefore, requirement has been proposed the adaptability to changes as the Web list of the transmission information important carrier of workflow; Principle based on described system and method, when the information of workflow and transmission changes, only need the redesign traditional form, identification imports and sets up data association and gets final product again again, the cost that is spent is extremely low, thereby has improved the adaptability to changes of Web application system greatly;
Six, expanded Web application system supplier's service mode, especially after sale service pattern; In the past, when the pre-sales and after sale service that the Web application system is provided, usually by technician and the on-the-spot communication of user, the understanding demand returns development platform and carries out the technology realization; And based on described system and method, make the service of complete non-at-sceneization, especially after sale service becomes possibility, because the design of traditional form and modification ability are possessed by the user, identification and importing to traditional form then can be by supplier's operated from a distances, or by supplier remote guide user operation, to also inevitable supplier's the cost of serving that significantly reduces of the expansion of service mode.
Description of drawings
Fig. 1 is a kind of synoptic diagram of discerning traditional form information and creating client computer in the corresponding Web list system of the present invention;
Fig. 2 is a kind of synoptic diagram of discerning traditional form information and creating server in the corresponding Web list system of the present invention;
Fig. 3 is a kind of synoptic diagram of discerning traditional form information and creating traditional form recognition unit in the corresponding Web list system users machine of the present invention;
Fig. 4 of the present inventionly a kind ofly discerns traditional form information and creates the process flow diagram that reads in a traditional form information process in the method for corresponding Web list from the routine information carrier;
Fig. 5 is a kind of process flow diagram of discerning traditional form information and creating step (2) identification in the method for corresponding Web list and analyze of the present invention;
Fig. 6 a kind ofly discerns traditional form information and creates step (3) in the method for corresponding Web list according to of the present invention, determines the scene example of control class with user interactions;
Fig. 7-A a kind ofly discerns traditional form information and creates step (3) in the method for corresponding Web list according to of the present invention, changes a scene example before the adjacent pair relationhip with blank cell of non-empty cell lattice with user interactions;
Fig. 7-B a kind ofly discerns traditional form information and creates step (3) in the method for corresponding Web list according to of the present invention, changes a scene example behind the adjacent pair relationhip with blank cell of non-empty cell lattice with user interactions;
Fig. 8 a kind ofly discerns traditional form information and creates step (3) in the method for corresponding Web list according to of the present invention, with user interactions to the related scene example that is provided with between the form controls;
Fig. 9 of the present inventionly a kind ofly discerns traditional form information and creates in the method for corresponding Web list, and server is created the process flow diagram of an example according to Web list template data;
Figure 10 is a kind of synoptic diagram of discerning traditional form information and creating the system and method overall work principle of corresponding Web list of the present invention.
Embodiment
Below in conjunction with embodiment and accompanying drawing, the present invention is described in further detail, but embodiments of the present invention are not limited thereto.
The system specialization that embodiment relied on:
A kind of system that discerns traditional form information and create corresponding Web list among the embodiment comprises:
Client computer 100 as shown in Figure 1 is the computer equipment that possesses network communicating function, can carry out message exchange with other computing machine by network; Be used for reading in traditional form information from the routine information carrier, be converted to the format digital document, or directly read in the format digital document that includes traditional form information from the routine information carrier, it is discerned and analyzes, extract its information key word, required control class, layout and field attribute in the corresponding Web list template of determining to create defaultly, and by with user interactions, revise with setting Web list template between interactive controls and nonreciprocal control and each interactive controls between related; To and analyze by identification, with user interactions, revise and the complete Web list template data of setting formation, send to server 200 with the XML form;
Server 200 as shown in Figure 2 is for possessing network communicating function, the network information service based on Web, the computer equipment that can carry out message exchange with other computing machine by network can being provided; Be used to create and revise the Web list template data that first tables of data 242 sends in order to storage client 100, resolve Web list template data, the dynamic creation template example and the second related tables of data 243 during operation, and be second tables of data 243 of the control binding association in this example, the example of creating is sent to client computer 100 or other client with html format.
Described client computer 100 hardware components comprise as shown in Figure 1:
CPU (central processing unit) 101, system storage 102, input equipment 105, output device 106, movable storage device 103, non-moving memory device 104, data fetch equipment 107, image reading device 108 and network communication unit 109;
The software section of described client computer 100 comprises as shown in Figure 1:
Operating system 110: control and administration client 100 hardware and software resources, the work of organize computer systems rationally and effectively make the bottom software system of application software stable operation thereon
Common object repository 114: the interface routine that is used for the format digital document of set form is carried out read-write operation;
Traditional form recognition unit 113: the interface that utilizes common object repository 114 to provide reads traditional form information from the format digital document; The traditional form information that reads in is calculated and analyzes, obtain corresponding Web list layout and appearance information; And by the cell in the traditional form information that reads in is discerned, the user revises with set the back determines related between the field attribute of the classification of control, control and the control, formation Web list template data; The Web list layout of above-mentioned correspondence and appearance information and Web list template data are stored in the storer of system, and send to server with the XML form;
Image identification and converting unit 116: the printed medium that comprises traditional form information being carried out optical scanning, obtain image, is traditional form recognition unit 113 discernible format digital documents again with image transitions;
Web browser 115 based on Web: be used for information that browser server 200 returns and carry out alternately with server 200;
As shown in Figure 2, the hardware components of described server 200 comprises: CPU (central processing unit) 201, system storage 202, input equipment 204, output device 205, the non-moving memory device 203 of high capacity and network communication unit 206;
As shown in Figure 2, the software of described server 200 and data division comprise:
Operating system 211: control and administration client 100 hardware and software resources, the work of organize computer systems rationally and effectively make the bottom software system of application software stable operation thereon;
Network information service program 221: be used on the Internet and LAN (Local Area Network), releasing news and application program that Web uses;
Data base server 223: the application program that is used to provide services such as database and tables of data establishment, inquiry, modification, deletion;
Web application runtime environment 222: the client that Web is used presents, state responds the software platform that carries out management and control with server end;
Example list backstage responder 232: the program that the client user is responded in the enterprising line operate of example list that generates according to Web list template and the information returned;
Information characteristics storehouse 241: be used to provide and determine control and the required information characteristics database of control field attribute defaultly;
First tables of data 242: be used for the tables of data of storage client 100 with the Web list template data of XML form transmission;
Second tables of data 243: be used for tables of data with the example binding of creating according to first tables of data, 242 Web list template datas;
Web list template resolution unit 231: be used for extracting Web list template data from first tables of data 242 according to the request of client computer 100, and create an example of this template, the value of each control in the example is tied to corresponding field in the second related tables of data 243, and this example sent to the client computer 100 of filing a request with html format mutual to respond.
As shown in Figure 3, the traditional form recognition unit 113 in the client computer 100 specifically comprises:
Traditional form information read module 301: be used for utilizing interface that common object repository 114 provides to read traditional form information and the traditional form information that reads is stored in the routine data district 112 of system storage from the format digital document;
Computation analysis module 302: the traditional form information that reads in is carried out data unit and system conversion, obtain corresponding Web list layout and appearance information, and be stored in corresponding Web list layout and appearance information in the routine data district 112 of system storage with XML character string 117 forms;
Identification module 303: the cell in the traditional form information that reads in is traveled through, extract the information key word of non-empty cell lattice, by with server 200 shown in Figure 2 in canned data feature database 241 comparison, determine related between non-empty cell lattice and adjacent blank cell defaultly, and classification, layout and the field attribute of each control in default the Web list template of determining to create;
The user revises and setting module 304: by input equipment 105 and the output device 106 and the user interactions of client computer 100, modification does not meet the default determined value of user's expection, in the Web list template that setting will be created between interactive controls and the nonreciprocal control, and the association between each interactive controls; With revising the classification of the control of determining the back, the field attribute and the formation of the association between the control Web list template data of control, be stored in the XML character string forms in the routine data district 112 of system storage;
Sending module 305: the XML character string that will store Web list layout and appearance information and Web list template data sends to server 200.
Described at server 200 canned data feature databases 241, in actual applications by user's infinite expanding, to reflect complicated real information characteristics.
Embodiment one:
Present embodiment has illustrated according to system and method for the present invention, and a printed medium file that includes traditional form information is read and discerns, and the most corresponding Web list of Chuan Jianing sends to the process of networking client with the response interactive request.In the present embodiment, described routine information carrier is the paper printing medium, described image reading device 108 is a scanner, described format digital document is the format digital document of expansion .DOC by name, Microsoft Word by the exploitation of the Microsoft in Washington state Lei Mengde city is supported that the file of this kind form and this software are widely used in reality.Described input equipment 105 is mouse and keyboard, described output device 106 is a display, described operating system 110 is the Microsoft Windows by Microsoft's exploitation in Washington state Lei Mengde city, described Web application runtime environment 222 is the .NET Framework by Microsoft's exploitation in Washington state Lei Mengde city, and described web browser 115 based on Web is the Microsoft InternetExplorer by Microsoft's exploitation in Washington state Lei Mengde city.
This process is as follows:
1) read in traditional form information:
See Fig. 1, Fig. 3 and Fig. 4, according to step 402-403, by client computer 100 provided by the invention, read the image that comprises traditional form information by scanner 108 from described paper printing medium, by image identification and converting unit 116, the image transitions that reads is called the Word file of .DOC for expansion;
Again according to step 404, the interface that provides by common object repository 114, traditional form information read module 301 in the traditional form recognition unit 113 reads in the traditional form information in the above-mentioned Word file, comprise: the font of text, size of images, position etc. in the content of the height of the pattern of position in document file page of form quantity, form, form lines, wire frame, form line number, columns, each cell, width, cell, the cell in the document, and, these information are stored in the routine data district 112 of system storage 102 according to step 405.
2)
Identification and analysis:
See Fig. 1, Fig. 3 and Fig. 5, according to step 501-503,302 pairs of traditional form informations that read in of computation analysis module in the traditional form recognition unit 113 carry out data unit and system conversion, obtain Web list layout and appearance information one to one, cell as total total how many row, every row has plurality of units lattice etc., these information is write the XML character string 117 of record Web list template data, and be stored in the routine data district 112 of system storage.These layouts and appearance information specifically comprise: list block split position information, lines, wire frame style information, cell line number, columns, each cell height, width, cell sum, size of images, memory location etc.
Then, be written in order to determine the information characteristics storehouse 241 of control and control field attribute defaultly from server 200 shown in Figure 2.This information characteristics storehouse 241 includes the information characteristics of control and control field attribute, and this information characteristics storehouse 241 forms by user and system interaction, can expand according to user's needs.
According to step 505-506, identification module 303 in the traditional form recognition unit 113 is after to the traversal of the cell in the traditional form information, determine related between non-empty cell lattice and adjacent blank cell defaultly, the principle of this deterministic process is: non-empty cell lattice are related with the blank cell on its right side, when the right side does not have blank cell, related with the blank cell of below, when right side and below all do not have blank cell, then according to step 511, these non-empty cell lattice are nonreciprocal control of correspondence independently; The meaning of this association is: the source that can be used as the information key word of determining corresponding control class of adjacent blank cell and field attribute after the text in these non-empty cell lattice defaultly.Such as, the corresponding label control that directly shows " date " printed words of cell that has comprised text " date ", blank cell adjacent on the right of its is then according to " date " printed words, and one of correspondence is with the text box of calendar control and is bound the field of a date type.
Then, according to step 507-510, determine the pairing control kind of each cell, be interactive controls according to the control in the Web data form of its establishment promptly defaultly, or the nonreciprocal control.The principle of this deterministic process institute foundation is: the corresponding nonreciprocal control of non-empty cell lattice, comprise: label control, picture control, the Input that can not edit etc. specifically are that key word is compared in information characteristics storehouse 241 and searched the corresponding controls classification with the cell content; The corresponding interactive controls of blank cell comprises: editable single file, editable multiline text frame control, combobox control, list control, single selected control part, final election control or calendar control etc.; Be that key word is compared in information characteristics storehouse 241 and searched the corresponding controls classification specifically, if search less than then being set to default attribute with the content of the related non-empty cell lattice of adjacent with it (left or top).For example: determine a cell that has comprised text " name ", then the label control of its corresponding direct videotex " name "; And the cell that is somebody's turn to do " name " is right-hand or the blank cell of below, be that the corresponding controls classification is searched in key word 241 comparisons in the information characteristics storehouse then with " name ", the corresponding combobox control of final definite this blank cell in order to selection name in list of names, perhaps filled in the Input of name by the user, these two kinds of controls all are interactive controls.
Simultaneously, identification module 303 in the traditional form recognition unit 113 is determined each cell corresponding controls field attribute also defaultly, the principle of this deterministic process is: the field of the corresponding regular length of non-empty cell lattice, character types, perhaps corresponding fixing picture URL, the i.e. memory location of picture; Blank cell is that the field attribute of determining correspondence searched in key word in information characteristics storehouse 241 with adjacent related non-empty cell lattice content; Such as, a content is the cell of text " name ", the corresponding controls field data types is " character type ", length is 4, the text box field data types of the blank cell correspondence that it is adjacent is " character type ", and length is 8, promptly allows to insert the name of four Chinese character length.
3)
Revise and set:
Above step has been determined the pairing control of each cell defaultly, and this control is to the preset data type and the field length of the data source field of association, but generally, deviation to some extent still among these default definite results.These deviations at first come from the deviation of the adjacent cells lattice association that the traditional form form that varies brings, such as, cell that has comprised content of text improperly with adjacent another one blank cell pairing; Secondly in information characteristics storehouse 241, can't find with the cell that comprises content of text in the identical value of text, thereby can't determine the data type and the length of pairing control of adjacent blank cell and associate field, can only be set to default value according to step, and this default value might not meet reality or user's expection.
User in the traditional form recognition unit 113 revises with setting module 304 and provide an interaction page on this question on the display of client computer 100, and the user can pass through this interaction page, utilizes mouse and keyboard by the above-mentioned deviation of mutual correction.
As shown in Figure 6, in this interaction page, each nonreciprocal control of determining can put demonstration with traditional form units corresponding case defaultly, and the default attribute of each interactive controls all is suspicious, so a combobox control all can occur on corresponding position, the shown default value of this control is the value that the identification module 303 in the traditional form recognition unit 113 is determined in above-mentioned identification and analytical procedure.The user can make amendment by clicking this combobox, selects the value that the user is desired in the tabulation that demonstrates, to determine the correct control type and the data type of associate field.The all amateur term of statement text of all values in this tabulation, but meet the statement text that reality is accustomed to.Such as: the identification module 303 in the traditional form recognition unit 113 is that a cell that includes content of text " name " has determined that the pairing control of its adjacent blank cell is an Input defaultly, the statement text of default demonstration is " an input name ", and the user can directly import name in text frame control when using this Web list.This moment, the user then can select " import name or select name in list of names " in the tabulation of combobox, thereby determined another index value, the corresponding drop-down text box control of this index value.The user both can directly import name in this drop-down text box when using this Web list later on, also can click combobox, selected a name in the list of names that shows.
Shown in Fig. 7-A, in identical interaction page, the control of being set up adjacent association by the identification module in the traditional form recognition unit 113 303 can connect with a fine rule defaultly, the user is after selecting " reassigning input field " on the right mouse button menu that does not need on the mutual label control, select corresponding interactive controls can change adjacent association again, see Fig. 7-B.
Afterwards, in the Web data form, need to be provided with association between the interactive controls.The meaning of setting this association is, carries out when mutual when the user utilizes the list of this dynamic creation, and the change of control data can influence other control and set new data range of choice, and this helps improving user experience and raises the efficiency.Such as, when the user selects " sales department " in a combobox, another one is in order to only to show that the employee's of sales department list of names is selective in the combobox of selecting employee's name, when in this combobox, choosing a name, another one is in order to then only to show the client list relevant with this name in the combobox that shows customer list, the value in order to a plurality of Inputs of demonstration customer data in addition then changes thereupon.Its essence is that the identification field with each control is an index, revise a character string field in first tables of data 242, this string table is understood the master control of this control and other control and by control relationship and rule change etc.; An embodiment of this related method to set up is as shown in Figure 8: in same interaction page, on the right mouse button menu of each interactive controls, select " influencing other input field ", select the interactive controls influenced by it again, on the menu that ejects, select again to be subjected to its scope that influences and rule, comprise the value of control, the state and the appearance attribute of control.User in the traditional form recognition unit 113 revises with the automatic judgement of setting module 304 meetings, prompting and eliminating and can cause the closed loop of system mistake related, and promptly the A control influences the B control, and the B control influences the C control, and the C control influences the situation of A control again.
As Fig. 6, Fig. 7-A, Fig. 7-B and shown in Figure 8, with regard to the classification of control and field attribute and user interactions the time, program interface provided by the invention all adopts and meets the statement text that the non-computer professional distinguishes custom, the all corresponding a kind of control of each bar statement text, perhaps one group of field attribute that comprises data type and field length, this corresponding informance is stored in the information characteristics storehouse 241.
4)
Send and storage:
When the user confirms to submit to by clicking " submissions " or " finishing " button, promptly represent the identification of list and the end of reciprocal process, obtained the complete information data of the Web list template that traditional form information that basis reads in will create.See Fig. 1, Fig. 2 and Fig. 3, sending module 305 in the traditional form recognition unit 113 is stored in the Web list layout of above-mentioned correspondence and appearance information and Web list template data in the routine data district 112 of system storage with XML character string 117 forms, and this XML character string 117 sent to server 200, by it these data are write first tables of data 242.
5)
Resolve:
See Fig. 2 and Fig. 9, when the user of a networking client 100 clicks a hypertext link at certain Web page, this point hits to described server 200 and proposes one for the mutual application of the Web data form of storing template information, according to step 901, Web list template resolution unit 231 is extracted the list identification parameter from this request.
List identification parameter with extraction in step 901 is an index, according to step 902, Web list template resolution unit 231 extracts the control field that all are associated with this sign from first tables of data 242, the identification field, classification field, default value field, appearance attribute field, the location field that comprise control, and embody the relevant character string field that other control is influenced, indicate the path of second tables of data 243 of binding and the field of sign in addition with the example of this template that will create.
According to step 903, data field value according to above extraction, Web list template resolution unit 231 is at a new Web page, or in a DIV label of an existing page, thereby all controls of dynamic creation in service are created an example of this template, and bind corresponding field in second tables of data 243 for all controls in the example according to step 904, again according to step 905, load the example list backstage responder 232 that influences other control behavior that can trigger when corresponding control data change, at last according to step 906, this example is sent to the networking client 100 of filing a request with hypertext language form (HTML) mutual to respond.
So far, complete passing through is as shown in figure 10 read in traditional form information from the routine information carrier, discerns and the process end of the corresponding Web data form of back establishment alternately.
Embodiment two
Present embodiment has illustrated according to system and method for the present invention, and a pdf document that includes traditional form information is read and discerns, and the most corresponding Web list of Chuan Jianing sends to the process of networking client with the response interactive request.In the present embodiment, described routine information carrier is a mail, described format digital document is the format digital document of expansion .PDF by name, Acrobat Reader by the exploitation of U.S. Adobe Systems Incorporated company is supported that the file of this kind form and this software are widely used in reality.Include the traditional form that represents with forms mode in the described format digital document.Described input equipment 105 is mouse and keyboard, described output device 106 is a display, described operating system 110 is the Red Hat Linux by the exploitation of U.S. Red Hat company, described Web application runtime environment 222 is the JBoss Enterprise Middleware by U.S. RedHat company exploitation, described web browser 115 based on the Web Firefox of the Mozilla of foundation exploitation of increasing income that serves as reasons.
The implementation process of present embodiment is as follows, and except that following technical characterictic, other is with embodiment 1:
1) read in traditional form information:
See Fig. 1, Fig. 3 and Fig. 4,, in the middle of Email, read the .PDF file that comprises traditional form information by network communication unit 109 by client computer 100 provided by the invention;
Again according to step 404, the interface that provides by common object repository 114, traditional form information read module 301 in the traditional form recognition unit 113 reads in the traditional form information in the above-mentioned .PDF file, comprise: the font of text, size of images, position etc. in the content of the height of the pattern of position in document file page of form quantity, form, form lines, wire frame, form line number, columns, each cell, width, cell, the cell in the document, and, these information are stored in the routine data district 112 of system storage 102 according to step 405.
Embodiment three
Present embodiment has illustrated according to system and method for the present invention, and an Excel file that includes traditional form information is read and discerns, and the most corresponding Web list of Chuan Jianing sends to the process of networking client with the response interactive request.In the present embodiment, described routine information carrier is a CD, described format digital document is the format digital document of expansion .XLS by name, Microsoft Excel by the exploitation of the Microsoft in Washington state Lei Mengde city is supported that the file of this kind form and this software are widely used in reality.Include the traditional form that represents with forms mode in the described format digital document.Described input equipment 105 is mouse and keyboard, described output device 106 is a display, described data fetch equipment 107 is a CD drive, described operating system 110 is the Microsoft Windows by Microsoft's exploitation in Washington state Lei Mengde city, described Web application runtime environment 222 is the .NET Framework by Microsoft's exploitation in Washington state Lei Mengde city, and described web browser 115 based on Web is the Microsoft Internet Explorer by Microsoft's exploitation in Washington state Lei Mengde city.
The implementation process of present embodiment is as follows, and except that following technical characterictic, other is with embodiment 1:
1) read in traditional form information:
See Fig. 1, Fig. 3 and Fig. 4,,, in the middle of CD, read the Excel file that comprises traditional form information by CD drive 107 by client computer 100 provided by the invention;
Again according to step 404, the interface that provides by common object repository 114, traditional form information read module 301 in the traditional form recognition unit 113 reads in the traditional form information in the above-mentioned Excel file, comprise: the font of text, size of images, position etc. in the content of the height of cell, width, cell in form line number, columns, the workspace, the cell in the pattern of form lines, wire frame, the workspace, and, these information are stored in the routine data district 112 of system storage 102 according to step 405.
Embodiment four
Present embodiment has illustrated according to system and method for the present invention, and a Word file that includes traditional form information is read and discerns, and the most corresponding Web list of Chuan Jianing sends to the process of networking client with the response interactive request.In the present embodiment, described routine information carrier is a flash disk, described format digital document is the format digital document of expansion .DOC by name, Microsoft Word by the exploitation of the Microsoft in Washington state Lei Mengde city is supported that the file of this kind form and this software are widely used in reality.Include the traditional form that represents with forms mode in the described format digital document.Described input equipment 105 is mouse and keyboard, described output device 106 is a display, described data fetch equipment 107 is a card reader, described operating system 110 is the Microsoft Windows by Microsoft's exploitation in Washington state Lei Mengde city, described Web application runtime environment 222 is the .NET Framework by Microsoft's exploitation in Washington state Lei Mengde city, and described web browser 115 based on Web is the Microsoft Internet Explorer 7.0 by Microsoft's exploitation in Washington state Lei Mengde city.
The implementation process of present embodiment is as follows, and except that following technical characterictic, other is with embodiment 1:
1) read in traditional form information:
See Fig. 1, Fig. 3 and Fig. 4,, in the middle of flash disk, read the Word file that comprises traditional form information by card reader 107 by client computer 100 provided by the invention;
Again according to step 404, the interface that provides by common object repository 114, traditional form information read module 301 in the traditional form recognition unit 113 reads in the traditional form information in the above-mentioned Word file, comprise: the font of text, size of images, position etc. in the content of the height of the pattern of position in document file page of form quantity, form, form lines, wire frame, form line number, columns, each cell, width, cell, the cell in the document, and, these information are stored in the routine data district 112 of system storage 102 according to step 405.
More than 4 embodiment understand in detail according to system and method for the present invention, the traditional form information that is comprised in the digital document from different information carrier, different-format is discerned, and the Web list of creating according to traditional form information sends to the specific implementation method of networking client with the response interactive request the most at last.These 4 embodiment have verified system and method for the present invention feasibility in the practical application, practicality and universality under different condition fully.
The foregoing description is a preferred implementation of the present invention; but embodiments of the present invention are not limited by the examples; other any do not deviate from change, the modification done under spirit of the present invention and the principle, substitutes, combination, simplify; all should be the substitute mode of equivalence, be included within protection scope of the present invention.