CN111368019A - Document data structured processing method - Google Patents
Document data structured processing method Download PDFInfo
- Publication number
- CN111368019A CN111368019A CN201811487817.0A CN201811487817A CN111368019A CN 111368019 A CN111368019 A CN 111368019A CN 201811487817 A CN201811487817 A CN 201811487817A CN 111368019 A CN111368019 A CN 111368019A
- Authority
- CN
- China
- Prior art keywords
- data
- class
- descriptions
- subclass
- supplier
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a document data structuring processing method, which is characterized by comprising the following steps: A. a main data list is established, wherein the main material data list comprises material codes, material short descriptions, material long descriptions, material large class codes, material large class descriptions, material middle class codes, material middle class descriptions, material small class codes and material small class descriptions, and the main material data are stored on a main material data platform of a national network company; B. according to the main data of the materials, each material has class attribution, the material subclass is the last layer of class division, the material subclass is developed by taking the material subclass as a basic unit when the purchasing demand is submitted, a supplier is searched and a purchasing plan is made, the information of 'one paper certificate' is structured, important information is changed into structured data, and the office efficiency is greatly improved.
Description
Technical Field
The present invention relates to the field of data processing, and in particular, to a document data structuring processing method.
Background
At present, many text mining tools exist at home and abroad, which are used for analyzing text data, processing unstructured data and refining text characters into fixed labels, classification dimensions or structured fields. Some mature text mining tools can help an application party to effectively analyze a large amount of text data through an HTTP interface, so that the processing efficiency of the application party is improved, and main text mining functions comprise the following four functions: firstly, automatically extracting tags for text files, and extracting more important keyword tags from text data through natural language analysis; secondly, classifying the texts, automatically judging the categories of the articles through an algorithm, and giving corresponding confidence, such as judging whether one article belongs to the entertainment bagua, the politics of the current affairs or the digital science and technology; thirdly, automatically checking the file, judging whether the content of the article violates politics or the complexion, and giving the severity of violation; and fourthly, whether the text belongs to the junk text or not is automatically judged, automatic filtering is carried out on junk data, the whole processing flow and process are too complicated, manual arrangement and discrimination are needed, and the whole office efficiency is influenced.
Disclosure of Invention
In view of the above, the present invention is directed to a method for document data structuring processing.
The invention provides a document data structuring processing method based on the above object, which is characterized by comprising the following steps:
A. establishing a main data list, wherein the main material data list comprises material codes, material short descriptions, material long descriptions, material large class codes, material large class descriptions, material middle class codes, material middle class descriptions, material small class codes and material small class descriptions, and the main material data are stored on a main material data platform;
B. according to the main data of the materials, each material has class attribution, the material subclass is the classification of the last layer, and the material subclass is developed by taking the material subclass as a basic unit when the purchasing requirement is submitted, a supplier is searched and a purchasing plan is made;
C. aiming at the material subclass which is uniformly purchased, the method is completed in a centralized bidding mode, the purchased materials are subpackaged, the bidding and the bid evaluation are publicly performed, and finally the suppliers of all bidding packages are determined;
D. establishing a uniform template aiming at important material subclasses with larger purchase amount, allowing a bidding participant to fill enterprise qualification and sales supply performance information according to the template, and then issuing a paper certificate to explain the qualification performance of the bidding participant after the approval by a quality supervision department of the material department;
F. services such as material purchasing, contract signing, supply planning, waste material disposal, supplier management and the like are managed and data recorded on the ECP system, wherein the ECP system is stored in a doc format through the paper certification;
G. and (6) performing evaluation.
Preferably, in the step B, a two-dimensional data table structure is designed according to the 'one-paper certificate' related to the material subclass, the name of the two-dimensional table, the position in the file, which fields are contained in each table are determined, and the data format of each field is defined;
preferably, in the step D, all the supplier lists with the "one paper certificate" are sorted by the material subclasses, and the full names of the suppliers correspond to the supplier names in other two-dimensional tables one by one, and completely correspond to each other, and the supplier names can be used as the main keys to realize matching between the two-dimensional tables.
Preferably, the paper proves that the text data of the design two-dimensional data table is extracted into a given two-dimensional table and is filled in the corresponding position,
preferably, the data format and the unit of each field in the two-dimensional table are defined.
Preferably, the step G specifically comprises:
g1, initial assessment, and checking the technical deviation in the bidding document item by item according to the bidding document.
G2, detailed review, comprehensive comparison of technical part of tender, business part, business qualification and supply ability of bidders.
Preferably, the two-dimensional data table defines a basic business information table, which includes fields such as a supplier name enterprise full name, an enterprise short name, establishment time, registered fund, a registered place, a plant place, a legal representative, an enterprise category, a unit type, an enterprise property, and the like
Preferably, the data format and unit of each field in the two-dimensional data table are based on the principle that a program is read once, so that secondary processing is avoided.
From the above, it can be seen that the document data structuring processing method provided by the present invention performs structuring processing on a paper certification text file stored in an ECP system, extracts key information in a large number of text files, constructs a two-dimensional data table of basic information, financial conditions, existing performance, personnel composition, design software, design drawings, patents, authentication certificates, type tests, manufacturing equipment, test equipment, manufacturing processes, production environments, product productivity, etc., with a bidding enterprise as an object, structures the paper certification information, changes important information into structured data, and greatly improves office administration efficiency.
Drawings
FIG. 1 is a flow chart of the steps of an embodiment of the present invention;
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to specific embodiments and the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
A document data structuring processing method according to the present invention as shown in fig. 1 includes the following steps: A. establishing a main data list, wherein the main material data list comprises material codes, material short descriptions, material long descriptions, material large class codes, material large class descriptions, material middle class codes, material middle class descriptions, material small class codes and material small class descriptions, and the main material data are stored on a main material data platform; B. according to the main data of the materials, each material has class attribution, the material subclass is the classification of the last layer, and the material subclass is developed by taking the material subclass as a basic unit when the purchasing requirement is submitted, a supplier is searched and a purchasing plan is made; b, designing a two-dimensional data table structure according to a paper certificate related to the material subclass, determining names of the two-dimensional tables, positions in a file, fields contained in each table, and defining data formats of the fields, preferably, the two-dimensional data table defines a basic business information table, wherein the fields comprise supplier name enterprise full names, enterprise short names, establishment time, registered funds, registered locations, plant locations, legal representatives, enterprise types, unit types, enterprise properties and the like, the data formats and units of the fields in the two-dimensional data table are read once by a program to avoid secondary processing, text data of the two-dimensional data table designed by the paper certificate are extracted into a set two-dimensional table and are filled in corresponding positions, and the data formats and units of the fields in the two-dimensional table are defined; C. aiming at the material subclass which is uniformly purchased, the method is completed in a centralized bidding mode, the purchased materials are subpackaged, the bidding and the bid evaluation are publicly performed, and finally the suppliers of all bidding packages are determined; D. establishing a uniform template aiming at important material subclasses with larger purchase amount, allowing a bidding participant to fill enterprise qualification and sales supply performance information according to the template, and then issuing a paper certificate to explain the qualification performance of the bidding participant after the approval by a quality supervision department of the material department; d, combing all supplier lists with 'one paper certificate' in the material subclass, wherein the supplier full names correspond to the supplier names in other two-dimensional tables one by one and completely correspond to each other, and the supplier names can be used as main keys to realize the matching among the two-dimensional tables; F. services such as material purchasing, contract signing, supply planning, waste material disposal, supplier management and the like are managed and data recorded on the ECP system, wherein the ECP system is stored in a doc format through the paper certification; G. and D, performing evaluation, wherein the step G specifically comprises the following steps: g1, initially evaluating, and examining technical deviation in the bidding documents item by item according to the bidding documents; g2, detailed evaluation, comprehensive comparison of technical part, business part, enterprise qualification and supply capability of bidders of the tender book, and selection of bidders with optimal comprehensive performance.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, is limited to these examples; within the idea of the invention, also features in the above embodiments or in different embodiments may be combined, steps may be implemented in any order, and there are many other variations of the different aspects of the invention as described above, which are not provided in detail for the sake of brevity.
In addition, well known power/ground connections to Integrated Circuit (IC) chips and other components may or may not be shown within the provided figures for simplicity of illustration and discussion, and so as not to obscure the invention. Furthermore, devices may be shown in block diagram form in order to avoid obscuring the invention, and also in view of the fact that specifics with respect to implementation of such block diagram devices are highly dependent upon the platform within which the present invention is to be implemented (i.e., specifics should be well within purview of one skilled in the art). Where specific details (e.g., circuits) are set forth in order to describe example embodiments of the invention, it should be apparent to one skilled in the art that the invention can be practiced without, or with variation of, these specific details. Accordingly, the description is to be regarded as illustrative instead of restrictive.
While the present invention has been described in conjunction with specific embodiments thereof, many alternatives, modifications, and variations of these embodiments will be apparent to those of ordinary skill in the art in light of the foregoing description. For example, other memory architectures (e.g., dynamic ram (dram)) may use the discussed embodiments.
The embodiments of the invention are intended to embrace all such alternatives, modifications and variances that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, substitutions, improvements and the like that may be made without departing from the spirit and principles of the invention are intended to be included within the scope of the invention.
Claims (8)
1. A document data structuring processing method is characterized by comprising the following steps:
A. establishing a main data list, wherein the main material data list comprises material codes, material short descriptions, material long descriptions, material large class codes, material large class descriptions, material middle class codes, material middle class descriptions, material small class codes and material small class descriptions, and the main material data are stored on a main material data platform;
B. according to the main data of the materials, each material has class attribution, the material subclass is the classification of the last layer, and the material subclass is developed by taking the material subclass as a basic unit when the purchasing requirement is submitted, a supplier is searched and a purchasing plan is made;
C. aiming at the material subclass uniformly purchased, the method is completed in a centralized bidding mode, the purchased materials are subpackaged, the bidding and the bid evaluation are publicly performed, and finally the suppliers of all bid packages are determined;
D. establishing a uniform template aiming at important material subclasses with larger purchase amount, allowing a bidding participant to fill enterprise qualification and sales supply performance information according to the template, and then issuing a paper certificate to explain the qualification performance of the bidding participant after the approval by a national network material company and a material department quality supervision department;
F. services such as material purchasing, contract signing, supply planning, waste material disposal, supplier management and the like are managed and data recorded on the ECP system, wherein the ECP system is stored in a doc format through the paper certification;
G. and (6) performing evaluation.
2. The method according to claim 1, wherein in step B, a two-dimensional data table structure is designed according to a paper certificate related to the material subclass, the name of the two-dimensional table, the position in the file, which fields are contained in each table are determined, and the data format of each field is defined.
3. The method according to claim 1, wherein the material subclass in step D combs all supplier lists with a paper certificate, and the supplier full names are in one-to-one correspondence with the supplier names in other two-dimensional tables, and the supplier names can be used as the primary keys to realize matching between the two-dimensional tables.
4. The method as claimed in claim 2, wherein the paper certificate' text data of the design two-dimensional data table is extracted into a predetermined two-dimensional table and filled in to a corresponding position.
5. The method of claim 1, wherein the data format and unit of each field in the two-dimensional table are defined.
6. The method according to claim 1, wherein step G is in particular:
g1, initially evaluating, and examining technical deviation in the bidding documents item by item according to the bidding documents;
g2, detailed review, comprehensive comparison of technical part of tender, business part, business qualification and supply ability of bidders.
7. The method of claim 2, wherein the two-dimensional data table defines a base business information table comprising fields for supplier name enterprise full name, enterprise abbreviation, time of completion, registered funds, location of registration, location of plant, legal representative, enterprise category, type of entity, nature of enterprise, etc.
8. The method according to claim 2, wherein the data format and unit of each field in the two-dimensional data table are based on the principle of one-time reading of a program, and secondary processing is avoided.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811487817.0A CN111368019A (en) | 2018-12-06 | 2018-12-06 | Document data structured processing method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811487817.0A CN111368019A (en) | 2018-12-06 | 2018-12-06 | Document data structured processing method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111368019A true CN111368019A (en) | 2020-07-03 |
Family
ID=71203955
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811487817.0A Pending CN111368019A (en) | 2018-12-06 | 2018-12-06 | Document data structured processing method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111368019A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117573877A (en) * | 2024-01-17 | 2024-02-20 | 安徽省优质采科技发展有限责任公司 | Supply chain collaborative management platform material data processing method and system |
-
2018
- 2018-12-06 CN CN201811487817.0A patent/CN111368019A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117573877A (en) * | 2024-01-17 | 2024-02-20 | 安徽省优质采科技发展有限责任公司 | Supply chain collaborative management platform material data processing method and system |
CN117573877B (en) * | 2024-01-17 | 2024-03-22 | 安徽省优质采科技发展有限责任公司 | Supply chain collaborative management platform material data processing method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110069623B (en) | Abstract text generation method and device, storage medium and computer equipment | |
CN109344154B (en) | Data processing method, device, electronic equipment and storage medium | |
CN110879808B (en) | Information processing method and device | |
CN110879939A (en) | Method and device for generating response document | |
CN109062881A (en) | Purchase bidding documenting method and system | |
CN110765101A (en) | Label generation method and device, computer readable storage medium and server | |
CN110798567A (en) | Short message classification display method and device, storage medium and electronic equipment | |
CN110717754A (en) | Commodity transaction method, server, user side, laboratory side and system | |
CN113205402A (en) | Account checking method and device, electronic equipment and computer readable medium | |
CN112214508A (en) | Data processing method and device | |
CN112990713A (en) | Method, system and storage medium for evaluating engineering consultation service in whole process | |
CN112800755A (en) | Data management method and system | |
CN111368019A (en) | Document data structured processing method | |
CN108959289B (en) | Website category acquisition method and device | |
CN111951081A (en) | System for enabling each material to be attached with information attribute and constructing scene by using data | |
CN115618120B (en) | Public number information pushing method, system, terminal equipment and storage medium | |
Symeonidis et al. | Unsupervised consumer intention and sentiment mining from microblogging data as a business intelligence tool | |
CN115982241A (en) | Data processing method and device, electronic equipment and computer readable medium | |
CN105809453A (en) | Electronic documents based full supply chain information back-dating and controlling method | |
CN112612817B (en) | Data processing method, device, terminal equipment and computer readable storage medium | |
CN111026705B (en) | Building engineering file management method, system and terminal equipment | |
CN113763143A (en) | Auditing processing method, computer equipment and storage device | |
CN113626655A (en) | Method for extracting information in file, computer equipment and storage device | |
US20140270575A1 (en) | Methods and systems for capture processing | |
CN112765448A (en) | User label mining method, device, server and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20200703 |
|
WD01 | Invention patent application deemed withdrawn after publication |