CN109542969B - Text transformer test data structuring system and method - Google Patents

Text transformer test data structuring system and method Download PDF

Info

Publication number
CN109542969B
CN109542969B CN201811403566.3A CN201811403566A CN109542969B CN 109542969 B CN109542969 B CN 109542969B CN 201811403566 A CN201811403566 A CN 201811403566A CN 109542969 B CN109542969 B CN 109542969B
Authority
CN
China
Prior art keywords
file
data
template
module
test data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811403566.3A
Other languages
Chinese (zh)
Other versions
CN109542969A (en
Inventor
周正钦
许晓路
徐进霞
陈佳
倪辉
程林
范辉
高树国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electric Power Research Institute of State Grid Hebei Electric Power Co Ltd
Wuhan NARI Ltd
Original Assignee
Electric Power Research Institute of State Grid Hebei Electric Power Co Ltd
Wuhan NARI Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electric Power Research Institute of State Grid Hebei Electric Power Co Ltd, Wuhan NARI Ltd filed Critical Electric Power Research Institute of State Grid Hebei Electric Power Co Ltd
Priority to CN201811403566.3A priority Critical patent/CN109542969B/en
Publication of CN109542969A publication Critical patent/CN109542969A/en
Application granted granted Critical
Publication of CN109542969B publication Critical patent/CN109542969B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a text transformer test data structuring system and a text transformer test data structuring method, which comprise an application layer, a logic layer and a data storage layer, wherein the application layer is used for inputting the structuring requirement of text transformer test data, the logic layer is used for structuring the test data according to the structuring requirement, and the data storage layer comprises a file result table and a file template table which are generated when the logic layer is used for structuring the test data, so that the text data is converted into the structured data, the utilization rate of the historical test data of a transformer is improved, the integration of the state data of the transformer is realized, and the establishment of a perfect transformer operation and maintenance file is facilitated.

Description

Text transformer test data structuring system and method
Technical Field
The invention relates to the technical field of data processing, in particular to a system and a method for structuring test data of a text transformer.
Background
The transformer test Data is often stored in an electronic text form, and the storage form of the electronic text includes a structured storage form and an Unstructured storage form, wherein the Unstructured Data (Unstructured Data) refers to Data other than the structured Data, which has the following defects:
(1) The data structure is not fixed, the relational database cannot be used for storage, and the data can only be stored in various file forms, such as Word, txt and the like;
(2) Unstructured data usually cannot directly know the data content, and corresponding software is used to open the browse;
(3) The database can only store the data in the BLOB field, so that the data retrieval is inconvenient;
(4) Unstructured data is not easy to understand, and the meaning of the expression cannot be directly obtained from the data;
(5) Unstructured data does not have a specified structure, cannot be standardized, and is not easy to manage, so that a more intelligent system is required for querying, storing, updating, and using the unstructured data.
Therefore, in order to improve the utilization rate of the historical test data of the transformer, realize the integration of the state data of the transformer, establish a perfect operation and maintenance file of the transformer and convert the test data of the transformer into structured data, it is necessary.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a system and a method for structuring test data of a text-type transformer, which improve the utilization rate of historical test data of the transformer, realize the integration of state data of the transformer and are beneficial to establishing a perfect operation and maintenance file of the transformer.
In order to solve the technical problems, the technical scheme adopted by the invention specifically comprises the following contents:
a text transformer test data structuring system comprises an application layer, a logic layer and a data storage layer, wherein the application layer is used for inputting the structuring requirement of text transformer test data, the logic layer is used for structuring the test data according to the structuring requirement, and the data storage layer comprises a file result table and a file template table which are generated when the logic layer is used for structuring the test data.
Further, the logic layer includes:
the file format definition module is used for converting the acquired text file into a standard structure file;
the metadata extraction module is used for extracting metadata of the standard structure file;
the template creating module is used for creating a file template according to the metadata and transmitting the file template to the template base module, and the template base module transmits the file template to the file template table;
the intermediate data representation module is used for creating a simulation result table by adopting a structure mapping and semantic mapping method according to the file template;
and the data conversion module is used for extracting the data content of the standard structure file and inserting the data content into the simulation result table.
Furthermore, the file format definition module comprises a data conversion interface, and the file format definition module converts the acquired text file into a standard structure file through the data conversion interface.
Further, the standard structure file is an unstructured file in an XML format.
Furthermore, the file template comprises structure information of a standard structure file, and the intermediate data representation module creates a simulation result table according to the structure information to obtain a file result table.
The invention also discloses a text transformer experimental data structuring method, which comprises the following steps:
s1: the file format definition module converts the acquired text file into a standard structure file;
s2: the metadata extraction module extracts metadata of the standard structure file;
s3: the template creating module creates a file template according to the metadata and transmits the file template to the template library module;
s4: the intermediate data representation module creates a simulation result table for the file template in the template library according to the structural mapping and the semantic mapping, and then the template library module transmits the file template to the file template table;
s5: and the data conversion module extracts the data content of the standard structure file and inserts the data content into the simulation result table to obtain a file result table.
Compared with the prior art, the invention has the beneficial effects that:
the text transformer test data structuring system and method convert text data into structured files, improve the utilization rate of transformer historical test data, realize the integration of transformer state data, and facilitate the establishment of perfect transformer operation and maintenance files.
The foregoing description is only an overview of the technical solutions of the present invention, and in order to make the technical means of the present invention more clearly understood, the present invention may be implemented in accordance with the content of the description, and in order to make the above and other objects, features, and advantages of the present invention more clearly understood, the following preferred embodiments are described in detail with reference to the accompanying drawings.
Drawings
FIG. 1 is a schematic structural diagram of a system for structuring experimental data of a text-based transformer according to the present invention;
FIG. 2 is a schematic flow chart of a method for structuring experimental data of a text-based transformer according to the present invention;
wherein, the reference numbers in fig. 1 are:
1. an application layer; 2. a logic layer; 3. a data storage layer; 4. a file format definition module; 5. a metadata extraction module; 6. a template creation module; 7. an intermediate data representation module; 8. a data conversion module; 9. a data conversion interface; 10. a template library module; 11. a file result table; 12. a document template table.
Detailed Description
To further illustrate the technical means and effects of the present invention adopted to achieve the predetermined objects, the following detailed description of the embodiments, structures, characteristics and effects according to the present invention with reference to the accompanying drawings and preferred embodiments is as follows:
as shown in fig. 1, the text-based transformer test data structuring system includes an application layer 1, a logic layer 2 and a data storage layer 3, where the application layer 1 is used for inputting a structuring requirement of text-based transformer test data, so that a user can conveniently input the structuring requirement of text-based transformer test data by using the application layer 1, and the operation is convenient; the logic layer 2 is used for carrying out structuralization processing on the test data according to structuralization requirements; the data storage layer 3 comprises a file result table 11 and a file template table 12 which are generated when the logic layer 2 carries out structuring processing on the test data.
Specifically, the logic layer 2 includes a file format definition module 4, a metadata extraction module 5, a template creation module 6, an intermediate data representation module 7, and a data conversion module 8, where: the file format definition module 4 is used for converting the acquired text file into a standard structure file; the metadata extraction module 5 is used for extracting metadata of the standard structure file; the template creating module 6 is used for creating a file template according to the metadata and transmitting the file template to the template base module 10, and the template base module 10 transmits the file template to the file template table 12; the intermediate data representation module 7 is used for creating a simulation result table by adopting a structure mapping and semantic mapping method according to a file template; the data conversion module 8 is configured to extract data content of the standard structure file, and insert the data content into the simulation result table to obtain a file result table 11.
In the invention, the data conversion interface 9 can distinguish the test data of the text transformers in different formats and adopt different programs to convert the test data of the text transformers in different formats into the standard structure file with uniform format according to the distinguishing result, thereby greatly facilitating the structural conversion of the data.
In the invention, the standard structure file is an unstructured file in an XML format.
The invention also discloses a method for structuring the experimental data of the text-type transformer, which comprises the following steps as shown in fig. 2:
s1: the file format definition module 4 converts the acquired text file into a standard structure file.
S2: the metadata extraction module 5 extracts metadata of the standard configuration file.
S3: the template creation module 6 creates a document template from the metadata and transmits the document template to the template base module 10.
It should be noted that, in the present invention, a file template plays an important role in the process of converting text-type transformer test data, the file template includes structural information of a standard structural file, the intermediate data representation module 7 creates a simulation result table according to the structural information, and the structural information includes basic information such as a field name, a field type, and an index used for creating the simulation result table. Moreover, after the establishment of the simulation result table is completed, the file template is finally transmitted to the file template table 12 for storage, so that the template does not need to be established when the system is used for subsequently performing data conversion on the text files with the same format, and convenience is brought.
S4: the intermediate data representation module 7 creates a simulation result table for the document templates in the template library from the structural and semantic mappings, and the template library module 10 then transfers the document templates to the document template table 12.
It should be noted that, the intermediate data representation module 7 performs a structure matching operation on the file template and the already established simulation result table according to the generated file template, and if there are a structure conflict, a semantic conflict, a contact conflict, and the like, the simulation result table may be created by eliminating the structure conflict, the semantic conflict, the contact conflict, and the like according to a predefined rule.
Moreover, if the format of the test data of the text-type transformer is complex, a plurality of simulation result tables need to be established for the same test data of the text-type transformer; and if the test data of the plurality of text-type transformers have the same structure, the test data of the plurality of text-type transformers only need to establish a simulation result table.
S5: the data conversion module 8 extracts the data content of the standard structure file and inserts the data content into the simulation result table to obtain the structured file.
The above embodiments are only preferred embodiments of the present invention, and the scope of the present invention should not be limited thereby, and any insubstantial changes and substitutions made by those skilled in the art based on the present invention are intended to be covered by the claims.

Claims (1)

1. A system for structuring test data of a text transformer is characterized in that: the test data storage layer comprises a file result table and a file template table which are generated when the logic layer carries out structuring processing on the test data;
the logic layer includes: the file format definition module is used for converting the acquired text file into a standard structure file; the metadata extraction module is used for extracting metadata of the standard structure file; the template creating module is used for creating a file template according to the metadata and transmitting the file template to the template base module, and the template base module transmits the file template to the file template table; the intermediate data representation module is used for creating a simulation result table by adopting a structure mapping and semantic mapping method according to the file template; the data conversion module is used for extracting the data content of the standard structure file and inserting the data content into the simulation result table to obtain a file result table;
the file format definition module comprises a data conversion interface, and converts the acquired text file into a standard structure file through the data conversion interface; the standard structure file is an unstructured file with an XML format;
the file template comprises structure information of a standard structure file, and the intermediate data representation module creates a simulation result table according to the structure information.
CN201811403566.3A 2018-11-23 2018-11-23 Text transformer test data structuring system and method Active CN109542969B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811403566.3A CN109542969B (en) 2018-11-23 2018-11-23 Text transformer test data structuring system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811403566.3A CN109542969B (en) 2018-11-23 2018-11-23 Text transformer test data structuring system and method

Publications (2)

Publication Number Publication Date
CN109542969A CN109542969A (en) 2019-03-29
CN109542969B true CN109542969B (en) 2023-02-07

Family

ID=65849299

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811403566.3A Active CN109542969B (en) 2018-11-23 2018-11-23 Text transformer test data structuring system and method

Country Status (1)

Country Link
CN (1) CN109542969B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113191719A (en) * 2021-04-01 2021-07-30 北京优易惠技术有限公司 File processing method and system in bidding field
CN114492361A (en) * 2022-04-06 2022-05-13 普锐斯(北京)科技有限公司 Laboratory detection data structuring method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105474166A (en) * 2013-03-15 2016-04-06 先进元素科技公司 Methods and systems for purposeful computing
CN107992597A (en) * 2017-12-13 2018-05-04 国网山东省电力公司电力科学研究院 A kind of text structure method towards electric network fault case

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030014406A1 (en) * 2001-06-07 2003-01-16 Urbanpixel Inc. Intelligent browser windows in a multi-browser environment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105474166A (en) * 2013-03-15 2016-04-06 先进元素科技公司 Methods and systems for purposeful computing
CN107992597A (en) * 2017-12-13 2018-05-04 国网山东省电力公司电力科学研究院 A kind of text structure method towards electric network fault case

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于XML技术的非结构化数据到结构化数据转换的研究;李爱民等;《铁路计算机应用》;20121025;第12-16页 *

Also Published As

Publication number Publication date
CN109542969A (en) 2019-03-29

Similar Documents

Publication Publication Date Title
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
CN106202292B (en) Standard information analysis method based on structured data model
CN102163233A (en) Method and system for converting webpage markup language format
CN109542969B (en) Text transformer test data structuring system and method
CN102193906A (en) Method for automatically introducing examination paper in WORD format into database system
CN104035993B (en) Memory search method, e-book management system, the reading system of e-book
CN110716952A (en) Multi-source heterogeneous data processing method and device and storage medium
CN103729463A (en) Method for implementing full-text retrieval based on Lucene and Solr
CN108228726B (en) Incremental transaction content acquisition method and storage medium for distribution network red and black images
CN101430684A (en) Method and apparatus for mutual conversion between Chinese work office software document and documents with other format
EP3688624A1 (en) Intelligent inferences of authoring from document layout and formatting
US9626346B2 (en) Method of implementing structured and non-structured data in an XML document
CN109471957B (en) Metadata conversion method and device based on uniform tags
CN111666747A (en) Method for generating WORD document into description class data module conforming to S1000D standard
CN106570095B (en) XML data operation method and equipment
CN101183388A (en) Electronic component self-adapting data-base realization method
CN101645073A (en) Method for guiding prior database file into embedded type database
WO2011074942A1 (en) System and method of converting data from a multiple table structure into an edoc format
CN110889261A (en) Method for automating electronic official document service processing
CN102609410B (en) Authority file auxiliary writing system and authority file generating method
CN106209936B (en) Third party system data capture method and device
CN101799890A (en) Certificate data processing method and system
CN112395292B (en) Data feature extraction and matching method and device
CN104994169A (en) Floor-sweeping robot state information query method and server
CN115510834A (en) Laboratory data digital management system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant