CN105786921B - A kind of the data module method for transformation and device of non-structured document - Google Patents

A kind of the data module method for transformation and device of non-structured document Download PDF

Info

Publication number
CN105786921B
CN105786921B CN201410829893.0A CN201410829893A CN105786921B CN 105786921 B CN105786921 B CN 105786921B CN 201410829893 A CN201410829893 A CN 201410829893A CN 105786921 B CN105786921 B CN 105786921B
Authority
CN
China
Prior art keywords
structured document
label
data
module
dmrl
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410829893.0A
Other languages
Chinese (zh)
Other versions
CN105786921A (en
Inventor
刘剑
梁伟杰
连光耀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Aerospace Measurement and Control Technology Co Ltd
Original Assignee
Beijing Aerospace Measurement and Control Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Aerospace Measurement and Control Technology Co Ltd filed Critical Beijing Aerospace Measurement and Control Technology Co Ltd
Priority to CN201410829893.0A priority Critical patent/CN105786921B/en
Publication of CN105786921A publication Critical patent/CN105786921A/en
Application granted granted Critical
Publication of CN105786921B publication Critical patent/CN105786921B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention discloses the data module method for transformation and device of a kind of non-structured document.This method comprises: selecting non-structured document to be transformed;Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;According to the conversion data target classification, the data module list of requirements DMRL for meeting interactive electronic technical manual IETM standard is generated;According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.The present invention to extract various types of other content in non-structured document, and then is converted to a variety of data modules by being inserted into preset label in non-structured document.Present invention raising IETM's writes efficiency, reduces the workload of manual compiling IETM.

Description

A kind of the data module method for transformation and device of non-structured document
Technical field
The present invention relates to information data switching technology fields, turn more particularly to a kind of data module of non-structured document Change method and apparatus.
Background technique
In the after-sale service of the large scale equipments such as aircraft, railway, ship, customer service, equipment guarantee field, will use a large amount of Maintenance of equipment file, these technological documents in use, exist and occupy that memory space is big, is difficult to carry, manages, looks into The problems such as looking for, is inconvenient to use, service efficiency is low, in order to solve problem above, relatively good approach are exactly to use a kind of data Enterprise schema, data description method by the organizing again of data, recycle, by the data of diversified forms (such as text, video, Audio, three-dimensional) it integrates, synthesization shows the processes such as equipment, the maintenance of equipment, fault diagnosis.It uses among these A kind of technology be exactly interactive electronic technical manual (Interactive Electronic Technical Manual, referred to as IETM)。
IETM applies the failure of the electronization in technical data, standardization, integrated management, the instruction of equipment, equipment Maintenance, the training and examination of equipment, equipment maintenance record management etc. during, improving equipment, equipment fault diagnosis efficiency Meanwhile reducing its Support expense.
A kind of important tool of the IETM as equipment guarantee, still, IETM belongs to new things, does not pacify in the equipment development stage The production work for arranging IETM, so as to cause after the completion of equipment development, it has to rewrite IETM, this will undoubtedly bring huge Human input and workload, while the situation that data may also be caused inconsistent.How from the source documents etc. of equipment Data are extracted in a large amount of unstructured datas (such as WORD formatted file), the corresponding data module content of IETM are generated, by shadow Ring the manufacturing process of IETM.
Summary of the invention
Based on above-mentioned technical problem, the present invention provides the data module method for transformation and dress of a kind of non-structured document It sets.
In order to solve the above technical problems, the present invention solves by the following technical programs.
The present invention provides a kind of data module method for transformation of non-structured document, comprising: selectes non-knot to be transformed Structure document;Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;According to The conversion data target classification generates the data module list of requirements for meeting interactive electronic technical manual IETM standard DMRL;According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.
Wherein, pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification, Include: the label of the corresponding position insertion preset kind in the non-structured document to be transformed, keeps each label corresponding The data module of respective type.
Wherein, the label includes following one:<system>,<descript>,<proced>,<fault>,< process>;After the label is inserted into the corresponding position in the non-structured document to be transformed, the label includes: Node type, nodename and node content.
Wherein, according to the conversion data target classification, the number for meeting interactive electronic technical manual IETM standard is generated According to module list of requirements DMRL, comprising: obtain label and be the node type and nodename of<system>, and configure corresponding SNS coding;Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured pair The type coding answered;According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising every The DMRL of the node type of a label, nodename and coding.
Wherein, according to the DMRL, multiple data modules are converted by the non-structured document to be transformed, comprising: According to the DMRL of generation, node content extraction is carried out in non-structured document;For the multiple node contents extracted, according to IETM data format standard is separately converted to the data module of corresponding data format.
Wherein, the method also includes: according to pre-set sensitive word rule and synonym rule, in unstructured text Sensitive word and synonym are searched in shelves, are inserted into sensitive word label in the position of sensitive word and synonym;It is described according to the DMRL, Multiple data modules are converted by the non-structured document to be transformed, comprising: identification sensitive word label, by sensitive word label Node content be converted into the data module of corresponding data format.
The present invention also provides a kind of data module reforming units of non-structured document, comprising: chosen module, for selecting Fixed non-structured document to be transformed;Processing module, for being carried out at pre- labeling to the non-structured document to be transformed Reason, determines conversion data target classification;Generation module, for according to the conversion data target classification, generation to meet interactive mode The data module list of requirements DMRL of electronic technical manual IETM standard;Conversion module, for according to the DMRL, will it is described to The non-structured document of conversion is converted into multiple data modules.
Wherein, the processing module is used for: the corresponding position insertion in the non-structured document to be transformed is default The label of type makes each label correspond to the data module of respective type.
Wherein, the label includes following one:<system>,<descript>,<proced>,<fault>,< process>;After the label is inserted into the corresponding position in the non-structured document to be transformed by the processing module, The label includes: node type, nodename and node content.
Wherein, the generation module is used for: obtaining the node type and nodename that label is<system>, and configuration pair The SNS coding answered;Obtaining label is<descript>,<fault>,<process>node type and nodename, match respectively Set corresponding type coding;Packet is automatically generated according to DMRL data format according to the corresponding relationship of node type and data module The DMRL of node type, nodename and coding containing each label;
Wherein, the conversion module, is used for: according to the DMRL of generation, carrying out node content in non-structured document and mentions It takes;The data of corresponding data format are separately converted to according to IETM data format standard for the multiple node contents extracted Module.
The present invention has the beneficial effect that:
The present invention in non-structured document by being inserted into preset label, to extract various classifications in non-structured document Content, and then be converted to a variety of data modules.Present invention raising IETM's writes efficiency, reduces the work of manual compiling IETM It measures.
Detailed description of the invention
Fig. 1 is the flow chart of the data module method for transformation of the non-structured document of one embodiment of the invention;
Fig. 2 is flow chart the step of generating DMRL of one embodiment of the invention;
Fig. 3 is the step flow chart of the data module conversion of one embodiment of the invention;
Fig. 4 is the structure chart of the data module reforming unit of the non-structured document of one embodiment of the invention.
Specific embodiment
The present invention provides the data module method for transformation and device of a kind of non-structured document.Below in conjunction with attached drawing and Embodiment, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used to explain The present invention does not limit the present invention.
Fig. 1 is the flow chart of the data module method for transformation of non-structured document according to an embodiment of the invention.
Step S110 selectes non-structured document to be transformed.
Non-structured document to be transformed is the source documents of equipment, is maintenance of equipment file.It needs to tie up equipment It improves literature part, is converted into meeting the data module of IETM standard.
Step S120 carries out pre- labeling processing to non-structured document to be transformed, determines conversion data target classification.
Pre- labeling processing refers to: labeling description is carried out to non-structured document, it is corresponding in non-structured document Preset label is inserted into position.Label is used to identify the node in non-structured document, specifies inhomogeneity in non-structured document Other content.Such as: main body class content, description the class content, failure classes content, mistake in non-structured document are marked off with label Journey class content.Main body refers to the object mainly described in non-structured document.Such as: in an article, chapter 1 description master Machine, chapter 2 describe display, then main body is respectively host and display.
Conversion data target classification refers to: corresponding to the data content of different data module in non-structured document, that is to say It is inserted into default label, marks off the non-structured document after different classes of content.
Step S130 generates the data module demand column for meeting IETM standard according to determining conversion data target classification Table (Data Module Requirements List, abbreviation DMRL).
It include data module essential information needed for user in DMRL.The data module essential information includes number needed for user According to each label node type, nodename and the coding of module, the description of Fig. 2 is specifically please referred to.
Step S140 converts multiple data modules for non-structured document to be transformed according to the DMRL of generation.
Multiple data modules are divided into different types, such as: main body class data module, description class data module, failure classes Data module, process class data module.Further, data module can regard as will be different classes of in non-structured document in Appearance is converted respectively and is formed.
For step S120, specifically:
The label of corresponding position insertion preset kind in non-structured document to be transformed, forms multiple nodes (mark Label), so that the node type of each node is corresponded to the data module of respective type.
Non-structured document includes the formats such as WORD, PDF and CAJ.The present embodiment is preferred, non-structured document WORD Format.
It is inserted into the label of non-structured document, including but not limited to:<system>,<descript>,<proced>,< fault>,<process>.Wherein<system>be the theme class label,<descript>be description class label,<fault>is therefore Barrier class label,<process>for process class label.
Labeling requirement is predefined.Specifically, tag definition can be carried out according to data module, make label and data mould Block is mapped, in this way, the data content of non-structured document is just mapped with corresponding data module naturally.Further Ground, each label include node type, and according to the position that label is inserted into, label further includes nodename and node content.Node Type is configured according to the type of data module, and nodename is configured according to the document content of label insertion position, section Point content is document content.Wherein, the node content of each label can correspond to same or different data module.
Such as:<system>is the theme class label, the position where insertion main body (e.g., equipment cabinets), corresponding main body class Data module;<descript>is description class label, the position where insertion equipment description, corresponding description class data module;< Fault > and it is failure classes label, the position where insertion equipment fault description, corresponding failure classes data module;<process>for Process class label is inserted into the position where equipment use process, corresponding process class data module.
For label inserted mode, such as: the part content of text in selected parts non-structured document, and inserted in corresponding position Enter label:
……
1.3.6 safety
Driver task's terminal has self-destroying function, guarantees the safety of important information.
……
To above-mentioned content of text, following label can be set:
<system name=" totality ">
<descript>
1.3.6 safety
Driver task's terminal has self-destroying function, guarantees the safety of important information.
</descript>
</system>”。
For another example: the non-structured document after insertion label are as follows:
<system name=" software ">
The design of 4 software subsystems
<system name=" technical information acquisition subsystem ">
4.1 technical information acquisition subsystems (Infosys transformation and upgrade)
<descript name=" system structure ">
4.1.1 system structure
<Para>
Infosys uses the architecture of C/S, realizes durings development and production, sizing etc., the acquisition of technical information, The parameter of technical data, training, maintenance and spares provisioning etc., picture, design drawing, each class model, design are supported including providing The information such as document, technical manual.
</Para>
</descript>
</system>。
For another example: in an article, chapter 1 and chapter 2 describe different main bodys respectively, and chapter 1 and chapter 2 all wrap The description of first segment equipment, the second section equipment operation, third section equipment fault are included, then when being inserted into label: chapter 1 insertion < System>, chapter 2 insertion<system>, the first segment insertion of chapter 1<descript>, the second section insertion<process>, The insertion of third section<fault>, the first segment insertion of chapter 2<descript>, the second section insertion<process>, third section insertion <fault>。
It is inserted into label by the corresponding position in non-structured document, non-structured document is made to be changed into description category information Content, and achieved the purpose that be split non-structured document.Wherein, descriptor format can refer to the S1000D of IETM Standard, and suitably extended.
If non-structured document is WORD format, plug-in unit is accessed by the document data that office is provided, to non-knot Structure document accesses.
For step S130, specifically:
Specific steps flow chart the step of generating DMRL as shown in Figure 2.
Step S210 obtains label and is the node type and nodename of<system>, and match in non-structured document It sets corresponding system and divides coding (Standard numbering System, abbreviation SNS).
Wherein, SNS coding is based on standardized coding scheme, for marker rig and its code of distinguishing hierarchy.? I other words converting SNS structure for non-structured document.<system>is level-one label.
Step S220, in non-structured document, obtaining label is<descript>,<fault>,<process>section Vertex type and nodename, and corresponding type coding is respectively configured.
<descript>,<fault>,<process>equal labels are corresponding<system>second level label under label.
Step S230 automatically generates packet according to DMRL data format according to the corresponding relationship of node type and data module DMRL containing each label node type, nodename and coding.
Further, while generating DMRL, data module coding is corrected.Because each data module has One unique data module encodes, and includes that initial SNS coding and starting type encode (IC code) in data module coding, For<system>label configures SNS coding, is<descript>,<fault>,<process>type coding is respectively configured in label Afterwards, SNS coding and initial SNS coding and type coding and corresponding starting type coding have differences, therefore, it is desirable to will Initial SNS coding and starting type coding in data module coding replace with corresponding SNS coding and type coding.
For step S140, specifically:
According to DMRL, the content (node content) in non-structured document under each label is extracted, by each mark of extraction The content signed, be separately converted to corresponding types and meets the data module of IETM data format standard, and with data module The form of list is managed.
As shown in figure 3, for the step flow chart converted according to the data module of one embodiment of the invention.
Step S310 carries out node content extraction according to the DMRL of generation in non-structured document.
According to<descript>,<fault>,<process>equal labels, are partitioned into node content.In other words, divide unit For data module.
Such as: in an article, chapter 1 and chapter 2 describe different main bodys respectively, and chapter 1 and chapter 2 all wrap The description of first segment equipment, the second section equipment operation, third section equipment fault are included, then when dividing content, according to tag extraction Out: the node content A of chapter 1, the node content B of chapter 2, the node of the node content C of the first segment of chapter 1, the second section The node content E of content D, third section, the node content F of the first segment of chapter 2, the node content G of the second section, third section Node content H.
Step S320, according to IETM data format standard, is separately converted to corresponding for the multiple node contents extracted The data module of data format.
Such as: it is pressed for content of text<para>data format is converted, and is pressed for figure<graphic>data format It is converted, is converted for table according to<table>format.
Step S330 is managed in data module list for multiple data modules of acquisition.
Unlike data module list of requirements DMRL, data module list includes the basic letter of all data modules Breath, and DMRL only include user needs data module essential information, as include data module under each label node type, Nodename and corresponding coding.
For step S320, specifically:
During carrying out data module conversion to non-structured document, non-structured document can also be carried out into one Step ground labeling processing, to increase the accuracy of data module.
According to pre-set sensitive word rule and synonym rule, sensitive word and synonymous is searched in non-structured document Word is inserted into sensitive word label in the position of sensitive word and synonym;In the conversion process, sensitive word label is identified, by sensitive word The node content of label is converted into the data module of corresponding data format, in other words, by the content comprising sensitive word and synonym It is converted into the data module of corresponding data format.
Specifically, non-structured document is parsed, such as: selected non-structured document is parsed, is analyzed The purposes of non-structured document determines sensitive word and the corresponding synonym of sensitive word in non-structured document, establishes sensitive word Rule, synonym rule.Sensitive word rule is for example shown in table 1, but the content being not limited in table 1, synonym rule such as 2 institute of table The content shown, but be not limited in table 2.
Table 1 sensitive word rule
Document name Sensitive word Format Purposes
Driver behavior illustrates .doc Safety <security>safety</security>
Driver behavior illustrates .doc Task terminal <endItem>task terminal</endItem>
Table 2 synonym rule
Sensitive word Synonym one Synonym two Synonym X
Safety Safety Security feature
Task terminal Terminal Using terminal
Including the non-structured document title where multiple sensitive words, each sensitive word, Mei Gemin in sensitive word rule Feel the format etc. of word addition sensitive word label.
It include the corresponding one or more synonyms of sensitive word in synonym rule.Sensitive word mark is being added for synonym When label, using the identical addition format of the corresponding sensitive word of the synonym.
In one embodiment, the sensitive word rule of setting can also be realized quick to the sensitive word addition in preset range Feel word label.Further, conjunctive word and metadata are added in sensitive word rule.Conjunctive word is the range for limiting sensitive word Associated symbol, such as larger than, be less than etc.;Metadata is the value range of sensitive word, such as MPa, Min.Such as: air pressure is greater than 5MPa, Then, air pressure is sensitive word, greater than being metadata for conjunctive word, Mpa.
Further, preset sensitive word and synonym may be defined description in IETM standard, it is also possible in IETM Without definition description in standard;For the sensitive word and synonym of no definition description, needs to be extended IETM standard, make There are corresponding descriptions.Such as: " air pressure " without definition description, then needs to redefine in IETM data in IETM standard Hold, " air pressure " is made to there is definition description.
The present embodiment carries out labeling segmentation, sensitive word and synonym definition, DMRL generation, IETM to non-structured document The processes such as conversion tentatively solve heavy workload when non-structured document generates IETM, generate the problems such as data are inconsistent.
Present invention seek to address that the non-structural data such as maintenance of equipment file are converted into the maintenance support class data mould in IETM Block (for example describe class data module, program class data module, failure classes data module, maintenance project class data module etc.), have The problem of that improves IETM writes efficiency, reduces the workload of manual compiling IETM, specifically includes: selecting to be analyzed non-structural Change document (WORD document), the preliminary labeling of data is carried out to selected document content, conversion data is carried out by labeling The classification of target;The lteral data in document is deeply combed, data use is analyzed, defines metadata sensitive word in data, same Adopted word rule;After conversion data target classification determines, the DMRL (data module list of requirements) that IETM is required is generated;According to DMRL Convert analytic process data and result data to the data format for meeting IETM data format standard, and with data module list Form be managed.
The present invention also provides a kind of data module reforming units of non-structured document, as shown in Figure 4.
Chosen module 410, for selecting non-structured document to be transformed.
Processing module 420 determines turn over number for carrying out pre- labeling processing to the non-structured document to be transformed According to target classification.Further, processing module 420 are inserted for the corresponding position in the non-structured document to be transformed The label for entering preset kind makes each label correspond to the data module of respective type.The label includes following one: < system>,<descript>,<proced>,<fault>,<process>.The label is inserted into the processing module 420 Behind corresponding position in the non-structured document to be transformed, the label includes: in node type, nodename and node Hold.
Generation module 430, for according to the conversion data target classification, generation to meet interactive electronic technical manual The data module list of requirements DMRL of IETM standard.Further, the generation module 430 is<system>for obtaining label Node type and nodename, and configure corresponding SNS coding;Obtain label be<descript>,<fault>,< Process > node type and nodename, corresponding type coding is respectively configured;According to node type and data module Corresponding relationship automatically generates the node type comprising each label, nodename and coding according to DMRL data format DMRL。
Conversion module 440, for converting multiple data for the non-structured document to be transformed according to the DMRL Module.Further, conversion module 440 carry out node content in non-structured document and mention for the DMRL according to generation It takes;The data of corresponding data format are separately converted to according to IETM data format standard for the multiple node contents extracted Module.
The function of device described in the present embodiment is described in Fig. 1-embodiment of the method shown in Fig. 3, therefore Not detailed place, may refer to the related description in previous embodiment, this will not be repeated here in the description of the present embodiment.
Although for illustrative purposes, the preferred embodiment of the present invention has been disclosed, those skilled in the art will recognize It is various improve, increase and replace be also it is possible, therefore, the scope of the present invention should be not limited to the above embodiments.

Claims (10)

1. a kind of data module method for transformation of non-structured document characterized by comprising
Select non-structured document to be transformed;
Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;The label Including following one:<system>,<descript>,<proced>,<fault>,<process>;
According to the conversion data target classification, the data module demand for meeting interactive electronic technical manual IETM standard is generated List DMRL;
According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.
2. the method as described in claim 1, which is characterized in that carry out pre- labeling to the non-structured document to be transformed Processing, determines conversion data target classification, comprising:
The label of corresponding position insertion preset kind in the non-structured document to be transformed, makes each label correspond to phase Answer the data module of type.
3. method according to claim 2, which is characterized in that the label is being inserted into the unstructured text to be transformed Behind corresponding position in shelves, the label includes: node type, nodename and node content.
4. method as claimed in claim 3, which is characterized in that according to the conversion data target classification, generation meets interaction The data module list of requirements DMRL of formula electronic technical manual IETM standard, comprising:
It obtains label and is the node type and nodename of<system>, and configure corresponding SNS coding;
Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured corresponding Type coding;
According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising each label The DMRL of node type, nodename and coding.
5. the method as described in claim 1, which is characterized in that according to the DMRL, by the unstructured text to be transformed Shelves are converted into multiple data modules, comprising:
According to the DMRL of generation, node content extraction is carried out in non-structured document;
The number of corresponding data format is separately converted to according to IETM data format standard for the multiple node contents extracted According to module.
6. the method as described in claim 1, which is characterized in that the method also includes:
According to pre-set sensitive word rule and synonym rule, sensitive word and synonym are searched in non-structured document, Sensitive word label is inserted into the position of sensitive word and synonym;
It is described according to the DMRL, convert multiple data modules for the non-structured document to be transformed, comprising: identification it is quick Feel word label, converts the node content of sensitive word label to the data module of corresponding data format.
7. a kind of data module reforming unit of non-structured document characterized by comprising
Chosen module, for selecting non-structured document to be transformed;
Processing module determines conversion data target for carrying out pre- labeling processing to the non-structured document to be transformed Classification;The label includes following one:<system>,<descript>,<proced>,<fault>,<process>;
Generation module, for according to the conversion data target classification, generation to meet interactive electronic technical manual IETM standard Data module list of requirements DMRL;
Conversion module, for converting multiple data modules for the non-structured document to be transformed according to the DMRL.
8. device as claimed in claim 7, which is characterized in that the processing module is used for:
The label of corresponding position insertion preset kind in the non-structured document to be transformed, makes each label correspond to phase Answer the data module of type.
9. device as claimed in claim 8, which is characterized in that in the processing module that label insertion is described to be transformed Non-structured document in corresponding position after, the label includes: node type, nodename and node content.
10. device as claimed in claim 9, which is characterized in that
The generation module is used for:
It obtains label and is the node type and nodename of<system>, and configure corresponding SNS coding;
Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured corresponding Type coding;
According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising each label The DMRL of node type, nodename and coding;
The conversion module, is used for:
According to the DMRL of generation, node content extraction is carried out in non-structured document;
The number of corresponding data format is separately converted to according to IETM data format standard for the multiple node contents extracted According to module.
CN201410829893.0A 2014-12-26 2014-12-26 A kind of the data module method for transformation and device of non-structured document Active CN105786921B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410829893.0A CN105786921B (en) 2014-12-26 2014-12-26 A kind of the data module method for transformation and device of non-structured document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410829893.0A CN105786921B (en) 2014-12-26 2014-12-26 A kind of the data module method for transformation and device of non-structured document

Publications (2)

Publication Number Publication Date
CN105786921A CN105786921A (en) 2016-07-20
CN105786921B true CN105786921B (en) 2019-06-18

Family

ID=56388701

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410829893.0A Active CN105786921B (en) 2014-12-26 2014-12-26 A kind of the data module method for transformation and device of non-structured document

Country Status (1)

Country Link
CN (1) CN105786921B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294551A (en) * 2016-07-25 2017-01-04 中国商用飞机有限责任公司 System and comprehensive establishment management system is managed for the CIR of technical publications
CN108021632B (en) * 2017-11-23 2020-07-07 中国移动通信集团河南有限公司 Mutual conversion processing method for unstructured data and structured data
CN110119984A (en) * 2018-02-07 2019-08-13 青岛农业大学 A kind of processing system for international trade tick financing
CN108710660A (en) * 2018-05-11 2018-10-26 上海核工程研究设计院有限公司 A kind of items property parameters modeling of database and storage method
CN110990636A (en) * 2019-12-18 2020-04-10 哈尔滨工程大学 Intelligent data module acquisition and conversion method for diesel engine interactive electronic technical manual
CN111666747A (en) * 2020-05-29 2020-09-15 中国工程物理研究院计算机应用研究所 Method for generating WORD document into description class data module conforming to S1000D standard
CN111859863A (en) * 2020-06-03 2020-10-30 远光软件股份有限公司 Document structure conversion method and device, storage medium and electronic equipment
CN112699641B (en) * 2021-03-25 2021-07-20 南京国睿信维软件有限公司 Method for quickly converting batch copy of WORD content to DM based on S1000D standard

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7849048B2 (en) * 2005-07-05 2010-12-07 Clarabridge, Inc. System and method of making unstructured data available to structured data analysis tools
CN101055578A (en) * 2006-04-12 2007-10-17 龙搜(北京)科技有限公司 File content dredger based on rule
CN102207975A (en) * 2011-06-24 2011-10-05 天津大学 Method for manufacturing and displaying extensive makeup language (xml) data module based on ietm standard
CN102982027A (en) * 2011-09-02 2013-03-20 北大方正集团有限公司 Method and device for abstracting contents in document
CN103678625A (en) * 2013-12-18 2014-03-26 北京航天测控技术有限公司 Method and device for transforming interactive electronic technical manual data

Also Published As

Publication number Publication date
CN105786921A (en) 2016-07-20

Similar Documents

Publication Publication Date Title
CN105786921B (en) A kind of the data module method for transformation and device of non-structured document
CN101122899B (en) Report generation method and device
CN101989256A (en) Typesetting method of document file and device
US20080155519A1 (en) Code translator
JP6090850B2 (en) Source program analysis system, source program analysis method and program
CN102722479A (en) A method and device for realizing language translation
US20130262987A1 (en) Document processing method, apparatus and editor
CN103885942B (en) A kind of rapid translation device and method
CN106547729A (en) A kind of dynamic creation method and system of data sheet
CN103064659A (en) Software as a service (SAAS) model based on metadata extraction user-defined worksheet system
CN103095726A (en) Processing method and device of protocol interpreter
US20130204875A1 (en) Automatic Configuration Of A Product Data Management System
CN104298705A (en) Converting method of relational data and unstructured data
CN111859053A (en) Data definition method of visual chart and chart library realized by data definition method
CN104063545A (en) Method and system for dynamically displaying process tracing diagram
CN104536947A (en) Layout document processing method and device
CN108228688B (en) Template generation method, system and server based on XBRL
CN102521359B (en) Interface data file comparison method and device
CN107203311B (en) Display method and device of multi-language menu
CN110968591A (en) Query statement generation method and device, storage medium and processor
CN104978379A (en) Method and device for building application program information station
CN105808595B (en) A kind of the data library generating method and device of authority file
CN108628862A (en) database addressing method, device and system
CN111401005B (en) Text conversion method and device and readable storage medium
CN109446295B (en) Svg data map editing tool, editing method and computer readable medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant