CN105786921B - A kind of the data module method for transformation and device of non-structured document - Google Patents
A kind of the data module method for transformation and device of non-structured document Download PDFInfo
- Publication number
- CN105786921B CN105786921B CN201410829893.0A CN201410829893A CN105786921B CN 105786921 B CN105786921 B CN 105786921B CN 201410829893 A CN201410829893 A CN 201410829893A CN 105786921 B CN105786921 B CN 105786921B
- Authority
- CN
- China
- Prior art keywords
- structured document
- label
- data
- module
- dmrl
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The invention discloses the data module method for transformation and device of a kind of non-structured document.This method comprises: selecting non-structured document to be transformed;Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;According to the conversion data target classification, the data module list of requirements DMRL for meeting interactive electronic technical manual IETM standard is generated;According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.The present invention to extract various types of other content in non-structured document, and then is converted to a variety of data modules by being inserted into preset label in non-structured document.Present invention raising IETM's writes efficiency, reduces the workload of manual compiling IETM.
Description
Technical field
The present invention relates to information data switching technology fields, turn more particularly to a kind of data module of non-structured document
Change method and apparatus.
Background technique
In the after-sale service of the large scale equipments such as aircraft, railway, ship, customer service, equipment guarantee field, will use a large amount of
Maintenance of equipment file, these technological documents in use, exist and occupy that memory space is big, is difficult to carry, manages, looks into
The problems such as looking for, is inconvenient to use, service efficiency is low, in order to solve problem above, relatively good approach are exactly to use a kind of data
Enterprise schema, data description method by the organizing again of data, recycle, by the data of diversified forms (such as text, video,
Audio, three-dimensional) it integrates, synthesization shows the processes such as equipment, the maintenance of equipment, fault diagnosis.It uses among these
A kind of technology be exactly interactive electronic technical manual (Interactive Electronic Technical Manual, referred to as
IETM)。
IETM applies the failure of the electronization in technical data, standardization, integrated management, the instruction of equipment, equipment
Maintenance, the training and examination of equipment, equipment maintenance record management etc. during, improving equipment, equipment fault diagnosis efficiency
Meanwhile reducing its Support expense.
A kind of important tool of the IETM as equipment guarantee, still, IETM belongs to new things, does not pacify in the equipment development stage
The production work for arranging IETM, so as to cause after the completion of equipment development, it has to rewrite IETM, this will undoubtedly bring huge
Human input and workload, while the situation that data may also be caused inconsistent.How from the source documents etc. of equipment
Data are extracted in a large amount of unstructured datas (such as WORD formatted file), the corresponding data module content of IETM are generated, by shadow
Ring the manufacturing process of IETM.
Summary of the invention
Based on above-mentioned technical problem, the present invention provides the data module method for transformation and dress of a kind of non-structured document
It sets.
In order to solve the above technical problems, the present invention solves by the following technical programs.
The present invention provides a kind of data module method for transformation of non-structured document, comprising: selectes non-knot to be transformed
Structure document;Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;According to
The conversion data target classification generates the data module list of requirements for meeting interactive electronic technical manual IETM standard
DMRL;According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.
Wherein, pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification,
Include: the label of the corresponding position insertion preset kind in the non-structured document to be transformed, keeps each label corresponding
The data module of respective type.
Wherein, the label includes following one:<system>,<descript>,<proced>,<fault>,<
process>;After the label is inserted into the corresponding position in the non-structured document to be transformed, the label includes:
Node type, nodename and node content.
Wherein, according to the conversion data target classification, the number for meeting interactive electronic technical manual IETM standard is generated
According to module list of requirements DMRL, comprising: obtain label and be the node type and nodename of<system>, and configure corresponding
SNS coding;Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured pair
The type coding answered;According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising every
The DMRL of the node type of a label, nodename and coding.
Wherein, according to the DMRL, multiple data modules are converted by the non-structured document to be transformed, comprising:
According to the DMRL of generation, node content extraction is carried out in non-structured document;For the multiple node contents extracted, according to
IETM data format standard is separately converted to the data module of corresponding data format.
Wherein, the method also includes: according to pre-set sensitive word rule and synonym rule, in unstructured text
Sensitive word and synonym are searched in shelves, are inserted into sensitive word label in the position of sensitive word and synonym;It is described according to the DMRL,
Multiple data modules are converted by the non-structured document to be transformed, comprising: identification sensitive word label, by sensitive word label
Node content be converted into the data module of corresponding data format.
The present invention also provides a kind of data module reforming units of non-structured document, comprising: chosen module, for selecting
Fixed non-structured document to be transformed;Processing module, for being carried out at pre- labeling to the non-structured document to be transformed
Reason, determines conversion data target classification;Generation module, for according to the conversion data target classification, generation to meet interactive mode
The data module list of requirements DMRL of electronic technical manual IETM standard;Conversion module, for according to the DMRL, will it is described to
The non-structured document of conversion is converted into multiple data modules.
Wherein, the processing module is used for: the corresponding position insertion in the non-structured document to be transformed is default
The label of type makes each label correspond to the data module of respective type.
Wherein, the label includes following one:<system>,<descript>,<proced>,<fault>,<
process>;After the label is inserted into the corresponding position in the non-structured document to be transformed by the processing module,
The label includes: node type, nodename and node content.
Wherein, the generation module is used for: obtaining the node type and nodename that label is<system>, and configuration pair
The SNS coding answered;Obtaining label is<descript>,<fault>,<process>node type and nodename, match respectively
Set corresponding type coding;Packet is automatically generated according to DMRL data format according to the corresponding relationship of node type and data module
The DMRL of node type, nodename and coding containing each label;
Wherein, the conversion module, is used for: according to the DMRL of generation, carrying out node content in non-structured document and mentions
It takes;The data of corresponding data format are separately converted to according to IETM data format standard for the multiple node contents extracted
Module.
The present invention has the beneficial effect that:
The present invention in non-structured document by being inserted into preset label, to extract various classifications in non-structured document
Content, and then be converted to a variety of data modules.Present invention raising IETM's writes efficiency, reduces the work of manual compiling IETM
It measures.
Detailed description of the invention
Fig. 1 is the flow chart of the data module method for transformation of the non-structured document of one embodiment of the invention;
Fig. 2 is flow chart the step of generating DMRL of one embodiment of the invention;
Fig. 3 is the step flow chart of the data module conversion of one embodiment of the invention;
Fig. 4 is the structure chart of the data module reforming unit of the non-structured document of one embodiment of the invention.
Specific embodiment
The present invention provides the data module method for transformation and device of a kind of non-structured document.Below in conjunction with attached drawing and
Embodiment, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used to explain
The present invention does not limit the present invention.
Fig. 1 is the flow chart of the data module method for transformation of non-structured document according to an embodiment of the invention.
Step S110 selectes non-structured document to be transformed.
Non-structured document to be transformed is the source documents of equipment, is maintenance of equipment file.It needs to tie up equipment
It improves literature part, is converted into meeting the data module of IETM standard.
Step S120 carries out pre- labeling processing to non-structured document to be transformed, determines conversion data target classification.
Pre- labeling processing refers to: labeling description is carried out to non-structured document, it is corresponding in non-structured document
Preset label is inserted into position.Label is used to identify the node in non-structured document, specifies inhomogeneity in non-structured document
Other content.Such as: main body class content, description the class content, failure classes content, mistake in non-structured document are marked off with label
Journey class content.Main body refers to the object mainly described in non-structured document.Such as: in an article, chapter 1 description master
Machine, chapter 2 describe display, then main body is respectively host and display.
Conversion data target classification refers to: corresponding to the data content of different data module in non-structured document, that is to say
It is inserted into default label, marks off the non-structured document after different classes of content.
Step S130 generates the data module demand column for meeting IETM standard according to determining conversion data target classification
Table (Data Module Requirements List, abbreviation DMRL).
It include data module essential information needed for user in DMRL.The data module essential information includes number needed for user
According to each label node type, nodename and the coding of module, the description of Fig. 2 is specifically please referred to.
Step S140 converts multiple data modules for non-structured document to be transformed according to the DMRL of generation.
Multiple data modules are divided into different types, such as: main body class data module, description class data module, failure classes
Data module, process class data module.Further, data module can regard as will be different classes of in non-structured document in
Appearance is converted respectively and is formed.
For step S120, specifically:
The label of corresponding position insertion preset kind in non-structured document to be transformed, forms multiple nodes (mark
Label), so that the node type of each node is corresponded to the data module of respective type.
Non-structured document includes the formats such as WORD, PDF and CAJ.The present embodiment is preferred, non-structured document WORD
Format.
It is inserted into the label of non-structured document, including but not limited to:<system>,<descript>,<proced>,<
fault>,<process>.Wherein<system>be the theme class label,<descript>be description class label,<fault>is therefore
Barrier class label,<process>for process class label.
Labeling requirement is predefined.Specifically, tag definition can be carried out according to data module, make label and data mould
Block is mapped, in this way, the data content of non-structured document is just mapped with corresponding data module naturally.Further
Ground, each label include node type, and according to the position that label is inserted into, label further includes nodename and node content.Node
Type is configured according to the type of data module, and nodename is configured according to the document content of label insertion position, section
Point content is document content.Wherein, the node content of each label can correspond to same or different data module.
Such as:<system>is the theme class label, the position where insertion main body (e.g., equipment cabinets), corresponding main body class
Data module;<descript>is description class label, the position where insertion equipment description, corresponding description class data module;<
Fault > and it is failure classes label, the position where insertion equipment fault description, corresponding failure classes data module;<process>for
Process class label is inserted into the position where equipment use process, corresponding process class data module.
For label inserted mode, such as: the part content of text in selected parts non-structured document, and inserted in corresponding position
Enter label:
“……
1.3.6 safety
Driver task's terminal has self-destroying function, guarantees the safety of important information.
……
To above-mentioned content of text, following label can be set:
<system name=" totality ">
<descript>
1.3.6 safety
Driver task's terminal has self-destroying function, guarantees the safety of important information.
</descript>
</system>”。
For another example: the non-structured document after insertion label are as follows:
<system name=" software ">
The design of 4 software subsystems
<system name=" technical information acquisition subsystem ">
4.1 technical information acquisition subsystems (Infosys transformation and upgrade)
<descript name=" system structure ">
4.1.1 system structure
<Para>
Infosys uses the architecture of C/S, realizes durings development and production, sizing etc., the acquisition of technical information,
The parameter of technical data, training, maintenance and spares provisioning etc., picture, design drawing, each class model, design are supported including providing
The information such as document, technical manual.
</Para>
</descript>
</system>。
For another example: in an article, chapter 1 and chapter 2 describe different main bodys respectively, and chapter 1 and chapter 2 all wrap
The description of first segment equipment, the second section equipment operation, third section equipment fault are included, then when being inserted into label: chapter 1 insertion <
System>, chapter 2 insertion<system>, the first segment insertion of chapter 1<descript>, the second section insertion<process>,
The insertion of third section<fault>, the first segment insertion of chapter 2<descript>, the second section insertion<process>, third section insertion
<fault>。
It is inserted into label by the corresponding position in non-structured document, non-structured document is made to be changed into description category information
Content, and achieved the purpose that be split non-structured document.Wherein, descriptor format can refer to the S1000D of IETM
Standard, and suitably extended.
If non-structured document is WORD format, plug-in unit is accessed by the document data that office is provided, to non-knot
Structure document accesses.
For step S130, specifically:
Specific steps flow chart the step of generating DMRL as shown in Figure 2.
Step S210 obtains label and is the node type and nodename of<system>, and match in non-structured document
It sets corresponding system and divides coding (Standard numbering System, abbreviation SNS).
Wherein, SNS coding is based on standardized coding scheme, for marker rig and its code of distinguishing hierarchy.?
I other words converting SNS structure for non-structured document.<system>is level-one label.
Step S220, in non-structured document, obtaining label is<descript>,<fault>,<process>section
Vertex type and nodename, and corresponding type coding is respectively configured.
<descript>,<fault>,<process>equal labels are corresponding<system>second level label under label.
Step S230 automatically generates packet according to DMRL data format according to the corresponding relationship of node type and data module
DMRL containing each label node type, nodename and coding.
Further, while generating DMRL, data module coding is corrected.Because each data module has
One unique data module encodes, and includes that initial SNS coding and starting type encode (IC code) in data module coding,
For<system>label configures SNS coding, is<descript>,<fault>,<process>type coding is respectively configured in label
Afterwards, SNS coding and initial SNS coding and type coding and corresponding starting type coding have differences, therefore, it is desirable to will
Initial SNS coding and starting type coding in data module coding replace with corresponding SNS coding and type coding.
For step S140, specifically:
According to DMRL, the content (node content) in non-structured document under each label is extracted, by each mark of extraction
The content signed, be separately converted to corresponding types and meets the data module of IETM data format standard, and with data module
The form of list is managed.
As shown in figure 3, for the step flow chart converted according to the data module of one embodiment of the invention.
Step S310 carries out node content extraction according to the DMRL of generation in non-structured document.
According to<descript>,<fault>,<process>equal labels, are partitioned into node content.In other words, divide unit
For data module.
Such as: in an article, chapter 1 and chapter 2 describe different main bodys respectively, and chapter 1 and chapter 2 all wrap
The description of first segment equipment, the second section equipment operation, third section equipment fault are included, then when dividing content, according to tag extraction
Out: the node content A of chapter 1, the node content B of chapter 2, the node of the node content C of the first segment of chapter 1, the second section
The node content E of content D, third section, the node content F of the first segment of chapter 2, the node content G of the second section, third section
Node content H.
Step S320, according to IETM data format standard, is separately converted to corresponding for the multiple node contents extracted
The data module of data format.
Such as: it is pressed for content of text<para>data format is converted, and is pressed for figure<graphic>data format
It is converted, is converted for table according to<table>format.
Step S330 is managed in data module list for multiple data modules of acquisition.
Unlike data module list of requirements DMRL, data module list includes the basic letter of all data modules
Breath, and DMRL only include user needs data module essential information, as include data module under each label node type,
Nodename and corresponding coding.
For step S320, specifically:
During carrying out data module conversion to non-structured document, non-structured document can also be carried out into one
Step ground labeling processing, to increase the accuracy of data module.
According to pre-set sensitive word rule and synonym rule, sensitive word and synonymous is searched in non-structured document
Word is inserted into sensitive word label in the position of sensitive word and synonym;In the conversion process, sensitive word label is identified, by sensitive word
The node content of label is converted into the data module of corresponding data format, in other words, by the content comprising sensitive word and synonym
It is converted into the data module of corresponding data format.
Specifically, non-structured document is parsed, such as: selected non-structured document is parsed, is analyzed
The purposes of non-structured document determines sensitive word and the corresponding synonym of sensitive word in non-structured document, establishes sensitive word
Rule, synonym rule.Sensitive word rule is for example shown in table 1, but the content being not limited in table 1, synonym rule such as 2 institute of table
The content shown, but be not limited in table 2.
Table 1 sensitive word rule
Document name | Sensitive word | Format | Purposes |
Driver behavior illustrates .doc | Safety | <security>safety</security> | |
Driver behavior illustrates .doc | Task terminal | <endItem>task terminal</endItem> | |
… |
Table 2 synonym rule
Sensitive word | Synonym one | Synonym two | Synonym X |
Safety | Safety | Security feature | … |
Task terminal | Terminal | Using terminal | … |
Including the non-structured document title where multiple sensitive words, each sensitive word, Mei Gemin in sensitive word rule
Feel the format etc. of word addition sensitive word label.
It include the corresponding one or more synonyms of sensitive word in synonym rule.Sensitive word mark is being added for synonym
When label, using the identical addition format of the corresponding sensitive word of the synonym.
In one embodiment, the sensitive word rule of setting can also be realized quick to the sensitive word addition in preset range
Feel word label.Further, conjunctive word and metadata are added in sensitive word rule.Conjunctive word is the range for limiting sensitive word
Associated symbol, such as larger than, be less than etc.;Metadata is the value range of sensitive word, such as MPa, Min.Such as: air pressure is greater than 5MPa,
Then, air pressure is sensitive word, greater than being metadata for conjunctive word, Mpa.
Further, preset sensitive word and synonym may be defined description in IETM standard, it is also possible in IETM
Without definition description in standard;For the sensitive word and synonym of no definition description, needs to be extended IETM standard, make
There are corresponding descriptions.Such as: " air pressure " without definition description, then needs to redefine in IETM data in IETM standard
Hold, " air pressure " is made to there is definition description.
The present embodiment carries out labeling segmentation, sensitive word and synonym definition, DMRL generation, IETM to non-structured document
The processes such as conversion tentatively solve heavy workload when non-structured document generates IETM, generate the problems such as data are inconsistent.
Present invention seek to address that the non-structural data such as maintenance of equipment file are converted into the maintenance support class data mould in IETM
Block (for example describe class data module, program class data module, failure classes data module, maintenance project class data module etc.), have
The problem of that improves IETM writes efficiency, reduces the workload of manual compiling IETM, specifically includes: selecting to be analyzed non-structural
Change document (WORD document), the preliminary labeling of data is carried out to selected document content, conversion data is carried out by labeling
The classification of target;The lteral data in document is deeply combed, data use is analyzed, defines metadata sensitive word in data, same
Adopted word rule;After conversion data target classification determines, the DMRL (data module list of requirements) that IETM is required is generated;According to DMRL
Convert analytic process data and result data to the data format for meeting IETM data format standard, and with data module list
Form be managed.
The present invention also provides a kind of data module reforming units of non-structured document, as shown in Figure 4.
Chosen module 410, for selecting non-structured document to be transformed.
Processing module 420 determines turn over number for carrying out pre- labeling processing to the non-structured document to be transformed
According to target classification.Further, processing module 420 are inserted for the corresponding position in the non-structured document to be transformed
The label for entering preset kind makes each label correspond to the data module of respective type.The label includes following one: <
system>,<descript>,<proced>,<fault>,<process>.The label is inserted into the processing module 420
Behind corresponding position in the non-structured document to be transformed, the label includes: in node type, nodename and node
Hold.
Generation module 430, for according to the conversion data target classification, generation to meet interactive electronic technical manual
The data module list of requirements DMRL of IETM standard.Further, the generation module 430 is<system>for obtaining label
Node type and nodename, and configure corresponding SNS coding;Obtain label be<descript>,<fault>,<
Process > node type and nodename, corresponding type coding is respectively configured;According to node type and data module
Corresponding relationship automatically generates the node type comprising each label, nodename and coding according to DMRL data format
DMRL。
Conversion module 440, for converting multiple data for the non-structured document to be transformed according to the DMRL
Module.Further, conversion module 440 carry out node content in non-structured document and mention for the DMRL according to generation
It takes;The data of corresponding data format are separately converted to according to IETM data format standard for the multiple node contents extracted
Module.
The function of device described in the present embodiment is described in Fig. 1-embodiment of the method shown in Fig. 3, therefore
Not detailed place, may refer to the related description in previous embodiment, this will not be repeated here in the description of the present embodiment.
Although for illustrative purposes, the preferred embodiment of the present invention has been disclosed, those skilled in the art will recognize
It is various improve, increase and replace be also it is possible, therefore, the scope of the present invention should be not limited to the above embodiments.
Claims (10)
1. a kind of data module method for transformation of non-structured document characterized by comprising
Select non-structured document to be transformed;
Pre- labeling processing is carried out to the non-structured document to be transformed, determines conversion data target classification;The label
Including following one:<system>,<descript>,<proced>,<fault>,<process>;
According to the conversion data target classification, the data module demand for meeting interactive electronic technical manual IETM standard is generated
List DMRL;
According to the DMRL, multiple data modules are converted by the non-structured document to be transformed.
2. the method as described in claim 1, which is characterized in that carry out pre- labeling to the non-structured document to be transformed
Processing, determines conversion data target classification, comprising:
The label of corresponding position insertion preset kind in the non-structured document to be transformed, makes each label correspond to phase
Answer the data module of type.
3. method according to claim 2, which is characterized in that the label is being inserted into the unstructured text to be transformed
Behind corresponding position in shelves, the label includes: node type, nodename and node content.
4. method as claimed in claim 3, which is characterized in that according to the conversion data target classification, generation meets interaction
The data module list of requirements DMRL of formula electronic technical manual IETM standard, comprising:
It obtains label and is the node type and nodename of<system>, and configure corresponding SNS coding;
Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured corresponding
Type coding;
According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising each label
The DMRL of node type, nodename and coding.
5. the method as described in claim 1, which is characterized in that according to the DMRL, by the unstructured text to be transformed
Shelves are converted into multiple data modules, comprising:
According to the DMRL of generation, node content extraction is carried out in non-structured document;
The number of corresponding data format is separately converted to according to IETM data format standard for the multiple node contents extracted
According to module.
6. the method as described in claim 1, which is characterized in that the method also includes:
According to pre-set sensitive word rule and synonym rule, sensitive word and synonym are searched in non-structured document,
Sensitive word label is inserted into the position of sensitive word and synonym;
It is described according to the DMRL, convert multiple data modules for the non-structured document to be transformed, comprising: identification it is quick
Feel word label, converts the node content of sensitive word label to the data module of corresponding data format.
7. a kind of data module reforming unit of non-structured document characterized by comprising
Chosen module, for selecting non-structured document to be transformed;
Processing module determines conversion data target for carrying out pre- labeling processing to the non-structured document to be transformed
Classification;The label includes following one:<system>,<descript>,<proced>,<fault>,<process>;
Generation module, for according to the conversion data target classification, generation to meet interactive electronic technical manual IETM standard
Data module list of requirements DMRL;
Conversion module, for converting multiple data modules for the non-structured document to be transformed according to the DMRL.
8. device as claimed in claim 7, which is characterized in that the processing module is used for:
The label of corresponding position insertion preset kind in the non-structured document to be transformed, makes each label correspond to phase
Answer the data module of type.
9. device as claimed in claim 8, which is characterized in that in the processing module that label insertion is described to be transformed
Non-structured document in corresponding position after, the label includes: node type, nodename and node content.
10. device as claimed in claim 9, which is characterized in that
The generation module is used for:
It obtains label and is the node type and nodename of<system>, and configure corresponding SNS coding;
Obtaining label is<descript>,<fault>,<process>node type and nodename, be respectively configured corresponding
Type coding;
According to the corresponding relationship of node type and data module, according to DMRL data format, automatically generate comprising each label
The DMRL of node type, nodename and coding;
The conversion module, is used for:
According to the DMRL of generation, node content extraction is carried out in non-structured document;
The number of corresponding data format is separately converted to according to IETM data format standard for the multiple node contents extracted
According to module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410829893.0A CN105786921B (en) | 2014-12-26 | 2014-12-26 | A kind of the data module method for transformation and device of non-structured document |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410829893.0A CN105786921B (en) | 2014-12-26 | 2014-12-26 | A kind of the data module method for transformation and device of non-structured document |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105786921A CN105786921A (en) | 2016-07-20 |
CN105786921B true CN105786921B (en) | 2019-06-18 |
Family
ID=56388701
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410829893.0A Active CN105786921B (en) | 2014-12-26 | 2014-12-26 | A kind of the data module method for transformation and device of non-structured document |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105786921B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106294551A (en) * | 2016-07-25 | 2017-01-04 | 中国商用飞机有限责任公司 | System and comprehensive establishment management system is managed for the CIR of technical publications |
CN108021632B (en) * | 2017-11-23 | 2020-07-07 | 中国移动通信集团河南有限公司 | Mutual conversion processing method for unstructured data and structured data |
CN110119984A (en) * | 2018-02-07 | 2019-08-13 | 青岛农业大学 | A kind of processing system for international trade tick financing |
CN108710660A (en) * | 2018-05-11 | 2018-10-26 | 上海核工程研究设计院有限公司 | A kind of items property parameters modeling of database and storage method |
CN110990636A (en) * | 2019-12-18 | 2020-04-10 | 哈尔滨工程大学 | Intelligent data module acquisition and conversion method for diesel engine interactive electronic technical manual |
CN111666747A (en) * | 2020-05-29 | 2020-09-15 | 中国工程物理研究院计算机应用研究所 | Method for generating WORD document into description class data module conforming to S1000D standard |
CN111859863A (en) * | 2020-06-03 | 2020-10-30 | 远光软件股份有限公司 | Document structure conversion method and device, storage medium and electronic equipment |
CN112699641B (en) * | 2021-03-25 | 2021-07-20 | 南京国睿信维软件有限公司 | Method for quickly converting batch copy of WORD content to DM based on S1000D standard |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7849048B2 (en) * | 2005-07-05 | 2010-12-07 | Clarabridge, Inc. | System and method of making unstructured data available to structured data analysis tools |
CN101055578A (en) * | 2006-04-12 | 2007-10-17 | 龙搜(北京)科技有限公司 | File content dredger based on rule |
CN102207975A (en) * | 2011-06-24 | 2011-10-05 | 天津大学 | Method for manufacturing and displaying extensive makeup language (xml) data module based on ietm standard |
CN102982027A (en) * | 2011-09-02 | 2013-03-20 | 北大方正集团有限公司 | Method and device for abstracting contents in document |
CN103678625A (en) * | 2013-12-18 | 2014-03-26 | 北京航天测控技术有限公司 | Method and device for transforming interactive electronic technical manual data |
-
2014
- 2014-12-26 CN CN201410829893.0A patent/CN105786921B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN105786921A (en) | 2016-07-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105786921B (en) | A kind of the data module method for transformation and device of non-structured document | |
CN101122899B (en) | Report generation method and device | |
CN101989256A (en) | Typesetting method of document file and device | |
US20080155519A1 (en) | Code translator | |
JP6090850B2 (en) | Source program analysis system, source program analysis method and program | |
CN102722479A (en) | A method and device for realizing language translation | |
US20130262987A1 (en) | Document processing method, apparatus and editor | |
CN103885942B (en) | A kind of rapid translation device and method | |
CN106547729A (en) | A kind of dynamic creation method and system of data sheet | |
CN103064659A (en) | Software as a service (SAAS) model based on metadata extraction user-defined worksheet system | |
CN103095726A (en) | Processing method and device of protocol interpreter | |
US20130204875A1 (en) | Automatic Configuration Of A Product Data Management System | |
CN104298705A (en) | Converting method of relational data and unstructured data | |
CN111859053A (en) | Data definition method of visual chart and chart library realized by data definition method | |
CN104063545A (en) | Method and system for dynamically displaying process tracing diagram | |
CN104536947A (en) | Layout document processing method and device | |
CN108228688B (en) | Template generation method, system and server based on XBRL | |
CN102521359B (en) | Interface data file comparison method and device | |
CN107203311B (en) | Display method and device of multi-language menu | |
CN110968591A (en) | Query statement generation method and device, storage medium and processor | |
CN104978379A (en) | Method and device for building application program information station | |
CN105808595B (en) | A kind of the data library generating method and device of authority file | |
CN108628862A (en) | database addressing method, device and system | |
CN111401005B (en) | Text conversion method and device and readable storage medium | |
CN109446295B (en) | Svg data map editing tool, editing method and computer readable medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |