CN112232032B - Automatic conversion method for content style of docx document - Google Patents

Automatic conversion method for content style of docx document Download PDF

Info

Publication number
CN112232032B
CN112232032B CN202010923972.3A CN202010923972A CN112232032B CN 112232032 B CN112232032 B CN 112232032B CN 202010923972 A CN202010923972 A CN 202010923972A CN 112232032 B CN112232032 B CN 112232032B
Authority
CN
China
Prior art keywords
chart
original
template
style
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010923972.3A
Other languages
Chinese (zh)
Other versions
CN112232032A (en
Inventor
徐鹏
刘琦
郑维
邱枫
章恒靖
朱得元
乔正宇
刘恒昌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Comprehend Suzhou Information Technology Co ltd
Original Assignee
Comprehend Suzhou Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Comprehend Suzhou Information Technology Co ltd filed Critical Comprehend Suzhou Information Technology Co ltd
Priority to CN202010923972.3A priority Critical patent/CN112232032B/en
Publication of CN112232032A publication Critical patent/CN112232032A/en
Application granted granted Critical
Publication of CN112232032B publication Critical patent/CN112232032B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The application discloses a content style automatic conversion method of docx document, which pre-stores a template file, wherein the template file comprises template charts, when the original charts in the original document are converted, style characteristic information of the original charts is compared with style characteristic information of the template charts, after the matched template charts are found, the original charts are converted according to the template chart format, so that automatic batch conversion of the original charts is realized, manual operation is reduced, and efficiency is improved.

Description

Automatic conversion method for content style of docx document
Technical Field
The present application relates to the field of software technology, and in particular, to a method for automatically converting a content style of a docx document and a computer readable storage medium.
Background
In daily life and work, users often need to change the style in the document content. For example: the method comprises the steps of converting the content with the styles of Song Ti, xiaoqu and the like in text content into the styles of regular script, xiaoqu and the like, and converting the styles of charts of white background, black word and the like in documents into the styles of orange background, white word and the like. Such a need to present the same document in different fonts or colors is quite common in work and life.
The inventor finds that the conversion operation for the document content at present needs manual operation by a user in the process of realizing the application, and the batch operation of characters, tables and charts cannot be realized, so that the conversion mode is too complicated and the efficiency is lower.
The information disclosed in this background section is only for enhancement of understanding of the general background of the application and should not be taken as an acknowledgement or any form of suggestion that this information forms the prior art already known to a person of ordinary skill in the art.
Disclosure of Invention
The application aims to provide a content style automatic conversion method of a docx document, which can automatically convert chart styles.
In order to achieve the above object, the present application provides a method for automatically converting a content style of a docx document, comprising: and automatically converting the chart style of the docx document. Wherein, the automatic conversion of chart styles of docx documents comprises: reading an original chart in an original docx document, and extracting style characteristic information in the original chart; reading a template chart in a preset template file in the original docx document, and extracting style characteristic information in the template chart; comparing the style characteristic information of the extracted original chart with style characteristic information of a preset template chart, and selecting a template chart matched with the original chart; reading a chart.xml file in the original chart, reading a chart.xml file corresponding to the original chart in the template chart, deleting data of each < c: ser > node in the chart.xml file in the template chart, and writing data contained in style characteristic information of the original chart into a corresponding position of the template chart matched with the original chart; writing data contained in the non-style characteristic information in the original chart into the corresponding position of the template chart matched with the original chart; replacing the data reference of the template chart matched with the original chart with the data reference of the original chart, and storing the template chart with the replaced data reference into a specified folder; judging whether the template file has media file references and/or the me file references, if so, updating a char.xml.rels file in a template chart in the specified folder, and copying the media file and/or the me file referenced by the char.xml.rels file to the specified folder, otherwise, directly entering the next step; judging whether an unprocessed original chart exists in the original docx document, if so, returning to the first step for processing, otherwise, directly entering the next step; adding nodes which are included in the [ content_types ]. Xml file of the original docx document but are not included in the [ content_types ]. Xml file of the original docx document to the [ content_types ]. Xml file of the original docx document so as to form a first report, and storing the first report in the specified folder; and copying the original docx document, and compressing all contents in the designated folder into the copied original docx document according to the internal structure of the docx file, thereby obtaining a second report.
In an embodiment of the present application, reading an original chart in the original docx document, and extracting style characteristic information in the original chart includes: reading an uncompressed docx document by adopting a zip file packet realized based on a python code, and extracting an original chart file in the file; and analyzing the xml file in the original chart file by adopting the lxml package, thereby extracting the style characteristic information in the original chart file.
In an embodiment of the present application, reading a template chart in a preset template file in the original docx document, and extracting style feature information in the template chart includes: reading an uncompressed docx document by adopting a zip file package realized based on a python code, and extracting a preset template file and a template chart file in the template file; and analyzing the xml file in the template chart file by adopting the lxml package, thereby extracting the style characteristic information in the template chart file.
In one embodiment of the present application, the style characteristics of the original chart and the style characteristics of the template chart both adopt json data structures.
In one embodiment of the application, the style characteristics include one or more of barChart, lineChart, pie3Chart, pieChart, num, ser — num, dPt, marker, grouping, wherein barChart represents a bar graph, lineChart represents a line graph, pie3Chart represents a 3D Pie graph, pieChart represents a 2D Pie graph, num represents the number of axes, ser_num represents the number of graphics, dPt represents whether the graphics are marked, marker represents the style of the dots in the line graph, and grouping represents the placement of the columns in the bar graph.
In one embodiment of the present application, comparing the style characteristic information of the extracted original chart with style characteristic information of a preset template chart, and selecting a template chart matched with the original chart includes: selecting a template chart with the same num value as the original chart from preset template charts; selecting a template chart with a ser_num value larger than that of the original chart from the selected template charts; and selecting the template chart with the most identical style characteristic information with the original chart from the re-selected template charts as the template chart matched with the original chart.
In an embodiment of the present application, the automatic content style conversion method of the docx document further includes: the second report is automatically converted into text and form style,
wherein automatically converting text and form styles for the second report includes: reading a preset style corresponding relation file, wherein the style corresponding relation file records a text style conversion corresponding relation and a form style conversion corresponding relation between the original docx document and the template file; reading the text of the template file and the style information of the table by using a python_docx package; copying the text of the template file and the style information of the form to a style library of the second report; reading a paragraph list of the second report, and modifying style names applied to paragraphs in the second report according to the text style conversion corresponding relation; and reading a table list of the second report, and modifying the style names applied by the tables in the second report according to the table style conversion corresponding relation.
The application also provides a computer readable storage medium for executing the automatic conversion method of the content style of the docx document according to any one of the above embodiments.
Compared with the prior art, the automatic conversion method for the content style of the docx document pre-stores the template file, wherein the template file comprises the template chart, when the original chart in the original document is converted, the style characteristic information of the original chart is compared with the style characteristic information of the template chart, and after the matched template chart is found, the original chart is converted according to the template chart format, so that the automatic batch conversion of the original chart is realized, the manual operation is reduced, and the efficiency is improved. Preferably, in an embodiment, the template file further includes a style correspondence file, so that the text and the table in the original document are automatically converted in batches according to the mapping relationship, and the conversion efficiency is further improved.
Drawings
FIG. 1 is a step composition of a method for automatically converting a content style of a docx document according to an embodiment of the present application;
FIG. 2 is an original graph according to an embodiment of the present application;
FIG. 3 is a template chart of matching according to an embodiment of the present application;
FIG. 4 is a diagram after conversion of an original diagram according to an embodiment of the present application;
FIG. 5 is a diagram of an automatic conversion step composition of text and form styles for the second report, according to an embodiment of the present application;
fig. 6 is a style correspondence file according to an embodiment of the present application.
Description of the embodiments
The following detailed description of embodiments of the application is, therefore, to be taken in conjunction with the accompanying drawings, and it is to be understood that the scope of the application is not limited to the specific embodiments.
Throughout the specification and claims, unless explicitly stated otherwise, the term "comprise" or variations thereof such as "comprises" or "comprising", etc. will be understood to include the stated element or component without excluding other elements or components.
Fig. 1 is a schematic diagram of a docx document according to an embodiment of the present application, and the method for automatically converting a content style of the docx document mainly includes steps S1 to S10.
In step S1, an original chart in an original docx document is read, and style characteristic information in the original chart is extracted. Specifically, reading an original chart in the original docx document, and extracting style characteristic information in the original chart includes: reading an uncompressed docx document by adopting a zip file packet realized based on a python code, and extracting an original chart file in the file; and analyzing the xml file in the original chart file by adopting the lxml package, thereby extracting the style characteristic information in the original chart file.
And in step S2, reading a template chart in a preset template file in the original docx document, and extracting style characteristic information in the template chart.
Specifically, reading a template chart in a preset template file in the original docx document, and extracting style characteristic information in the template chart includes: reading an uncompressed docx document by adopting a zip file package realized based on a python code, and extracting a preset template file and a template chart file in the template file; and analyzing the xml file in the template chart file by adopting the lxml package, thereby extracting the style characteristic information in the template chart file.
It should be noted that, the style features of the original chart and the style features of the template chart all adopt json data structures. Style characteristics include one or more of barChart, lineChart, pie3Chart, pieChart, num, ser — num, dPt, marker, grouping, where barChart represents a histogram, lineChart represents a line graph, pie3Chart represents a 3D Pie graph, pieChart represents a 2D Pie graph, num represents the number of axes, e.g., "num=0" may represent no style type in the graph, "num=1" may represent a single y-axis as the style type, "num=2" may represent dual y-axes as the style type, ser_num represents the number of graphics, e.g., the number of polylines, etc., dPt represents whether a portion of the graphics is marked, e.g., a dashed line is used for a portion of the polyline, marker represents the style of a point in the line graph, e.g., the size, color, shape, etc., and grouping represents the columnar placement in the histogram, e.g., horizontal placement, stacking placement, etc.
Selecting a template chart matched with the original chart style to be converted in the step S3: specifically, the style characteristic information of the extracted original chart is compared with style characteristic information of a preset template chart, and the template chart matched with the original chart is selected.
Specifically, comparing the style characteristic information of the extracted original chart with style characteristic information of a preset template chart, and selecting a template chart matched with the original chart includes: selecting a template chart with the same num value as the original chart from preset template charts; selecting a template chart with a ser_num value larger than that of the original chart from the selected template charts; and selecting the template chart with the most identical style characteristic information with the original chart from the re-selected template charts as the template chart matched with the original chart.
It should also be noted that if there is no matching template chart, the outer frame of the chart is removed, prompting the user that the chart has no format conversion because there is no suitable template.
Writing corresponding style characteristic data into the template chart in step S4: reading a chart.xml file in the original chart, reading a chart.xml file corresponding to the original chart in the template chart, deleting data of each < c: ser > node in the chart.xml file in the template chart, and writing data contained in style characteristic information of the original chart into a corresponding position of the template chart matched with the original chart. Wherein the < c: ser > node contains data of one broken line in the broken line graph. The corresponding location of the writing of the template chart is determined by the format of the char.xml file.
Writing corresponding non-style feature data into the template chart in step S5: and writing the data contained in the non-style characteristic information in the original chart into the corresponding position of the template chart matched with the original chart. The data in the non-style characteristic information includes: legend text, axis label, axis unit, etc.
Replacing the data references in the template chart in step S6: replacing the data reference of the template chart matched with the original chart with the data reference of the original chart, and storing the template chart with the replaced data reference in a specified folder.
One point to be noted is that the chart file in the docx file has an associated file chart. Xml. Rels, and an excel file must be referred to in the chart. Xml, and the chart data is stored, and the reference mode is that: the method comprises the steps of writing an excel in a chat.xml, then declaring an Id, and then referencing the Id in the chat.xml, wherein the data referencing refers to the reference of the excel, and replacing the data referencing Id of the graph template with the data referencing Id of the original report in the chat.xml in conversion, so that the chat.xml of the original graph is replaced by the chat.xml of the modified template graph, and the correct referencing relationship of the new graph is ensured.
The char.xml.rels file in the template chart is updated in step S7: judging whether the media file references and/or the me file references exist in the template files, if so, updating the char.xml.rels files in the template chart in the specified folder, copying the media files and/or the me files referenced by the char.xml.rels files to the specified folder, and otherwise, directly entering the next step S8.
In step S8, it is determined whether an unprocessed original chart still exists in the original docx document, if so, the first step is returned to process, otherwise, the next step S9 is directly performed.
In step S9, a node included in the [ content_types ]. Xml file of the original docx document but not included in the [ content_types ]. Xml file of the original docx document is added to the [ content_types ]. Xml file of the original docx document to form a first report, and is saved to the specified folder.
In step S10, the original docx document is copied, all contents in the designated folder are compressed into the docx document according to the internal structure of the docx file, and the same file is replaced or a new file is added, so that a second report is obtained. The second report thus far completes the automatic conversion of the chart style.
For ease of understanding, fig. 2 shows an original chart of the present embodiment, fig. 3 is a template chart to which the original chart is matched, and fig. 4 is a chart after the original chart is converted into a pattern.
Through the embodiment, automatic conversion of chart styles and conversion of batch charts are realized, manual operation can be greatly reduced, and conversion efficiency is improved.
Preferably, in an embodiment, in order to further improve the conversion efficiency, the automatic conversion method for content styles of the docx document further includes: and carrying out automatic conversion of text and form style on the second report. Fig. 5 is a diagram showing an automatic conversion procedure of text and form style for the second report according to the present embodiment.
In step S20, a preset style correspondence file is read, where a text style conversion correspondence and a table style conversion correspondence between the original docx document and the template file are recorded in the style correspondence file. Fig. 6 is a style correspondence file of the present embodiment, where column a is the text and form style list of the original docx document and column B is the text and form style list of the template file.
The python_docx package is used to read the text of the template file and the style information of the table in step S21.
And in step S22, copying the text of the template file and the style information of the table into the style library of the second report.
In step S23, the paragraph list of the second report is read, and the style name applied to the paragraphs in the second report is modified according to the text style conversion correspondence.
In step S24, the table list of the second report is read, and the style names applied by the tables in the second report are modified according to the table style conversion correspondence.
The final processed second report is saved in step S25, thereby completing the automatic conversion of the text and the form.
The embodiment not only realizes the batch automatic conversion of the charts, but also realizes the batch automatic conversion of the characters and the tables, thereby greatly improving the conversion efficiency and reducing the manual operation.
Based on the same inventive concept, the present embodiment also provides a computer readable storage medium for performing the content style automatic conversion method of the docx document according to any one of the above embodiments.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The foregoing descriptions of specific exemplary embodiments of the present application are presented for purposes of illustration and description. It is not intended to limit the application to the precise form disclosed, and obviously many modifications and variations are possible in light of the above teaching. The exemplary embodiments were chosen and described in order to explain the specific principles of the application and its practical application to thereby enable one skilled in the art to make and utilize the application in various exemplary embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the application be defined by the claims and their equivalents.

Claims (6)

1. A method for automatically converting a content style of a docx document, comprising: chart style automatic conversion is carried out on the docx document,
wherein, the automatic conversion of chart styles of docx documents comprises:
reading an original chart in an original docx document, and extracting style characteristic information in the original chart;
reading a template chart in a preset template file in the original docx document, and extracting style characteristic information in the template chart;
comparing the style characteristic information of the extracted original chart with style characteristic information of a preset template chart, and selecting a template chart matched with the original chart;
reading a chart.xml file in the original chart, reading a chart.xml file corresponding to the original chart in the template chart, deleting data of each < c: ser > node in the chart.xml file in the template chart, and writing data contained in style characteristic information of the original chart into a corresponding position of the template chart matched with the original chart;
writing data contained in the non-style characteristic information in the original chart into the corresponding position of the template chart matched with the original chart;
replacing the data reference of the template chart matched with the original chart with the data reference of the original chart, and storing the template chart with the replaced data reference into a specified folder;
judging whether the template file has media file references and/or the me file references, if so, updating a char.xml.rels file in a template chart in the specified folder, and copying the media file and/or the me file referenced by the char.xml.rels file to the specified folder, otherwise, directly entering the next step;
judging whether an unprocessed original chart exists in the original docx document, if so, returning to the first step for processing, otherwise, directly entering the next step;
adding nodes which are included in the [ content_types ]. Xml file of the original docx document but are not included in the [ content_types ]. Xml file of the original docx document to the [ content_types ]. Xml file of the original docx document so as to form a first report, and storing the first report in the specified folder; and
copying the original docx document, and compressing all contents in the designated folder into the copied original docx document according to the internal structure of the docx file, so as to obtain a second report;
wherein the style characteristics include one or more of barChart, lineChart, pie3Chart, pieChart, num, ser — num, dPt, marker, grouping, wherein barChart represents a histogram, lineChart represents a line graph, pie3Chart represents a 3D Pie Chart, pieChart represents a 2D Pie Chart, num represents the number of axes, ser_num represents the number of graphics, dPt represents whether the graphics are marked, marker represents the style of the dots in the line graph, and grouping represents the placement of the columns in the histogram;
comparing the style characteristic information of the extracted original chart with style characteristic information of a preset template chart, and selecting a template chart matched with the original chart comprises the following steps:
selecting a template chart with the same num value as the original chart from preset template charts;
selecting a template chart with a ser_num value larger than that of the original chart from the selected template charts;
and selecting the template chart with the most identical style characteristic information with the original chart from the re-selected template charts as the template chart matched with the original chart.
2. The method for automatically converting a content style of a docx document according to claim 1, wherein reading an original chart in the original docx document, and extracting style characteristic information in the original chart comprises:
reading an uncompressed docx document by adopting a zip file packet realized based on a python code, and extracting an original chart file in the file;
and analyzing the xml file in the original chart file by adopting the lxml package, thereby extracting the style characteristic information in the original chart file.
3. The method for automatically converting content styles of docx documents according to claim 1, wherein reading a template chart in a preset template file in the original docx document, and extracting style characteristic information in the template chart comprises:
reading an uncompressed docx document by adopting a zip file package realized based on a python code, and extracting a preset template file and a template chart file in the template file;
and analyzing the xml file in the template chart file by adopting the lxml package, thereby extracting the style characteristic information in the template chart file.
4. The method for automatically converting content styles of docx documents according to claim 1, wherein style characteristics of the original chart and style characteristics of the template chart all adopt json data structures.
5. The method for automatically converting a content style of a docx document according to claim 1, wherein the method for automatically converting a content style of a docx document further comprises: the second report is automatically converted into text and form style,
wherein automatically converting text and form styles for the second report includes:
reading a preset style corresponding relation file, wherein the style corresponding relation file records a text style conversion corresponding relation and a form style conversion corresponding relation between the original docx document and the template file;
reading the text of the template file and the style information of the table by using a python_docx package;
copying the text of the template file and the style information of the form to a style library of the second report;
reading a paragraph list of the second report, and modifying style names applied to paragraphs in the second report according to the text style conversion corresponding relation;
and reading a table list of the second report, and modifying the style names applied by the tables in the second report according to the table style conversion corresponding relation.
6. A computer-readable storage medium for performing the content style automatic conversion method of the docx document according to any one of claims 1 to 5.
CN202010923972.3A 2020-09-04 2020-09-04 Automatic conversion method for content style of docx document Active CN112232032B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010923972.3A CN112232032B (en) 2020-09-04 2020-09-04 Automatic conversion method for content style of docx document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010923972.3A CN112232032B (en) 2020-09-04 2020-09-04 Automatic conversion method for content style of docx document

Publications (2)

Publication Number Publication Date
CN112232032A CN112232032A (en) 2021-01-15
CN112232032B true CN112232032B (en) 2023-08-18

Family

ID=74116477

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010923972.3A Active CN112232032B (en) 2020-09-04 2020-09-04 Automatic conversion method for content style of docx document

Country Status (1)

Country Link
CN (1) CN112232032B (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1801137A (en) * 2004-12-20 2006-07-12 微软公司 Method and system for applying a global formatting scheme to a chart in an electronic document
CN102147791A (en) * 2011-04-02 2011-08-10 数字时代出版设计有限公司 Automatic book typesetting method and system
CN103186510A (en) * 2011-12-30 2013-07-03 北大方正集团有限公司 Document format transforming method and device
CN104239284A (en) * 2014-09-15 2014-12-24 广州市西美信息科技有限公司 Method and device for automatic image-text composition
CN104699664A (en) * 2015-03-19 2015-06-10 徐州工业职业技术学院 Type setting system and method with independent templates
CN108647305A (en) * 2018-05-09 2018-10-12 天津字节跳动科技有限公司 document format conversion method and device
CN108733638A (en) * 2017-04-17 2018-11-02 北大方正集团有限公司 The structural method of WORD contributions and the structurizer of WORD contributions
CN109815450A (en) * 2017-11-20 2019-05-28 北大方正集团有限公司 The method and device of XML file conversion Word file
CN110032720A (en) * 2018-12-28 2019-07-19 万康源(天津)基因科技有限公司 A kind of visualization report typesetting and automatic generation method and system based on XML
CN110738035A (en) * 2019-09-18 2020-01-31 平安科技(深圳)有限公司 document template generation method and device
CN111062187A (en) * 2019-11-27 2020-04-24 北京计算机技术及应用研究所 Structured parsing method and system for docx format document
CN111159995A (en) * 2020-01-16 2020-05-15 江西金格科技股份有限公司 Method for generating word document in template mode
CN111368523A (en) * 2018-12-26 2020-07-03 嘉太科技(北京)有限公司 Method and device for converting layout format of movie and television script

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1801137A (en) * 2004-12-20 2006-07-12 微软公司 Method and system for applying a global formatting scheme to a chart in an electronic document
CN102147791A (en) * 2011-04-02 2011-08-10 数字时代出版设计有限公司 Automatic book typesetting method and system
CN103186510A (en) * 2011-12-30 2013-07-03 北大方正集团有限公司 Document format transforming method and device
CN104239284A (en) * 2014-09-15 2014-12-24 广州市西美信息科技有限公司 Method and device for automatic image-text composition
CN104699664A (en) * 2015-03-19 2015-06-10 徐州工业职业技术学院 Type setting system and method with independent templates
CN108733638A (en) * 2017-04-17 2018-11-02 北大方正集团有限公司 The structural method of WORD contributions and the structurizer of WORD contributions
CN109815450A (en) * 2017-11-20 2019-05-28 北大方正集团有限公司 The method and device of XML file conversion Word file
CN108647305A (en) * 2018-05-09 2018-10-12 天津字节跳动科技有限公司 document format conversion method and device
CN111368523A (en) * 2018-12-26 2020-07-03 嘉太科技(北京)有限公司 Method and device for converting layout format of movie and television script
CN110032720A (en) * 2018-12-28 2019-07-19 万康源(天津)基因科技有限公司 A kind of visualization report typesetting and automatic generation method and system based on XML
CN110738035A (en) * 2019-09-18 2020-01-31 平安科技(深圳)有限公司 document template generation method and device
CN111062187A (en) * 2019-11-27 2020-04-24 北京计算机技术及应用研究所 Structured parsing method and system for docx format document
CN111159995A (en) * 2020-01-16 2020-05-15 江西金格科技股份有限公司 Method for generating word document in template mode

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
论文自动排版系统的设计与实现;郑根创;中国优秀硕士学位论文,信息科技辑(2013年第05期);I138-1608 *

Also Published As

Publication number Publication date
CN112232032A (en) 2021-01-15

Similar Documents

Publication Publication Date Title
JP5502745B2 (en) Merging documents
US20050165847A1 (en) Data processing method and apparatus
US20070136660A1 (en) Creation of semantic objects for providing logical structure to markup language representations of documents
CN110705237B (en) Automatic document generation method, data processing device and storage medium
CN104636428A (en) Trademark recommendation method and device
CN105488471A (en) Character pattern recognition method and device
CN115391439A (en) Document data export method, device, electronic equipment and storage medium
CN113032580B (en) Associated file recommendation method and system and electronic equipment
CN114529933A (en) Contract data difference comparison method, device, equipment and medium
CN111984666A (en) Database access method and device, computer readable storage medium and computer equipment
WO2011074942A1 (en) System and method of converting data from a multiple table structure into an edoc format
CN112232032B (en) Automatic conversion method for content style of docx document
CN105512096A (en) Optimization method and device based on file embedded font
CN110516220B (en) Report data input method, system and related equipment
CN111833408A (en) Hierarchical coding and tracing method for image operation information
CN116110065A (en) Method, device, computer equipment and storage medium for generating question card data
CN106897434A (en) A kind of data lead-in method, apparatus and system
JP2016018279A (en) Document file search program, document file search device, document file search method, document information output program, document information output device, and document information output method
CN112947908A (en) Code generation method, device, equipment and storage medium
CN113127408A (en) Data conversion method and device
CN111143356A (en) Report retrieval method and device
CN114048352B (en) Method for converting XML file into structure with indefinite length
CN112818642B (en) Method and device for converting PDF (Portable document Format) file into form file, computer equipment and storage medium
JP2022045559A (en) Document retrieval system, document retrieval method, and program
CN103019665A (en) Receipt opening method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant