CN100365621C - Files conversion system and method - Google Patents

Files conversion system and method Download PDF

Info

Publication number
CN100365621C
CN100365621C CNB2003101125843A CN200310112584A CN100365621C CN 100365621 C CN100365621 C CN 100365621C CN B2003101125843 A CNB2003101125843 A CN B2003101125843A CN 200310112584 A CN200310112584 A CN 200310112584A CN 100365621 C CN100365621 C CN 100365621C
Authority
CN
China
Prior art keywords
file
document
format
conversion
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2003101125843A
Other languages
Chinese (zh)
Other versions
CN1627288A (en
Inventor
李忠一
林海洪
罗宝胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hongfujin Precision Industry Shenzhen Co Ltd
Hon Hai Precision Industry Co Ltd
Original Assignee
Hongfujin Precision Industry Shenzhen Co Ltd
Hon Hai Precision Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hongfujin Precision Industry Shenzhen Co Ltd, Hon Hai Precision Industry Co Ltd filed Critical Hongfujin Precision Industry Shenzhen Co Ltd
Priority to CNB2003101125843A priority Critical patent/CN100365621C/en
Publication of CN1627288A publication Critical patent/CN1627288A/en
Application granted granted Critical
Publication of CN100365621C publication Critical patent/CN100365621C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The present invention provides a system and a method for file conversion. The system for the file conversion comprises a plurality of client computers, an application server, a file receiving server and a data base; the method for the file conversion comprises the following procedures: sending a file transmission request, obtaining a corresponding file, checking and judging the format of the file, converting the input file into a file with the format of an extensible labeled language, merging the file with the format of the extensible labeled language and a graphic file; returning the complete file with the format of the extensible labeled language. By the system and the method for the file conversion provided by the present invention, a file with the format of Word can be converted into a file with the format of the extensible labeled language, and the work efficiency of a user can be enhanced.

Description

Document conversion system and method
[technical field]
The invention relates to a kind of file layout switch technology, particularly a kind of technology that can automatically the Word formatted file be converted to extendability mark up language format file.
[background technology]
Arrival along with the information age, the required a-c cycle that carries out information gets more and more between different enterprises, the user, but between different enterprises, the different user since use habit and software different cause the form of file to differ, thereby when carrying out exchange files, bring inconvenience.
More existing conversion methods about file layout in the prior art, as China national Department of Intellectual Property on Dec 6th, 2000 disclosed publication number be that the name of CN 1275752A is called the patent application case of " method and system of the automatic conversion of database storage on the world-wide web ", the method that the file that it is the database prescribed form that this application case has disclosed a kind of file conversion that the world-wide web user can be uploaded is stored.The file that this method is uploaded the user is checked and is resolved, and the file of set form is formed in rearrangement then.Though this method can be carried out the conversion of file layout, can only be converted to the form of this database defined, limitation is bigger, but also user's file is split again, can not keep the consistance and the integrality of file.
For another example China national Department of Intellectual Property in September 26 calendar year 2001 disclosed publication number be that the name of CN1314634A is called the patent application case of " document conversion method, file converteractivemil builder activemil builder and document display system ", this application case relates to a kind of document conversion method, this document conversion method is extraction unit divided data from a file of being made up of a plurality of data segments at first, then this partial data is presented on the device of a limited display ability to show.The deficiency of this application case be its can only be on a display device limited in one's ability the display part data, can not show complete data, and can only be applied in HTML (the Hypertext Markup Language on the world-wide web browser, HTML) on, and can not be the file of XML form with the text formatting file conversion, limitation is bigger.
Again an example such as China national Department of Intellectual Property on March 19th, 2003 disclosed publication number be that the name of CN 1403950A is called the patent application case of " System and method for that e-file is changed transmission automatically ", this application case discloses a kind of conversion method of e-file, it can be changed the coded message of file, for example simplified form of Chinese Character is converted to Chinese-traditional, perhaps Chinese-traditional is converted to simplified form of Chinese Character.The deficiency of this patent application case is that it just carries out the conversion of literal code, and can not be the file of XML form with the text formatting file conversion.
Last example such as China national Department of Intellectual Property on April 15th, 1998 disclosed publication number be that the name of CN 1178948A is called the patent application case of " file layout change-over method ", the disclosed technology of this application case can with personal computer (Personal Computer, PC) or the file resource on the mobile computer (Notebook Personal Computer) be converted to a kind of form that can be can read by pocket personal computer such as CD player.The deficiency of technology that this patent application case discloses is to be the file of XML form with the text formatting file conversion equally.
But, need submit a kind of file of set form in some occasion, in this case, the user often needs to carry out again the typing again and the editor of file, thus the waste user time causes unnecessary workload.
[summary of the invention]
Syllabus of the present invention be to provide a kind of document conversion method, it can be the file of XML form with the file conversion of the edited Word form of user, satisfies user's different needs.
The invention provides a kind of document conversion system, this document converting system comprises a plurality of client computers, a network, an apps server, a file reception server and a database.Each client computer all provides a graphical user interface, is used to carry out the file editor, and when needs carried out the file editor, client computer sent a file transfer requests.Store the file of various forms in the database, comprise the file of Word form, simultaneously, in this database, also store documentary summary info.Apps server is used to receive the file transfer requests that client computer sends, the transmission respective file, and the execute file format checking, the Study document content, the row format of going forward side by side conversion, execute file merges operation.Apps server comprises a transmission requests receiver module, is used to receive the file transfer requests that client computer sends; One file acquisition module is used for obtaining corresponding file according to file transfer requests from database; One file checking module is used for the file layout of obtaining is checked, comprises the identification and the inspection of file layout, judges whether the form of this document is the Word form; One file analysis module is used for the file content that obtains is analyzed, thereby obtains the different paragraph of this document, the paragraph of for example making a summary, text paragraph, detailed description paragraph etc.; One format converting module, the file conversion that is used for the Word form is the file of XML form, and this format converting module is by the program execute file format conversion of a running background, and this running background program is by Visual Basic programming language compilation; One file merges module, is used for the appended drawings shelves of XML formatted file after the conversion and Word file are merged, and constitutes a complete XML file.The file reception server is used to receive the file from the apps server transmission, and this document is through the XML formatted file after the format conversion.
The present invention also provides a kind of document conversion method, and it can be the file of XML form with the file conversion of the edited Word form of user, and this document conversion method comprises the steps: to send file transfer requests; Obtain respective file; Check file layout, judge whether the form of this document is the Word form; If through judging that drawing this document is the Word formatted file, then is converted to input file the XML formatted file; Merge this extendability indicating language file and figure shelves; Return complete XML formatted file.If judge to draw the file of this document form then direct end operation flow process for other non-Word form.
By document conversion system provided by the invention and method, can realize user's Word formatted file is converted to the file of XML form.
[description of drawings]
Fig. 1 is the enforcement environment map of document conversion system of the present invention.
Fig. 2 is the functional block diagram of document conversion system apps server of the present invention.
Fig. 3 is a summary info hoist pennants in the database of document conversion system of the present invention.
Fig. 4 is the file conversion and merging process flow diagram of document conversion system of the present invention and method.
[embodiment]
Consulting shown in Figure 1ly, is the enforcement environment map of document conversion system of the present invention.This document converting system comprises a plurality of client computer 10, one networks 11, an apps server 12, a database 13 and a file reception server 14.Each client computer 10 all provides a graphical user interface (not shown), be used to carry out the file editor, when needs carried out the file editor, client computer sent a file transfer requests (not shown), and this document transmission requests is transferred to apps server 12.Store the file of various forms in the database 13, comprise the file of Word form, simultaneously, in this database 13, also store the summary info of this document.Apps server 12 is used to receive the file transfer requests that client computer sends, the execute file format conversion, and this apps server 12 is positioned at the file transmit leg.File reception server 14 is used to receive the file from apps server 12 transmission, and this document is that this document reception server 14 is positioned at the file take over party through the XML formatted file after the format conversion.
Consulting shown in Figure 2ly, is the functional block diagram of document conversion system apps server of the present invention.This apps server 12 is the control center of file layout conversion, it receives from the file transfer requests of client computer 10 transmission, and this apps server 12 comprises that a transmission requests receiver module 121, a file acquisition module 122, a file checking module 123, a file analysis module 124, a format converting module 125 and a file merge module 126.Transmission requests receiver module 121 is used to receive the file transfer requests of client computer 10 transmission.File acquisition module 122 is used for obtaining corresponding file according to file transfer requests from database 13.
File checking module 123 is used for the file layout that database 13 stores is checked, comprises the identification and the inspection of file layout, judges whether this document is the file of Word form.File analysis module 124 is used for this document content being analyzed, thereby being obtained the different paragraph of this document when the file that obtains is the Word form, the paragraph of for example making a summary, text paragraph, detailed description paragraph etc.Format converting module 125 is used for the execute file format conversion, with the file conversion of Word form is the file of XML form, this format converting module is by the program execute file format conversion of a running background, and this running background program is by Visual Basic programming language compilation.
File merges module 126 and is used for the appended drawings shelves of XML formatted file after the conversion and Word file are merged, constitute a complete XML file, these appended drawings shelves are figure shelves additional in the Word file, the form of these figure shelves can be Tagged Image File (TIF) Format (Tagged Image File, TIF), tagged image file format (Tagged Image File Format, TIFF) bitmap file (BitMap, BMP), BIIF (Graphics lnterchange Format, GIF), (Joint Photo Graphic Experts Group JPEG) waits form in associating graph image expert group.
Consulting shown in Figure 3ly, is summary info hoist pennants in the database of document conversion system of the present invention.This summary info is the summary info 300 of destructuring data in the database 13, and this summary info 300 comprises data numbering 301, data title 302, data position 303, document bibliography 304 and conversion date 305.Data numbering 301 is that a data indicates numbering, is used for the usefulness of apps server 12 identification files, and this data is numbered serial number, and arranges in order in database 13.Data title 302 is the title of various destructuring data, comprises file title, image header, sound title and image title.Data position 303 is used for the memory location of the different destructuring data of database of record 13, and this memory location has shown the detailed storage location of a certain data, and for example the Data Position of file 123.doc is C: Winnt System32 123.doc.The storage catalogue of a certain data of document bibliography 304 records, conversion dates 305 record Word formatted file is converted to the conversion date of XML formatted file.
Consult shown in Figure 4, be document conversion system of the present invention and method file conversion with merge process flow diagram.At first, transmission requests receiver module 121 receives the file transfer requests (step S40) that client computer 10 sends; Then see through network 11 by file acquisition module 122 and obtain corresponding file (step S41) from database 13,123 pairs of above-mentioned files that obtain of file checking module are carried out format identification and inspection (step S42); Judge whether this document form is Word form (step S43); If through checking, judge that this document form is non-Word formatted file, then directly finish flow path switch.If through checking, judge that this document really is the Word file, then by file analysis module 124 execute file content recognition, thereby obtain the different paragraphs of this document, for example: summary paragraph, text paragraph, detailed description paragraph etc. are XML form (step S44) with this document from the Word format conversion by format converting module 125 then.Above-mentioned format converting module 125 is carried out and is comprised the steps: at first, set corresponding paragraph in the XML file by format converting module 125 according to above-mentioned analysis result, corresponding paragraph literal under each data title in this Word file is duplicated and paste data header segment corresponding in the XML formatted file to be fallen, finish the file layout conversion, the file format conversion is finished under the control of a running background program among the above-mentioned steps S44, this running background program Visual Basic language compilation.XML formatted file after then will being changed by file merging module 126 and the image in the Word file merge, and to constitute a complete XML file (step S45), return this XML file at last to client computer 10 (step S46), and flow process finishes.

Claims (4)

1. document conversion system, this document converting system can be converted to the Word formatted file extendability mark up language format file, it is characterized in that, comprising:
A plurality of client computers are used to send file transfer requests;
One database is wherein stored the file of different-format, also stores the summary info of described different-format file in this database;
One apps server comprises:
One transmission requests receiver module is used to receive the file transfer requests that client computer sends;
One file acquisition module is used for obtaining according to file transfer requests the file of required transmission;
One file checking module is used for the above-mentioned file that obtains is carried out the identification and the inspection of file layout, judges whether this document is the Word formatted file;
One file analysis module is used for when this document is the Word formatted file, and the file content after this process file layout inspection is analyzed, and obtains the different paragraph of this document;
One format converting module is used for the paragraph that this document is different and duplicates and paste the corresponding header segment of extendability mark up language format file and fall and finish format conversion;
One file merges module, is used for the figure shelves of extendability mark up language format file after the conversion and Word formatted file are merged, and constitutes a complete extendability mark up language format file; And
One file reception server is used to receive the extendability mark up language format file from the apps server transmission.
2. document conversion system as claimed in claim 1 is characterized in that format converting module is wherein finished format conversion under the control of a running background program, this running background program Visual Basic language compilation.
3. document conversion method, this document conversion method can be converted to the Word formatted file extendability mark up language format file, it is characterized in that, may further comprise the steps:
Send file transfer requests;
Obtain respective file;
Check file layout, judge whether the form of this document is the Word form;
If judge that drawing this document is the Word formatted file really, then the file content after this process file layout inspection is analyzed, obtain the different paragraph of this document, the paragraph that this document is different duplicates and pastes header segment corresponding in the extendability mark up language format file and falls, merge this extendability mark up language format file and figure shelves, return complete extendability mark up language format file: and
If judge to draw the file of this document form then direct end operation flow process for other non-Word form.
4. document conversion method as claimed in claim 3 is characterized in that, figure shelves wherein are the figure shelves that comprised in the Word file of conversion.
CNB2003101125843A 2003-12-10 2003-12-10 Files conversion system and method Expired - Fee Related CN100365621C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2003101125843A CN100365621C (en) 2003-12-10 2003-12-10 Files conversion system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2003101125843A CN100365621C (en) 2003-12-10 2003-12-10 Files conversion system and method

Publications (2)

Publication Number Publication Date
CN1627288A CN1627288A (en) 2005-06-15
CN100365621C true CN100365621C (en) 2008-01-30

Family

ID=34759828

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2003101125843A Expired - Fee Related CN100365621C (en) 2003-12-10 2003-12-10 Files conversion system and method

Country Status (1)

Country Link
CN (1) CN100365621C (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101610277B (en) * 2008-06-18 2012-07-04 中兴通讯股份有限公司 Method for processing information transmission
CN102111569B (en) * 2009-12-28 2015-06-17 新奥特(北京)视频技术有限公司 Method and system for collecting and broadcasting stock information
CN101833567A (en) * 2010-03-31 2010-09-15 北京志腾新诺科技有限公司 Document conversion method, device and system
CN101867397A (en) * 2010-03-31 2010-10-20 宇龙计算机通信科技(深圳)有限公司 Data transmission method based on Bluetooth and system, receiving terminal and transmitting terminal
CN102402541B (en) * 2010-09-14 2015-02-11 赛恩倍吉科技顾问(深圳)有限公司 File analysis system and method
CN101980183B (en) * 2010-09-17 2013-12-18 深圳万兴信息科技股份有限公司 Method for analyzing Word file information and system thereof
CN105159869B (en) * 2011-05-23 2020-06-16 成都科创知识产权研究所 Picture editing method and system
CN102831151B (en) * 2012-06-28 2015-07-08 华为技术有限公司 Method and device for generating electronic document
CN103399842A (en) * 2013-07-03 2013-11-20 惠州Tcl移动通信有限公司 File processing method and system in wireless communication equipment
CN106534267A (en) * 2016-10-19 2017-03-22 中国银行股份有限公司 File uploading and resolving method and device
CN106557657A (en) * 2016-11-21 2017-04-05 北京市农林科学院 A kind of GWAS analysis methods and device based on GEMMA

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003196267A (en) * 2001-12-13 2003-07-11 Bebright Corp Method and system for on-line distributing printed matter such as teaching material
JP2003216626A (en) * 2002-01-21 2003-07-31 Mitsubishi Electric Corp Structured document processing apparatus, method and program

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003196267A (en) * 2001-12-13 2003-07-11 Bebright Corp Method and system for on-line distributing printed matter such as teaching material
JP2003216626A (en) * 2002-01-21 2003-07-31 Mitsubishi Electric Corp Structured document processing apparatus, method and program

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
使用XML 技术构建图文信息系统. 于长泓,李勇.辽宁师范大学学报(自然科学版),第24卷第4期. 2001 *
基于语义的数据格式转换. 郝亚南.河北大学论文集(2003年6月). 2003 *

Also Published As

Publication number Publication date
CN1627288A (en) 2005-06-15

Similar Documents

Publication Publication Date Title
JP4687803B2 (en) Communication terminal device
US20060101007A1 (en) Information processing apparatus and method, and recording medium
JP3918230B2 (en) Data update monitoring server
CN100365621C (en) Files conversion system and method
CN101866342B (en) Method and device for generating or displaying webpage label and information sharing system
CN101551800B (en) Marked information generation device, inquiry unit and sharing system
CN100440127C (en) Method and apparatus for printing web page
US20060230100A1 (en) Web content transcoding system and method for small display device
EP3226159A1 (en) System and method for managing browsing histories of web browser
JP2004258911A (en) Server, method for collecting information, and program
CN102177515A (en) Methods, systems and devices for transcoding and displaying electronic documents
CN101087308A (en) Information processing system, information processing apparatus, information processing method, and computer program
US20020184269A1 (en) Document management systems for and methods of sharing documents
US20010002471A1 (en) System and program for processing special characters used in dynamic documents
JP2005234837A (en) Structured document processing method, structured document processing system and its program
JP4073536B2 (en) Securities information display device
JP5466133B2 (en) Document search apparatus with image and document search program with image
CN101145936B (en) A method and system for adding tags in Web pages
JP5245629B2 (en) Relay device, communication relay method, program thereof, and relay system
JP4752020B2 (en) Character string acquisition method and character string acquisition system
JP4241757B2 (en) Communication terminal device
JP4890310B2 (en) Information processing system, information output device, information registration device, information processing method in information processing system, and information processing program in information processing system
JP4571849B2 (en) Content providing system, content providing method, program for executing the method, and recording medium storing the program
JP2007087426A (en) Communication terminal apparatus
KR101864291B1 (en) Method of Managing and Optimizing Page Coorperating with PageSpeedInsights

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080130

Termination date: 20141210

EXPY Termination of patent right or utility model