TWI273443B - System and method for converting file's format - Google Patents

System and method for converting file's format Download PDF

Info

Publication number
TWI273443B
TWI273443B TW092134652A TW92134652A TWI273443B TW I273443 B TWI273443 B TW I273443B TW 092134652 A TW092134652 A TW 092134652A TW 92134652 A TW92134652 A TW 92134652A TW I273443 B TWI273443 B TW I273443B
Authority
TW
Taiwan
Prior art keywords
document
format
conversion
file
word
Prior art date
Application number
TW092134652A
Other languages
Chinese (zh)
Other versions
TW200519637A (en
Inventor
Chung-I Lee
Hai-Hong Lin
Bao-Sheng Luo
Original Assignee
Hon Hai Prec Ind Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hon Hai Prec Ind Co Ltd filed Critical Hon Hai Prec Ind Co Ltd
Priority to TW092134652A priority Critical patent/TWI273443B/en
Publication of TW200519637A publication Critical patent/TW200519637A/en
Application granted granted Critical
Publication of TWI273443B publication Critical patent/TWI273443B/en

Links

Abstract

A system and method for converting file's format is disclosed. The system comprises a plurality of client computers, an application programme server, a file receiving server and a database. The method comprises following steps: sending out a converting request of file to the application programme server; getting the corresponding file from the database; checking and judging the file's format; converting the file into format of extensible markup language; merging the converted file and the image file; returning an integrated extensible markup language format file to the database.

Description

12734431273443

案號 92134652 五、發明說明(1) 【發明所屬之技術領域】 本發明係關於-種文槽格式轉換技術,尤指一種 動將Word格式文檔轉換為可延伸性標示語言格式文檔 術。 ^ 【先前技術】 隨著資訊時代的到纟’不同的企業、用戶之間所 進打資訊的交流頻率越來越多,I是不同的企業、不 用戶之間由於使用習慣和軟體的不同,造成文槽的格式不 一,從而在進行文檔交換時造成不便。 現有技術中已有一些關於文槽格式的轉換方法,如 =國家知識產權局於2_年12月6日公開之公開號為 = 7575 2A之名稱為”網際網絡上資料庫的自動轉換 方法與系統"之專利巾請案,該巾請案揭露了一種可以= 網際網路用戶上傳之文標轉換^資料冑規定格式之文標進 =存儲的方法。該方法將用戶上傳的文檔進行檢查鱼^ Π = Ϊ排序組成固定格式的文檀。該方法雖然可以 I文檔格式的轉換,但是只能轉換為該資料庫所規定 =式,局限性較大,而且還將用戶的文檔重新拆分, 保持文檔的一致性與完整性。 月匕 再如中國國家知識產權局於2〇〇丨年9月26日公開之八 =號為CN 1 3 1 46 34Α之名稱為,,文件轉換方法、文件轉換Α ° 、以及文件顯示系統π之專利申請案,該申請案、♦及、一 ㈠文件轉換…先從由多個數據段: 、 文件中提取部分數據,然後將該部分數據顯示在Case No. 92134652 V. Description of the Invention (1) Technical Field of the Invention The present invention relates to a text format conversion technique, and more particularly to a method for converting a Word format document into an extensible markup language format. ^ [Prior Art] With the arrival of the information age, there are more and more exchanges of information between different companies and users. I are different companies, and users are not using the habits and software. The format of the text slot is different, which causes inconvenience in the exchange of documents. There are some methods for converting the format of the stencil in the prior art, for example, the public number of the database disclosed by the State Intellectual Property Office on December 6, 2, is 7575 2A. The system " patent towel request, the towel disclosure reveals a method that can be used by the Internet user to upload the document conversion ^ data 胄 the specified format of the text = storage method. This method will check the user uploaded document Fish ^ Π = Ϊ Sorting constitutes a fixed format of Wen Tan. Although this method can convert I document format, but can only be converted to the definition of the database, the limitation is greater, and the user's document is also split. , to maintain the consistency and integrity of the document. The same as the name of CN 1 3 1 46 34Α published by the State Intellectual Property Office of China on September 26, 2000, the file conversion method, File conversion Α °, and the patent application system of the file display system π, the application, ♦ and one (1) file conversion... first extract part of the data from multiple data segments: , and then display the partial data

第6頁 1273443Page 6 1273443

:個有限顯示能力的裝置上進行顯示。該申請案之不足在 =其只能在一個能力有限的顯示裝置上顯示部分數據,不 忐顯示完整的數據,且只能應用在網際網路瀏覽器上的超 文本鏈接標示 ^^(Hypertext Markup Language ’HTML) 上,而不能將文本格式文檔轉換為XML格式的文檔,局限 性較大,。 再一例子如中國國家知識產權局於2〇〇3年3月i 9曰公 ^之公開號為CN 1 403 95 0A之名稱為”電子文件自動轉換發 送的系統與方法”之專利申請案,該申請案揭露一種電子 文件的轉換方法,其可以將文件的編碼資訊進行轉換,例 如將簡體中文轉換為繁體中文,或者將繁體中文轉換為簡 體中文。該專利申請案之不足在於其只是進行文字編碼之 轉換,而不能將文本格式文檔轉換為XML格式的文槽。 最後一個例子如中國國家知識產權局於1 99 8年4月15 曰公開之公開號為CN 1 1 78948A之名稱為,’文件格式轉換方 法”之專利申請案’該申請案所揭露之技術可將個人電腦 (Personal Computer,PC)或筆記型電腦(N〇teb〇〇k: Display on a device with limited display capability. The shortcoming of this application is that it can only display part of the data on a limited-capacity display device, and does not display the complete data, and can only be applied to the hypertext link mark on the Internet browser ^^ (Hypertext Markup Language 'HTML', but can not convert text format documents into XML format documents, the limitations are greater. Another example is the patent application filed by the State Intellectual Property Office of China in March 2003, the publication number of CN 1 403 95 0A is "System and Method for Automatic Conversion and Transmission of Electronic Documents". The application discloses a method for converting an electronic file, which can convert the encoded information of the file, for example, convert Simplified Chinese to Traditional Chinese, or convert Traditional Chinese to Simplified Chinese. The shortcoming of this patent application is that it is only a conversion of text encoding, and cannot convert a text format document into a text formatted XML format. The last example is the patent application filed by the State Intellectual Property Office of China on April 15, 1999. The publication number of CN 1 1 78948A is 'the file format conversion method'. The technology disclosed in the application can be disclosed. PC (Personal Computer, PC) or laptop (N〇teb〇〇k

Personal Computer)上的文件資源轉換為一種可被袖珍型 個人電腦如CD機可讀取的格式。該專利申請案所揭露技術 之不足同樣在於不能將文本格式文檔轉換為XML格式的文 檔0 但是,在某些場合需要提交一種固定格式的文播,在 這種情況下’用戶往往需要重新進行文檔的重新錄^與編 輯,從而浪費用戶時間,造成不必要的工作量。The file resource on Personal Computer is converted to a format that can be read by a pocket-sized personal computer such as a CD player. The deficiencies of the technology disclosed in the patent application also lie in the inability to convert a text format document into a document in XML format. However, in some cases it is necessary to submit a fixed format text broadcast, in which case the user often needs to re-document Re-recording and editing, thus wasting user time and causing unnecessary work.

第7頁 1273443Page 7 1273443

案號 92134652 五、發明說明(3) 【發明内容】 本發明之要目的在於提供一種文樓轉換方法,其能將 用戶編輯過的Word格式的文檔轉換為〇L格式的文檔,滿 足用戶的不同需要。 / 本發明提供一種文檔轉換系統,該文檔轉換系統包括 複數客戶端電腦,一網路、一應用程式伺服器、一文檔接 收伺服器及一資料庫。每一客戶端電腦均提供一圖形^戶 "面,係用於進行文檔編輯,當需要進行文檔編輯時,客 戶端電腦發出一文檔傳輸請求。資料庫中存儲有各種格式 j檔:,包括Word格式之文檔,同時,在該資料庫中亦存 儲有文檔之摘要資訊。應用程式伺服器用於接收客戶 月I發送之文檔傳輸請求,傳輸對應文檔,執行文槽格式檢 =,分析文槽内容,並進行格式轉換,執行文權合併作 應用程式伺服器包括一傳輸請求接收模組,用於接收 =端電腦發出的文檔傳輸請求;一文檔獲取模組,用於 依^文檔傳輸請求從資料庫中獲取對應之文檔,·一文檔檢 查模組,用於對獲取之文檔格式進行檢查,包括文檔^式 =識別和檢查,判斷該文檔之格式是否為w〇rd格式·二文 =模組,用於對獲取之文檔内容進行分析,從而得到 m · 一 洛、正文段落、詳細描述 格犬的文1^轉換模組,用於將word格式的文檔轉換為 XML秸式的文檔,該格式轉換模組藉由一後臺運 執行文檀格式轉換,該後臺運行 . 式編寫語言編寫;—文#入供rf式係由Vlsuai “川程Case No. 92134652 V. SUMMARY OF THE INVENTION (3) SUMMARY OF THE INVENTION The object of the present invention is to provide a text conversion method capable of converting a user-edited document in Word format into a document in the 〇L format to satisfy different users. need. / The present invention provides a document conversion system including a plurality of client computers, a network, an application server, a document receiving server, and a database. Each client computer provides a graphic "face, which is used for document editing. When document editing is required, the client computer issues a document transfer request. The database contains various formats j files: including documents in Word format, and also contains summary information of the documents in the database. The application server is configured to receive a document transmission request sent by the customer's monthly I, transmit the corresponding document, perform a stencil format check, analyze the contents of the slot, and perform format conversion, and perform literary merging as the application server including a transmission request. The receiving module is configured to receive a document transmission request sent by the computer; a document obtaining module is configured to obtain a corresponding document from the database according to the document transmission request, and a document checking module is configured to obtain the The document format is checked, including the document type=identification and check, and the format of the document is determined to be w〇rd format·two text=module, which is used for analyzing the content of the obtained document, thereby obtaining m · one lo, the text Paragraph, detailed description of the dog's text 1 ^ conversion module, used to convert the word format document into XML straw type document, the format conversion module is executed by a background transfer text sandal format conversion, the background run. Writing language; - text #入为rf式系 by Vlsuai "川程

文检口併挺組’用於將轉換後的XML 1273443The text check port and the group 'for the converted XML 1273443

二上 文檔之附加圖檔合併,構成-個完整的 文檔。文檔接收伺服器係用於接收從應用程式伺服器 傳輸之文檔,該文檔係為經過格式轉換后之XML格式文 本發明還提供-種文槽轉換方法,其能將用戶編輯過 的Word格式的文檔轉換為XML格式的文檔,該文檔轉換方 法包括如下步驟:發出文檔傳輪請求;獲得對應文檔;檢 查文檔格式,判斷該文檔之格式是否為w〇rd格式;若經過 判斷知出該文檔為Word格式文檔,則將輸入文檔轉換為 XML格式文檔;合併該可延伸性標示語言文檔與圖檔;返 回完整的XML格式文檔。若判斷得出該文檔格式為其他非 Word格式之文檔,則直接結束操作流程。 藉由本發明提供之文檔轉換系統與方法,可實現將用 戶之Word格式文檔轉換為xML格式的文播。 【實施方式】 參閱第一圖所示,係本發明文檔轉換系統之實施環境 圖。該文檔轉換系統包括複數客戶端電腦丨〇,一網路丨i、 一應用程式伺服器1 2、一資料庫丨3及一文檔接收伺服器 14。每一客戶端電腦1〇均提供一圖形用戶介面(圖中未示 出)’係用於進行文檔編輯,當需要進行文檔編輯時,客 戶端電腦發出一文檔傳輸請求(圖中未示出),該文檔傳輸 請求被傳輸到應用程式伺服器1 2。資料庫1 3中存儲有各種 格式之文檔,包括Word格式之文檔,同時,在該資料庫13 中亦存儲有該文檔之摘要資訊。應用程式伺服器12用於接The additional files of the document on the second document are merged to form a complete document. The document receiving server is configured to receive a document transmitted from an application server, and the document is a formatted XML format text invention. The invention also provides a text slot conversion method, which can edit a user-edited document in Word format. Converting to a document in XML format, the document conversion method comprises the following steps: issuing a document routing request; obtaining a corresponding document; checking a document format, determining whether the format of the document is a w〇rd format; if it is determined that the document is Word The format document converts the input document into an XML format document; merges the extensibility markup language document with the image file; and returns the complete XML format document. If it is judged that the document format is another document other than Word format, the operation flow is directly ended. The document conversion system and method provided by the present invention can realize the conversion of the user's Word format document into the xML format. [Embodiment] Referring to the first figure, it is an implementation environment diagram of the document conversion system of the present invention. The document conversion system includes a plurality of client computers, a network, an application server 1, a database 丨3, and a document receiving server 14. Each client computer provides a graphical user interface (not shown) for editing documents. When document editing is required, the client computer issues a document transmission request (not shown). The document transfer request is transmitted to the application server 12. The database 13 stores documents in various formats, including documents in the Word format, and also stores summary information of the documents in the database 13. Application server 12 is used to connect

第9頁 1273443 __案號92134652 羊Γ月ί 〇曰 絛正_ 五、發明說明(5) 收客戶端電腦發送之文檔傳輸請求,執行文檔格式轉換, 該應用程式伺服器1 2位于文檔發送方。文檔接收伺服器j 4 係用於接收從應用程式祠服器1 2傳輸之文槽,該文樓係為 經過格式轉換后之XML格式文檔,該文檔接收伺服器丨4位 于文檔接收方。 參閱第二圖所示,係本發明文播轉換系統應用程式伺 服器之功能模組圖。該應用程式伺服器丨2係為文檔格式轉 換之控制中心,其接收從客戶端電腦1 〇傳輸之文槽傳輸請 求,該應用程式伺服器1 2包括一傳輸請求接收模組丨2 i ' 一文槽獲取模組122、一文槽檢查模組123、一文播分析模 組1 2 4、一格式轉換模組1 2 5及一文權合併模組1 2 6。傳輸 請求接收模組1 2 1係用於接收客戶端電腦1 〇傳輸之文槽傳 輸請求。文檔獲取模組1 22係用於依據文檔傳輸請求從資 料庫1 3中獲得對應之文檔。 ' 文檔檢查模組123用於對資料庫13中儲存之文播格式 進行檢查,包括文檔格式的識別和檢查,判斷該文槽是否 為Word格式之文檔。文檔分析模組1 24用於對獲得之文檀 内容進行分析,從而得到該文檔不同的段落,例如摘要段 落、正文段落、詳細描述段落等。格式轉換模組丨2 5用於 執行文檔格式轉換,將Word格式的文檔轉換為格式的 文檔,該格式轉換模組藉由一後臺運行之程式執行文播格 式轉換,該後臺運行程式係由Visual Basic程式編寫g古 編寫。 文檔合併模組126用於將轉換後的XML格式文標與w〇rdPage 9 1273344 __ Case No. 92134652 羊Γ月 〇曰绦 _ _ 5, invention description (5) Receive the document transmission request sent by the client computer, perform document format conversion, the application server 12 is located in the document transmission square. The document receiving server j 4 is for receiving a slot transmitted from the application server 12, which is a format-converted XML format document, and the document receiving server 4 is located at the document receiving side. Referring to the second figure, it is a functional module diagram of the server of the text broadcast conversion system application program of the present invention. The application server 丨 2 is a control center for document format conversion, and receives a creek transmission request transmitted from the client computer 1 , the application server 12 includes a transmission request receiving module 丨 2 i ' The slot acquisition module 122, the stencil checking module 123, the spoofing analysis module 142, a format conversion module 152 and a text merging module 126. The transmission request receiving module 1 2 1 is for receiving a request for transmission of the client computer 1 〇 transmission. The document acquisition module 1 22 is configured to obtain a corresponding document from the material library 13 according to the document transmission request. The document checking module 123 is configured to check the cipher format stored in the database 13, including the identification and checking of the document format, and determine whether the stencil is a document in the Word format. The document analysis module 1 24 is configured to analyze the obtained content of the text, thereby obtaining different paragraphs of the document, such as a summary paragraph, a body paragraph, a detailed description paragraph, and the like. The format conversion module 丨 2 5 is configured to perform a document format conversion, and convert a document in a Word format into a formatted document. The format conversion module performs a text format conversion by a program running in the background, and the background running program is executed by Visual. Basic programming is written in g ancient. The document merge module 126 is configured to convert the converted XML format logo with w〇rd

第10頁 1273443 ^ . _案號92134652 6/Γ朱β月,〇曰 條正_ 五、發明說明(6) 文檔之附加圖檔合併,構成一個完整的XML文檔,該附加 圖播系為Word文播内附加之圖播,該圖棺之格式可為標簽 圖像文件格式(Tagged Image Fi le,TIF)、標記圖像文件 格式(Tagged Image File Format,T IFF)點陣圖文件(B 土 tPage 10 1273344 ^ . _ Case No. 92134652 6 / Γ Zhu β month, 〇曰条正 _ five, invention description (6) document additional files merged to form a complete XML document, the additional map is a Word The image is attached to the text broadcast. The format of the image can be Tagged Image Fier (TIF) or Tagged Image File Format (T IFF) bitmap file (B t

Map,BMP)、圖像交換格式(Graphics InterchangeMap, BMP), image exchange format (Graphics Interchange

Format,GIF)、聯合圖形圖像專家組(j〇int ph〇t〇 Graphic Experts Group,JPEG)等格式。Format, GIF), J图形int ph〇t〇 Graphic Experts Group (JPEG) and other formats.

參閱第三圖所示,係本發明文檔轉換系統之資料庫中 摘要資訊表示意圖。該摘要資訊係為資料庫丨3中非結構化 資料之摘要資訊30 0,該摘要資訊3〇〇包括資料編號3〇1、 資料標題302、資料位置303、資料目錄3〇4及轉換日期 3。〇5。資料編號3〇1為一資料標示編號,用於應用程式伺服 裔1 2識別文檔之用,該資料編號為順序編號,且在資料庫 1 3中有序排列。資料標題30 2係為各種非結構化資料之標 題,包括文擋標題、圖像標題、聲音標題及影像標題。資 料位置303用於記錄資料庫13中不同的非結構化資料之存、 儲位置,該存儲位置表明了某項資料之詳細館存位置,例 如文檔123· doc之資料位置為c: \WinnUSystem32Referring to the third figure, it is a schematic diagram of a summary information table in the database of the document conversion system of the present invention. The summary information is the summary information 30 of the unstructured data in the database ,3, which includes the data number 3〇1, the data title 302, the data location 303, the data directory 3〇4, and the conversion date 3 . 〇 5. The data number 3〇1 is a data identification number for the application server to identify the document. The material number is sequential number and is ordered in the database 13. The title of the material 30 2 is the title of various unstructured materials, including the title of the document, the title of the image, the title of the sound, and the title of the image. The data location 303 is used to record the storage location of different unstructured data in the database 13, and the storage location indicates the detailed storage location of a certain data. For example, the data location of the document 123·doc is c:\WinnUSystem32

\123.doc。資料目錄304記錄某項資料之儲存目錄,轉換 曰』3 0 5記錄心以格式文檔轉換為XML格式文檔之轉換日 矜鳇示,係、本發明文檔轉換系統與方法之文 程圖。首先,傳輸請求接收模組⑴接收 客戶端電㈣發出的文檀傳輸請求( =\123.doc. The data directory 304 records the storage directory of a certain data, and converts the conversion date of the document into a document of the XML format, which is a textual diagram of the document conversion system and method of the present invention. First, the transmission request receiving module (1) receives the message transmission request sent by the client (4) (=

第11頁 1273443 案號 92134652 年g月/'次曰 條正_ 五、發明說明(7) 檔獲取模組1 22透過網路11從資料庫13獲取對應之文檔(步 驟S41),文播檢查模組123對上述所獲得之文棺執行格式 識別與檢查(步驟S42);判斷該文檔格式是否gW〇rd格式 (步驟S43);若經過檢查,判斷該文檔格式為#w〇rd格式 文槽’則直接結束轉換流程。若經過檢查,判斷該文標確 為Word文檔,則由文檔分析模組124執行文檔内容識別, 從而得到該文稽之不同段落,例如:摘要段落、正文段 落、詳細描述段落等,接著由格式轉換模組丨2 5將該文檔 從Word格式轉換為XML格式(步驟S44)。上述的格式轉換模 組125執行包括如下步驟:首先,由格式轉換模組丨25根據 上述的分析結果設定XML文檔中對應段落,將該w〇rd文標 中每一資料標題下對應段落文字複製並粘貼到XML格式文 播中對應的資料標題段落下,完成文權格式轉換,上述步 驟S44中文檔格式轉換係在一後臺運行程式之控制下完 成j該後臺運行程式係用Visuai Basic語言編寫。接著由 文槽合併模組1 2 6將轉換後的xml格式文檔與Word文檔中的 圖像進行合併,以構成一個完整的XML文檔(步驟S45),最 後返回該XML文槽到客戶端電腦丨〇(步驟S46),流程結束。 斤“上所述,本發明所提出之文檔轉換系統與方法確實 可符合發明專利要件,爰依法提出專利申請。惟,以上所 述f僅為本發明文檔轉換系統與方法之較佳實施例,舉凡 七悉本案技藝之人士,在參照本發明精神所作之等效修飾 或變化,皆應包含於以下之申請專利範圍内。 _ 111 1 11 11 11 11 圓 第12頁 1273443 修正 案號 92134652 圖式簡單說明 第一圖係本發明文檔轉換系統之實施環境圖。 第二圖係本發明文檔轉換系統應用程式伺服器之功能模組 圖。 第三圖係本發明文檔轉換系統之資料庫中摘要資訊表示意 圖。 第四圖係本發明文檔轉換系統與方法之文檔轉換與合併流 程圖。Page 11 1273344 Case No. 92134652 g month / 'Secondary 正 _ _ _, invention description (7) file acquisition module 1 22 through the network 11 from the database 13 to obtain the corresponding document (step S41), the text check The module 123 performs format identification and check on the obtained document (step S42); determines whether the document format is gW〇rd format (step S43); if it is checked, determines that the document format is #w〇rd format slot 'The process ends directly. If it is checked that the document is confirmed to be a Word document, the document analysis module 124 performs document content recognition, thereby obtaining different paragraphs of the document, such as: summary paragraph, body paragraph, detailed description paragraph, etc., followed by format The conversion module 丨 25 converts the document from the Word format to the XML format (step S44). The format conversion module 125 performs the following steps: First, the format conversion module 丨25 sets the corresponding paragraph in the XML document according to the analysis result, and copies the corresponding paragraph text in each data title in the w〇rd text label. And paste it into the corresponding data title paragraph in the XML format text broadcast to complete the text right format conversion. In the above step S44, the document format conversion is completed under the control of a background running program. The background running program is written in the Visuai Basic language. Then, the converted xml format document is merged with the image in the Word document by the squaring merge module 1 26 to form a complete XML document (step S45), and finally the XML snippet is returned to the client computer. 〇 (Step S46), the flow ends. As described above, the document conversion system and method proposed by the present invention can indeed meet the requirements of the invention patent, and the patent application is filed according to law. However, the above f is only a preferred embodiment of the document conversion system and method of the present invention. Equivalent modifications or variations made by those skilled in the art will be included in the scope of the following claims. _ 111 1 11 11 11 11 Round 12 1273343 Amendment 92134652 BRIEF DESCRIPTION OF THE DRAWINGS The first figure is an implementation environment diagram of the document conversion system of the present invention. The second figure is a functional module diagram of the application server of the document conversion system of the present invention. The third figure is a summary information in the database of the document conversion system of the present invention. The fourth figure is a flow chart of document conversion and merge of the document conversion system and method of the present invention.

第13頁Page 13

Claims (1)

12734431273443 伸性“ ’其可將word格式文檔轉換為可延 = ί = :式文檔,該文檔轉換系統包括: 1. 腦’係用於發出文檔傳輸請求; 貝’ ’,、中存儲不同格式之文檔; 一應用程式伺服器,包括: 一ίίΐί接收模組’用於接收客戶端電腦發送之 文檔傳輸請求; # 取模、且,用於根據文槽傳輸請求獲得所需 傳輸之文檔; 一 查模組,用於對上述所獲得之文檔進行文 棺格式的識別和檢查; 一文檔分析模組,用於對經過文檔格式檢查后之文 槽内容進行分析,獲得該文檔不同的段落; 一格式轉換模組,用於將經過文檔内容分析后之word 格式文檔轉換為可延伸性標示語言格式的文檔; 一文檔合併模組,用於將轉換後的可延伸性標示語 吕格式文檔與Word格式文檔中之圖檔合併,構成 一個完整的可延伸性標示語言格式文檔; 一文檔接收伺服器,用於接收從應用程式伺服器傳輸 之可延伸性標示語言格式文檔。 & 2 ·如申請專利範圍第1項所述之文檔轉換系統,其中的資 料庫中存儲之不同格式文檔包括Word格式之文槽。貝 3 ·如申請專利範圍第1項所述之文檔轉換系統,其中的格 式轉換模組係在一後臺運行程式之控制下完成格式轉°Extensibility "' converts word format documents into deferrable = ί = : style documents, the document conversion system includes: 1. The brain 'is used to issue document transfer requests; the shell ' ', stores documents in different formats An application server, comprising: an ίίΐί receiving module for receiving a document transmission request sent by a client computer; # modulo, and a document for obtaining a required transmission according to the slot transmission request; a group for identifying and checking the document format obtained by the above document; a document analysis module for analyzing the contents of the document after the document format check to obtain different paragraphs of the document; a module for converting a word format document that has been analyzed by the document content into a document in an extensible markup language format; a document merge module for converting the extensible markup language format document and the Word format document The files in the file merge to form a complete extensible markup language format document; a document receiving server for receiving the slave application The server transmits the extensibility markup language format document. & 2 · The document conversion system described in claim 1, wherein the different format documents stored in the database include the Word format. For example, the document conversion system described in claim 1 is characterized in that the format conversion module completes the format conversion under the control of a background running program. 第14頁 1273443Page 14 1273443 伸^ t檔轉換方法,其可將勖1^格式文檔轉換為可延 •払示m 3格式文檔,該文檔轉換方法包括以下步a conversion method for converting a 格式1^ format document into a descriptive m3 format document, the document conversion method comprising the following steps 該後苴運行程式係用Visual Basic語言編寫。 發出文檔傳輸請求; 獲取對應文檔; 'U 二文槽格式,判斷該文檔之格式是否為word格式; 若判斷得出該文檔確實為心以格式文檔,則將輸入 文播轉換為可延伸性標示語言格式文檔,合併該 可延伸性標示語言格式文檔與圖檔,返回完整的 可延伸性標示語言格式文檔; 右判斷得出該文檔格式為其它非word格式之文檔, 5 則直接結束操作流程。 申請專利範圍第4項所述之文檔轉換方法,其中的圖 ‘係為轉換之Word文檔中所包含之圖槽。 b ·如由 j .甲凊專利範圍第4項所述之文槽轉換方法,其中的文 槽轉換操作包括以下步驟: 根據文槽分析模組對所獲取文槽之分析結果設定可延 伸性標示語言格式文檔中對應之段落; 將該文檔中不同資料標題下對應段落文字複製並粘貼 到可延伸性標示語言格式文檔中對應的資料標題段 落下,完成文檔格式轉換。 7 ·如申請專利範圍第4項所述之文槽轉換方法,其中的 Word文檔係用美國微軟公司(MicrosoftThe post-running program is written in Visual Basic. Issuing a document transmission request; obtaining a corresponding document; 'U two-slot format, determining whether the format of the document is a word format; if it is determined that the document is indeed a format document, converting the input text to an extensibility mark The language format document merges the extensible markup language format document and the image file, and returns the complete extensible markup language format document; the right judgment results that the document format is other non-word format documents, and 5 directly ends the operation flow. The document conversion method described in claim 4 of the patent scope, wherein the figure ‘ is a groove included in the converted Word document. b. The stencil conversion method according to item 4 of the patent scope of the invention, wherein the slot conversion operation comprises the following steps: setting the extensibility indication according to the analysis result of the acquired stencil according to the stencil analysis module The corresponding paragraph in the language format document; copy and paste the corresponding paragraph text under the different material headings in the document to the corresponding data title paragraph in the extensible markup language format document, and complete the document format conversion. 7 · As described in the patent application scope 4, the word conversion method, the Word document is US Microsoft Corporation (Microsoft 1273443 修正 案號 92134652 六、申請專利範圍 Corporation,MS) 之文播編輯軟體MS Office系列軟 體編輯過的文檔。 ϋϋ 第16頁 1273443 修正 曰 -^MuJ2\U%2 车广日 / 旧、中文發明摘要(發明名稱:文槽轉換系統與方法) 續勺i ί=供—種文檔轉換系統與方法。該文檔轉換李 ,,'充。括稷數客戶端電腦、-應用程式伺服器、—文ί ϊ糸 文檔傳輸請求;ί取換:;包括以下步驟:發出 輸入文檔轉換為可延伸性標示語言:式文可: 伸性::語言格式文檔與圖檔;$回完整的性 語言格式文襠。藉由本發明提供之文播轉換系統及方J了 可將Word格式X標轉&為可延伸性標示語 高用戶工作效率。 五、(一)、本案代表圖為:第一一^一圖 (二)、本案代表圖之元件代表符號簡單說明: 無 六、英文發明摘要(發明名稱:System and Method for Converting File,s Format) A system and method for converting file,s format is disclosed· The system comprises a plurality of client computers 、 an application programe server 、 a file receiving server and a database· The method comprises following steps: sending out a converting request of file to the application programe server; getting the corresponding file from the database; checking and judging the1273443 Amendment Case No. 92134652 VI. Patent Application The document of the MS Office series software edited by the company, MS). Ϋϋ Page 16 1273443 Correction 曰 -^MuJ2\U%2 Che Guangri / Old, Chinese invention summary (invention name: slot conversion system and method) Continuation spoon i ί=Supply-document conversion system and method. The document converts Li,, 'charge. Including the client computer, the application server, the document transfer request, and the following steps: the input document is converted into an extensibility markup language: the text can be: Extensibility: Language format document and image file; $ back to the complete language format document. By means of the text-to-speech conversion system and the method provided by the present invention, the Word format X can be converted to & as an extensible markup for high user work efficiency. V. (1) The representative figure of the case is: the first one figure (2), the representative symbol of the representative figure of the case is a simple description: No. 6. English invention summary (invention name: System and Method for Converting File, s Format A system and method for converting file, s format is disclosed, the system includes a plurality of client computers, an application programe server, a file receiving server and a database· the method with the following steps: sending out a converting request of file to The application programe server; getting the corresponding file from the database; checking and judging the 第3頁Page 3
TW092134652A 2003-12-09 2003-12-09 System and method for converting file's format TWI273443B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW092134652A TWI273443B (en) 2003-12-09 2003-12-09 System and method for converting file's format

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW092134652A TWI273443B (en) 2003-12-09 2003-12-09 System and method for converting file's format

Publications (2)

Publication Number Publication Date
TW200519637A TW200519637A (en) 2005-06-16
TWI273443B true TWI273443B (en) 2007-02-11

Family

ID=38621529

Family Applications (1)

Application Number Title Priority Date Filing Date
TW092134652A TWI273443B (en) 2003-12-09 2003-12-09 System and method for converting file's format

Country Status (1)

Country Link
TW (1) TWI273443B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2014008560A (en) 2012-01-23 2014-09-26 Microsoft Corp Formula detection engine.
US9330070B2 (en) 2013-03-11 2016-05-03 Microsoft Technology Licensing, Llc Detection and reconstruction of east asian layout features in a fixed format document

Also Published As

Publication number Publication date
TW200519637A (en) 2005-06-16

Similar Documents

Publication Publication Date Title
CN1801149B (en) Systems and methods for converting a formatted document to a web page
Asakawa et al. Annotation-based transcoding for nonvisual web access
Shreeves et al. Is “quality” metadata “shareable” metadata? The implications of local metadata practices for federated collections
US7996767B2 (en) System and method for generating electronic patent application files
EP1452966A2 (en) Method and system for enhancing the paste functionality of a software application
US7797389B2 (en) Monitoring and reporting usage of non-hypertext markup language e-mail campaigns
US20100281353A1 (en) Automated Annotating Hyperlinker
US7558830B2 (en) Method for tagging and tracking non-hypertext markup language based e-mail
US9177263B2 (en) Identifying and tracking grouped content in e-mail campaigns
US20050166143A1 (en) System and method for collection and conversion of document sets and related metadata to a plurality of document/metadata subsets
US7424669B2 (en) Automatic bibliographical information within electronic documents
US20110246452A1 (en) Trademark report with store layout diagram
US20090100023A1 (en) Information processing apparatus and computer readable information recording medium
CN105760501A (en) Document format conversion method and device
US20080022327A1 (en) System, method, and computer program product for remote printing
TWI273443B (en) System and method for converting file's format
Maurer et al. Transclusions in an html-based environment
TW201337605A (en) Multipurpose network editing page automatic conversion mechanism
KR101975111B1 (en) Mass webpage document transforming method, and system thereof
CN107066437B (en) Method and device for labeling digital works
US8170270B2 (en) Universal reader
PRADHAN Developing digital libraries: technologies and challenges
Coleman et al. SGML as a Framework for Digital Preservation and Access.
Hodge et al. Formats for digital preservation: A review of alternatives and issues
Exchange Portable Document Format (PDF)—Finally, a Universal Document Exchange Technology

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees