TW201411372A - Multi-file checking system and method thereof - Google Patents

Multi-file checking system and method thereof Download PDF

Info

Publication number
TW201411372A
TW201411372A TW101132921A TW101132921A TW201411372A TW 201411372 A TW201411372 A TW 201411372A TW 101132921 A TW101132921 A TW 101132921A TW 101132921 A TW101132921 A TW 101132921A TW 201411372 A TW201411372 A TW 201411372A
Authority
TW
Taiwan
Prior art keywords
file
format
vocabulary
rule
decoding
Prior art date
Application number
TW101132921A
Other languages
Chinese (zh)
Other versions
TWI456413B (en
Inventor
Chao-Lung Chiang
Chun-Liang Chen
yu-hua Yu
Chi-Tsung Huang
Original Assignee
Univ Nan Kai Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Nan Kai Technology filed Critical Univ Nan Kai Technology
Priority to TW101132921A priority Critical patent/TWI456413B/en
Publication of TW201411372A publication Critical patent/TW201411372A/en
Application granted granted Critical
Publication of TWI456413B publication Critical patent/TWI456413B/en

Links

Landscapes

  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A multi-file checking system and method thereof is disclosed to solve the problem without spell-checking for multi-file. By analyzing format of the multi-file and loading decoding rule, and then comparing content of the multi-file and words for generating a spelling report. The mechanism is to improve the efficiency and convenience of spell-checking.

Description

多文件檢查系統及其方法 Multi-file inspection system and method thereof

一種文件檢查系統及其方法,特別是指一種同時對多文件進行拼寫檢查的多文件檢查系統及其方法。 A file inspection system and method thereof, in particular, a multi-file inspection system and method for spell checking multiple files at the same time.

近年來,隨著資訊科技的蓬勃發展以及環保意識的抬頭,使得以往紙本文件逐漸邁向電子化,因此,處理電子化文件(以下簡稱文件)的各種文書處理軟體便如雨後春筍般的產生。 In recent years, with the rapid development of information technology and the rise of environmental awareness, the paper documents have gradually become electronic. Therefore, various software processing software for processing electronic documents (hereinafter referred to as documents) has sprung up.

一般而言,文書處理軟體的基本功能便是提供使用者進行文字的編輯,甚至能夠進行圖表或簡易試算表的處理。不過,隨著文書處理軟體的發展,人們已經不能滿足文書處理軟體所提供的基本功能,因此,如何增加文書處理軟體的功能便成為各家廠商急欲解決的難題。 In general, the basic function of the word processing software is to provide the user with the ability to edit the text, and even to process the chart or simple spreadsheet. However, with the development of software processing software, people can no longer meet the basic functions provided by the software processing software. Therefore, how to increase the function of the software processing software has become an urgent problem for various manufacturers.

有鑑於此,有廠商提出具有拼寫檢查功能的文書處理軟體,以便使用者進行文字編輯時,能夠同時提示使用者所輸入的詞彙是否拼寫錯誤,以降低拼寫錯誤的可能性。然而,由於文書處理軟體僅能針對當前所開啟的文件進行拼寫檢查,故具有無法同時對多文件進行拼寫檢查之問題。另一方面,當多文件具有不同格式時,因為各種文書處理軟體所支援的文件格式不一,同樣容易因相容性造成拼寫檢查錯誤,且進行拼寫檢查若需實際開啟文件,除了耗費效能之外,更造成使用者的不便。 In view of this, some manufacturers have proposed a word processing software with a spell check function, so that when the user performs text editing, the user can simultaneously prompt the user whether the entered vocabulary is misspelled to reduce the possibility of misspelling. However, since the word processing software can only perform spell checking on the currently opened file, there is a problem that it is impossible to perform spell checking on multiple files at the same time. On the other hand, when multiple files have different formats, because the file formats supported by various word processing software are different, it is also easy to cause spell checking errors due to compatibility, and if the file is actually opened, it is not only cost effective. In addition, it is more inconvenient for the user.

綜上所述,可知先前技術中長期以來一直存在無法同時對多文件進行拼寫檢查之問題,因此有必要提出改進的技術手段,來解決此一問題。 In summary, it can be seen that there has been a problem in the prior art that it is impossible to perform spell checking on multiple files at the same time, and therefore it is necessary to propose an improved technical means to solve this problem.

有鑒於先前技術存在的問題。本發明遂揭露一種多文件檢查系統及其方法。 In view of the problems of the prior art. The present invention discloses a multi-file inspection system and method thereof.

本發明所揭露之多文件檢查系統,包含:拼字資料庫、載入模組、分析模組及檢查模組。其中,拼字資料庫用以儲存詞彙;載入模組用以載入文件檔案;分析模組用以分析並記錄文件檔案的格式於分析結果;檢查模組根據分析結果載入相應的解碼規則,並且透過解碼規則對相應的文件檔案進行解碼後與詞彙作比對,用以生成並顯示拼寫報告。 The multi-file inspection system disclosed in the present invention comprises: a spelling database, a loading module, an analysis module and an inspection module. The spelling database is used to store the vocabulary; the loading module is used to load the file file; the analysis module is used to analyze and record the format of the file file in the analysis result; and the checking module loads the corresponding decoding rule according to the analysis result. And decoding the corresponding file file through the decoding rule and comparing with the vocabulary to generate and display the spelling report.

至於本發明之多文件檢查方法,其步驟包括:建立詞彙於拼字資料庫;載入文件檔案;分析並記錄文件檔案的格式於分析結果;根據分析結果載入相應的解碼規則,並且透過解碼規則對相應的文件檔案進行解碼後與詞彙作比對,用以生成並顯示拼寫報告。 As for the multi-file inspection method of the present invention, the steps include: creating a vocabulary in the spelling data database; loading the file file; analyzing and recording the format of the file file in the analysis result; loading the corresponding decoding rule according to the analysis result, and decoding The rules decode the corresponding file file and compare it with the vocabulary to generate and display the spelling report.

本發明所揭露之系統與方法如上,與先前技術之間的差異在於本發明透過分析各文件的格式並分別載入相應的解碼規則進行文件解碼後,將解碼後的文件內容與詞彙進行比對並產生拼寫報告。 The system and method disclosed in the present invention are as above, and the difference from the prior art is that the present invention compares the decoded file content with the vocabulary by analyzing the format of each file and respectively loading the corresponding decoding rule to decode the file. And generate a spelling report.

透過上述的技術手段,本發明可以達成提高拼寫檢查的修正效率及便利性的技術功效。 Through the above technical means, the present invention can achieve the technical effect of improving the efficiency and convenience of spelling correction.

以下將配合圖式及實施例來詳細說明本發明之實施方式,藉此對本發明如何應用技術手段來解決技術問題並達成技術功效的實現過程能充分理解並據以實施。 The embodiments of the present invention will be described in detail below with reference to the drawings and embodiments, so that the application of the technical means to solve the technical problems and achieve the technical effects can be fully understood and implemented.

首先,在詳細說明實施例之前,先對本發明所提出的多文件檢查系統及其方法之自訂名詞進行說明,本發明所述的文件是以電腦檔案的型態存在,故可稱之為文件檔案,而且此文件檔案具有用以識別格 式的副檔名,如:“doc”、“pdf”、“html”、“ppt”、......等等,以便搭配專門處理該副檔名的文書處理軟體作進一步的文書處理。而編碼規則及解碼規則是用以對具有不同副檔名的文件進行編碼及解碼的處理。舉例來說,對應“doc”副檔名的解碼規則能夠對副檔名為“doc”的文件檔案進行解碼以顯示其中的文字內容;對應“pdf”副檔名的解碼規則能夠對副檔名為“pdf”的文件檔案進行解碼並顯示其中的文字內容......並以此類推,反之,編碼規則如同解碼規則,其差異僅在於編碼規則用於編碼,而解碼規則用於對已編碼的文件進行解碼,故在此不對編碼規則作贅述,一般而言,不同副檔名對應不同的編碼規則及解碼規則,且所對應的編碼規則及解碼規則能夠進行相應的編/解碼處理。 First, before describing the embodiment in detail, the self-defined nouns of the multi-file inspection system and the method thereof according to the present invention are described. The file according to the present invention exists in the form of a computer file, so it can be called a file. File, and this file file has a file for identification The name of the auxiliary file, such as: "doc", "pdf", "html", "ppt", ..., etc., in order to cooperate with the processing software that specializes in the processing of the file name for further paper processing. . The encoding rule and the decoding rule are processes for encoding and decoding files having different file names. For example, the decoding rule corresponding to the "doc" file name can decode the file file with the file name "doc" to display the text content therein; the decoding rule corresponding to the "pdf" file name can be used for the file name. Decode the file file of "pdf" and display the text content thereof... and so on. Otherwise, the encoding rule is like the decoding rule, the difference is only that the encoding rule is used for encoding, and the decoding rule is used for The encoded file is decoded, so the encoding rules are not described here. In general, different sub-file names correspond to different encoding rules and decoding rules, and the corresponding encoding rules and decoding rules can be correspondingly encoded/decoded. .

接下來,在說明本發明的實施例之前,先配合圖式對本發明多文件檢查系統及其方法作進一步說明,請參閱「第1圖」,「第1圖」為本發明多文件檢查系統之方塊圖,包含:拼字資料庫101、載入模組102、分析模組103及檢查模組104。其中,拼字資料庫101用以儲存詞彙,所述詞彙包含各國語言,如:中文、英文、法文、......等等的單字,以及程式語言,如:c、java、html、......等等的關鍵字至少其中之一。另外,拼字資料庫101更可包含文法規則,用以分析文字內容的文法及程式語言的語法至少其中之一是否拼寫有誤,而此文法規則可透過網路或檔案進行更新,由於其更新方式為習知技術,故在此不多作贅述。 Next, prior to describing an embodiment of the present invention, the multi-document inspection system and method thereof of the present invention will be further described with reference to the drawings. Please refer to "FIG. 1", which is a multi-file inspection system of the present invention. The block diagram includes: a spelling database 101, a loading module 102, an analysis module 103, and an inspection module 104. The spelling database 101 is used to store vocabulary, and the vocabulary includes words in various languages, such as Chinese, English, French, ..., and programming languages, such as: c, java, html, ...and at least one of the keywords. In addition, the spelling database 101 may further include a grammar rule for analyzing whether the grammar of the text content and the syntax of the programming language are at least one of the spelling errors, and the grammar rule may be updated through the network or the file due to the update thereof. The method is a conventional technique, so it will not be repeated here.

載入模組102用以載入文件檔案,其文件檔案的格式至少包含文書檔案格式(例如:副檔名為“doc”、“ppt”、......等等)、純文字格式(例如:副檔名為“txt”)、可攜式文件格式(例如:副檔名為“pdf ”)、超文字標記語言格式(例如:副檔名為“html”)及開放文件格式(例如:副檔名為“odf”、“oxml”、......等等)其中之一,由於所述文件檔案已於前面自訂名詞中進行說明,故在此不再多作贅述。 The loading module 102 is configured to load a file file, and the format of the file file includes at least a file file format (for example, the file name is "doc", "ppt", ..., etc.), and the text format is (Example: the file name is "txt"), portable file format (for example: the file name is "pdf" "), the super-text markup language format (for example: the extension file name "html") and the open file format (for example: the extension file name "odf", "oxml", ..., etc.) Since the file file has been explained in the previous custom nouns, it will not be repeated here.

分析模組103用以分析並記錄文件檔案的格式於分析結果,其分析方式可根據文件檔案的檔案結構或直接根據文件檔案的副檔名來進行判斷。一般而言,文件檔案的檔案結構中具有記錄文件格式之欄位,因此,在實際實施上,可透過判斷此欄位的值(例如:字串值為“word文件”)得知文件檔案的格式,並且將格式記錄於分析結果中;而直接根據副檔名(例如:“doc”)來判斷文件檔案之格式,容易因誤判而產生錯誤,其原因在於副檔名容易遭到竄改,話雖如此,卻仍然不失為一個判斷格式的簡便方式,同樣地,在判斷出格式後,將判斷的結果記錄於分析結果中。 The analysis module 103 is configured to analyze and record the format of the file file in the analysis result, and the analysis manner can be determined according to the file structure of the file file or directly according to the file name of the file file. Generally speaking, the file structure of the file file has a field for recording the file format. Therefore, in actual implementation, the value of the field can be determined by judging the value of the field (for example, the string value is "word file"). Format, and record the format in the analysis results; and directly determine the format of the file file according to the file name (for example: "doc"), it is easy to generate errors due to misjudgment, because the file name is easily falsified, However, it is still a simple way to judge the format. Similarly, after judging the format, the result of the judgment is recorded in the analysis result.

檢查模組104根據分析結果載入相應的解碼規則,並且在透過所載入的解碼規則對相應的文件檔案進行解碼後,將文字檔案中的文字內容與拼字資料庫101中的所有詞彙作比對,用以生成並顯示拼寫報告,舉例來說,分析結果中記錄為“word文件”,檢查模組104將根據此分析結果載入處理“word文件”的解碼規則以便對文件檔案進行解碼,由於解碼後即可得到此文件檔案的文字內容,例如:文字內容中具有“good”及“goood”之詞彙,於是檢查模組104接著將文字內容與拼字資料庫101中的所有詞彙進行比對,並列出比對不相符的文字內容作為拼寫報告後進行顯示,以此例而言,詞彙“goood”為錯字,所以不存在於拼字資料庫101中,因此在進行比對後,此詞彙“goood”為比對不相符並記錄於拼寫報告中。另外,上述拼寫報告記錄可記錄多個文件檔案的比對結果,且包含比對相符的詞彙及比 對不相符的詞彙至少其中之一的統計,並可以搭配圖形及表格的方式進行顯示。而在實際實施上,檢查模組104更可透過預設的拼字檢查規則,進行客製化的拼字檢查,所述拼字檢查規則可記錄拼字檢查的方式,例如:只檢查開頭為字母“b”的詞彙,甚至是設定忽略檢查的詞彙。 The checking module 104 loads the corresponding decoding rule according to the analysis result, and after decoding the corresponding file file through the loaded decoding rule, the text content in the text file and all the words in the spelling database 101 are made. The comparison is used to generate and display a spelling report. For example, the analysis result is recorded as a “word file”, and the checking module 104 loads the decoding rule of the “word file” according to the analysis result to decode the file file. Since the text content of the file file is obtained after decoding, for example, the word content has "good" and "goood" words, and then the checking module 104 then performs the text content and all the words in the spelling database 101. The comparison is performed, and the text content that does not match the comparison is displayed as a spelling report. In this example, the word “goood” is a typo, so it does not exist in the spelling database 101, so after the comparison, The word "goood" does not match the match and is recorded in the spelling report. In addition, the above spelling report record can record the comparison results of multiple file files, and includes the matching words and ratios. Statistics on at least one of the inconsistent words, and can be displayed in the form of graphics and tables. In actual implementation, the check module 104 can perform a customized spell check by using a preset spell check rule, and the spell check rule can record the spell check mode, for example, only the check start is The vocabulary of the letter "b" is even a vocabulary that ignores checking.

承上所述,此多文件檢查系統更可包含:接收模組105、解碼模組106及修訂模組107。其中,接收模組105用以接收選擇條件及修正文字,所述選擇條件及修正文字皆可透過按壓功能鍵或游標點選的方式進行輸入,分別用以選擇進行編輯的文件檔案,以及選擇或輸入正確的詞彙以進行修正。 As described above, the multi-file inspection system may further include: a receiving module 105, a decoding module 106, and a revision module 107. The receiving module 105 is configured to receive the selection condition and the modified text, and the selection condition and the modified text can be input by pressing a function key or a cursor selection manner, respectively, for selecting a file file to be edited, and selecting or Enter the correct vocabulary to correct it.

解碼模組106用以根據選擇條件選擇所載入的文件檔案之一,並且透過相應於所選擇的文件檔案之解碼規則進行解碼後,顯示此文件檔案之文字內容。特別要說明的是,解碼模組106與檢查模組104的差異在於解碼模組106會顯示解碼後所得到的文字內容,而檢查模組104則是將解碼後的文字內容與拼字資料庫101中的所有詞彙進行比對,僅顯示根據比對結果所產生的拼寫報告。 The decoding module 106 is configured to select one of the loaded file files according to the selection condition, and display the text content of the file file after being decoded by the decoding rule corresponding to the selected file file. In particular, the difference between the decoding module 106 and the inspection module 104 is that the decoding module 106 displays the decoded text content, and the inspection module 104 is the decoded text content and the spelling database. All the words in 101 are compared, and only the spelling reports generated based on the comparison results are displayed.

修訂模組107根據修訂文字對文件檔案之文字內容進行編輯,並於編輯完成後透過相應於文件檔案之格式的編碼規則進行編碼及寫入文件檔案。從前面的說明已得知,文件檔案根據不同的副檔名而有不同的編碼格式,因此,當要進行寫入文件檔案時,必須先將已解碼並且修訂完畢的文字內容透過相應的編碼規則進行編碼後再寫入文件檔案。 The revision module 107 edits the text content of the file file according to the revised text, and after the editing is completed, encodes and writes the file file through the encoding rule corresponding to the format of the file file. As can be seen from the foregoing description, file files have different encoding formats according to different file names. Therefore, when writing file files, the decoded and revised text content must first be transmitted through the corresponding encoding rules. Write the file and then write it to the file.

如「第2圖」所示,「第2圖」為本發明多文件檢查方法之流程圖,其步驟包含:建立詞彙於拼字資料庫(步驟201);載入文件檔案(步 驟202);分析並記錄文件檔案的格式於分析結果(步驟203);根據分析結果載入相應的解碼規則,並且透過解碼規則對相應的文件檔案進行解碼後與詞彙作比對,用以生成並顯示拼寫報告(步驟204)。透過上述步驟即可於分析各文件的格式並分別載入相應的解碼規則進行文件解碼後,將解碼後的文件內容與詞彙進行比對並產生拼寫報告。 As shown in "Fig. 2", "Fig. 2" is a flowchart of the multi-file checking method of the present invention, the steps of which include: creating a vocabulary in the spelling database (step 201); loading the file file (step Step 202); analyzing and recording the format of the file file in the analysis result (step 203); loading the corresponding decoding rule according to the analysis result, and decoding the corresponding file file through the decoding rule and comparing with the vocabulary to generate A spelling report is displayed (step 204). Through the above steps, the format of each file can be analyzed and the corresponding decoding rules are respectively loaded to decode the file, and the decoded file content is compared with the vocabulary and a spelling report is generated.

除此之外,該多文件檢查方法更可包含:接收選擇條件及修正文字(步驟205);根據選擇條件選擇文件檔案之一,並且透過相應於文件檔案的解碼規則進行解碼後,顯示文件檔案之文字內容(步驟206);根據修正文字對文件檔案之文字內容進行編輯,並於編輯完成後透過相應於文件檔案之格式的編碼規則進行編碼及寫入文件檔案(步驟207)。藉由上述步驟即可於拼寫報告產生後,選擇所載入的文件檔案之一進行詞彙修訂並儲存。 In addition, the multi-file checking method may further include: receiving a selection condition and correcting the text (step 205); selecting one of the file files according to the selection condition, and displaying the file file after decoding by the decoding rule corresponding to the file file The text content (step 206); editing the text content of the file file according to the modified text, and after the editing is completed, encoding and writing the file file through the encoding rule corresponding to the format of the file file (step 207). By the above steps, after the spelling report is generated, one of the loaded file files is selected for vocabulary revision and storage.

以下將配合「第3圖」至「第6圖」以實施例方式進行如下說明,請先參閱「第3圖」至「第5圖」,其為應用本發明進行多文件檢查之示意圖。如「第3圖」所示,包含:檢查視窗300、載入元件301、載入顯示區塊302、拼寫檢查元件303、報告輸出元件304、拼寫報告顯示區塊305、前一詞彙元件306、次一詞彙元件307、修正文字選擇區塊308、替換詞彙元件309及全部恢復元件310,所述檢查視窗300的形式與包含元件並不以此為限。 The following description will be made by way of example with reference to "3rd" to "6th". Please refer to "3rd" to "5th", which is a schematic diagram of multi-file inspection using the present invention. As shown in FIG. 3, the inspection window 300, the loading component 301, the loading display block 302, the spell checking component 303, the report output component 304, the spelling report display block 305, the previous vocabulary component 306, The next vocabulary component 307, the modified text selection block 308, the replacement vocabulary component 309, and all of the recovery components 310, the form and inclusion of the inspection window 300 are not limited thereto.

當使用者欲對多文件進行拼寫檢查時,可執行檢查視窗300並且點選載入元件301,用以產生選擇對話框(圖中未示)以提供使用者選擇並載入欲進行拼寫檢查的文件檔案,例如:選擇載入文件檔案為“a.htm”及“b.doc”。接著,載入模組102載入所選擇的文件檔案,且經由分析模組103分析並記錄文件檔案的格式於分析結果,以此例 而言,可根據“a.htm”及“b.doc”的檔案結構或副檔名得知這兩個檔案的格式分別為“html文件”及“word文件”,故分析結果記錄文件檔案“a.htm”的格式為“html文件”、文件檔案“b.doc”的格式為“word文件”。特別要注意的是,本發明並未限定分析結果所記錄的格式,也就是說,只要能夠得知文件檔案及所對應的格式,便不脫離本發明的精神,舉例來說,分析結果亦可以數值來代表不同的格式,例如:數值“1”代表“html文件”、數值“2”代表“word文件”。 When the user wants to perform a spell check on multiple files, the inspection window 300 can be executed and the loading component 301 can be clicked to generate a selection dialog (not shown) to provide the user with the choice and loading of the spell check. File files, for example: Select to load file files as "a.htm" and "b.doc". Then, the loading module 102 loads the selected file file, and analyzes and records the format of the file file through the analysis module 103 to analyze the result. In fact, according to the file structure or the auxiliary file name of "a.htm" and "b.doc", the format of the two files is "html file" and "word file" respectively, so the analysis result records the file file. The format of a.htm" is "html file", and the format of file file "b.doc" is "word file". It should be noted that the present invention does not limit the format recorded by the analysis result, that is, as long as the file file and the corresponding format can be known, the spirit of the present invention can be omitted. For example, the analysis result can also be Values represent different formats, for example: the value "1" stands for "html file" and the value "2" stands for "word file".

如「第4圖」所示,當分析模組103分析並記錄文件檔案的格式於分析結果後,亦可將所載入的文件檔案之相關訊息,例如:檔案名稱、類型及大小,顯示於載入顯示區塊302中。接著,當使用者點選拼寫檢查元件303後,透過檢查模組104根據分析結果載入相應的解碼規則(例如:載入處理“html文件”的解碼規則及處理“word文件”的解碼規則),並且透過所載入的解碼規則對相應的文件檔案進行解碼後與拼字資料庫101中的所有詞彙作比對,用以生成並顯示拼寫報告,以此例而言,檢查模組104載入處理“html文件”的解碼規則用以對文件檔案“a.htm”進行解碼,以及載入處理“word文件”的解碼規則用以對文件檔案“b.doc”進行解碼,並且在進行比對後記錄比對不相符的詞彙“boook”及“goood”於拼寫報告,並可透過點選報告輸出元件304將拼寫報告顯示於如「第5圖」所示意的拼寫報告顯示區塊305中,此拼寫報告更可記錄統計每一個文件檔案中比對不相符的詞彙總數。 As shown in Figure 4, when the analysis module 103 analyzes and records the format of the file file and analyzes the result, the related information of the file file to be loaded, such as the file name, type and size, can also be displayed. Loaded into display block 302. Then, when the user clicks on the spell check component 303, the check module 104 loads the corresponding decoding rule according to the analysis result (for example, the decoding rule of the loading process "html file" and the decoding rule of processing the "word file"). And decoding the corresponding file file through the loaded decoding rule, and comparing with all the words in the spelling database 101 to generate and display a spelling report. For example, the checking module 104 The decoding rule for processing the "html file" is used to decode the file file "a.htm", and the decoding rule for loading the "word file" is used to decode the file file "b.doc" and compare The words "boook" and "goood" which are inconsistent with the post-recording are recorded in the spelling report, and the spelling report can be displayed in the spelling report display block 305 as shown in "Fig. 5" by clicking the report output component 304. This spelling report can also record the total number of words that do not match the matching in each file file.

接著,請參閱「第6圖」,「第6圖」為應用本發明進行多文件修訂之示意圖。前面提到,本發明除了可進行多文件檢查以產生拼寫報 告之外,更可在拼寫報告產生後,選擇所載入的文件檔案之一進行詞彙修訂並儲存。因此,以下將針對修訂進行說明,而產生拼寫報告之前的步驟流程便不再作贅述。 Next, please refer to "Figure 6", "Figure 6" is a schematic diagram of multi-file revision using the present invention. As mentioned earlier, the present invention can perform multi-file checking to generate spellings. In addition, after the spelling report is generated, one of the loaded file files is selected for vocabulary revision and storage. Therefore, the revision will be explained below, and the flow of the steps before the spelling report is generated will not be repeated.

當使用者欲對拼寫檢查完畢的文件檔案進行修訂時,可透過按壓功能鍵的方式產生選擇條件,用以選擇已載入的文件檔案,舉例來說,可按壓鍵盤的數字鍵“1”來載入第一個文件檔案;按壓鍵盤的數字鍵“2”來載入第二個文件檔案......等等,以此類推。而所產生的選擇條件(例如:數字鍵“2”)透過接收模組105接收後,解碼模組106便根據此選擇條件載入所選擇的文件檔案(例如:文件檔案“b.doc”),並且透過相應於此文件檔案的解碼規則(例如:處理“word文件”的解碼規則)進行解碼後,可將原本拼寫報告顯示區塊305切換為如「第6圖」所示的編輯區塊350,用以顯示此文件檔案的文字內容。此時,使用者可於編輯區塊350中輸入修正文字,並且透過接收模組105進行接收後,修訂模組107根據所輸入的修正文字對文字檔案之文字內容進行編輯,並於編輯完成後透過相應於文件檔案之格式的編碼規則進行編碼及寫入此文件檔案(例如:文件檔案“b.doc”),至此完成文件檔案的修訂。 When the user wants to revise the file file that has been spell-checked, a selection condition can be generated by pressing a function key to select a file file that has been loaded. For example, the numeric key "1" of the keyboard can be pressed. Load the first file file; press the number key "2" on the keyboard to load the second file file... and so on, and so on. After the generated selection condition (for example, the number key "2") is received by the receiving module 105, the decoding module 106 loads the selected file file according to the selection condition (for example, the file file "b.doc"). And after decoding by the decoding rule corresponding to the file file (for example, the decoding rule for processing the "word file"), the original spelling report display block 305 can be switched to the editing block as shown in "Fig. 6". 350, used to display the text content of this file file. At this time, the user can input the corrected text in the editing block 350, and after receiving the received text through the receiving module 105, the revision module 107 edits the text content of the text file according to the input corrected text, and after editing, Encoding and writing to the file file (for example, file file "b.doc") through the encoding rules corresponding to the format of the file file, thus completing the revision of the file file.

另外,編輯區塊350更可根據拼寫報告中所記錄的不相符之詞彙,於文字內容相應處以醒目的顏色加以註記,例如:增加紅色底線(圖中未示)於第一錯誤詞彙351及第二錯誤詞彙352,以便使用者能夠快速得知拼寫有誤的詞彙之出處並進行編輯以完成修訂,在實際實施上,亦可透過前一詞彙元件306及次一詞彙元件307來調整顯示當前欲修正之詞彙,舉例來說,假設當前欲修正之詞彙為第一錯誤詞彙351,當點選次一詞彙元件307後,其當前欲修正之詞彙將調整為第 二錯誤詞彙352,甚至在當前欲修正之詞彙為第一錯誤詞彙351時,更可於修正文字選擇區塊308中顯示相似於第一錯誤詞彙351的相似詞彙(例如:book、booker及bookie),用以提供使用者修正時的參考。 In addition, the editing block 350 can be further marked with a conspicuous color according to the inconsistent vocabulary recorded in the spelling report, for example, adding a red bottom line (not shown) to the first wrong vocabulary 351 and the first The second error vocabulary 352, so that the user can quickly know the source of the misspelled vocabulary and edit it to complete the revision. In actual implementation, the current vocabulary component 306 and the next vocabulary component 307 can also be used to adjust the current desire. The revised vocabulary, for example, assumes that the current vocabulary to be corrected is the first erroneous vocabulary 351. When the vocabulary component 307 is selected, the vocabulary to be corrected is adjusted to The second erroneous vocabulary 352, even when the vocabulary to be corrected is the first erroneous vocabulary 351, the similar vocabulary similar to the first erroneous vocabulary 351 (for example: book, booker, and bookie) can be displayed in the modified text selection block 308. Used to provide a reference for the user to correct.

而除了上述作為修正時的參考外,亦可於修正文字選擇區塊308中選取所列的相似詞彙之一,並透過點選替換詞彙元件309來替換當前欲修正之詞彙,舉例來說,假設當前欲修正之詞彙為第一錯誤詞彙351時,當使用者於修正文字選擇區塊308中選取相似詞彙為“book”,並且點選替換詞彙元件309後,其第一錯誤詞彙351將由錯誤的“boook”替換為正確的“book”。若使用者想要取消替換,則可點選全部恢復元件310取消所有的替換。特別要說明的是,相似詞彙之判斷是透過當前欲修正之詞彙與拼字資料庫101中的所有詞彙進行模糊比對後所產生,由於此方式為習知技術,故在此不多作贅述。 In addition to the above reference as a correction, one of the listed similar words may be selected in the modified text selection block 308, and the currently modified word may be replaced by clicking the replacement vocabulary element 309, for example, assuming When the current vocabulary to be corrected is the first erroneous vocabulary 351, when the user selects the similar vocabulary as "book" in the modified text selection block 308, and clicks the replacement vocabulary element 309, the first erroneous vocabulary 351 will be wrong. Replace "boook" with the correct "book". If the user wants to cancel the replacement, all of the restoration elements 310 can be clicked to cancel all replacements. In particular, the judgment of the similar vocabulary is generated by performing a fuzzy comparison between the vocabulary to be corrected and all the vocabulary in the spelling database 101. Since this method is a conventional technique, it is not described here. .

綜上所述,可知本發明與先前技術之間的差異在於透過分析各文件的格式並分別載入相應的解碼規則進行文件解碼後,將解碼碼後的文件內容與詞彙進行比對並產生拼寫報告的技術手段,藉由此一技術手段可以解決先前技術無法同時對多文件進行拼寫檢查的問題,進而達成提高拼寫檢查的修正效率及便利性的技術功效。 In summary, it can be seen that the difference between the present invention and the prior art is that by analyzing the format of each file and separately loading the corresponding decoding rules for file decoding, the file content after the decoded code is compared with the vocabulary and the spelling is generated. The technical means of the report can solve the problem that the prior art cannot perform spell check on multiple files at the same time by using a technical means, thereby achieving the technical effect of improving the efficiency and convenience of the spell check.

雖然本發明所揭露之實施方式如上,惟所述之內容並非用以直接限定本發明之專利保護範圍。任何本發明所屬技術領域中具有通常知識者,在不脫離本發明所揭露之精神和範圍的前提下,可以在實施的形式上及細節上作些許之更動。本發明之專利保護範圍,仍須以所附之申請專利範圍所界定者為準。 While the embodiments of the present invention have been described above, the above description is not intended to limit the scope of the invention. Any changes in the form and details of the embodiments may be made without departing from the spirit and scope of the invention. The scope of the invention is to be determined by the scope of the appended claims.

101‧‧‧拼字資料庫 101‧‧‧Spelling database

102‧‧‧載入模組 102‧‧‧Loading module

103‧‧‧分析模組 103‧‧‧Analysis module

104‧‧‧檢查模組 104‧‧‧Check module

105‧‧‧接收模組 105‧‧‧ receiving module

106‧‧‧解碼模組 106‧‧‧Decoding module

107‧‧‧修訂模組 107‧‧‧Modification module

300‧‧‧檢查視窗 300‧‧‧Check window

301‧‧‧載入元件 301‧‧‧Loading components

302‧‧‧載入顯示區塊 302‧‧‧Load display block

303‧‧‧拼寫檢查元件 303‧‧‧Spell check component

304‧‧‧報告輸出元件 304‧‧‧Report output component

305‧‧‧拼寫報告顯示區塊 305‧‧‧Spelling report display block

306‧‧‧前一詞彙元件 306‧‧‧Previous vocabulary components

307‧‧‧次一詞彙元件 307‧‧‧ vocabulary components

308‧‧‧修正文字選擇區塊 308‧‧‧Revised text selection block

309‧‧‧替換詞彙元件 309‧‧‧Replacement of vocabulary components

310‧‧‧全部恢復元件 310‧‧‧All recovery components

350‧‧‧編輯區塊 350‧‧‧Editing block

351‧‧‧第一錯誤詞彙 351‧‧‧First wrong vocabulary

352‧‧‧第二錯誤詞彙 352‧‧‧ second wrong vocabulary

步驟201‧‧‧建立至少一詞彙於一拼字資料庫 Step 201‧‧‧Create at least one vocabulary in a spelling database

步驟202‧‧‧載入至少一文件檔案 Step 202‧‧‧ Load at least one file file

步驟203‧‧‧分析並記錄所述文件檔案的格式於一分析結果 Step 203‧‧‧ Analyze and record the format of the file file in an analysis result

步驟204‧‧‧根據該分析結果載入相應的至少一解碼規則,並且透過所述解碼規則對相應的所述文件檔案進行解碼後與所述詞彙作比對,用以生成並顯示一拼寫報告 Step 204‧‧‧ loading corresponding at least one decoding rule according to the analysis result, and decoding the corresponding file file through the decoding rule, and comparing with the vocabulary to generate and display a spelling report

步驟205‧‧‧接收一選擇條件及一修正文字 Step 205‧‧‧ Receive a selection condition and a correction text

步驟206‧‧‧根據該選擇條件選擇所述文件檔案之一,並且透過相應於該文件檔案的該解碼規則進行解碼後,顯示該文件檔案之文字內容 Step 206‧‧‧ select one of the file files according to the selection condition, and display the text content of the file file after decoding by the decoding rule corresponding to the file file

步驟207‧‧‧根據該修正文字對該文件檔案之文字內容進行編輯,並於編輯完成後透過相應於該文件檔案之格式的一編碼規則進行編碼及寫入該文件檔案 Step 207‧‧‧ edit the text content of the file file according to the modified text, and after the editing is completed, encode and write the file file through an encoding rule corresponding to the format of the file file

第1圖為本發明多文件檢查系統之方塊圖。 Figure 1 is a block diagram of a multi-file inspection system of the present invention.

第2圖為本發明多文件檢查方法之流程圖。 Figure 2 is a flow chart of the multi-file checking method of the present invention.

第3圖至第5圖為應用本發明進行多文件檢查之示意圖。 Figures 3 through 5 are schematic views of multi-file inspection using the present invention.

第6圖為應用本發明進行多文件修訂之示意圖。 Figure 6 is a schematic diagram of multi-document revision using the present invention.

101‧‧‧拼字資料庫 101‧‧‧Spelling database

102‧‧‧載入模組 102‧‧‧Loading module

103‧‧‧分析模組 103‧‧‧Analysis module

104‧‧‧檢查模組 104‧‧‧Check module

105‧‧‧接收模組 105‧‧‧ receiving module

106‧‧‧解碼模組 106‧‧‧Decoding module

107‧‧‧修訂模組 107‧‧‧Modification module

Claims (10)

一種多文件檢查系統,包含:一拼字資料庫,用以儲存至少一詞彙;一載入模組,用以載入至少一文件檔案;一分析模組,用以分析並記錄所述文件檔案的格式於一分析結果;及一檢查模組,根據該分析結果載入相應的至少一解碼規則,並且透過所述解碼規則對相應的所述文件檔案進行解碼後與所述詞彙作比對,用以生成並顯示一拼寫報告。 A multi-file inspection system includes: a spelling database for storing at least one vocabulary; a loading module for loading at least one file file; and an analysis module for analyzing and recording the file file The format is an analysis result; and an inspection module loads the corresponding at least one decoding rule according to the analysis result, and decodes the corresponding file file through the decoding rule, and compares the vocabulary with the vocabulary. Used to generate and display a spelling report. 如申請專利範圍第1項所述之多文件檢查系統,其中該系統更包含:一接收模組,用以接收一選擇條件及一修正文字;一解碼模組,用以根據該選擇條件選擇所述文件檔案之一,並且透過相應於該文件檔案的該解碼規則進行解碼後,顯示該文件檔案之文字內容;及一修訂模組,根據該修正文字對該文件檔案之文字內容進行編輯,並於編輯完成後透過相應於該文件檔案之格式的一編碼規則進行編碼及寫入該文件檔案。 The multi-document inspection system of claim 1, wherein the system further comprises: a receiving module for receiving a selection condition and a modified text; and a decoding module for selecting the selection according to the selection condition Descripting one of the file files, and displaying the text content of the file file after decoding by the decoding rule corresponding to the file file; and a revision module, editing the text content of the file file according to the modified text, and After the editing is completed, the encoding file is encoded and written by an encoding rule corresponding to the format of the file file. 如申請專利範圍第1項所述之多文件檢查系統,其中該拼字資料庫更包含儲存至少一文法規則,所述文法規則透過網路或檔案進行更新,所述詞彙為各國語言的單字及程式語言的關鍵字至少其中之一,所述文件檔案的格式至少包含文書檔案格式、純文字格式、可攜式文件格式、超文字標記語言格式及開放文件格式其中之一。 The multi-document inspection system of claim 1, wherein the spelling database further comprises storing at least one grammar rule, the grammar rule being updated through a network or a file, the vocabulary being a word in each language and At least one of the keywords of the programming language, the file file format includes at least one of a document file format, a plain text format, a portable file format, a hypertext markup language format, and an open file format. 如申請專利範圍第2項所述之多文件檢查系統,其中所述編碼規則及所述解碼規則為處理不同格式之依據,用以能夠正確讀寫所述文件檔案。 The multi-file inspection system of claim 2, wherein the encoding rule and the decoding rule are based on processing different formats for being able to correctly read and write the file file. 如申請專利範圍第1項所述之多文件檢查系統,其中該拼寫報告記錄各該文件檔案的比對結果,包含比對相符的所述詞彙及比對不相符的所述詞彙至少其中之一的統計。 The multi-document inspection system of claim 1, wherein the spelling report records a comparison result of each of the file archives, and includes at least one of the vocabulary that matches the vocabulary and the collocation that does not match the comparison. registration. 一種多文件檢查方法,其步驟包括:建立至少一詞彙於一拼字資料庫;載入至少一文件檔案;分析並記錄所述文件檔案的格式於一分析結果;及根據該分析結果載入相應的至少一解碼規則,並且透過所述解碼規則對相應的所述文件檔案進行解碼後與所述詞彙作比對,用以生成並顯示一拼寫報告。 A multi-file checking method, the method comprising: establishing at least one vocabulary in a spelling database; loading at least one file file; analyzing and recording the format of the file file in an analysis result; and loading the corresponding result according to the analysis result At least one decoding rule, and decoding the corresponding file file through the decoding rule, and comparing the vocabulary to generate and display a spelling report. 如申請專利範圍第6項所述之多文件檢查方法,其中該方法更包含:接收一選擇條件及一修正文字;根據該選擇條件選擇所述文件檔案之一,並且透過相應於該文件檔案的該解碼規則進行解碼後,顯示該文件檔案之文字內容;及根據該修正文字對該文件檔案之文字內容進行編輯,並於編輯完成後透過相應於該文件檔案之格式的一編碼規則進行編碼及寫入該文件檔案。 The multi-document inspection method of claim 6, wherein the method further comprises: receiving a selection condition and a correction text; selecting one of the file files according to the selection condition, and transmitting the file file corresponding to the file file After the decoding rule is decoded, the text content of the file file is displayed; and the text content of the file file is edited according to the modified text, and after editing, encoding is performed by an encoding rule corresponding to the format of the file file and Write to the file archive. 如申請專利範圍第6項所述之多文件檢查方法,其中該拼字資料庫更包含儲存至少一文法規則,所述文法規則透過網路或檔 案進行更新,所述詞彙為各國語言的單字及程式語言的關鍵字至少其中之一,所述文件檔案的格式至少包含文書檔案格式、純文字格式、可攜式文件格式、超文字標記語言格式及開放文件格式其中之一。 The multi-file inspection method of claim 6, wherein the spelling database further comprises storing at least one grammar rule, the grammar rule is transmitted through a network or a file. The file is updated, and the vocabulary is at least one of a word of a national language and a keyword of a programming language, and the format of the file file includes at least a document file format, a plain text format, a portable file format, and a hypertext markup language format. And one of the open file formats. 如申請專利範圍第7項所述之多文件檢查方法,其中所述編碼規則及所述解碼規則為處理不同格式之依據,用以能夠正確讀寫所述文件檔案。 The multi-file inspection method of claim 7, wherein the encoding rule and the decoding rule are based on processing different formats for being able to correctly read and write the file file. 如申請專利範圍第6項所述之多文件檢查方法,其中該拼寫報告記錄各該文件檔案的比對結果,包含比對相符的所述詞彙及比對不相符的所述詞彙至少其中之一的統計。 The multi-document inspection method of claim 6, wherein the spelling report records a comparison result of each of the file archives, and includes at least one of the vocabulary that matches the vocabulary and the collocation that does not match the comparison. registration.
TW101132921A 2012-09-10 2012-09-10 Multi-file checking system and method thereof TWI456413B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW101132921A TWI456413B (en) 2012-09-10 2012-09-10 Multi-file checking system and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW101132921A TWI456413B (en) 2012-09-10 2012-09-10 Multi-file checking system and method thereof

Publications (2)

Publication Number Publication Date
TW201411372A true TW201411372A (en) 2014-03-16
TWI456413B TWI456413B (en) 2014-10-11

Family

ID=50820846

Family Applications (1)

Application Number Title Priority Date Filing Date
TW101132921A TWI456413B (en) 2012-09-10 2012-09-10 Multi-file checking system and method thereof

Country Status (1)

Country Link
TW (1) TWI456413B (en)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI237779B (en) * 2003-11-14 2005-08-11 Infoacer Corp Context searching system of computer files and method thereof
US7996769B2 (en) * 2006-04-05 2011-08-09 Research In Motion Limited Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature
TW201035784A (en) * 2009-03-31 2010-10-01 Inventec Corp Complicated Chinese and simplified Chinese conversion system with marked prompt and method thereof
TW201118602A (en) * 2009-11-20 2011-06-01 Inventec Corp Translation resource management system and method thereof

Also Published As

Publication number Publication date
TWI456413B (en) 2014-10-11

Similar Documents

Publication Publication Date Title
US20200257848A1 (en) System and method for generating task-embedded documents
US7240279B1 (en) XML patterns language
Schmidt The inadequacy of embedded markup for cultural heritage texts
US20080133444A1 (en) Web-based collocation error proofing
US20080288239A1 (en) Localization and internationalization of document resources
EP2092447A1 (en) Email document parsing method and apparatus
US8078451B2 (en) Interface and methods for collecting aligned editorial corrections into a database
US11768677B2 (en) Method and system for software application optimization using natural language-based queries
US11158118B2 (en) Language model, method and apparatus for interpreting zoning legal text
US10120843B2 (en) Generation of parsable data for deep parsing
JP2014240884A (en) Content creation assist device, method, and program
US7996768B2 (en) Operations on document components filtered via text attributes
CA2504111A1 (en) Critiquing clitic pronoun ordering in french
CN113159969A (en) Financial long text rechecking system
CN101464875B (en) Method for representing electronic dictionary catalog data by XML
JP6568968B2 (en) Document review device and program
CN116360794A (en) Database language analysis method, device, computer equipment and storage medium
CN115794057A (en) Command line function code generation method, device and computer readable storage medium
CN116243901A (en) Method and system for rapidly developing multilingual project of front-end page
TW201411372A (en) Multi-file checking system and method thereof
JP2014137613A (en) Translation support program, method and device
US9965453B2 (en) Document transformation
JP5994150B2 (en) Document creation method, document creation apparatus, and document creation program
JP2004234402A (en) Web screen creating tool and term checking tool
Rodríguez TEITOK, a visual solution for XML/TEI encoding: editing, annotating and hosting linguistic corpora

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees