TWM602240U - Suspected money laundering transaction report conversion system - Google Patents
Suspected money laundering transaction report conversion system Download PDFInfo
- Publication number
- TWM602240U TWM602240U TW109209838U TW109209838U TWM602240U TW M602240 U TWM602240 U TW M602240U TW 109209838 U TW109209838 U TW 109209838U TW 109209838 U TW109209838 U TW 109209838U TW M602240 U TWM602240 U TW M602240U
- Authority
- TW
- Taiwan
- Prior art keywords
- module
- character
- conversion system
- money laundering
- symbol
- Prior art date
Links
Images
Abstract
本創作揭露一種疑似洗錢交易報告轉檔系統,包含一辨識模組、一擷取模組以及一處理模組。辨識模組辨識文書處理文件中的字元,以產生一辨識結果。擷取模組根據辨識結果,擷取文書處理文件中的至少一字元。處理模組轉換文書處理文件為網頁格式文件,其中網頁格式文件包含擷取模組擷取文書處理文件中的至少一字元。This creation discloses a suspected money laundering transaction report conversion system, which includes an identification module, an acquisition module, and a processing module. The recognition module recognizes characters in the word processing document to generate a recognition result. The capture module captures at least one character in the word processing document according to the recognition result. The processing module converts the word processing document into a webpage format document, wherein the webpage format document includes at least one character in the word processing document extracted by the extraction module.
Description
本創作係有關於一種轉檔系統,特別是有關於一種疑似洗錢交易報告轉檔系統。This creation is about a file conversion system, especially a file conversion system for suspected money laundering transaction reports.
目前電子化作業普及,各行各業等各種工作皆已電子化作業,但在電子化作業的過程中,皆是由人工在文字檔案上進行輸入作業。At present, electronic operations are popular, and various jobs in all walks of life have been electronically operated, but in the process of electronic operations, manual input operations are performed on text files.
此外,在電子申報作業中,通常會有一電子申報作業程式,且該電子申報作業程式中會有許多表格必須要經由人工方式輸入資料,也就是說,除了需要通過人工方式在文字檔案上進行輸入作業外,尚須將相同的資料內容對應輸入至該電子申報作業程式的表格中。In addition, in electronic reporting operations, there is usually an electronic reporting program, and there are many forms in the electronic reporting program that must be entered manually, that is to say, in addition to the manual input on the text file In addition to the operation, the same data content must be correspondingly input into the form of the electronic reporting operation program.
然而,由於需要輸入的資料眾多且繁雜,若每一項資料皆以人工方式輸入,並將相同的資料對應輸入到該電子申報作業程式的表格中,將耗費許多時間及人力,並且也有資料輸入錯誤的可能性。雖然目前有些電子申報作業程式可接受特定格式檔案的匯入,以自動將資料填入對應的表格中,但目前電子申報作業程式可接收匯入的檔案格式均非一般可供人工進行輸入作業的文字檔案。However, due to the large and complex data to be entered, if each item of data is manually entered and the same data is correspondingly entered into the form of the electronic reporting operation program, it will consume a lot of time and manpower, and there is also data input The possibility of error. Although some electronic reporting programs currently accept the import of files in a specific format to automatically fill in the corresponding forms, the current electronic reporting programs can accept imported file formats that are not generally available for manual input. Text file.
據此,提供一種疑似洗錢交易報告轉檔系統將可供人工進行輸入作業的文字檔案轉換成電子申報作業程式可接受的特定格式檔案,已成為目前急需研究的課題。Accordingly, providing a file conversion system for suspected money laundering transaction reports to convert text files that can be manually input into files in a specific format acceptable to the electronic reporting operating program has become an urgent research topic.
鑑於上述問題,本創作揭露一種疑似洗錢交易報告轉檔系統,包含一辨識模組、一擷取模組以及一處理模組。辨識模組辨識一文書處理文件中的字元,以產生一辨識結果。擷取模組根據辨識結果,擷取文書處理文件中的至少一字元。處理模組轉換文書處理文件為網頁格式文件,其中網頁格式文件包含擷取模組擷取文書處理文件中的至少一字元。In view of the above-mentioned problems, this creation discloses a suspected money laundering transaction report conversion system, which includes an identification module, an acquisition module, and a processing module. The recognition module recognizes characters in a word processing document to generate a recognition result. The capture module captures at least one character in the word processing document according to the recognition result. The processing module converts the word processing document into a webpage format document, wherein the webpage format document includes at least one character in the word processing document extracted by the extraction module.
承上所述,本創作疑似洗錢交易報告轉檔系統以彈性、不限資料輸入順序、不限資料輸入類型、不限資料輸入筆數、不限資料固定位置的資料擷取功能,擷取文書處理文件中所需要的字元後,轉換為網頁格式文件,因而可節省人工重複輸入至網頁格式文件的時間,提升作業效率,進一步避免資料輸入錯誤的問題。Based on the above, this authoring suspected money laundering transaction report conversion system uses flexibility, unlimited data input sequence, unlimited data input type, unlimited data input number, unlimited data retrieval function, and retrieves documents. After the characters needed in the document are processed, they are converted into a web page format file, thus saving the time of manual re-input to the web page format file, improving work efficiency, and further avoiding data input errors.
請參閱圖1,其係為本創作疑似洗錢交易報告轉檔系統的方塊示意圖。疑似洗錢交易報告轉檔系統1包含一辨識模組11、一擷取模組12以及一處理模組13。辨識模組11辨識文書處理文件中的字元,以產生一辨識結果。擷取模組12根據辨識結果,擷取文書處理文件中的至少一字元。處理模組13轉換文書處理文件為網頁格式文件,其中網頁格式文件包含擷取模組擷取文書處理文件中的至少一字元。Please refer to Figure 1, which is a block diagram of the conversion system for creating suspected money laundering transaction reports. The suspected money laundering transaction report conversion system 1 includes an
於本創作之一實施例中,文書處理文件包含微軟辦公室軟體之文書處理文件,網頁格式文件包含可延伸標記式語言(Extensible Markup Language, XML)文件以及超文本標記語言(HyperText Markup Language, HTML)文件,但本創作並不以此為限,而是任何需要文字輸入及處理的軟體文間接包含在本創作的範圍中。In an embodiment of the present creation, the word processing document includes the word processing document of Microsoft Office software, and the web page format document includes Extensible Markup Language (XML) documents and HyperText Markup Language (HTML) However, this creation is not limited to this, but any software text that requires text input and processing is indirectly included in the scope of this creation.
請參閱圖2A及圖2B,其係為本創作疑似洗錢交易報告轉檔系統的文書處理文件示意圖以及網頁格式文件示意圖。圖2A的實施例中,以微軟辦公室Word文書處理文件為例,當使用者完成輸入Word文件表格各個欄位(1)~(10)的資料後,欲針對Word文件轉換檔案格式為圖2B的可延伸標記式語言的文件格式時,由於圖2B中可延伸標記式語言的文件係為一制式的表格文件,亦即,除了欄位(1)姓名/法人團體名稱、欄位(2)生日/登記日期…欄位(10)國籍的資料尚未填入以外,其餘欄位資料皆已製作成制式表格而無須填寫。因此,藉由本創作的疑似洗錢交易報告轉檔系統1,針對Word文書處理文件中各個欄位的編號、文字及以及冒號設定為預設文字,例如將欄位(1)姓名/法人團體名稱以及冒號設定為預設文字,依此類推,辨識模組11則針對預設文字之後的欄位中的至少一字元進行辨識,並在辨識到終止字元後停止辨識。Please refer to Figure 2A and Figure 2B, which are the schematic diagram of the word processing file and the schematic diagram of the webpage format file for the creation of the suspected money laundering transaction report conversion system. In the embodiment of FIG. 2A, taking the Microsoft Office Word word processing document as an example, when the user finishes inputting the data in each field (1) ~ (10) of the Word document form, he wants to convert the file format of the Word document to that of FIG. 2B In the case of the document format of the extensible markup language, since the document of the extensible markup language in Figure 2B is a form document, that is, except for the fields (1) name/corporate group name, field (2) birthday /Registration date... Column (10) has not yet filled in the nationality data, the rest of the fields have been made into a standard form and do not need to be filled in. Therefore, with this creation of the suspected money laundering transaction report conversion system 1, the number, text, and colon of each field in the Word document processing document are set as the default text, for example, the field (1) name/corporate name and The colon is set as the default text, and so on, the
再者,當需要辨識的資料包含多筆字元資料時,可在各筆字元資料之間設定切割符號,以便於辨識模組11辨識。辨識模組11係根據起始符號、切割符號以及終止符號辨識字元。起始符號包含文書處理文件中各個欄位的編號、文字或者冒號,切割符號包含井字號、驚嘆號或其他可與一般字元有所區別的符號,終止符號包含各個欄位的編號及文字。擷取模組12在辨識模組11辨識到起始符號後,係擷取起始符號之後到終止符號之間的字元。若起始符號與終止符號之間包含至少一切割符號時,擷取模組12則將擷取到的多個字元依據該至少一切割符號切割成多個字串。Furthermore, when the data to be recognized includes multiple character data, a cutting symbol can be set between each character data to facilitate the recognition by the
以圖2A中的表格為例,當辨識模組11辨識到起始符號『(1)姓名/法人團體名稱:』,以及終止符號『(2)生日/登記日期:』時,擷取模組12則擷取起始符號以及終止符號之間的至少一字元,因此擷取到「甲XX#乙有限公司#丙XX」等的複數字元。再者,由於擷取模組12擷取到的複數字元中包含有井字號的切割符號,因此擷取模組12將擷取到的複數字元依據切割符號「#」切割成「甲XX」、「乙有限公司」、「丙XX」等的多個字串。處理模組13則針對擷取模組12擷取到的複數字串依序對應輸入到圖2B網頁格式文件中的第一框格F1、第二框格F2及第三框格(未圖示)中的姓名/法人團體名稱的欄位,以此類推。Take the table in Figure 2A as an example, when the
相似地,以圖2A中欄位(2)生日/登記日期為例,當辨識模組11辨識到起始符號『(2)姓名/法人團體名稱:』,以及終止符號『(3)類型:』時,擷取模組12則擷取起始符號以及終止符號之間的至少一字元,因此擷取到「78/01/01#102/01/02#57/01/03」等的複數字元。再者,由於擷取模組12擷取到的複數字元中包含有切割符號,因此擷取模組12將擷取到複數字元依據切割符號「#」切割成「78/01/01」、「102/01/02」、「57/01/03」等的多個字串。處理模組13則針對擷取模組12擷取到的複數字串依序對應轉換輸入到圖2B網頁格式文件中的第一框格F1、第二框格F2及第三框格(未圖示)中的生日登記日期的欄位,以此類推。需注意的是,生日的年份已自動經由電腦程式將民國年份轉換為西元年份。Similarly, taking the field (2) birthday/register date in Figure 2A as an example, when the
再者,以圖2A中欄位(10)國籍為例,在欄位中僅填入數字0的字元,因此,辨識模組11以『欄位(10)國籍:』作為辨識的起始符號,並以『0:本國人;1:外國人有居留證;2:外國人無居留證』作為辨識的終止符號,因此擷取模組12僅擷取到「0」的字元,但根據圖2A中欄位(1)姓名/財團法人名稱處可知,在圖2A中的表格應至少有三筆資料,分別對應「甲XX」、「乙有限公司」、「丙XX」。因此在此實施例中,辨識模組11雖然僅讀取到「0」的字元,未讀取到切割字元將其切割成多個字串,但處理模組13根據之前的欄位可知應有三筆資料,故處理模組13會直接將讀取到的「0」的字元分別輸入到三筆資料的對應區域,例如處理模組針對擷取模組12擷取的字元「0」對應轉換輸入到圖2B網頁格式文件中的第一框格F1、第二框格F2及第三框格(未圖示)中的國籍欄位,以此類推。換句話說,由於在圖2A編號(10)國籍的欄位僅填入一個數字0,但根據編號(1)姓名/法人團體名稱的欄位,輸入者係填入「甲XX」、「乙有限公司」以及「丙XX」的字元,因此實際上應填入3個對應編號(1) 姓名/法人團體名稱欄位的國籍數字,但於本創作疑似洗錢交易報告轉檔系統中則自動將一個國籍數字0轉換為重複的3個0,亦即在輸入者省略輸入、使得各個欄位之間的資料數量沒有一致對應的情況下,疑似洗錢交易報告轉檔系統將自動重複填入相同的字元到圖2B可延伸標記式語言的文件格式的各個框格中。Furthermore, taking the field (10) nationality in Figure 2A as an example, only the characters of the
此外,在圖2A的Word文件中空白的欄位對應轉換到圖2B可延伸標記式語言的文件格式中則仍為空白欄位。其餘欄位字元轉換的過程如上所述,於此不再贅述。In addition, the blank fields in the Word document of FIG. 2A corresponding to the file format of the extensible markup language in FIG. 2B are still blank fields. The process of converting the characters in the remaining fields is as described above, and will not be repeated here.
請參閱圖3A及圖3B,其係為本創作疑似洗錢交易報告轉檔系統的交易帳號表格示意圖以及網頁格式文件示意圖。於此實施例中,為了節省人工重複對應輸入至網頁格式文件的時間,亦可特別針對交易帳號表格中的數字進行辨識及擷取。如圖3A所示,人工輸入的資料為OBU帳號欄位(1)以及欄位(2)之後的數字,以及DOM帳號欄位(1)以及欄位(2)之後的數字,因此,藉由本創作的疑似洗錢交易報告轉檔系統,將交易帳號表格中的『OBU帳號:(1)』設定為起始符號以及將『(2)』設定為終止符號,以及將『(2)』設定為起始符號以及將『DOM帳號:(1)』設定為終止符號,辨識模組11則針對起始符號與終止符號之間的至少一字元進行辨識,並由擷取模組12擷取起始符號與終止符號之間的至少一字元,處理模組13則針對擷取模組12擷取的字元對應轉換輸入到網頁格式文件的框格中。此外,本創作的疑似洗錢交易報告轉檔系統還將交易帳號表格中的『DOM帳號:(1)』設定為起始符號以及將『(2)』設定為終止符號,以及將『(2)』設定為起始符號以及將『換行符號』設定為終止符號,辨識模組11則針對起始符號與終止符號之間的至少一字元進行辨識,並由擷取模組12擷取起始符號與終止符號之間的至少一字元,處理模組13則針對擷取模組12擷取的字元對應轉換輸入到網頁格式文件的框格中。Please refer to Figure 3A and Figure 3B, which are the schematic diagram of the transaction account table and the schematic diagram of the webpage format file for the creation of the suspected money laundering transaction report conversion system. In this embodiment, in order to save the time of manually repetitively inputting the corresponding input into the webpage format file, it is also possible to identify and retrieve the numbers in the transaction account form. As shown in Figure 3A, the manually entered data is the number after the OBU account field (1) and field (2), and the number after the DOM account field (1) and field (2). Therefore, by this Created a suspected money laundering transaction report conversion system, set "OBU account number: (1)" in the transaction account table as the start symbol and "(2)" as the termination symbol, and set "(2)" as The start symbol and the "DOM account number: (1)" are set as the end symbol. The
承上所述,於此實施例中,以『OBU帳號:』為例,辨識模組11針對括號(1)之後的數字進行辨識,由擷取模組12擷取之後的數字,並由處理模組13對應輸入至圖3B的<交易帳號>OBU第一欄位L3中對應的位置。相似地,OBU帳號的括號(2)之後的數字經由辨識模組11辨識及擷取模組12擷取後,由處理模組13對應輸入至圖3B的<交易帳號>OBU第二欄位L4中對應的位置。DOM帳號的括號(1)之後的數字經由辨識模組11辨識及擷取模組12擷取後,由處理模組13對應輸入至圖3B的<交易帳號>DOM第三欄位L5中對應的位置,DOM帳號的括號(2)之後的數字經由辨識模組11辨識及擷取模組12擷取後,由處理模組13對應輸入至圖3B的<交易帳號>DOM第四欄位L6中對應的位置,以此類推。Continuing from the above, in this embodiment, taking "OBU account number:" as an example, the
綜上所述,本創作疑似洗錢交易報告轉檔系統以彈性、不限資料輸入順序、不限資料輸入類型、不限資料輸入筆數、不限資料固定位置的資料擷取功能,擷取文書處理文件中所需要的字元後,轉換為網頁格式文件,因而可節省人工重複輸入至網頁格式文件的時間,提升作業效率,進一步避免資料輸入錯誤的問題。To sum up, this authoring suspected money laundering transaction report conversion system uses flexibility, unlimited data input sequence, unlimited data input type, unlimited data input number, unlimited data capture function to capture documents After the characters needed in the document are processed, they are converted into a web page format file, thus saving the time of manual re-input to the web page format file, improving work efficiency, and further avoiding data input errors.
1:疑似洗錢交易報告轉檔系統 11:辨識模組 12:擷取模組 13:處理模組 F1:框格 F2:框格 L1:欄位 L2:欄位 L3:欄位 L4:欄位 L5:欄位 L6:欄位 1: Suspected money laundering transaction report conversion system 11: Identification module 12: Capture module 13: Processing module F1: sash F2: sash L1: field L2: field L3: field L4: field L5: field L6: field
圖1係為本創作疑似洗錢交易報告轉檔系統的方塊示意圖; 圖2A係為本創作疑似洗錢交易報告轉檔系統的文書處理文件示意圖; 圖2B係為本創作疑似洗錢交易報告轉檔系統的網頁格式文件示意圖; 圖3A係為本創作疑似洗錢交易報告轉檔系統的交易帳號表格示意圖;以及 圖3B係為本創作疑似洗錢交易報告轉檔系統的網頁格式文件示意圖。 Figure 1 is a block diagram of the system for creating suspected money laundering transaction report conversion; Figure 2A is a schematic diagram of the document processing file for this creation of the suspected money laundering transaction report conversion system; Figure 2B is a schematic diagram of the webpage format file for the creation of the suspected money laundering transaction report conversion system; Figure 3A is a schematic diagram of the transaction account table for the creation of the suspected money laundering transaction report conversion system; and Figure 3B is a schematic diagram of the webpage format file for the creation of the suspected money laundering transaction report conversion system.
1:疑似洗錢交易報告轉檔系統 1: Suspected money laundering transaction report conversion system
11:辨識模組 11: Identification module
12:擷取模組 12: Capture module
13:處理模組 13: Processing module
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW109209838U TWM602240U (en) | 2020-07-30 | 2020-07-30 | Suspected money laundering transaction report conversion system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW109209838U TWM602240U (en) | 2020-07-30 | 2020-07-30 | Suspected money laundering transaction report conversion system |
Publications (1)
Publication Number | Publication Date |
---|---|
TWM602240U true TWM602240U (en) | 2020-10-01 |
Family
ID=74094906
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW109209838U TWM602240U (en) | 2020-07-30 | 2020-07-30 | Suspected money laundering transaction report conversion system |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWM602240U (en) |
-
2020
- 2020-07-30 TW TW109209838U patent/TWM602240U/en unknown
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10067931B2 (en) | Analysis of documents using rules | |
JP5090369B2 (en) | Automated processing using remotely stored templates (method for processing forms, apparatus for processing forms) | |
US7415482B2 (en) | XBRL enabler for business documents | |
US7979793B2 (en) | Graphical creation of a document conversion template | |
US20110019917A1 (en) | System and method for automating document search and report generation | |
TW501028B (en) | A font access method, a font registration method, a font display method, a font printing method and a method for processing electronic documents in which variant fonts are included, and a recording medium therefor | |
EP1542133A2 (en) | Programmable object model for namespace or schema library support in a software application | |
JP2008515061A (en) | A method for searching data elements on the web using conceptual and contextual metadata search engines | |
CN101430684A (en) | Method and apparatus for mutual conversion between Chinese work office software document and documents with other format | |
US20080109400A1 (en) | Method and device for configuring a variety of medical information | |
CN105589813B (en) | A kind of electronic document version variation tracking | |
Geyken et al. | The DTA'base format': A TEI-subset for the compilation of interoperable corpora. | |
Averkamp et al. | Repurposing ProQuest metadata for batch ingesting ETDs into an institutional repository | |
TWM602240U (en) | Suspected money laundering transaction report conversion system | |
JP2007041983A (en) | Application form creation program and application form creation apparatus | |
CN100442275C (en) | Method and system for indentifying Chinese address data | |
JP4990925B2 (en) | Process management system and process management method | |
US20100023517A1 (en) | Method and system for extracting data-points from a data file | |
WO2017090054A1 (en) | Editfile | |
CN112487329A (en) | Method for exporting EXCEL from HTML table based on JAVA | |
JP3966086B2 (en) | Document processing apparatus and method | |
CN110990636A (en) | Intelligent data module acquisition and conversion method for diesel engine interactive electronic technical manual | |
TWM578817U (en) | Processing system for converting data of data system into relational data format | |
JP2002342342A (en) | Document managing method, execution system therefor, processing program and recording medium therefor | |
CN112766889B (en) | Dynamic classification management method and device for work tasks |