TWI659369B - Message processing device - Google Patents

Message processing device Download PDF

Info

Publication number
TWI659369B
TWI659369B TW106123390A TW106123390A TWI659369B TW I659369 B TWI659369 B TW I659369B TW 106123390 A TW106123390 A TW 106123390A TW 106123390 A TW106123390 A TW 106123390A TW I659369 B TWI659369 B TW I659369B
Authority
TW
Taiwan
Prior art keywords
module
data
string
information
universal
Prior art date
Application number
TW106123390A
Other languages
Chinese (zh)
Other versions
TW201909043A (en
Inventor
林美玲
Original Assignee
金腦數位股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 金腦數位股份有限公司 filed Critical 金腦數位股份有限公司
Priority to TW106123390A priority Critical patent/TWI659369B/en
Priority to CN201810750467.6A priority patent/CN109254963A/en
Publication of TW201909043A publication Critical patent/TW201909043A/en
Application granted granted Critical
Publication of TWI659369B publication Critical patent/TWI659369B/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language

Abstract

一種訊息處理裝置,包括一控制模組、一儲存模組、一訊息擷取模組、一分類模組、一標準化模組及一翻譯模組。控制模組用於接收一處理指令。儲存模組儲存有一外部表單。外部表單具有至少一編碼資料。訊息擷取模組依處理指令與至少一個資料來源連結,並擷取多個資料。各該資料具有一關鍵字串與一通用字串。分類模組依通用字串將資料分類,以生成一資料表單,並儲存於儲存模組。其中,關鍵字串與通用字串排列於資料表單。標準化模組用於對資料表單進行標準化。翻譯模組則對標準化後的該資料表單進行翻譯。An information processing device includes a control module, a storage module, a message retrieval module, a classification module, a standardized module, and a translation module. The control module is used for receiving a processing instruction. The storage module stores an external form. The external form has at least one encoded data. The message extraction module is connected to at least one data source according to processing instructions, and retrieves multiple data. Each of the data has a keyword string and a universal string. The classification module classifies data according to a common string to generate a data form and stores it in a storage module. The keyword string and the universal string are arranged in the data form. The standardization module is used to standardize the data form. The translation module translates the standardized data sheet.

Description

訊息處理裝置Information processing device

本發明涉及一種訊息處理裝置,特別是一種可以收集個人和或企業實體訊息,並對訊息分類、標準化、翻譯、並彙整的訊息處理裝置。The invention relates to an information processing device, in particular to an information processing device that can collect personal and or corporate entity information, and classify, standardize, translate, and integrate the information.

對於企業而言,為了獲得市場認同以及成就,需要做出消息即時且正確的商業決策。在做出這些商業決策之前,企業經由搜尋取得相關訊息以探索或是判定關於商業及其他類型實體之身分及相關資訊,尤其是在風險管理的部分。For companies, in order to gain market recognition and success, they need to make timely and correct business decisions. Before making these business decisions, companies search for relevant information to explore or determine the identity and related information of business and other types of entities, especially in the risk management part.

換言之,這些商業決策的成功主要是取決於所取得訊息的資訊品質。所述品質是由資訊是否準確、完整、及時來決定。因此,如何提供符合企業需求的訊息品質是一重要課題。In other words, the success of these business decisions depends primarily on the quality of the information obtained. The quality is determined by whether the information is accurate, complete and timely. Therefore, how to provide information quality that meets the needs of enterprises is an important issue.

本發明之目的在於提供一訊息處理裝置,以取得準確、完整、及時的訊息。The purpose of the present invention is to provide an information processing device to obtain accurate, complete and timely information.

為達上述目的,本發明提供一種訊息處理裝置,其包括一控制模組、一儲存模組、一訊息擷取模組、一分類模組、一標準化模組及一翻譯模組。控制模組用於接收一處理指令。該處理指令包括個人和或企業實體的名稱。其中,個人和或企業實體的名稱是以一第一語言呈現。儲存模組儲存有一外部表單,該外部表單具有至少一編碼資料以及對應各該編碼資料的至少一關聯訊息。訊息擷取模組則依據該處理指令與至少一個資料來源連結,並擷取關於個人或企業實體的多個資料。各該資料具有一關鍵字串與一通用字串。分類模組則依據通用字串將該些資料分類,以生成一資料表單,並儲存於該儲存模組。其中,該關鍵字串與該通用字串排列於該資料表單,且該關鍵字串與該通用字串有對應關係。標準化模組用於對該資料表單進行標準化。翻譯模組則對標準化後的該資料表單進行翻譯,使該資料表單包含一第二語言。To achieve the above object, the present invention provides an information processing device including a control module, a storage module, a message retrieval module, a classification module, a standardized module, and a translation module. The control module is used for receiving a processing instruction. The processing instruction includes the names of individuals and or business entities. The names of individuals and / or business entities are presented in a first language. The storage module stores an external form. The external form has at least one coded data and at least one associated message corresponding to each of the coded data. The message retrieval module is linked to at least one data source according to the processing instruction, and retrieves multiple data about individuals or business entities. Each of the data has a keyword string and a universal string. The classification module classifies the data according to the universal string to generate a data form and stores the data form in the storage module. The keyword string and the universal string are arranged in the data form, and the keyword string and the universal string have a corresponding relationship. The standardization module is used to standardize the data sheet. The translation module translates the standardized data sheet so that the data sheet includes a second language.

在本發明之一實施例中,該翻譯模組對該資料表單進行翻譯後的該第二語言資訊與原本的該第一語言資訊同時存在該資料表單,而該資料表單包含一系譜樹之所有訊息。In one embodiment of the present invention, the second language information translated by the translation module to the data form and the original first language information exist in the data form at the same time, and the data form includes all of a genealogical tree. message.

在本發明之一實施例中,該系譜樹之所有訊息包括該企業實體的總部、其所有或控股的任何子公司以及任何次級位置或分支機構的的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息。In one embodiment of the present invention, all the information of the genealogical tree includes the name, coding, establishment time, business status of the corporate entity's headquarters, any subsidiaries owned or controlled by it, and any secondary locations or branches, Industry category, share capital, composition, person in charge, address, email address, website address, phone number, fax number, legal information or industry information.

在本發明之一實施例中,該控制模組會定期地對該系譜樹進行關聯更新。In one embodiment of the present invention, the control module periodically updates the pedigree tree.

在本發明之一實施例中,該資料表單中的通用字串包括個人和或企業實體的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息,該關鍵字串為該通用字串前方或該通用字串後方的數字或是文字。In one embodiment of the present invention, the common strings in the information form include names and codes of individuals and or business entities, establishment time, business status, industry category, share capital, composition, person in charge, address, email address, and website address. , Phone number, fax number, legal message, or industry message, the keyword string is the number or text in front of the universal string or behind the universal string.

在本發明之一實施例中,訊息處理裝置更包括一比對模組,比對該資料表單的該第二語言資訊與位於該儲存模組的該外部表單,並於比對後產生一訊息。In an embodiment of the present invention, the message processing device further includes a comparison module, which compares the second language information of the data form with the external form located in the storage module, and generates a message after the comparison. .

在本發明之一實施例中,該編碼資料是資料通用編碼系統(DUNS)編碼。In one embodiment of the present invention, the encoded data is a data universal encoding system (DUNS) encoding.

在本發明之一實施例中,該分類模組係切割該關鍵字串與該通用字串,而該關鍵字串為該通用字串前或後的數字或是文字,該關聯訊息是該個人或企業實體的商業交易訊息。In one embodiment of the present invention, the classification module cuts the keyword string and the universal string, and the keyword string is a number or text before or after the universal string, and the associated message is the individual Or business transaction information for a corporate entity.

在本發明之一實施例中,該通用字串包括國、市、區、村、里、鄰、路、段、巷、弄、號或樓。In one embodiment of the present invention, the universal string includes a country, a city, a district, a village, a neighborhood, a neighborhood, a road, a section, an alley, a lane, a house, or a building.

在本發明之一實施例中,訊息處理裝置,更包括一切割模組,用以切割該關鍵字串與該通用字串。In one embodiment of the present invention, the information processing device further includes a cutting module for cutting the keyword string and the universal character string.

為了讓上述目的、技術特徵以及實際實施後之增益性更為明顯易懂,於下文中將係以較佳之實施範例輔佐對應相關之圖式來進行更詳細之說明。In order to make the above-mentioned purpose, technical characteristics, and gain after actual implementation more obvious and easy to understand, the following will be explained in more detail with better implementation examples supplemented by corresponding diagrams.

為更清楚瞭解本發明之特徵、內容與優點及其所能達成之功效,茲將本發明配合附圖,並以實施例之表達形式詳細說明如下,而其中所使用之圖式,其主旨僅為示意及輔助說明書之用,未必為本發明實施後之真實比例與精準配置,故不應就所附之圖式的比例與配置關係解讀、侷限本發明於實際實施上的權利範圍。In order to better understand the features, contents and advantages of the present invention and the effects that can be achieved, the present invention is described in detail with the accompanying drawings in the form of embodiments, and the schematics used therein are only For the purpose of illustration and supplementary description, the actual scale and precise configuration after the implementation of the present invention may not necessarily be interpreted and limited to the scope of rights of the present invention in actual implementation based on the relationship between the proportions and configuration of the attached drawings.

本發明之優點、特徵以及達到之技術方法將參照例示性實施例及所附圖式進行更詳細地描述而更容易理解,且本發明或可以不同形式來實現,故不應被理解僅限於此處所陳述的實施例,相反地,對所屬技術領域具有通常知識者而言,所提供的實施例將使本揭露更加透徹與全面且完整地傳達本發明的範疇,且本發明將僅為所附加的申請專利範圍所定義。The advantages, features, and technical methods of the present invention will be described in more detail with reference to the exemplary embodiments and the accompanying drawings for easier understanding, and the present invention may be implemented in different forms, so it should not be understood to be limited to this The embodiments described herein, on the contrary, to those having ordinary knowledge in the technical field, the embodiments provided will make this disclosure more thoroughly and comprehensively and completely convey the scope of the invention, and the invention will only be appended As defined by the scope of patent applications.

圖1繪示本發明一實施例的訊息處理裝置的方塊圖。請參考圖1,本實施例的訊息處理裝置100,其適於自外部收集相關商業、產業、法律訴訟等重大訊息,並可將該些訊息彙整後提出一準確、完整、及時的訊息資訊,而相關企業可以依據本發明之訊息處理裝置100所提出的訊息資訊做出對應的準確、完整、及時商業決策,進而獲得市場認同以及成就。FIG. 1 is a block diagram of an information processing apparatus according to an embodiment of the present invention. Please refer to FIG. 1. The information processing device 100 of this embodiment is suitable for collecting relevant important business, industry, legal proceedings and other important information from the outside, and can consolidate these information to present an accurate, complete and timely information message. Relevant companies can make corresponding accurate, complete, and timely business decisions based on the information provided by the message processing device 100 of the present invention, thereby gaining market recognition and success.

詳細地說,本實施例的訊息處理裝置100主要包括一控制模組110、一儲存模組120、一訊息擷取模組130、一分類模組140、一標準化模組150及一翻譯模組160。在本實施例中,控制模組110例如是本實施例的訊息處理裝置100的作業核心,其用於接收一處理指令,以進行相關的訊息處理作業。進一步地說,處理指令例如是包括個人或是企業實體的名稱,進而讓訊息處理裝置100可以針對個人或是企業實體來作業。即,訊息處理裝置100可自外部收集個人或是企業實體的相關商業、產業、法律訴訟等重大訊息,並可將該些訊息彙整後提出一準確、完整、及時的訊息資訊。In detail, the information processing device 100 of this embodiment mainly includes a control module 110, a storage module 120, a message acquisition module 130, a classification module 140, a standardized module 150, and a translation module. 160. In this embodiment, the control module 110 is, for example, an operation core of the information processing apparatus 100 of this embodiment, and is configured to receive a processing instruction to perform related information processing operations. Further, the processing instruction includes, for example, the name of an individual or an enterprise entity, so that the information processing apparatus 100 can operate on the individual or the enterprise entity. That is, the information processing device 100 can collect important commercial, industrial, and legal proceedings related to an individual or a corporate entity from the outside, and can consolidate these information to present accurate, complete, and timely information.

此外,訊息擷取模組130則可依據控制模組110所接收的處理指令來與至少一個資料來源連結,並擷取關於個人或企業實體的多個資料。在本實施例中,由於本實施例的訊息處理裝置100係針對個人或是企業實體的相關商業、產業、法律訴訟等重大訊息進行收集,所以訊息擷取模組130所連結資料來源例如是與商業、產業、法律相關的網站。舉例來說,資料來源可以是財政部、經濟部、法務部、內政部、外交部等官方網站,而所擷取的訊息例如是可公開的公司登記、工商股東、財務稅務資料、進出口資料、戶政資料、入出境資料等訊息。當然,訊息擷取模組130所連結資料來源也可以是搜尋網站或是例如是人力銀行的私人公司網站,本發明在此並不限制資料來源的類型。上述由訊息擷取模組130所擷取的訊息例如可以儲存於儲存模組120。In addition, the message retrieval module 130 may be connected to at least one data source according to the processing instruction received by the control module 110 and retrieve multiple data about individuals or business entities. In this embodiment, since the information processing device 100 of this embodiment collects important information such as related business, industry, and lawsuits of an individual or an enterprise entity, the data source linked to the information extraction module 130 is, for example, related to Business, industry, and law related websites. For example, the data source can be official websites such as the Ministry of Finance, the Ministry of Economy, the Ministry of Justice, the Ministry of the Interior, the Ministry of Foreign Affairs, and the information retrieved is, for example, publicly available company registration, business shareholders, financial and tax information, and import and export data , Household administration information, entry and exit information, etc. Of course, the data source linked by the message extraction module 130 may also be a search website or a private company website such as a human bank, and the present invention does not limit the type of the data source here. The messages captured by the message capturing module 130 can be stored in the storage module 120, for example.

此外,由於訊息擷取模組130所擷取的訊息係包括了各種類型,因此本實施例是應用分類模組140來進行上述的訊息分類。在本實施例中,上述訊息擷取模組130所擷取的各筆資料可定義一關鍵字串與一通用字串,而本實施例可以依據通用字串將這些資料分類,以生成一資料表單,並儲存於該儲存模組120。其中,本實施例的通用字串例如可以包括「個人和或企業實體」的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息,而關鍵字串例如為通用字串前方或通用字串後方的數字或是文字。In addition, since the messages captured by the message capturing module 130 include various types, this embodiment uses the classification module 140 to perform the above-mentioned message classification. In this embodiment, each piece of data captured by the above-mentioned message extraction module 130 can define a keyword string and a universal string, and this embodiment can classify these data based on the universal string to generate a data The form is stored in the storage module 120. The universal string in this embodiment may include, for example, the name, code, establishment time, business status, industry category, share capital, composition, person in charge, address, email address, web address, telephone number, A fax number, legal message, or industry message, and the keyword string is, for example, a number or text before or after the universal string.

舉例來說,訊息擷取模組130可以自相關的商業網站、搜尋網站或是財稅網站中擷取有關該「個人或企業實體」的「地址」訊息,並將字串為「地址」的文字列於資料表單中。同時,位於通用字串「地址」後方的關鍵字串可對應地一併列於資料表單。須考量的是,無論個人或企業實體,都可能有多筆設立的「地址」訊息,而分類模組140即是依據通用字串「地址」的類別將其彙整於至少一欄位中。同時,會有至少一筆屬於關鍵字串的「地址」訊息描述。圖2A與圖2B即繪示本發明一實施例的訊息處理裝置的工作示意圖。如圖2A與圖2B所示,關鍵字串的「地址」的描述可以是「A1市B1區C1路D1號」以及「E1市F1區G1路H1號」。在其他較佳實施例中,通用字串也可以是「負責人」,而關鍵字串即為「負責人」後面的名字。在本實施例中,若一間設立公司在不同的資料來源中有不同的「負責人」,本實施例的訊息處理裝置100也可以自多種資料來源將其彙整出,並列於資料表單中。For example, the message extraction module 130 can retrieve the "address" information about the "person or business entity" from the relevant business website, search website, or financial and tax website, and use the word "address" as the text. Listed in a data sheet. At the same time, the keyword strings behind the universal string "address" can be correspondingly listed in the data form. It must be considered that no matter whether an individual or a corporate entity, there may be multiple "address" messages, and the classification module 140 aggregates them in at least one field according to the category of the universal string "address". At the same time, there will be at least one description of the "address" message belonging to the keyword string. FIG. 2A and FIG. 2B are schematic diagrams illustrating the operation of an information processing device according to an embodiment of the present invention. As shown in FIG. 2A and FIG. 2B, the description of the "address" of the keyword string may be "No. D1, C1 Road, B1 District, A1 City" and "No. H1, G1 Road, F1 District, E1 City". In other preferred embodiments, the universal string can also be "responsible person", and the keyword string is the name after "responsible person". In this embodiment, if an established company has different "persons in charge" in different data sources, the information processing device 100 of this embodiment can also integrate them from multiple data sources and list them in a data form.

再進一步地說,由圖2A可知,通用字串也可以更細部地是地址中的「國」、「市」、「區」、「村」、「里」、「鄰」、「路」、「段」、「巷」、「弄」、「號」或「樓」,而關鍵字串即為「國」、「市」、「區」、「村」、「里」、「鄰」、「路」、「段」、「巷」、「弄」、「號」或「樓」前面的字串或是數字。關於通用字串的類型,本發明在此亦不做任何限制。其中,上述關鍵字串與通用字串例如亦可以藉由分類模組140來切割或區隔。當然,在一較佳實施例中,訊息處理裝置100亦可額外包括一切割模組170,用以切割關鍵字串與通用字串。Furthermore, as can be seen from FIG. 2A, the universal string can also be more detailed in the address of "country", "city", "district", "village", "li", "neighbor", "road", "Duan", "Alley", "Alley", "Number" or "Floor", and the keyword strings are "Country", "City", "District", "Village", "Li", "Neighbor", The string or number in front of "Road", "Segment", "Alley", "Alley", "Number" or "Floor". Regarding the type of the universal string, the present invention is not limited in any way. The keyword string and the general character string can also be cut or separated by the classification module 140, for example. Of course, in a preferred embodiment, the information processing device 100 may further include a cutting module 170 for cutting a keyword string and a universal word string.

另外,本實施例的標準化模組150可用於對資料表單進行標準化。以通用字串「地址」為例,在不同的資料來源中,關鍵字串可能是數字,也可能是文字。例如,可能以「111」號表示,亦可能以可能以「一一一」號表示。因此,本實施例的標準化模組150可以將不同型態的表示方式以標準的方式來呈現。例如,均以阿拉伯數字的方式來表示。In addition, the standardization module 150 of this embodiment may be used to standardize the data sheet. Take the universal string "address" as an example. In different data sources, the keyword string may be a number or a text. For example, it may be represented by the number "111" or it may be represented by the "one-to-one" number. Therefore, the standardization module 150 of this embodiment can present different types of representations in a standard manner. For example, they are all represented in Arabic numerals.

在本實施例中,於進行本地端的企業服務時,個人和或企業實體的名稱例如是以一第一語言呈現,且訊息擷取模組130於本地端所擷取的「地址」資料亦例如是以第一語言呈現。第一語言例如是中文。因此,在與國際接軌的過程中,為能讓訊息上能與國際同步,並進行對應的比較或交流,本實施例可以透過翻譯模組160來對標準化後的資料表單進行翻譯,以使資料表單包含以一第二語言來呈現的資訊。第二語言例如是英文。In this embodiment, when performing local enterprise services, the names of individuals and / or business entities are presented in a first language, for example, and the "address" data retrieved by the message extraction module 130 on the local side is also, for example, Is presented in the first language. The first language is, for example, Chinese. Therefore, in the process of being in line with international standards, in order to enable the information to be synchronized with the international and to make corresponding comparisons or exchanges, in this embodiment, the standardized data form can be translated through the translation module 160 in order to make the data The form contains information presented in a second language. The second language is, for example, English.

因此,在翻譯模組160對資料表單進行翻譯後的第二語言資訊與原本的該第一語言資訊即可同時存在資料表單。如圖2B所示,第一語言所呈現的第一筆地址「A1市B1區C1路D1號」經翻譯模組160翻譯後即以第二語言「No.D2, C2 Rd., B2 District, A2 City」來呈現。其中,數字D1例如是等於D2。相同的,第一語言所呈現的第二筆地址「E1市F1區G1路H1號」經翻譯模組160翻譯後即以第二語言「No.H2, G2 Rd., F2 District, E2 City」來呈現。其中,數字H1例如是等於H2。其中,翻譯模組160的翻譯依據例如可以依據外交部翻譯界面、郵政地址翻譯界面等政府或標準單位。Therefore, the second language information after the data form is translated by the translation module 160 and the original first language information may exist in the data form at the same time. As shown in FIG. 2B, the first address “No. D1, C1 Road, B1 District, A1 City” in the first language is translated by the translation module 160, and then the second language is “No.D2, C2 Rd., B2 District, A2 City ". The number D1 is, for example, equal to D2. Similarly, the second address "H1, G1 Road, F1 District, E1 City" presented in the first language is translated by the translation module 160, and then the second language is "No.H2, G2 Rd., F2 District, E2 City" To render. The number H1 is, for example, equal to H2. The translation basis of the translation module 160 may be, for example, a government or standard unit such as the translation interface of the Ministry of Foreign Affairs and the translation address of a postal address.

由圖2A與圖2B可知,本實施例可以透過訊息擷取模組130自相關網站擷取多筆「個人或企業實體」的「地址」訊息,並藉由分類模組140或是切割模組170來將「地址」訊息中的通用字串和關鍵字串區隔(如圖2A所示)。後續,再藉由翻譯模組160來將多筆「地址」訊息由第一語言翻譯為第二語言,並進而再整合為一完整的「地址」字串(如圖2B所示),以達到與國際接軌的目的。As can be seen from FIG. 2A and FIG. 2B, in this embodiment, multiple “personal or corporate entity” “address” messages can be retrieved from the relevant website through the message retrieval module 130, and the classification module 140 or the cutting module can be used. 170 to separate the general string and the keyword string in the "address" message (as shown in FIG. 2A). Subsequently, the translation module 160 is used to translate multiple "address" messages from the first language into the second language, and then integrate them into a complete "address" string (as shown in Figure 2B) to achieve The purpose of being in line with international standards.

特別的是,為讓資料表單有較佳的資訊完整度,本實施例的資料表單例如包含一系譜樹之所有訊息。該系譜樹之所有訊息包括處理指令所提及之「企業實體」的總部、其所有或控股的任何子公司以及任何次級位置或分支機構的的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息。換言之,當處理指令所提及之內容為一特定「企業實體」時,本實施例的訊息處理裝置100可依上述的實施說明建立一包括該特定「企業實體」的所有相關經歷的資料表單。當然,在一較佳實施例中,控制模組110可以依據設定來定期地對系譜樹進行關聯更新,進而讓本實施例的訊息處理裝置100所生成的資料表單有準確、完整、及時的訊息資訊。In particular, in order to make the data form have better information integrity, the data form of this embodiment contains, for example, all the information of a genealogical tree. All information of the genealogical tree includes the name, number, establishment time, business status, industry type of the headquarters of the `` enterprise entity '' mentioned in the processing instruction, any subsidiaries owned or controlled by it, and any secondary locations or branches , Share capital, composition, person in charge, address, email address, website address, telephone number, fax number, legal information or industrial information. In other words, when the content mentioned in the processing instruction is a specific "enterprise entity", the information processing device 100 of this embodiment may create a data form including all relevant experiences of the specific "enterprise entity" according to the above-mentioned implementation description. Of course, in a preferred embodiment, the control module 110 may periodically update the genealogical tree according to the settings, so that the data form generated by the information processing device 100 of this embodiment has accurate, complete, and timely information. Information.

值得一提的是,本實施例的儲存模組120還儲存有一外部表單,而訊息處理裝置100更可包括一比對模組180。其中,外部表單具有至少一編碼資料以及對應各編碼資料的至少一關聯訊息。在本實施例中,編碼資料例如是資料通用編碼系統(DUNS)編碼,而關聯訊息例如是個人或企業實體的商業交易訊息。因此,本實施例的比對模組180可以比對資料表單的第二語言資訊與位於儲存模組120的外部表單,並於比對後產生一訊息。舉例來說,外部表單例如是國際上相關商業犯罪名單,該名單上是以資料通用編碼系統(DUNS)進行編碼,依據該編碼可得到對應的商業犯罪訊息。如此一來,本實施例的訊息處理裝置100可藉由本地端所擷取的相關資訊(資料表單)來與國際資訊(外部表單)比對,更能有效達到與國際接軌的目的。其中,上述通用編碼系統(DUNS)編碼亦可於本地端的「企業實體」的公司編碼連結,以建立完整的資訊資料庫。上述「企業實體」的公司編碼可以是統一編號。It is worth mentioning that the storage module 120 in this embodiment further stores an external form, and the information processing device 100 may further include a comparison module 180. The external form has at least one coded data and at least one associated message corresponding to each coded data. In this embodiment, the coded data is, for example, a Data Universal Coding System (DUNS) code, and the associated information is, for example, a business transaction message of an individual or an enterprise entity. Therefore, the comparison module 180 in this embodiment can compare the second language information of the data form with the external form located in the storage module 120 and generate a message after the comparison. For example, the external form is, for example, a list of relevant international commercial crimes. The list is encoded by the Universal Data Encoding System (DUNS), and corresponding commercial crime information can be obtained according to the code. In this way, the information processing device 100 of this embodiment can compare with the international information (external form) by using the relevant information (data form) retrieved from the local end, which can more effectively achieve the purpose of integrating with the international. Among them, the above-mentioned universal coding system (DUNS) coding can also be linked with the company coding of the "enterprise entity" on the local side to establish a complete information database. The company code of the "enterprise entity" may be a uniform number.

當然,上述比對模組180於比對後所產生的訊息亦可一併地彙整於資料表單中,以一併地提供予需求單位,進而讓需求單位接收準確、完整、及時的訊息資訊,而做出對應的準確、完整、及時商業決策,進而獲得市場認同以及成就。在另一較佳實施例中,上述比對模組180於比對後所產生的訊息亦可為一獨立報告,而作為需求單位的參考資訊。Of course, the information generated by the above-mentioned comparison module 180 after the comparison can also be integrated in the data form and provided to the demand unit together, so that the demand unit can receive accurate, complete and timely information. And make corresponding accurate, complete and timely business decisions, and then gain market recognition and achievements. In another preferred embodiment, the information generated by the above-mentioned comparison module 180 after the comparison may also be an independent report, which is used as reference information for the demand unit.

圖3繪示本發明一實施例的訊息處理方法的流程圖。請再參考圖3,本實施例的訊息處理方法包括下列步驟。首先,如步驟S310所示,提供一控制模組,接收一處理指令,該處理指令包括個人和或企業實體的名稱,其中個人和或企業實體的名稱是以一第一語言呈現。接著,如步驟S320所示,提供一儲存模組,儲存有一外部表單,該外部表單具有至少一編碼資料以及對應該編碼資料的至少一關聯訊息。接著,如步驟S330所示,提供一訊息擷取模組,依據該處理指令與至少一個資料來源連結,並擷取關於個人或企業實體的多個資料,各該資料具有一關鍵字串與一通用字串。FIG. 3 is a flowchart of an information processing method according to an embodiment of the invention. Please refer to FIG. 3 again. The message processing method of this embodiment includes the following steps. First, as shown in step S310, a control module is provided to receive a processing instruction. The processing instruction includes names of individuals and / or business entities. The names of individuals and / or business entities are presented in a first language. Next, as shown in step S320, a storage module is provided to store an external form, the external form having at least one encoded data and at least one associated message corresponding to the encoded data. Then, as shown in step S330, a message retrieval module is provided, which is connected to at least one data source according to the processing instruction, and retrieves multiple data about individuals or business entities, each of which has a keyword string and a Universal string.

在步驟S330之後,接著如步驟S340所示,提供一分類模組,依據通用字串將該些資料分類,以生成一資料表單,並儲存於該儲存模組,其中該關鍵字串與該通用字串排列於該資料表單,且該關鍵字串與該通用字串有對應關係。接著,如步驟S350所示,提供一標準化模組,標準化對該資料表單。之後,如步驟S360所示,提供一翻譯模組,對標準化後的該資料表單進行翻譯,使該資料表單包含一第二語言。在一較佳實施例中,在步驟S360之後,還可進行一步驟S370,提供一比對模組,比對該資料表單的該第二語言資訊與位於該儲存模組的該外部表單,並於比對後產生一訊息。After step S330, as shown in step S340, a classification module is provided to classify the data according to a universal string to generate a data form and store it in the storage module, where the keyword string and the universal The strings are arranged in the data form, and the keyword string has a corresponding relationship with the universal string. Next, as shown in step S350, a standardized module is provided to standardize the data sheet. After that, as shown in step S360, a translation module is provided to translate the standardized data sheet so that the data sheet includes a second language. In a preferred embodiment, after step S360, a step S370 may be performed to provide a comparison module to compare the second language information of the data form with the external form located in the storage module, and A message is generated after the comparison.

綜上所述,本發明的訊息處理裝置可以經由外部收集相關商業、產業、法律訴訟等重大訊息,並可將該些訊息彙整後提出一準確、完整、及時的訊息資訊,而相關企業可以依據本發明之訊息處理裝置所提出的訊息資訊做出對應的準確、完整、及時商業決策,進而獲得市場認同以及成就。To sum up, the information processing device of the present invention can collect relevant important business, industry, legal proceedings and other important information through the outside, and can aggregate these information to provide an accurate, complete and timely information, and the relevant enterprises can The message information provided by the message processing device of the present invention makes corresponding accurate, complete, and timely business decisions, thereby gaining market recognition and achievement.

此外,本發明的訊息處理裝置亦可以與國際接軌,以與國際相關商業訊息連結,以獲得更完整的訊息。如此一來,更是大幅降低了國際商業操作的風險。In addition, the information processing device of the present invention can also be in line with international standards, so as to link with international related commercial information to obtain more complete information. As a result, the risks of international business operations have been greatly reduced.

以上所述之實施例僅係為說明本發明之技術思想及特點,其目的在使熟習此項技藝之人士能夠瞭解本創作之內容並據以實施,當不能以之限定本發明之專利範圍,即凡依本發明所揭示之精神所作之均等變化或修飾,仍應涵蓋在本發明之專利範圍內。The above-mentioned embodiments are only for explaining the technical ideas and characteristics of the present invention. The purpose is to enable those skilled in the art to understand the content of this creation and implement it accordingly. When the scope of the patent of the present invention cannot be limited, That is, any equivalent change or modification made according to the spirit disclosed in the present invention should still be covered by the patent scope of the present invention.

100:訊息處理裝置 110:控制模組 120:儲存模組 130:訊息擷取模組 140:分類模組 150:標準化模組 160:翻譯模組 170:切割模組 180:比對模組100: message processing device 110: control module 120: storage module 130: message retrieval module 140: classification module 150: standardized module 160: translation module 170: cutting module 180: comparison module

圖1繪示本發明一實施例的訊息處理裝置的方塊圖。 圖2A與圖2B繪示本發明一實施例的訊息處理裝置的工作示意圖。 圖3繪示本發明一實施例的訊息處理方法的流程圖。FIG. 1 is a block diagram of an information processing apparatus according to an embodiment of the present invention. FIG. 2A and FIG. 2B are schematic diagrams illustrating an operation of an information processing apparatus according to an embodiment of the present invention. FIG. 3 is a flowchart of an information processing method according to an embodiment of the invention.

Claims (9)

一種訊息處理裝置,包括:一控制模組,接收一處理指令,該處理指令包括個人和或企業實體的名稱,其中個人和或企業實體的名稱是以一第一語言呈現;一儲存模組,儲存有一外部表單,該外部表單具有至少一編碼資料以及對應各該編碼資料的至少一關聯訊息;一訊息擷取模組,依據該處理指令與至少一個資料來源連結,並擷取關於個人或企業實體的多個資料,各該資料具有一關鍵字串與一通用字串;一分類模組,依據通用字串將該些資料分類,以生成一資料表單,並儲存於該儲存模組,其中該關鍵字串與該通用字串排列於該資料表單,且該關鍵字串與該通用字串有對應關係;一標準化模組,對該資料表單進行標準化;一翻譯模組,對標準化後的該資料表單進行翻譯,使該資料表單包含一第二語言;以及一比對模組,比對該資料表單的該第二語言資訊與位於該儲存模組的該外部表單,並於比對後產生一訊息。An information processing device includes: a control module that receives a processing instruction, the processing instruction includes names of individuals and or business entities, wherein the names of individuals and or business entities are presented in a first language; a storage module, An external form is stored, the external form has at least one coded data and at least one associated message corresponding to each of the coded data; an information extraction module, which is linked to at least one data source according to the processing instruction, and retrieves information about individuals or businesses Multiple pieces of data in the entity, each of which has a keyword string and a universal string; a classification module that sorts the data based on the universal string to generate a data form and store it in the storage module, where The keyword string and the universal string are arranged in the data form, and the keyword string and the universal string have a corresponding relationship; a standardization module to standardize the data form; a translation module to standardize the Translate the data sheet so that the data sheet includes a second language; and a comparison module that compares the second language of the data sheet Located outside of the form information and the storage module, and to generate a message than the latter pair. 如申請專利範圍第1項所述之訊息處理裝置,其中該翻譯模組對該資料表單進行翻譯後的該第二語言資訊與原本的該第一語言資訊同時存在該資料表單,而該資料表單包含一系譜樹之所有訊息。The information processing device according to item 1 of the scope of patent application, wherein the second language information translated by the translation module to the data form and the original first language information exist in the data form at the same time, and the data form Contains all the information of a genealogical tree. 如申請專利範圍第2項所述之訊息處理裝置,其中該系譜樹之所有訊息包括該企業實體的總部、其所有或控股的任何子公司以及任何次級位置或分支機構的的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息。The information processing device as described in item 2 of the scope of patent application, wherein all the information of the genealogical tree includes the name, code, the headquarters of the enterprise entity, any subsidiaries owned or controlled by it, and any sub-locations or branches, Time of establishment, business status, industry type, share capital, composition, person in charge, address, email address, website address, telephone number, fax number, legal information or industry information. 如申請專利範圍第2項所述之訊息處理裝置,其中該控制模組會定期地對該系譜樹進行關聯更新。The information processing device described in item 2 of the scope of patent application, wherein the control module periodically updates the genealogical tree. 如申請專利範圍第1項所述之訊息處理裝置,其中該資料表單中的通用字串包括個人和或企業實體的名稱、編碼、成立時間、營業狀況、行業別、股本、組成、負責人、地址、郵件地址、網址、電話號碼、傳真號碼、法律訊息或產業訊息,該關鍵字串為該通用字串前方或該通用字串後方的數字或是文字。The information processing device described in item 1 of the scope of patent application, wherein the common string in the data form includes the name, code, establishment time, business status, industry category, share capital, composition, person in charge, Address, mail address, web address, telephone number, fax number, legal information or industry information, the keyword string is the number or text in front of the universal string or behind the universal string. 如申請專利範圍第1項所述之訊息處理裝置,其中該編碼資料是資料通用編碼系統(DUNS)編碼。The information processing device according to item 1 of the scope of patent application, wherein the encoded data is a data universal encoding system (DUNS) encoding. 如申請專利範圍第1項所述之訊息處理裝置,其中該分類模組係切割該關鍵字串與該通用字串,而該關鍵字串為該通用字串前或後的數字或是文字,該關聯訊息是該個人或企業實體的商業交易訊息。The information processing device according to item 1 of the scope of patent application, wherein the classification module cuts the keyword string and the universal string, and the keyword string is a number or text before or after the universal string, The associated message is a commercial transaction message for the individual or business entity. 如申請專利範圍第7項所述之訊息處理裝置,其中該通用字串包括國、市、區、村、里、鄰、路、段、巷、弄、號或樓。The information processing device according to item 7 of the scope of patent application, wherein the universal string includes a country, a city, a district, a village, a neighborhood, a neighborhood, a road, a section, an alley, a lane, a house, or a building. 如申請專利範圍第1項所述之訊息處理裝置,更包括一切割模組,用以切割該關鍵字串與該通用字串。The information processing device described in item 1 of the scope of patent application, further includes a cutting module for cutting the keyword string and the universal character string.
TW106123390A 2017-07-12 2017-07-12 Message processing device TWI659369B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
TW106123390A TWI659369B (en) 2017-07-12 2017-07-12 Message processing device
CN201810750467.6A CN109254963A (en) 2017-07-12 2018-07-10 Message processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW106123390A TWI659369B (en) 2017-07-12 2017-07-12 Message processing device

Publications (2)

Publication Number Publication Date
TW201909043A TW201909043A (en) 2019-03-01
TWI659369B true TWI659369B (en) 2019-05-11

Family

ID=65051601

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106123390A TWI659369B (en) 2017-07-12 2017-07-12 Message processing device

Country Status (2)

Country Link
CN (1) CN109254963A (en)
TW (1) TWI659369B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112286986B (en) * 2020-10-14 2021-08-03 北京乾唐伟业科技股份公司 Property right analysis management system based on enterprise genealogy

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201142628A (en) * 2010-05-20 2011-12-01 Greenflower Intercode Holding B V Method and system for compiling a unique sample code for specific web content
US20130103669A1 (en) * 2010-09-27 2013-04-25 International Business Machine Corporation Search Engine Indexing
CN104376406A (en) * 2014-11-05 2015-02-25 上海计算机软件技术开发中心 Enterprise innovation resource management and analysis system and method based on big data
CN106484855A (en) * 2016-09-30 2017-03-08 广州特道信息科技有限公司 A kind of big data concerning taxes intelligence analysis system

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788293B2 (en) * 2005-03-02 2010-08-31 Google Inc. Generating structured information
US20080065694A1 (en) * 2006-09-08 2008-03-13 Google Inc. Local Search Using Address Completion
WO2013113135A1 (en) * 2012-02-03 2013-08-08 Google Inc. Translating application resources
CN104615792A (en) * 2015-03-12 2015-05-13 浪潮集团有限公司 One household type querying and showing method for enterprise internet data
TWI590177B (en) * 2015-12-04 2017-07-01 鴻達積股份有限公司 Store messages integration system and method using the same
CN106682145A (en) * 2016-12-22 2017-05-17 北京览群智数据科技有限责任公司 Enterprise information processing method, server and client

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201142628A (en) * 2010-05-20 2011-12-01 Greenflower Intercode Holding B V Method and system for compiling a unique sample code for specific web content
US20130103669A1 (en) * 2010-09-27 2013-04-25 International Business Machine Corporation Search Engine Indexing
CN104376406A (en) * 2014-11-05 2015-02-25 上海计算机软件技术开发中心 Enterprise innovation resource management and analysis system and method based on big data
CN106484855A (en) * 2016-09-30 2017-03-08 广州特道信息科技有限公司 A kind of big data concerning taxes intelligence analysis system

Also Published As

Publication number Publication date
CN109254963A (en) 2019-01-22
TW201909043A (en) 2019-03-01

Similar Documents

Publication Publication Date Title
He et al. Constructing a Chinese patent database of listed firms in China: Descriptions, lessons, and insights
US7636719B2 (en) Contact schema
CN106372798B (en) A kind of customized contract generation method of user based on risk and system
CN112131275A (en) Enterprise portrait construction method of holographic city big data model and knowledge graph
CN110597870A (en) Enterprise relation mining method
US20020156817A1 (en) System and method for extracting information
CN105045847B (en) A kind of method that Chinese institutional units title is extracted from text message
GB2496120A (en) Analysis of emails using a hidden Markov model to recognize sections of the email, e.g. header, body, signature block and disclaimer
CN107967313A (en) A kind of method for merging different industries data based on field data and coordinate general character
CN109508458A (en) The recognition methods of legal entity and device
Brookes Labour as a transnational actor: Alliances, activism and the protection of labour rights in the Philippines and Pakistan
CN112650858A (en) Method and device for acquiring emergency assistance information, computer equipment and medium
TWI659369B (en) Message processing device
CN110880142A (en) Risk entity acquisition method and device
KR20190139037A (en) Text mining based construction document analysis method and system
CN106095754B (en) A kind of medical terminology dictionary part-of-speech tagging method
CN110825919A (en) ID data processing method and device
JP5253447B2 (en) Risk information generation system and program
CN109388648B (en) Method for extracting personnel information and relation person from electronic record
TWM549904U (en) Message processing device
CN113407734B (en) Method for constructing knowledge graph system based on real-time big data
US11681966B2 (en) Systems and methods for enhanced risk identification based on textual analysis
CN110766091B (en) Method and system for identifying trepanning loan group partner
CN112910923A (en) Intelligent financial big data processing system
Lu et al. A method of SNS topic models extraction based on self-adaptively LDA modeling