TWI718543B - Data exchange platform based on text exploration and method of using it - Google Patents

Data exchange platform based on text exploration and method of using it Download PDF

Info

Publication number
TWI718543B
TWI718543B TW108118233A TW108118233A TWI718543B TW I718543 B TWI718543 B TW I718543B TW 108118233 A TW108118233 A TW 108118233A TW 108118233 A TW108118233 A TW 108118233A TW I718543 B TWI718543 B TW I718543B
Authority
TW
Taiwan
Prior art keywords
information
data
module
data exchange
person
Prior art date
Application number
TW108118233A
Other languages
Chinese (zh)
Other versions
TW202004523A (en
Inventor
吳俊逸
Original Assignee
吳俊逸
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 吳俊逸 filed Critical 吳俊逸
Publication of TW202004523A publication Critical patent/TW202004523A/en
Application granted granted Critical
Publication of TWI718543B publication Critical patent/TWI718543B/en

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本發明提供一種基於文字探勘的資料交換平台,其特徵在於該資料交換平台包括一資訊蒐集模組、一文字探勘模組、一資料儲存模組、一資料交換模組、以及一資料集模組;本發明亦提供一種利用該資料交換平台進行資料交換的方法,用戶端將個人資訊上傳至該資料交換平台後,該資料交換平台通過文字探勘以集成用戶端的個人資訊,使供應服務或商品的供應商得以通過此資料交換平台取得用戶端的資料,用戶則可透過個人資訊的交換取得供應商的服務或商品。The present invention provides a data exchange platform based on text exploration, which is characterized in that the data exchange platform includes an information collection module, a text exploration module, a data storage module, a data exchange module, and a data collection module; The present invention also provides a method for data exchange using the data exchange platform. After the user terminal uploads personal information to the data exchange platform, the data exchange platform integrates the personal information of the user terminal through text exploration, so as to provide services or goods. Businesses can obtain client data through this data exchange platform, and users can obtain supplier services or products through the exchange of personal information.

Description

基於文字探勘的資料交換平台及利用其的方法Data exchange platform based on text exploration and method of using it

本發明涉及一種資料交換平台,尤指一種基於文字探勘(Text Mining)的資料交換平台以及利用其進行資料交換的方法,俾使用戶通過此平台將其個人資訊與不同領域的資料進行交換。The invention relates to a data exchange platform, in particular to a data exchange platform based on text mining and a method for data exchange using it, so that users can exchange their personal information with data in different fields through this platform.

目前的資料交換平台大多以金融相關數據蒐集為主流,通過取得用戶的銀行帳戶及花費支出等資料並進行統整,或者進一步透過分析計算以對用戶的財務管理提出建議。Most of the current data exchange platforms focus on financial-related data collection, by obtaining users' bank accounts and expenditures and other data and integrating them, or further analyzing and calculating to make suggestions for users' financial management.

而智慧型手機,智慧家庭,智能車,穿戴式裝置與帳單無紙化的普及對於這類資料交換平台應用程式的開發有推波助瀾的效果。然而,目前常見的應用程式僅能仰賴用戶將其每筆花費輸入載具中,方能對該些財務資料進行統計整理,在大多數的情況中,手動輸入容易出現錯誤、或者漸漸地怠於將每筆資料輸入,使得資料庫的完整性不佳,造成統計內容的失真與缺乏資料可追溯性。The popularity of smart phones, smart homes, smart cars, wearable devices, and paperless billing have contributed to the development of such data exchange platform applications. However, the current common applications can only rely on the user to input each of their expenses into the vehicle in order to collect statistics on the financial data. In most cases, manual input is prone to errors or gradually neglecting Entering each piece of data makes the completeness of the database poor, resulting in distortion of statistical content and lack of data traceability.

為了解決上述問題,譬如美國發明專利公開號US20130211986A1提出一種個人金融整合系統和方法。上述系統可集合來自一個或多個信貸來源而獲得信用報告或其他的財務數據,並且從該些信用報告或財務數據中獲得財務帳務訊息。該系統可進一步將上述的財務帳務訊息格式化或翻譯為可與用戶的個人財務軟體相容的格式並將該財務帳務訊息傳遞給用戶的個人財務軟體。如此一來,用戶可以在付出最少努力的前提下,將與其相關聯的多個帳務訊息添加到用戶的個人財務軟體中。In order to solve the above problems, for example, the US Invention Patent Publication No. US20130211986A1 proposes a personal financial integration system and method. The above-mentioned system can collect credit reports or other financial data from one or more credit sources, and obtain financial accounting information from these credit reports or financial data. The system can further format or translate the above-mentioned financial accounting information into a format compatible with the user's personal financial software and deliver the financial accounting information to the user's personal financial software. In this way, the user can add multiple accounting information associated with it to the user's personal financial software with the least effort.

另一個問題是,諸如這類資料交換平台的資料庫內容較為單一,因此僅能在相同或相似領域的公司資料之間進行資料交換,譬如將個人財務數據或信用報告與銀行或貸款機構進行交換。然而,當數據時代到來,將面臨資料與其資料庫應用在與資料庫來源不同的領域的可能性、資料交換平台的應用性拓展等,應成為未來資料交換平台的開發上重要的思考議題。Another problem is that the database content of such data exchange platforms is relatively single, so data can only be exchanged between company data in the same or similar fields, such as personal financial data or credit reports with banks or lending institutions. . However, when the data age comes, it will face the possibility of data and its database application in areas different from the source of the database, the application expansion of the data exchange platform, etc., which should become an important consideration in the development of the future data exchange platform.

本發明的目的之一,在於解決習知資料交換平台僅能通過用戶手動輸入資訊而建置資料庫,卻容易因為輸入錯誤、或者漸漸地怠於輸入每筆資料等人為因素使資料庫的完整性不佳,統計內容也有一定程度的失真的缺點。One of the objectives of the present invention is to solve the problem that the conventional data exchange platform can only build a database by manually inputting information by the user, but it is easy to make the database complete due to human factors such as input errors or gradually neglecting to input each data. The performance is not good, and the statistical content also has the disadvantage of a certain degree of distortion.

本發明的另一目的,在於解決習知資料交換平台資訊內容較為單一而不夠多元的缺點。Another purpose of the present invention is to solve the shortcomings that the information content of the conventional data exchange platform is relatively single and not diversified.

本發明的又一目的,在於解決習知進行資料交換時,因為資料內容單一性高,大多僅限於在相同或相似領域的公司之間進行資料交換,使資料無法被靈活應用的缺點。Another purpose of the present invention is to solve the disadvantage of conventional data exchange, which is limited to data exchange between companies in the same or similar fields due to the high unity of data content, which prevents the data from being flexibly used.

為了達到上述目的,本發明提供一種基於文字探勘(Text Mining)的資料交換平台,包括:一資訊蒐集模組、一文字探勘模組、一資料儲存模組、一資料交換模組、以及一資料集模組。該資訊蒐集模組接收來自一用戶端的複數個人資訊;該文字探勘模組連接該資訊蒐集模組,取得該些個人資訊並判斷每一該些個人資訊的檔案格式後,根據該些檔案格式分別輸出一相對應的擷取內容;該資料儲存模組連接該文字探勘模組以取得、分類並儲存該些擷取內容,其中,任意兩筆的該些擷取內容具有不同的時間點、不同的資訊內容、或其組合;該資料交換模組接收來自一供應商的請求而發出一請求資料交換訊息;且該資料集模組係連接該資料儲存模組,該資料集模組響應該請求資料交換訊息後,由與該資料集模組連接的該資料儲存模組取得對應該請求資料交換訊息之該些擷取內容以生成一資料交易紀錄。In order to achieve the above objectives, the present invention provides a data exchange platform based on text mining, including: an information collection module, a text mining module, a data storage module, a data exchange module, and a data collection Module. The information collection module receives a plurality of personal information from a client; the text exploration module connects to the information collection module, obtains the personal information and determines the file format of each personal information, and then separates them according to the file formats Output a corresponding captured content; the data storage module is connected to the text exploration module to obtain, classify and store the captured content, wherein any two of the captured content have different time points and differences The information content of, or a combination thereof; the data exchange module receives a request from a supplier and sends a request data exchange message; and the data set module is connected to the data storage module, and the data set module responds to the request After the data exchange message, the data storage module connected to the data set module obtains the captured content corresponding to the requested data exchange message to generate a data transaction record.

本發明還提供一種基於文字探勘的資料交換平台進行資料交換之方法,包括:The present invention also provides a method for data exchange on a data exchange platform based on text exploration, which includes:

蒐集,透過一資訊蒐集模組,蒐集來自一用戶端之複數個人資訊;Collect, collect plural personal information from a client through an information collection module;

擷取,透過一文字探勘模組,該文字探勘模組連接該資訊蒐集模組以取得該些個人資訊,並判斷每一該些個人資訊的檔案格式後,根據該些檔案格式分別輸出一相對應的擷取內容;Retrieval, through a text exploration module, the text exploration module is connected to the information collection module to obtain the personal information, and after judging the file format of each of the personal information, output a corresponding one according to the file format Extracted content of;

儲存(repository),透過一資料儲存模組,該資料儲存模組連接該文字探勘模組以取得、分類並儲存該些擷取內容,其中,任意兩筆的該些擷取內容具有不同的時間點、不同的資訊內容、或其組合;Storage (repository), through a data storage module, the data storage module is connected to the text exploration module to obtain, classify and store the captured content, wherein any two of the captured content have different times Points, different information content, or a combination thereof;

交換,透過一資料交換模組,該資料交換模組接收來自一供應商的請求而發出一請求資料交換訊息;以及Exchange through a data exchange module that receives a request from a supplier and sends out a request data exchange message; and

匯出,透過一資料集模組,該資料集模組係連接該資料儲存模組,該資料集模組響應該請求資料交換訊息後,由與該資料集模組連接的該資料儲存模組取得對應該請求資料交換訊息之該些擷取內容以生成一資料交易紀錄。Export through a data set module that is connected to the data storage module. After the data set module responds to the request data exchange message, the data storage module connected to the data set module Obtain the extracted content corresponding to the requested data exchange message to generate a data transaction record.

故相較於習知技術,本發明所能達到的功效在於:Therefore, compared with the conventional technology, the effects that the present invention can achieve are:

(1) 本發明的資料交換平台包括文字探勘模組,用戶同意將個人資訊上傳後,經由文字探勘模組判斷上傳的個人資訊之檔案格式,並根據檔案格式分別輸出相對應之擷取內容而儲存於資料儲存模組中,相較於僅能通過用戶手動輸入資訊而獲得相關資訊的習知資料交換平台而言,本發明可獲得一錯誤率低且完整性高的個人資料庫。(1) The data exchange platform of the present invention includes a text exploration module. After the user agrees to upload personal information, the text exploration module determines the file format of the uploaded personal information, and outputs the corresponding extracted content according to the file format. Stored in the data storage module, compared to a conventional data exchange platform that can only obtain relevant information by manually inputting information by the user, the present invention can obtain a personal data database with a low error rate and high integrity.

(2) 本發明的資料交換平台資訊涵蓋廣泛,除了用戶的銀行收支或信用卡消費等金融相關資訊外,尚包括就醫資訊、用電資訊、用水資訊、用瓦斯資訊、網路使用紀錄、收視紀錄,甚至是經由如穿戴式裝置所獲取的生理資訊等,提供一完善的個人可追溯資料庫可予在不同的行業之間進行資料交換,譬如,電力公司可在通過本發明的資料交換平台,經用戶同意後,取得用戶的異質資訊(就醫/用水/用瓦斯/網路/收視等),生成資料交易紀錄後,原產品或服務業者就該產品或服務提供相對應之優惠,例如兩個月的用電折扣交換三個月的網路使用紀錄。(2) The data exchange platform information of the present invention covers a wide range of information, in addition to financial-related information such as user's bank receipts and payments or credit card consumption, it also includes medical information, electricity usage information, water usage information, gas usage information, Internet usage records, and ratings Records, even physiological information obtained through wearable devices, etc., provide a complete personal traceability database for data exchange between different industries. For example, power companies can use the data exchange platform of the present invention After the user’s consent is obtained, the user’s heterogeneous information (medical treatment/water use/gas use/internet/viewing, etc.) is obtained, and after the data transaction record is generated, the original product or service provider provides corresponding discounts for the product or service, such as two The monthly electricity discount is exchanged for the three-month internet usage record.

涉及本發明的詳細說明及技術內容,現就配合圖式及具體實施例說明如下:Involving the detailed description and technical content of the present invention, the following descriptions are provided in conjunction with the drawings and specific embodiments:

『圖1』為本發明一實施例的一種基於文字探勘的資料交換平台,該資料交換平台主要包括一資訊蒐集模組10、一文字探勘模組20、一資料儲存模組30、一資料交換模組40、以及一資料集模組50。"Figure 1" is a data exchange platform based on text mining according to an embodiment of the present invention. The data exchange platform mainly includes an information collection module 10, a text mining module 20, a data storage module 30, and a data exchange module. Group 40, and a data set module 50.

該資訊蒐集模組10係用以接收一用戶端的個人資訊。換言之,單一用戶可在不同的時間點、持續地將其個人資訊上傳至該資料交換平台,使個人資訊具有連續性與完整性,該些個人資訊將持續地被該資訊蒐集模組10接收。The information collection module 10 is used to receive personal information from a client. In other words, a single user can continuously upload his personal information to the data exchange platform at different time points, so that the personal information has continuity and integrity, and the personal information will be continuously received by the information collection module 10.

本實施例中,該個人資訊舉例可為一銀行收支資訊、一銀行存款貸款紀錄資訊、一信用卡消費資訊、一用電資訊、一用水資訊、一用瓦斯資訊、一個人家中數位電視資訊、一個人遊戲平台數位資訊、一個人數位音樂資訊、一生活習慣資訊、一駕駛資訊、一車險資訊、一就醫資訊、一勞保資訊、一健保資訊、一生理資訊、一貸款資料、一個人股票交易資訊、一手機使用資訊、一投保資訊、一生理數據量測資訊、一個人履歷資訊、或上述的任意組合。In this embodiment, examples of the personal information can be a bank's income and expenditure information, a bank deposit and loan record information, a credit card consumption information, an electricity usage information, a water usage information, a gas usage information, a person's digital TV information, a person Game platform digital information, one person’s music information, one lifestyle information, one driving information, one car insurance information, one medical treatment information, one labor insurance information, one health insurance information, one physiological information, one loan information, one person’s stock trading information, one mobile phone Usage information, an insurance application information, a physiological data measurement information, a person's history information, or any combination of the above.

該個人資訊還可來自於一由互聯設備所提供的資訊,譬如,許多穿戴式裝置可擷取用戶的健康狀態,如心跳、睡眠情況、走路步數等生理資訊;又譬如,目前有些如冷暖氣、插頭等智慧家電具有儲存開關機時間、記錄用戶的使用高峰期、平均每日/每月使用時數等功能;又或者,智慧電視可記錄使用者愛好頻道、愛好的節目種類、以及平均每日/每月收看時數,反映用戶生活習慣;再或者,用戶在開車時的資訊,如平均速度、每月公里數、維修數據等駕駛資訊;甚至用戶就醫的就醫資料、健保資料等,亦涵蓋在本實施例的「個人資訊」的範疇中。The personal information can also come from information provided by an interconnected device. For example, many wearable devices can capture the user's health status, such as heartbeat, sleep status, walking steps, and other physiological information; for example, some of them are currently cold and warm. Smart home appliances such as gas, plugs, etc. have the functions of storing switch time, recording the user’s peak usage period, average daily/monthly usage hours, etc.; or, smart TV can record the user’s favorite channels, favorite program types, and average Daily/monthly viewing hours reflect the user’s living habits; or, the user’s information while driving, such as average speed, monthly kilometers, maintenance data and other driving information; even the user’s medical information, health insurance information, etc., It is also included in the category of "personal information" in this embodiment.

該文字探勘模組20連接該資訊蒐集模組10而取得該些個人資訊,可在一一判斷該些個人資訊的檔案格式後,執行一文字探勘技術,輸出一相對應的擷取內容。適用於本實施例的檔案格式舉例可為純文本TXT格式、HTML格式、RTF格式、WORD格式、CSV格式、或PDF格式,但不僅限於此。The text exploration module 20 is connected to the information collection module 10 to obtain the personal information. After determining the file format of the personal information one by one, it can execute a text exploration technique to output a corresponding captured content. Examples of file formats applicable to this embodiment can be plain text TXT format, HTML format, RTF format, WORD format, CSV format, or PDF format, but are not limited to this.

在本發明中,該文字探勘模組20接收該些個人資訊後係對該些個人資訊操作一文字探勘技術。舉例來說,該文字探勘模組20判斷該些個人資訊中的各項書面特徵,例如文字、數字、顏色、表格數量、欄列數量等,並且進一步針對該規格書的內容的關鍵字進行擷取,使得該擷取文件中包括但不限於上述所列舉的資訊的至少一種被擷取。In the present invention, the text mining module 20 receives the personal information and operates a text mining technology on the personal information. For example, the text mining module 20 determines various written features in the personal information, such as text, numbers, colors, number of tables, number of columns, etc., and further extracts keywords in the content of the specification. So that at least one of the above-listed information in the retrieved file is retrieved.

該文字探勘模組20可通過一內建辭典、至少一資料格式範本以及一預設的斷詞法則來該判斷該些個人資訊的數字與文字,該資料格式範本的非限制性實例包括各家信用卡帳單格式、電力帳單與數據格式、智慧家電數據格式、健保卡數據格式、或個人保單數據格式等。The text exploration module 20 can determine the numbers and texts of the personal information through a built-in dictionary, at least one data format template, and a predetermined word hyphenation rule. Non-limiting examples of the data format template include various Credit card bill format, electricity bill and data format, smart home appliance data format, health insurance card data format, or personal insurance policy data format, etc.

該資料儲存模組30連接該文字探勘模組20後,取得該些擷取內容,並根據一規則將該些擷取內容進行分類並儲存。該「規則」可為該資料交換平台預設的規則,此時,該規則的依據舉例可為關鍵詞、同義詞等;除此之外,該「規則」亦可為該用戶於該資料交換平台上自行選擇設定的規則,譬如設定並分類為「健康狀態」、「交通工具」、「網路用量」、「日常生活」等,根據其判斷將該些擷取內容加以分類。本實施例中,該資料儲存模組30所儲存的任意兩筆的該些擷取內容可具有不同的時間點、不同的資訊內容、或其組合。After the data storage module 30 is connected to the text exploration module 20, the captured content is obtained, and the captured content is classified and stored according to a rule. The "rule" can be a preset rule of the data exchange platform. In this case, examples of the basis for the rule can be keywords, synonyms, etc.; in addition, the "rule" can also be used by the user on the data exchange platform The rules you choose to set on the Internet, such as setting and categorizing it into "health status", "transportation", "network usage", "daily life", etc., and classify the captured content based on its judgment. In this embodiment, any two pieces of the captured content stored in the data storage module 30 may have different time points, different information content, or a combination thereof.

該資料交換模組40分別連接該資料儲存模組30以及該資料集模組50。當該資料交換模組40接收來自一供應商的請求後,發出一請求資料交換訊息予該資料集模組50。本實施例中,該供應商可為一供應服務或一供應商品的機構,譬如金融機構、廣告業者、百貨公司、車險公司、信用卡發卡機構、或不動產商,且該供應商在發出該請求時可附帶描述所欲取得的資料。The data exchange module 40 is connected to the data storage module 30 and the data collection module 50 respectively. After the data exchange module 40 receives a request from a supplier, it sends a request data exchange message to the data collection module 50. In this embodiment, the supplier may be an organization that supplies services or supplies goods, such as financial institutions, advertising agencies, department stores, auto insurance companies, credit card issuers, or real estate companies, and the supplier sends the request when making the request. It can be accompanied by a description of the information to be obtained.

該資料集模組50響應該請求資料交換訊息,由與該資料集模組50連接的該資料儲存模組30取得對應該請求資料交換訊息的該些擷取內容後,生成一資料交易紀錄予該資料交換模組40。本實施例中,該資料集模組50可被配置使該供應商得以經該資料交換平台訪問並取得該資料交易紀錄。The data set module 50 responds to the request data exchange message, and after the data storage module 30 connected to the data set module 50 obtains the captured content corresponding to the requested data exchange message, a data transaction record is generated for The data exchange module 40. In this embodiment, the data collection module 50 can be configured so that the supplier can access through the data exchange platform and obtain the data transaction record.

請續參考『圖2』,本發明一實施例可通過上述的資料交換平台,提供一種進行資料交換的方法,包括:Please continue to refer to "Figure 2". An embodiment of the present invention can provide a method for data exchange through the above-mentioned data exchange platform, including:

步驟(S1):蒐集(collection),透過該資訊蒐集模組10以蒐集來自一用戶端的個人資訊,本實施例中,該個人資訊可包括以下任一或組合:一銀行收支資訊、一銀行存款貸款紀錄資訊、一信用卡消費資訊、一用電資訊、一用水資訊、一用瓦斯資訊、一個人家中數位電視資訊、一個人遊戲平台數位資訊、一個人數位音樂資訊、一生活習慣資訊、一駕駛資訊、一車險資訊、一就醫資訊、一勞保資訊、一健保資訊、一生理資訊、一貸款資料、一個人股票交易資訊、一手機使用資訊、一投保資訊、一生理數據量測資訊或一個人履歷資訊。Step (S1): collection. The information collection module 10 is used to collect personal information from a client. In this embodiment, the personal information may include any one or a combination of the following: a bank's income and expenditure information, a bank Deposit and loan record information, one credit card consumption information, one electricity usage information, one water usage information, one gas usage information, one person’s home digital TV information, one person’s digital TV information, one person’s game platform digital information, one person’s music information, one lifestyle information, one driving information, One car insurance information, one medical treatment information, one labor insurance information, one health insurance information, one physiological information, one loan information, one person’s stock transaction information, one mobile phone usage information, one insurance information, one physiological data measurement information, or one person’s biographical information.

步驟(S2):擷取(extraction),透過該文字探勘模組20,該文字探勘模組20連接該資訊蒐集模組10以取得該些個人資訊,並判斷每一該些個人資訊的檔案格式後,根據該些檔案格式分別輸出一相對應的擷取內容。其中,該文字探勘模組20可包括一內建辭典、至少一資料格式範本以及一預設的斷詞法則而判斷出該些個人資訊的數字與文字。關於上述「擷取」,請搭配參考『圖3』,係以一信用卡帳單作為例子說明。該文字探勘模組20取得該信用卡帳單後,若為第一次輸入該信用卡帳單的格式,將會由該文字探勘模組20的該內建辭典進行一詞彙解析程序,並建構出該資料格式範本;若該信用卡帳單的格式已輸入過,即,該資料格式範本已被建構,則由內建的該至少一資料格式範本選擇與該信用卡帳單對應的該資料格式範本。隨後,從該信用卡帳單中挑選出至少一特徵文字(步驟S21),該特徵文字舉例可為包括姓名、數字、及其他預設的特定詞彙(譬如deposits、withdrawals)的組合,此步驟稱為實體標示(entity identification)(步驟S22)。接下來,根據該預設的斷詞法則,進一步對該些特徵文字進行標籤進而分類為,譬如,姓名、金額、收入、及支出,此步驟亦稱為實體解析(entity resolution) (步驟S23)。最後再將經過實體解析的資訊重新組合整理為一結構化訊息(步驟S24)。Step (S2): extract, through the text exploration module 20, the text exploration module 20 is connected to the information collection module 10 to obtain the personal information, and determine the file format of each of the personal information Afterwards, a corresponding captured content is output according to the file formats. Wherein, the text exploration module 20 may include a built-in dictionary, at least one data format template, and a predetermined word hyphenation rule to determine the numbers and texts of the personal information. Regarding the above "retrieving", please refer to "Figure 3", taking a credit card bill as an example. After the text exploration module 20 obtains the credit card statement, if the format of the credit card statement is entered for the first time, the built-in dictionary of the text exploration module 20 will perform a vocabulary analysis process and construct the Data format template; if the format of the credit card bill has been entered, that is, the data format template has been constructed, the data format template corresponding to the credit card bill is selected from the at least one built-in data format template. Subsequently, at least one characteristic character is selected from the credit card bill (step S21). The characteristic character may be a combination of names, numbers, and other preset specific words (such as deposits, withdrawals). This step is called Entity identification (step S22). Next, according to the preset rule of word segmentation, the characteristic characters are further labeled and classified into, for example, name, amount, income, and expenditure. This step is also called entity resolution (step S23) . Finally, the entity-analyzed information is rearranged into a structured message (step S24).

步驟(S3):儲存(repository),透過一資料儲存模組30,該資料儲存模組30連接該文字探勘模組20以取得、分類並儲存該些擷取內容,其中,任意兩筆的該些擷取內容具有不同的時間點、不同的資訊內容、或其組合。Step (S3): storage (repository), through a data storage module 30, the data storage module 30 is connected to the text exploration module 20 to obtain, classify and store the captured content, where any two of the Some captured content has different time points, different information content, or a combination thereof.

步驟(S4):交換(exchange),透過一資料交換模組40,該資料交換模組40接收來自一供應商的請求而發出一請求資料交換訊息。其中,該供應商係為一供應服務或一供應商品的機構。Step (S4): Exchange, through a data exchange module 40, which receives a request from a supplier and sends a request data exchange message. Among them, the supplier is an organization that supplies services or supplies goods.

步驟(S5):匯出(export),透過一資料集模組50,該資料集模組50係連接該資料儲存模組30,該資料集模組50響應該請求資料交換訊息後,由與該資料集模組50連接的該資料儲存模組30取得對應該請求資料交換訊息的該些擷取內容以生成一資料交易紀錄。本實施例中,該資料集模組50可被配置使該供應商得以經該資料交換平台訪問並取得該資料交易紀錄。Step (S5): Export, through a data set module 50, the data set module 50 is connected to the data storage module 30, the data set module 50 responds to the request data exchange message, and then The data storage module 30 connected to the data collection module 50 obtains the extracted content corresponding to the requested data exchange message to generate a data transaction record. In this embodiment, the data collection module 50 can be configured so that the supplier can access through the data exchange platform and obtain the data transaction record.

以一車險公司為例,為了要了解用戶的駕車習慣及健康資訊,該車險公司可透過該資料交換模組40發出一請求資料交換訊息,希望取得用戶的健保資料、以及諸如心跳、脈搏、血壓等生理資訊,亦希望獲取用戶駕駛時的平均速度、維修數據等由汽車所提供的個人資訊。Take a car insurance company as an example. In order to understand the user’s driving habits and health information, the car insurance company can send a request data exchange message through the data exchange module 40, hoping to obtain the user’s health insurance data, as well as information such as heartbeat, pulse, and blood pressure. It also hopes to obtain personal information provided by the car, such as the average speed of the user while driving, and maintenance data.

當用戶同意提供上述個人資訊後,該請求資料交換訊息傳送至連接該資料儲存模組30的該資料集模組50後,該資料集模組50響應該請求資料交換訊息並由與該資料集模組50連接的該資料儲存模組30取得對應該請求資料交換訊息的該些擷取內容,從而生成一資料交易紀錄(Data Transaction Log)予該資料交換模組40。如此一來,該供應商得以經該資料交換平台訪問並取得該資料交易紀錄,該供應商將提供相對應之優惠,例如一年的車險折扣交換兩個月的生理資訊或駕駛習慣。After the user agrees to provide the above-mentioned personal information, the request data exchange message is sent to the data set module 50 connected to the data storage module 30, and the data set module 50 responds to the request data exchange message and exchanges information with the data set. The data storage module 30 connected to the module 50 obtains the extracted content corresponding to the requested data exchange message, thereby generating a data transaction log (Data Transaction Log) for the data exchange module 40. In this way, the supplier can access and obtain the data transaction record through the data exchange platform, and the supplier will provide corresponding discounts, such as a one-year car insurance discount for two months of physiological information or driving habits.

以上已將本發明做一詳細說明,惟以上所述者,僅爲本發明的一較佳實施例而已,當不能限定本發明實施的範圍。即凡依本發明申請範圍所作的均等變化與修飾等,皆應仍屬本發明的專利涵蓋範圍內。The present invention has been described in detail above, but what is described above is only a preferred embodiment of the present invention, and should not limit the scope of implementation of the present invention. That is to say, all equal changes and modifications made according to the scope of application of the present invention should still fall within the scope of the patent of the present invention.

10‧‧‧資訊蒐集模組 20‧‧‧文字探勘模組 30‧‧‧資料儲存模組 40‧‧‧資料交換模組 50‧‧‧資料集模組10‧‧‧Information Collection Module 20‧‧‧Text Exploration Module 30‧‧‧Data Storage Module 40‧‧‧Data Exchange Module 50‧‧‧Data Set Module

『圖1』,為本發明一實施例的資料交換平台的方塊圖。 『圖2』,為本發明一實施例的資料交換方法的流程圖。 『圖3』,為本發明一實施例的資料交換方法的「擷取」示意圖。"Figure 1" is a block diagram of a data exchange platform according to an embodiment of the present invention. "Figure 2" is a flowchart of a data exchange method according to an embodiment of the present invention. "Figure 3" is a schematic diagram of "capture" of a data exchange method according to an embodiment of the present invention.

10‧‧‧資訊蒐集模組 10‧‧‧Information Collection Module

20‧‧‧文字探勘模組 20‧‧‧Text Exploration Module

30‧‧‧資料儲存模組 30‧‧‧Data Storage Module

40‧‧‧資料交換模組 40‧‧‧Data Exchange Module

50‧‧‧資料集模組 50‧‧‧Data Set Module

Claims (10)

一種基於文字探勘的資料交換平台,包括: 一資訊蒐集模組,該資訊蒐集模組接收來自一用戶端的複數個人資訊; 一文字探勘模組,該文字探勘模組連接該資訊蒐集模組,取得該些個人資訊並判斷每一該些個人資訊的檔案格式後,根據該些檔案格式分別輸出一相對應的擷取內容; 一資料儲存模組,該資料儲存模組連接該文字探勘模組以取得、分類並儲存該些擷取內容,其中,任意兩筆的該些擷取內容具有不同的時間點、不同的資訊內容、或其組合; 一資料交換模組,該資料交換模組接收來自一供應商的請求而發出一請求資料交換訊息;以及 一資料集模組,該資料集模組係連接該資料儲存模組,該資料集模組響應該請求資料交換訊息後,由與該資料集模組連接的該資料儲存模組取得對應該請求資料交換訊息的該些擷取內容以生成一資料交易紀錄。A data exchange platform based on text exploration, including: An information collection module, the information collection module receives plural personal information from a client; A text exploration module, the text exploration module is connected to the information collection module, obtains the personal information and determines the file format of each personal information, and outputs a corresponding extracted content according to the file format; A data storage module, the data storage module is connected to the text exploration module to obtain, classify and store the captured content, wherein any two of the captured content have different time points and different information content , Or a combination thereof; A data exchange module which receives a request from a supplier and sends out a request data exchange message; and A data set module connected to the data storage module. After the data set module responds to the request data exchange message, the data storage module connected to the data set module obtains the corresponding request The captured content of the data exchange message is used to generate a data transaction record. 如申請專利範圍第1項所述的資料交換平台,其中該個人資訊擇自於由一銀行收支資訊、一銀行存款貸款紀錄資訊、一信用卡消費資訊、一用電資訊、一用水資訊、一用瓦斯資訊、一個人家中數位電視資訊、一個人遊戲平台數位資訊、一個人數位音樂資訊、一生活習慣資訊、一駕駛資訊、一車險資訊、一就醫資訊、一勞保資訊、一健保資訊、一生理資訊、一貸款資料、一個人股票交易資訊、一手機使用資訊、一投保資訊、一生理數據量測資訊、一個人履歷資訊及上述組合所組成的群組。For example, in the data exchange platform described in item 1 of the scope of patent application, the personal information is selected from a bank's income and expenditure information, a bank deposit and loan record information, a credit card consumption information, an electricity usage information, a water usage information, and a Use gas information, a person’s home digital TV information, a person’s game platform digital information, a person’s music information, a lifestyle information, a driving information, a car insurance information, a medical treatment information, a labor insurance information, a health insurance information, a physiological information, A group consisting of a loan information, a person's stock transaction information, a mobile phone usage information, an insurance application information, a physiological data measurement information, a person's biographical information, and the above combination. 如申請專利範圍第1項所述的資料交換平台,其中該文字探勘模組包括一內建辭典、至少一資料格式範本以及一預設的斷詞法則。For example, in the data exchange platform described in item 1 of the scope of patent application, the text exploration module includes a built-in dictionary, at least one data format template, and a predetermined word hyphenation rule. 如申請專利範圍第1項所述的資料交換平台,其中該資料集模組被配置使該供應商得以經該資料交換平台訪問該資料交易紀錄。For example, the data exchange platform described in item 1 of the scope of patent application, wherein the data set module is configured to enable the supplier to access the data transaction record through the data exchange platform. 如申請專利範圍第1項所述的資料交換平台,其中該供應商係為一供應服務或一供應商品的機構。For example, in the data exchange platform described in item 1 of the scope of patent application, the supplier is an organization that supplies services or supplies goods. 一種基於文字探勘的資料交換平台進行資料交換的方法,包括: 蒐集,透過一資訊蒐集模組,蒐集來自一用戶端的複數個人資訊; 擷取,透過一文字探勘模組,該文字探勘模組連接該資訊蒐集模組以取得該些個人資訊,並判斷每一該些個人資訊的檔案格式後,根據該些檔案格式分別輸出一相對應的擷取內容; 儲存,透過一資料儲存模組,該資料儲存模組連接該文字探勘模組以取得、分類並儲存該些擷取內容,其中,任意兩筆的該些擷取內容具有不同的時間點、不同的資訊內容、或其組合; 交換,透過一資料交換模組,該資料交換模組接收來自一供應商的請求而發出一請求資料交換訊息;以及 匯出,透過一資料集模組,該資料集模組係連接該資料儲存模組,該資料集模組響應該請求資料交換訊息後,由與該資料集模組連接的該資料儲存模組取得對應該請求資料交換訊息的該些擷取內容以生成一資料交易紀錄。A method for data exchange on a data exchange platform based on text exploration, including: Collect, collect plural personal information from a client through an information collection module; Retrieval, through a text exploration module, the text exploration module is connected to the information collection module to obtain the personal information, and after judging the file format of each of the personal information, output a corresponding one according to the file format Extracted content of; Storage, through a data storage module, the data storage module is connected to the text exploration module to obtain, classify and store the captured content, where any two of the captured content have different time points and differences The information content of, or a combination thereof; Exchange through a data exchange module that receives a request from a supplier and sends out a request data exchange message; and Export through a data set module that is connected to the data storage module. After the data set module responds to the request data exchange message, the data storage module connected to the data set module Obtain the extracted content corresponding to the requested data exchange message to generate a data transaction record. 如申請專利範圍第6項所述的方法,其中,該個人資訊包括一銀行收支資訊、一銀行存款貸款紀錄資訊、一信用卡消費資訊、一用電資訊、一用水資訊、一用瓦斯資訊、一個人家中數位電視資訊、一個人遊戲平台數位資訊、一個人數位音樂資訊、一生活習慣資訊、一駕駛資訊、一車險資訊、一就醫資訊、一勞保資訊、一健保資訊、一生理資訊、一貸款資料、一個人股票交易資訊、一手機使用資訊、一投保資訊、一生理數據量測資訊、一個人履歷資訊、或上述的任意組合。For example, the method described in item 6 of the scope of patent application, wherein the personal information includes a bank's income and expenditure information, a bank deposit and loan record information, a credit card consumption information, an electricity usage information, a water usage information, a gas usage information, One person’s home digital TV information, one person’s game platform digital information, one person’s music information, one lifestyle information, one driving information, one car insurance information, one medical treatment information, one labor insurance information, one health insurance information, one physiological information, one loan information, A person’s stock trading information, a mobile phone usage information, an insurance application information, a physiological data measurement information, a person’s biographical information, or any combination of the above. 如申請專利範圍第6項所述的方法,其中,該文字探勘模組包括一內建辭典至少一資料格式範本、以及一預設的斷詞法則。Such as the method described in item 6 of the scope of patent application, wherein the text exploration module includes a built-in dictionary at least one data format template, and a predetermined word hyphenation rule. 如申請專利範圍第6項所述的方法,其中,該資料集模組被配置使該供應商得以經該資料交換平台訪問該資料交易紀錄。Such as the method described in item 6 of the scope of patent application, wherein the data set module is configured to enable the supplier to access the data transaction record through the data exchange platform. 如申請專利範圍第6項所述的方法,其中,該供應商係為一供應服務或一供應商品的機構。Such as the method described in item 6 of the scope of patent application, wherein the supplier is an organization that supplies services or supplies goods.
TW108118233A 2018-05-28 2019-05-27 Data exchange platform based on text exploration and method of using it TWI718543B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810521810.X 2018-05-28
CN201810521810.XA CN110609894A (en) 2018-05-28 2018-05-28 Data exchange platform based on character mining and method for utilizing same

Publications (2)

Publication Number Publication Date
TW202004523A TW202004523A (en) 2020-01-16
TWI718543B true TWI718543B (en) 2021-02-11

Family

ID=68887556

Family Applications (1)

Application Number Title Priority Date Filing Date
TW108118233A TWI718543B (en) 2018-05-28 2019-05-27 Data exchange platform based on text exploration and method of using it

Country Status (2)

Country Link
CN (1) CN110609894A (en)
TW (1) TWI718543B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI780416B (en) * 2020-03-13 2022-10-11 兆豐國際商業銀行股份有限公司 Method and system for identifying transaction remarks

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201104484A (en) * 2009-07-17 2011-02-01 Gpsstar Taiwan Inc System and method for converting cross-platform information
CN102043982A (en) * 2009-10-13 2011-05-04 西尼卡那国际咨询(北京)有限公司 Citizen individual oriented electronic health record system
TW201510918A (en) * 2014-06-05 2015-03-16 Joiiup Technology Inc Information exchange system and method for interactive health record
US20160357932A1 (en) * 2010-09-29 2016-12-08 Humana Inc. System and method for analysis of distributed electronic medical record data to detect potential health concerns

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107423279B (en) * 2017-04-11 2021-01-15 美林数据技术股份有限公司 Information extraction and analysis method for financial credit short message
CN107203872B (en) * 2017-05-26 2020-06-02 山东省科学院情报研究所 Regional talent demand quantitative analysis method based on big data
CN107239892B (en) * 2017-05-26 2021-06-15 山东省科学院情报研究所 Regional talent supply and demand balance quantitative analysis method based on big data
TWM555996U (en) * 2017-10-30 2018-02-21 兆豐國際商業銀行股份有限公司 Data exchange system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201104484A (en) * 2009-07-17 2011-02-01 Gpsstar Taiwan Inc System and method for converting cross-platform information
CN102043982A (en) * 2009-10-13 2011-05-04 西尼卡那国际咨询(北京)有限公司 Citizen individual oriented electronic health record system
US20160357932A1 (en) * 2010-09-29 2016-12-08 Humana Inc. System and method for analysis of distributed electronic medical record data to detect potential health concerns
TW201510918A (en) * 2014-06-05 2015-03-16 Joiiup Technology Inc Information exchange system and method for interactive health record

Also Published As

Publication number Publication date
CN110609894A (en) 2019-12-24
TW202004523A (en) 2020-01-16

Similar Documents

Publication Publication Date Title
US11720615B2 (en) Self-executing protocol generation from natural language text
Brynjolfsson et al. Crowd-Squared
Pesaran et al. A bias‐adjusted LM test of error cross‐section independence
Kripfganz et al. Instrumental-variable estimation of large-T panel-data models with common factors
WO2019109918A1 (en) Abstract text generation method, computer readable storage medium and computer device
US20150161606A1 (en) Method and system for assessing financial condition of a merchant
Chapman et al. No global crisis of trust: A longitudinal and multinational examination of public trust in nonprofits
CN109584034A (en) Legal document generation method and system
Manoharan A three dimensional assessment of US county e-government
Potnis et al. Analysing slow growth of mobile money market in India using a market separation perspective
Zhu Adult children’s characteristics and intergenerational financial transfers in urban China
Utamachant et al. An analysis of high-value datasets: a case study of Thailand’s open government data
CN109871861B (en) System and method for providing coding for target data
Lanosga et al. Journalists, sources, and policy outcomes: insights from three-plus decades of investigative reporting contest entries
CN112116103A (en) Method, device and system for evaluating personal qualification based on federal learning and storage medium
CN112328868A (en) Credit evaluation and credit granting application system and method based on information data
CN114357020A (en) Service scene data extraction method and device, computer equipment and storage medium
TWI718543B (en) Data exchange platform based on text exploration and method of using it
Guo et al. Testing for moderate explosiveness
US20180053204A1 (en) Auto-population of discount information into an e-invoice
Shon et al. Demographic Heterogeneity, Political Ideology, and Nonprofit Dissolution
Brasher Listening to hearings: Legislative hearings and legislative outcomes
CN118153964A (en) Vendor enterprise risk assessment method and system based on big data technology
CN112102069A (en) Personal property mortgage loan information input analysis system
Arcos et al. A novel calibration estimator in social surveys