TWI776370B - Investment risk scoring method and system for fund commodities - Google Patents

Investment risk scoring method and system for fund commodities Download PDF

Info

Publication number
TWI776370B
TWI776370B TW110102713A TW110102713A TWI776370B TW I776370 B TWI776370 B TW I776370B TW 110102713 A TW110102713 A TW 110102713A TW 110102713 A TW110102713 A TW 110102713A TW I776370 B TWI776370 B TW I776370B
Authority
TW
Taiwan
Prior art keywords
risk
score
fund
data
scoring
Prior art date
Application number
TW110102713A
Other languages
Chinese (zh)
Other versions
TW202230232A (en
Inventor
謝信誠
Original Assignee
第一商業銀行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 第一商業銀行股份有限公司 filed Critical 第一商業銀行股份有限公司
Priority to TW110102713A priority Critical patent/TWI776370B/en
Publication of TW202230232A publication Critical patent/TW202230232A/en
Application granted granted Critical
Publication of TWI776370B publication Critical patent/TWI776370B/en

Links

Images

Landscapes

  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
  • Separation By Low-Temperature Treatments (AREA)

Abstract

一種基金商品的投資風險評分系統中,資料伺服器蒐集客戶與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯的歷史金融行為資料以及與個人基本資料、財務情況和投資經驗相關聯的參考風險屬性資料;評分伺服器利用機器學習演算法分析並演算多個由該資料伺服器所蒐集用於模型訓練的樣本資料集,以建立對應於目標參數的評分模型,將該歷史金融行為資料饋入該評分模型以獲得模型評分,且根據該模型評分及對應於該參考風險屬性資料的參考評分,產生該客戶對應於該目標參數的風險評分。In an investment risk scoring system for fund commodities, the data server collects the historical financial behavior data related to at least one of the customer's fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior, as well as personal basic information. Reference risk attribute data related to data, financial situation and investment experience; the scoring server uses machine learning algorithms to analyze and calculate a number of sample data sets collected by the data server for model training to establish parameters corresponding to the target The historical financial behavior data is fed into the scoring model to obtain a model score, and according to the model score and the reference score corresponding to the reference risk attribute data, the customer's risk score corresponding to the target parameter is generated.

Description

對於基金商品的投資風險評分方法及系統Investment risk scoring method and system for fund commodities

本發明是有關於投資風險評估,特別是指一種對於基金商品的投資風險評分方法及系統。 The present invention relates to investment risk assessment, in particular to an investment risk scoring method and system for fund commodities.

隨著投資資訊爆炸及投資理財商品的推陳出新,投資人的投資行為更趨多元化且更難以捉模,因此,若僅根據投資人如性別、年齡、年收入等現有基本資料來評估投資人,特別是對於基金商品的投資風險等級或偏好,以此方式所獲得的評估結果往往難以充分反映投資人對於基金商品的投資風險等級或偏好。 With the explosion of investment information and the introduction of new investment and wealth management products, the investment behavior of investors has become more diversified and more elusive. Therefore, if investors are only evaluated based on existing basic information such as gender, age, annual income, etc. Especially for the investment risk level or preference of fund commodities, the evaluation results obtained in this way are often difficult to fully reflect the investor's investment risk level or preference for fund commodities.

因此,為了有效推展如基金商品的金融業務,如何發想出一種能夠有效反映客戶對於基金商品的投資風險評分方式遂成為目前金融機構急需解決的議題之一。 Therefore, in order to effectively promote financial services such as fund commodities, how to develop a risk scoring method that can effectively reflect customers' investment in fund commodities has become one of the urgent issues that financial institutions need to solve.

因此,本發明的目的,即在提供一種對於基金商品的投資風險評分方法及系統,其能克服現有技術至少一個缺點。 Therefore, the purpose of the present invention is to provide an investment risk scoring method and system for fund commodities, which can overcome at least one disadvantage of the prior art.

於是,本發明所提供的一種對於基金商品的投資風險評 分方法用於對一客戶在投資基金商品時的風險評估並利用一電腦系統來執行,並包含以下步驟:(A)蒐集該客戶的歷史金融行為資料與參考風險屬性資料,以及多個用於模型訓練的樣本資料集,該歷史金融行為資料與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯,該參考風險屬性資料與個人基本資料、財務情況和投資經驗相關聯,每一樣本資料集包含多個分別對應於M個與一目標參數有關的特徵參數的參數值,該M個特徵參數與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯;(B)利用機器學習演算法分析並演算該等樣本資料集,以建立一對應於該目標參數的評分模型,其中該評分模型包含多個決策樹,每一決策樹是根據該M個特徵參數中隨機選取的m個特徵參數來分析由預定數量的樣本資料集構成的訓練資料集而訓練成,並可操作來回應於輸入資料而產生一分類結果;(C)將該歷史金融行為資料饋入該評分模型,並根據每一決策樹回應於該歷史金融行為資料所產生的分類結果估算出一模型評分;及(D)至少根據該模型評分、及對應於該參考風險屬性資料的參考評分,產生該客戶對應於該目標參數的風險評分。 Therefore, the present invention provides an investment risk assessment for fund commodities. The sub-method is used to evaluate a client's risk when investing in fund commodities and is implemented by a computer system, and includes the following steps: (A) collecting the client's historical financial behavior data and reference risk attribute data, and a plurality of A sample data set for model training. The historical financial behavior data is associated with at least one of fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior, and credit card consumption behavior. The reference risk attribute data is related to personal basic data, financial The situation is related to investment experience, and each sample data set contains a plurality of parameter values corresponding to M characteristic parameters related to a target parameter. The M characteristic parameters are related to fund transaction behavior, fund profit and loss records, other asset status, Digital browsing behavior and credit card consumption behavior are correlated; (B) use machine learning algorithms to analyze and calculate the sample data sets to establish a scoring model corresponding to the target parameter, wherein the scoring model includes a plurality of decision trees, each A decision tree is trained by analyzing a training data set consisting of a predetermined number of sample data sets according to m characteristic parameters randomly selected from the M characteristic parameters, and is operable to generate a classification result in response to the input data; (C) feeding the historical financial behavior data into the scoring model, and estimating a model score based on the classification results produced by each decision tree in response to the historical financial behavior data; and (D) at least based on the model score, and Corresponding to the reference score of the reference risk attribute data, a risk score of the client corresponding to the target parameter is generated.

本發明對於基金商品的投資風險評分方法,還包含以下步驟:(E)輸出對應於該客戶且含有該風險評分的風險評估結果。 The investment risk scoring method for fund commodities of the present invention further includes the following steps: (E) outputting a risk assessment result corresponding to the client and containing the risk score.

本發明對於基金商品的投資風險評分方法中,在步驟(B)中,該機器學習演算法包含一隨機森林演算法。 In the investment risk scoring method for fund commodities of the present invention, in step (B), the machine learning algorithm includes a random forest algorithm.

本發明對於基金商品的投資風險評分方法中,在步驟(B)中,該目標參數為配置於基金商品所佔的資產比例。 In the investment risk scoring method for fund commodities of the present invention, in step (B), the target parameter is the proportion of assets allocated to fund commodities.

本發明對於基金商品的投資風險評分方法中,在步驟(D)中,該電腦系統還根據分別指派給該參考評分與該模型評分的一預定第一權重及一預定第二權重來產生該風險評分,以使該風險評分=(該參考評分×該預定第一權重)+(該模型評分×該預定第二權重)。 In the investment risk scoring method for fund commodities of the present invention, in step (D), the computer system further generates the risk according to a predetermined first weight and a predetermined second weight assigned to the reference score and the model score respectively Score, so that the risk score=(the reference score×the predetermined first weight)+(the model score×the predetermined second weight).

於是,本發明所提供的一種對於基金商品的投資風險評分系統用於對一客戶在投資基金商品時的風險評估,並包含一資料伺服器、及一評分伺服器。 Therefore, an investment risk scoring system for fund commodities provided by the present invention is used for risk assessment when a client invests in fund commodities, and includes a data server and a scoring server.

該資料伺服器操作來蒐集該客戶的歷史金融行為資料與參考風險屬性資料,以及用於模型訓練的多個樣本資料集。該歷史金融行為資料與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯。該參考風險屬性資料與個人基本資料、財務情況和投資經驗相關聯。每一樣本資料集包含多個分別對應於M個與一目標參數有關的特徵參數的參數值,該等特徵參數與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯。 The data server operates to collect the client's historical financial behavior data and reference risk attribute data, as well as a plurality of sample data sets for model training. The historical financial behavior data is associated with at least one of fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card spending behavior. The reference risk attribute information is associated with personal basic information, financial situation and investment experience. Each sample data set includes a plurality of parameter values respectively corresponding to M characteristic parameters related to a target parameter, and these characteristic parameters are related to fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior. link.

該評分伺服器連接該資料伺服器以接收該歷史金融行為 資料、該參考風險屬性資料和該等樣本資料集,並包括一建模模組、及一評分模組。該建模模組操作來利用機器學習演算法分析並演算該等樣本資料集,以建立一對應於該目標參數的評分模型,該評分模型包含多個決策樹,每一決策樹是根據該M個特徵參數中隨機選取的m個特徵參數來分析由預定數量的樣本資料集構成的訓練資料集而訓練成,並可操作來回應於輸入資料而產生一分類結果。該評分模組連接該建模模組且操作來將該歷史金融行為資料饋入該評分模型,以估算出一模型評分,並且至少根據該模型評分、及對應於該參考風險屬性資料的參考評分,產生該客戶對應於該目標參數的風險評分。 The scoring server is connected to the data server to receive the historical financial activity data, the reference risk attribute data, and the sample data sets, and include a modeling module, and a scoring module. The modeling module operates to analyze and calculate the sample data sets using machine learning algorithms to establish a scoring model corresponding to the target parameter. The scoring model includes a plurality of decision trees, each decision tree is based on the M The m feature parameters randomly selected from the feature parameters are trained by analyzing a training data set consisting of a predetermined number of sample data sets, and are operable to generate a classification result in response to the input data. The scoring module is connected to the modeling module and operates to feed the historical financial behavior data into the scoring model to estimate a model score based on at least the model score and a reference score corresponding to the reference risk attribute data , which generates the risk score for the client corresponding to the target parameter.

本發明對於基金商品的投資風險評分系統中,該評分伺服器還包含一輸出模組。該輸入模組輸連接該評分模組,並操作來輸出對應於該客戶且含有該風險評分的風險評估結果。 In the investment risk scoring system for fund commodities of the present invention, the scoring server further includes an output module. The input module is connected to the scoring module and operates to output a risk assessment result corresponding to the client and containing the risk score.

本發明對於基金商品的投資風險評分系統中,該機器學習演算法包含一隨機森林演算法。 In the investment risk scoring system for fund commodities of the present invention, the machine learning algorithm includes a random forest algorithm.

本發明對於基金商品的投資風險評分系統中,該目標參數為配置於基金商品所佔的資產比例。 In the investment risk scoring system for fund commodities of the present invention, the target parameter is the proportion of assets allocated to fund commodities.

本發明對於基金商品的投資風險評分系統中,該評分模組還根據分別指派給該參考評分與該模型評分的一預定第一權重及一預定第二權重來產生該風險評分,以使該風險評分=(該參考評 分×該預定第一權重)+(該模型評分×該預定第二權重)。 In the investment risk scoring system for fund commodities of the present invention, the scoring module further generates the risk score according to a predetermined first weight and a predetermined second weight respectively assigned to the reference score and the model score, so that the risk Rating = (The reference rating score × the predetermined first weight) + (the model score × the predetermined second weight).

本發明的功效在於:根據客戶與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為及/或信用卡消費行為相關聯的歷史金融行為資料,並利用經由機器學習演算法所建立對應於目標參數的評分模型而估算出的模型評分;然後根據模型評分和對應於客戶之參考風險屬性資料的參考評分所產生的風險評分,相對於現有風險評分方式所得的評分,能夠充分且真實地反映客戶對於基金商品所能承受的風險偏好,因而藉此風險評分能在有效地避免投資風險的前提下對客戶提供後續的基金理財及/或投資規劃服務。 The effect of the present invention is: according to the historical financial behavior data associated with the customer's fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and/or credit card consumption behavior, and use the machine learning algorithm to establish corresponding targets. The model score estimated by the scoring model of the parameters; then the risk score generated according to the model score and the reference score corresponding to the reference risk attribute data of the customer, compared with the score obtained by the existing risk scoring method, can fully and truly reflect the customer As for the risk appetite that fund commodities can bear, this risk score can provide customers with follow-up fund management and/or investment planning services on the premise of effectively avoiding investment risks.

100:投資風險評分系統 100: Investment Risk Scoring System

1:資料伺服器 1: Data server

2:評分伺服器 2: Scoring Server

21:建模模組 21: Modeling Modules

22:評分模組 22: Scoring Module

23:輸出模組 23: Output module

S21-S25:步驟 S21-S25: Steps

本發明的其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中:圖1是一方塊圖,示例地說明本發明實施例對於基金商品的投資風險評分系統的架構;圖2是一流程圖,示例地說明該實施例對於想要投資基金商品的一客戶如何執行一投資風險評分程序;及圖3是一示意圖,示例地說明本發明實施例所建立的一評分模型的架構。 Other features and effects of the present invention will be clearly presented in the embodiments with reference to the drawings, wherein: FIG. 1 is a block diagram illustrating the structure of an investment risk scoring system for fund commodities according to an embodiment of the present invention; FIG. 2 is a flow chart illustrating how the embodiment performs an investment risk scoring procedure for a client who wants to invest in fund commodities; and FIG. 3 is a schematic diagram illustrating an example of a scoring model established by an embodiment of the present invention. Architecture.

在本發明被詳細描述之前,應當注意在以下的說明內容中,類似的元件是以相同的編號來表示。 Before the present invention is described in detail, it should be noted that in the following description, similar elements are designated by the same reference numerals.

參閱圖1,本發明實施例的一種對於基金商品的投資風險評分系統100用於對一客戶在投資基金商品時的風險評估,並包含一資料伺服器1、及一評分伺服器2。 Referring to FIG. 1 , an investment risk scoring system 100 for fund commodities according to an embodiment of the present invention is used for risk assessment when a client invests in fund commodities, and includes a data server 1 and a scoring server 2 .

在本實施例中,該資料伺服器1例如可以一電腦裝置來實施,且可連接外部資料庫(圖未示),例如金融服務機構的資料庫,並用於資料蒐集,特別是用來蒐集如銀行機構的金融服務機構之(待評分)客戶的相關資料,以及用於建模所需的所有資料。 In this embodiment, the data server 1 can be implemented by, for example, a computer device, and can be connected to an external database (not shown), such as a database of a financial service institution, and used for data collection, especially for collecting data such as Information about the (to be scored) customers of the banking institution's financial services organization, and all the information needed for modeling.

在本實施例中,該評分伺服器2亦可以一電腦裝置來實施,其連接該資料伺服器1,且例如可包含一建模模組21、一連接該建模模組21的評分模組22、及一連接該評分模組22的輸出模組23。該建模模組21和該評分模組22其中每一者可以硬體、軟體、韌體或其組合來實施,而該輸出模組23例如可實施成但不限於一顯示器。 In this embodiment, the scoring server 2 can also be implemented as a computer device, which is connected to the data server 1 , and can include, for example, a modeling module 21 and a scoring module connected to the modeling module 21 . 22, and an output module 23 connected to the scoring module 22. Each of the modeling module 21 and the scoring module 22 may be implemented in hardware, software, firmware, or a combination thereof, and the output module 23 may be implemented as, but not limited to, a display, for example.

以下,將參閱圖1及圖2來示例地詳細說明該投資風險評分系統100對於例如屬於一銀行機構且想要投資基金商品的一客戶如何執行一投資風險評分程序。值得注意的是,此處連接該資料伺服器1的外部資料庫例如是該銀行機構所提供的一資料庫。較佳地, 此資料庫可提供有關於該銀行機構之所有客戶且更豐富及多元化的金融資料。該客戶個資管理程序例如可包含以下步驟S21~S25。 Hereinafter, referring to FIG. 1 and FIG. 2 , how the investment risk scoring system 100 performs an investment risk scoring procedure for a customer who belongs to a banking institution and wants to invest in fund commodities, for example, will be described in detail by way of example. It should be noted that, the external database connected to the data server 1 is, for example, a database provided by the banking institution. Preferably, This database provides richer and more diverse financial information about all of the banking institution's customers. The client personal information management program may include, for example, the following steps S21 to S25.

首先,在步驟S21中,該資料伺服器1搜尋例如該銀行機構的資料庫,以蒐集該客戶的歷史金融行為資料與參考風險屬性資料,以及用於模型訓練的多個樣本資料集,並且將蒐集到的該歷史金融行為資料、該參考風險屬性資料和該等樣本資料集傳送給該評分伺服器2。值得注意的是,該歷史金融行為資料與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯,該參考風險屬性資料與個人基本資料、財務情況和投資經驗相關聯,每一樣本資料集包含多個分別對應於M個與一目標參數有關的特徵參數的參數值,該等特徵參數與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯。 First, in step S21, the data server 1 searches, for example, the database of the banking institution to collect historical financial behavior data and reference risk attribute data of the customer, as well as multiple sample data sets for model training, and store The collected historical financial behavior data, the reference risk attribute data and the sample data sets are sent to the scoring server 2 . It is worth noting that the historical financial behavior information is associated with at least one of fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior, and the reference risk attribute data is related to personal basic information, financial situation and The investment experience is related, and each sample data set contains a plurality of parameter values corresponding to M characteristic parameters related to a target parameter. These characteristic parameters are related to fund transaction behavior, fund profit and loss records, other asset status, and digital browsing behavior. associated with credit card spending.

更明確地,該歷史金融行為資料可包含該客戶已購買之基金商品及其損益的相關資料(例如基金商品的平均持有天數、偏好的基金商品的類型、最大虧損金額、報酬續之波動程度)、該客戶持有該銀行機構所發行的其他金融商品以及金融帳戶之帳戶餘額的相關資料、該客戶在該銀行機構所提供如基金理財之服務網站的瀏覽足跡(例如,長、短期的瀏覽次數以及每次瀏覽的時間)的相關資料,以及該客戶使用該銀行機構所發行之信用卡的相關資料(例 如,刷卡情況以及是否使用循環利率)。該參考風險屬性資料例如是在該客戶首次申購基金商品前由該客戶透過人為填寫或輸入的方式作答的一風險屬性問卷,此風險屬性問卷的內容可包含例如基本資料(如年齡、職業)、財務背景資料(如、收入、資金來源)和投資經驗資料(如曾購買的金融商品),且根據此風險屬性問卷的內容經由預定的評分方式,可獲得該客戶的一參考評分。在本實施例中,該目標參數例如為「配置於基金商品所佔的資產比例」,並且例如M=35,但不限於此例。於是,每一樣本資料集可包含例如35個參數值,其中含有:16個與基金交易行為相關聯之「近二年曾經持有債券型基金數量」、「近二年曾經持有股票型基金數量」、「近二年曾經持有多重資產型基金數量」、「近二年曾經持有貨幣型基金數量」、「二年曾經持有平衡型基金數量」、「近二年曾經持有組合型基金數量」、「近二年曾經持有ETF型基金數量」、「平均申購金額」、「平均申購金額佔總資產比例」、「平均持有天數」、「申購次數佔所有交易行為的百分比」、「申購次數佔所有交易行為的百分比」、「申購次數/贖回次數」、「最大獲利金額」、「最大獲利金額-最大虧損金額」、「資產管理規模(AUM)排行分級」等特徵參數的參數值;7個與基金損益紀錄相關聯之「歷史交易基金的平均報酬率」、「歷史交易基金的平均報酬率之波動度」、「歷史持有的基金產品數」、「歷史最大虧損報酬率的波動度」、「歷 史最大虧損金額」、「歷史最大虧損金額的波動度」、「獲利因子」等特徵參數的參數值;5個與其他資產狀況相關聯之「台幣活存餘額」、「台幣定存餘額」、「支票存款餘額」、「外幣活存餘額」、「外幣定存餘額」等特徵參數的參數值;4個與數位瀏覽行為相關聯之「最近一個月訪問基金理財網次數」、「最近一個月於基金理財網平均停留時間」、「最近五個月訪問基金理財網次數」、「最近五個月於基金理財網平均停留時間」等特徵參數的參數值;及3個與信用卡消費行為相關聯之「信用額度/年收入」、「是否使用循環利率」、「平均信用額度使用率」等特徵參數的參數值。 More specifically, the historical financial behavior data may include information about the fund commodities that the client has purchased and their profits and losses (such as the average holding days of the fund commodities, the type of the preferred fund commodities, the maximum loss amount, and the degree of volatility of the return). ), other financial products issued by the banking institution and the relevant information of the account balance of the financial account held by the customer, the customer's browsing footprint on the service website provided by the banking institution such as fund wealth management (for example, long-term and short-term browsing information about the number of times and the time of each visit), and information about the customer's use of the credit card issued by the banking institution (e.g. such as card swiping and whether revolving rates are used). The reference risk attribute data is, for example, a risk attribute questionnaire answered by the customer through manual filling or input before the customer first subscribes for fund commodities. The content of the risk attribute questionnaire may include, for example, basic information (such as age, occupation), Financial background information (such as income, source of funds) and investment experience data (such as purchased financial products), and according to the content of the risk attribute questionnaire through a predetermined scoring method, a reference score of the customer can be obtained. In this embodiment, the target parameter is, for example, "the proportion of assets allocated to fund commodities", and for example, M=35, but it is not limited to this example. Therefore, each sample data set can include, for example, 35 parameter values, including: 16 “number of bond funds held in the past two years” associated with fund trading behavior, “stock funds held in the past two years” Quantity”, “The number of multi-asset funds held in the past two years”, “The number of currency funds held in the past two years”, “The number of balanced funds held in the past two years”, “The portfolio held in the past two years” number of ETF funds, number of ETF funds held in the past two years, average subscription amount, average subscription amount as a percentage of total assets, average holding days, and subscription times as a percentage of all transactions ", "Number of purchases as a percentage of all transactions", "Number of purchases/redemptions", "Maximum profit amount", "Maximum profit amount - Maximum loss amount", "Assets under management (AUM) ranking" Parameter values of such characteristic parameters; 7 "average rate of return of historical exchange-traded funds", "volatility of average rate of return of historical exchange-traded funds", "number of fund products held in history", " Volatility of historically largest loss return”, “Historical The parameter values of characteristic parameters such as the largest loss in history, the volatility of the largest loss in history, and the profit factor; 5 "TWD living balance" and "TWD fixed deposit balance" associated with other asset status , "check deposit balance", "foreign currency active deposit balance", "foreign currency fixed deposit balance" and other characteristic parameters parameter values; 4 parameters associated with digital browsing behaviors are "number of visits to the fund wealth management network in the last month", "the most recent one" The parameter values of characteristic parameters such as the average stay time on the fund wealth management network in the last five months, the number of visits to the fund wealth management network in the last five months, and the average stay time on the fund wealth management network in the last five months; and three parameters related to credit card consumption behavior The parameter values of the characteristic parameters such as "Credit Limit/Annual Income", "Whether to Use Revolving Interest Rate", "Average Credit Limit Utilization Rate" and so on.

當該評分伺服器2接收到來自該資料伺服器2的該歷史金融行為資料、該參考風險屬性資料和該等樣本資料集時,在步驟S22中,該建模模組21操作來根據該等樣本資料集,利用機器學習演算法建立一對應於該目標參數的評分模型。更具體地,在本實施例中,該機器學習演算法包含一隨機森林演算法,該評分模型包含多個決策樹,每一決策樹是依據m個選自該M個特徵參數的對應特徵參數進行決策演算且m<M。值得注意的是,隨機森林演算法屬於一種「監督式學習」的演算法,相較於深度學習的神經網絡引算法通常具有交高的模型解釋能力,並且在建模前已定義出該目標參數(即,「配置於基金商品所佔的資產比例」),如此能利用該等樣本資料集在學習過程中不斷修正以發展出相對正確的評分模型。 When the scoring server 2 receives the historical financial behavior data, the reference risk attribute data and the sample data sets from the data server 2, in step S22, the modeling module 21 operates to A sample data set, and a scoring model corresponding to the target parameter is established by using a machine learning algorithm. More specifically, in this embodiment, the machine learning algorithm includes a random forest algorithm, the scoring model includes a plurality of decision trees, and each decision tree is based on m corresponding characteristic parameters selected from the M characteristic parameters. Do the decision calculus and m<M. It is worth noting that the random forest algorithm is a kind of "supervised learning" algorithm. Compared with the neural network algorithm of deep learning, it usually has a high model interpretation ability, and the target parameters have been defined before modeling. (ie, "the proportion of assets allocated to fund commodities"), so that a relatively accurate scoring model can be developed by using these sample data sets to be continuously revised during the learning process.

更明確地,參閱圖3,該建模模組21利用裝袋(Bagging)演算法每次以隨機方式從該等樣本資料集中選取預定數量的樣本資料集作為該次的訓練資料集,並一共執行n次,如此可獲n個訓練資料集。對於每一訓練資料集,為了生成不同的隨機向量θ i,i=1,...,n,對從該M個特徵參數隨機選取的m個特徵參數其中每一者進行節點分割,以選擇達到最小的Gini係數的分割方式進行分裂而最後生成一棵決策樹。在此情況下,每一棵決策樹可用作一分類器,並可回應於輸入資料而產生一分類結果;然後,該建模模組21還利用分類或回歸演算法匯整並演算由n棵決策樹所產生的n個分類結果,以從多個預定的分類等級決定出一目標分類等級,並輸出對應於該目標分類等級的評分。值得注意的是,由於該評分模型結合有袋裝演算法及分類或回歸演算法,因而可提高模型準確率和穩定性,同時可避免過擬合(Overfitting)的發生而保有模型集成(Ensemble)特性。 More specifically, referring to FIG. 3 , the modeling module 21 uses the bagging algorithm to randomly select a predetermined number of sample data sets from the sample data sets each time as the training data set for this time, and a total of Execute n times, so that n training data sets can be obtained. For each training data set, in order to generate different random vectors θ i, i=1, . . . , n, node segmentation is performed on each of the m feature parameters randomly selected from the M feature parameters to select The division method that achieves the smallest Gini coefficient is divided and finally a decision tree is generated. In this case, each decision tree can be used as a classifier and can generate a classification result in response to the input data; then, the modeling module 21 also uses a classification or regression algorithm to assemble and calculate the number of n The n classification results generated by a decision tree are used to determine a target classification level from a plurality of predetermined classification levels, and output a score corresponding to the target classification level. It is worth noting that because the scoring model combines the bagging algorithm and the classification or regression algorithm, the accuracy and stability of the model can be improved, and the occurrence of overfitting can be avoided while maintaining the model ensemble (Ensemble). characteristic.

之後,在步驟S23中,該評分模組22操作來將該歷史金融行為資料作為輸入資料饋入該評分模型,以估算出對應於該客戶的一模型評分。由於該評分模型的目標參數為「配置於基金商品所佔的資產比例」,若比例越高代表該客戶對基金商品的偏好越高同時亦代表持有基金商品的風險越高,則該模型評分會越高,反之則越低。 Then, in step S23, the scoring module 22 operates to feed the historical financial behavior data into the scoring model as input data to estimate a model score corresponding to the customer. Since the target parameter of the scoring model is "proportion of assets allocated to fund commodities", if the ratio is higher, it means that the client has a higher preference for fund commodities and also represents a higher risk of holding fund commodities. will be higher, and vice versa.

然後,在步驟S24中,該評分模組22根據該模型評分、對應於該參考風險屬性資料的該參考評分、一預定第一權重及一預定第二權重,產生該客戶對應於該目標參數的風險評分。更明確地,該預定第一權重及該預定第二權重式分別指派給該參考評分與該模型評分,以使該風險評分=(該參考評分×該預定第一權重)+(該模型評分×該預定第二權重)。 Then, in step S24, the scoring module 22 generates the customer's score corresponding to the target parameter according to the model score, the reference score corresponding to the reference risk attribute data, a predetermined first weight and a predetermined second weight Risk Score. More specifically, the predetermined first weight and the predetermined second weight formula are respectively assigned to the reference score and the model score, so that the risk score=(the reference score×the predetermined first weight)+(the model score× the predetermined second weight).

最後,在步驟S25中,該輸出模組23輸出含有該風險評分的風險評分結果。舉例來說,該輸出模組23可以顯示出該風險評分結果,或者在其他實施例中,該輸出模組23可以是一通訊模組以將該風險評分結果傳送給一特定使用終端(如銀行機構的一電腦終端或該客戶持有之手機)。 Finally, in step S25, the output module 23 outputs a risk score result including the risk score. For example, the output module 23 can display the risk score result, or in other embodiments, the output module 23 can be a communication module to transmit the risk score result to a specific user terminal (such as a bank a computer terminal of the institution or a mobile phone held by the client).

於是,該銀行機構可依據該客戶的風險評分提供該客戶後續在基金理財或投資規劃時在基金商品配置(如股債比例)上的建議。 Therefore, the banking institution can provide the customer with suggestions on the allocation of fund commodities (such as the ratio of stocks and debts) in the subsequent fund management or investment planning according to the risk score of the customer.

綜上所述,由於該評分模型是利用與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯的樣本資料集所訓練而成,因而所獲得的模型評分可真實地反映客戶對於基金商品所能承受的風險偏好。於是,將(客觀的)模型評分結合(主觀的)參考評分所獲得的風險評分,相對於現有風險評分方式所得的評分,能在有效地避免投資風險的前提下對客戶提供後 續的基金理財及/或投資規劃服務。因此,本發明投資風險評分系統100確實能達成本發明的目的。 To sum up, since the scoring model is trained using the sample data set associated with fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior, the obtained model score can be realistic. It reflects the client's risk appetite for fund commodities. Therefore, the risk score obtained by combining the (objective) model score with the (subjective) reference score, compared with the score obtained by the existing risk scoring method, can provide customers with after-sales service under the premise of effectively avoiding investment risks. Continued fund management and/or investment planning services. Therefore, the investment risk scoring system 100 of the present invention can indeed achieve the purpose of the present invention.

惟以上所述者,僅為本發明的實施例而已,當不能以此限定本發明實施的範圍,凡是依本發明申請專利範圍及專利說明書內容所作的簡單的等效變化與修飾,皆仍屬本發明專利涵蓋的範圍內。 However, the above are only examples of the present invention, and should not limit the scope of implementation of the present invention. Any simple equivalent changes and modifications made according to the scope of the patent application of the present invention and the contents of the patent specification are still included in the scope of the present invention. within the scope of the invention patent.

100:投資風險評分系統 1:資料伺服器 2:評分伺服器 21:建模模組 22:評分模組 23:輸出模組100: Investment Risk Scoring System 1: Data Server 2: Scoring Server 21: Modeling Module 22: Scoring Module 23: Output Module

Claims (10)

一種對於基金商品的投資風險評分方法,用於對一客戶在投資基金商品時的風險評估並利用一電腦系統來執行,該投資風險評分方法包含以下步驟:(A)蒐集該客戶的歷史金融行為資料與參考風險屬性資料以及用於模型訓練的多個樣本資料集,該歷史金融行為資料與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯,該參考風險屬性資料與個人基本資料、財務情況和投資經驗相關聯,每一樣本資料集包含多個分別對應於M個與一目標參數有關的特徵參數的參數值,該M個特徵參數與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯;(B)利用機器學習演算法分析並演算該等樣本資料集,以建立一對應於該目標參數的評分模型,其中該評分模型包含多個決策樹,每一決策樹是根據該M個特徵參數中隨機選取的m個特徵參數來分析由該等樣本資料集中一預定數量的樣本資料集構成的訓練資料集而訓練成,並可操作來回應於輸入資料而產生一分類結果;(C)將該歷史金融行為資料饋入該評分模型,並根據每一決策樹回應於該歷史金融行為資料所產生的分類結果估算出一模型評分;及(D)至少根據該模型評分、及對應於該參考風險屬性資料的參考評分,產生該客戶對應於該目標參數的風險 評分。 An investment risk scoring method for fund commodities, which is used to evaluate a client's risk when investing in fund commodities and is executed by a computer system. The investment risk scoring method comprises the following steps: (A) Collecting the client's historical financial behavior Data and reference risk attribute data and multiple sample data sets used for model training, the historical financial behavior data is associated with at least one of fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior, The reference risk attribute data is associated with personal basic data, financial situation and investment experience. Each sample data set includes a plurality of parameter values respectively corresponding to M characteristic parameters related to a target parameter. The M characteristic parameters are related to the fund Transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior are correlated; (B) use machine learning algorithms to analyze and calculate these sample data sets to establish a scoring model corresponding to the target parameters, The scoring model includes a plurality of decision trees, and each decision tree is obtained by analyzing a training data set composed of a predetermined number of sample data sets from the sample data sets according to m characteristic parameters randomly selected from the M characteristic parameters. trained and operable to generate a classification result in response to input data; (C) feeding the historical financial behavior data into the scoring model, and generating classification results in response to the historical financial behavior data according to each decision tree estimating a model score; and (D) generating a risk of the client corresponding to the target parameter based on at least the model score and the reference score corresponding to the reference risk attribute data score. 如請求項1所述對於基金商品的投資風險評分方法,還包含以下步驟:(E)輸出對應於該客戶且含有該風險評分的風險評估結果。 The investment risk scoring method for fund commodities as described in claim 1, further comprising the following steps: (E) outputting a risk assessment result corresponding to the client and containing the risk score. 如請求項1所述對於基金商品的投資風險評分方法,其中,在步驟(B)中,該機器學習演算法包含一隨機森林演算法。 The investment risk scoring method for fund commodities according to claim 1, wherein, in step (B), the machine learning algorithm includes a random forest algorithm. 如請求項3所述對於基金商品的投資風險評分方法,其中,在步驟(B)中,該目標參數為配置於基金商品所佔的資產比例。 The investment risk scoring method for fund commodities according to claim 3, wherein, in step (B), the target parameter is the proportion of assets allocated to fund commodities. 如請求項1所述對於基金商品的投資風險評分方法,其中,在步驟(D)中,該電腦系統還根據分別指派給該參考評分與該模型評分的一預定第一權重及一預定第二權重來產生該風險評分,以使該風險評分=(該參考評分×該預定第一權重)+(該模型評分×該預定第二權重)。 The investment risk scoring method for fund commodities as described in claim 1, wherein, in step (D), the computer system is further based on a predetermined first weight and a predetermined second weight respectively assigned to the reference score and the model score weights to generate the risk score such that the risk score=(the reference score×the predetermined first weight)+(the model score×the predetermined second weight). 一種對於基金商品的投資風險評分系統,用於對一客戶在投資基金商品時的風險評估,並包含:一資料伺服器,操作來蒐集該客戶的歷史金融行為資料與參考風險屬性資料,以及用於模型訓練的多個樣本資料集,該歷史金融行為資料與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為其中至少一者相關聯,該參考風險屬性資料與個人基本資料、財務情況和投資經驗相關聯,每一樣本資料集包含 多個分別對應於M個與一目標參數有關的特徵參數的參數值,該M個特徵參數與基金交易行為、基金損益紀錄、其他資產狀況、數位瀏覽行為和信用卡消費行為相關聯;及一評分伺服器,連接該資料伺服器以接收該歷史金融行為資料、該參考風險屬性資料和該等樣本資料集,並包括一建模模組,操作來利用機器學習演算法分析並演算該等樣本資料集,以建立一對應於該目標參數的評分模型,該評分模型包含多個決策樹,每一決策樹是根據該M個特徵參數中隨機選取的m個特徵參數來分析由預定數量的樣本資料集構成的訓練資料集而訓練成,並可操作來回應於輸入資料而產生一分類結果,及一評分模組,連接該建模模組且操作來將該歷史金融行為資料饋入該評分模型,並根據每一決策樹回應於該歷史金融行為資料所產生的分類結果估算出一模型評分,並且至少根據該模型評分、及對應於該參考風險屬性資料的參考評分,產生該客戶對應於該目標參數的風險評分。 An investment risk scoring system for fund commodities, which is used for risk assessment of a client when investing in fund commodities, and includes: a data server, which operates to collect the client's historical financial behavior data and reference risk attribute data, and uses Multiple sample data sets used in model training, the historical financial behavior data is associated with at least one of fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior, and the reference risk attribute data is related to personal basic Information, financial situation and investment experience are related, and each sample data set contains a plurality of parameter values respectively corresponding to M characteristic parameters related to a target parameter, the M characteristic parameters being associated with fund transaction behavior, fund profit and loss records, other asset status, digital browsing behavior and credit card consumption behavior; and a score a server, connected to the data server to receive the historical financial behavior data, the reference risk attribute data, and the sample data sets, and includes a modeling module operative to analyze and calculate the sample data using machine learning algorithms set, to establish a scoring model corresponding to the target parameter, the scoring model includes a plurality of decision trees, each decision tree is based on the m feature parameters randomly selected from the M feature parameters to analyze the predetermined number of sample data is trained and operable to generate a classification result in response to input data, and a scoring module connected to the modeling module and operative to feed the historical financial behavior data into the scoring model , and estimate a model score according to the classification result generated by each decision tree in response to the historical financial behavior data, and at least according to the model score and the reference score corresponding to the reference risk attribute data, generate the customer corresponding to the Risk score for the target parameter. 如請求項6所述對於基金商品的投資風險評分系統,其中該評分伺服器還包含:一輸出模組,連接該評分模組,並操作來輸出對應於該客戶且含有該風險評分的風險評估結果。 The investment risk scoring system for fund commodities according to claim 6, wherein the scoring server further comprises: an output module, connected to the scoring module, and operated to output a risk assessment corresponding to the client and containing the risk score result. 如請求項6所述對於基金商品的投資風險評分系統,其 中,該機器學習演算法包含一隨機森林演算法。 For the investment risk scoring system for fund commodities as described in claim 6, the , the machine learning algorithm includes a random forest algorithm. 如請求項8所述對於基金商品的投資風險評分系統,其中,該目標參數為配置於基金商品所佔的資產比例。 The investment risk scoring system for fund commodities according to claim 8, wherein the target parameter is the proportion of assets allocated to fund commodities. 如請求項6所述對於基金商品的投資風險評分系統,其中,該評分模組還根據分別指派給該參考評分與該模型評分的一預定第一權重及一預定第二權重來產生該風險評分,以使該風險評分=(該參考評分×該預定第一權重)+(該模型評分×該預定第二權重)。 The investment risk scoring system for fund commodities according to claim 6, wherein the scoring module further generates the risk score according to a predetermined first weight and a predetermined second weight assigned to the reference score and the model score respectively , so that the risk score=(the reference score×the predetermined first weight)+(the model score×the predetermined second weight).
TW110102713A 2021-01-25 2021-01-25 Investment risk scoring method and system for fund commodities TWI776370B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW110102713A TWI776370B (en) 2021-01-25 2021-01-25 Investment risk scoring method and system for fund commodities

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110102713A TWI776370B (en) 2021-01-25 2021-01-25 Investment risk scoring method and system for fund commodities

Publications (2)

Publication Number Publication Date
TW202230232A TW202230232A (en) 2022-08-01
TWI776370B true TWI776370B (en) 2022-09-01

Family

ID=83782567

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110102713A TWI776370B (en) 2021-01-25 2021-01-25 Investment risk scoring method and system for fund commodities

Country Status (1)

Country Link
TW (1) TWI776370B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150095261A1 (en) * 2013-09-30 2015-04-02 The Toronto-Dominion Bank Systems and methods for administering investment portfolios based on transaction data
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN108665366A (en) * 2018-04-27 2018-10-16 平安科技(深圳)有限公司 Determine method, terminal device and the computer readable storage medium of consumer's risk grade
CN111161054A (en) * 2020-01-02 2020-05-15 中国银行股份有限公司 Operation risk monitoring method and system
TWM597467U (en) * 2020-03-09 2020-06-21 兆豐國際商業銀行股份有限公司 System for recommending financial product
CN111583018A (en) * 2020-04-09 2020-08-25 上海淇毓信息科技有限公司 Credit granting strategy management method and device based on user financial performance analysis and electronic equipment
CN112017061A (en) * 2020-07-15 2020-12-01 北京淇瑀信息科技有限公司 Financial risk prediction method and device based on Bayesian deep learning and electronic equipment
TWM613536U (en) * 2021-01-25 2021-06-21 第一商業銀行股份有限公司 Investment risk scoring system for fund commodities

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150095261A1 (en) * 2013-09-30 2015-04-02 The Toronto-Dominion Bank Systems and methods for administering investment portfolios based on transaction data
CN108665366A (en) * 2018-04-27 2018-10-16 平安科技(深圳)有限公司 Determine method, terminal device and the computer readable storage medium of consumer's risk grade
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN111161054A (en) * 2020-01-02 2020-05-15 中国银行股份有限公司 Operation risk monitoring method and system
TWM597467U (en) * 2020-03-09 2020-06-21 兆豐國際商業銀行股份有限公司 System for recommending financial product
CN111583018A (en) * 2020-04-09 2020-08-25 上海淇毓信息科技有限公司 Credit granting strategy management method and device based on user financial performance analysis and electronic equipment
CN112017061A (en) * 2020-07-15 2020-12-01 北京淇瑀信息科技有限公司 Financial risk prediction method and device based on Bayesian deep learning and electronic equipment
TWM613536U (en) * 2021-01-25 2021-06-21 第一商業銀行股份有限公司 Investment risk scoring system for fund commodities

Also Published As

Publication number Publication date
TW202230232A (en) 2022-08-01

Similar Documents

Publication Publication Date Title
US8688557B2 (en) Systems and methods for customer value optimization involving relationship optimization
TW530235B (en) Valuation prediction models in situations with missing inputs
TW530234B (en) Methods and systems for efficiently sampling portfolios for optimal underwriting
TWI248001B (en) Methods and apparatus for automated underwriting of segmentable portfolio assets
US20070282728A1 (en) Consolidation, sharing and analysis of investment information
US20140067650A1 (en) Methods and systems for consumer lending
US20020052836A1 (en) Method and apparatus for determining a prepayment score for an individual applicant
US20090240574A1 (en) Systems and Methods for Ranking Investors and Rating Investment Positions
TW530236B (en) Cross correlation tool for automated portfolio descriptive statistics
US20100280976A1 (en) Systems and methods for recommending investment positions to investors
CN107798607A (en) Asset Allocation strategy acquisition methods, device, computer equipment and storage medium
JP2003530649A (en) Personalized investment advisory system and method embodied on a network
US20140258175A1 (en) Generating Personalized Investment Recommendations
US20090265283A1 (en) Systems and Methods for Ranking Investors and Rating Investment Positions
KR101955546B1 (en) Lending Meditation Platform System and Credit Estimating Apparatus and Method thereof
JP7311495B2 (en) Improved Mortgage Rate Determination
CN111383091A (en) Asset securitization pricing method and device
JP6709775B2 (en) Calculation device, calculation method, and calculation program
CN112074860A (en) Computer-implemented method for compiling an investment portfolio of assets
JP6411562B2 (en) Information processing apparatus, information processing method, and program
TWM613536U (en) Investment risk scoring system for fund commodities
Park et al. Recommendation of investment portfolio for peer-to-peer lending with additional consideration of bidding period
JP6267812B1 (en) Calculation device, calculation method, and calculation program
JP2018160258A (en) Information processor, method for processing information, and program
JP2003114977A (en) Method and system for calculating customer&#39;s lifelong value

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent