WO2024040817A1 - Bond risk information processing method based on big data and related device - Google Patents

Bond risk information processing method based on big data and related device Download PDF

Info

Publication number
WO2024040817A1
WO2024040817A1 PCT/CN2022/140776 CN2022140776W WO2024040817A1 WO 2024040817 A1 WO2024040817 A1 WO 2024040817A1 CN 2022140776 W CN2022140776 W CN 2022140776W WO 2024040817 A1 WO2024040817 A1 WO 2024040817A1
Authority
WO
WIPO (PCT)
Prior art keywords
risk
target
bond
information
event
Prior art date
Application number
PCT/CN2022/140776
Other languages
French (fr)
Chinese (zh)
Inventor
刘蕾
Original Assignee
深圳市富途网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市富途网络科技有限公司 filed Critical 深圳市富途网络科技有限公司
Publication of WO2024040817A1 publication Critical patent/WO2024040817A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Definitions

  • This application relates to the field of big data technology, specifically, to a bond risk information processing method and related equipment based on big data.
  • Bonds are an important investment tool. When customers invest in bonds, they may gain income or bear certain transaction risks. Among them, the risks of trading bonds include but are not limited to: credit risk, liquidity risk, currency risk, interest rate risk, market risk, etc. If customers choose to invest in high-interest bonds or certain bonds with unique characteristics and risks, they will bear higher risks. For example, if a bond defaults, customers may even lose their principal. Therefore, it is necessary for customers to understand the characteristics of bond products and the degree of risk they need to bear before they make transactions. The data used by related technologies to conduct risk assessment on bonds is too simple, and the risk value obtained through analysis has a large error, which in turn leads to errors in risk warnings, and in turn causes the problem of incomplete data display in the bond trading process.
  • embodiments of the present application provide a bond risk information processing method and related equipment based on big data.
  • a bond risk information processing method based on big data includes: performing word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words.
  • the risk information includes issuance Financial information, credit information and historical default information corresponding to the object; determine the feature vector of the risk information based on the importance parameters of the content words matching the preset keyword table.
  • the keyword table includes multiple keys related to the preset risk events.
  • the feature vector is matched with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector, and the risk event
  • the library includes multiple risk events; risk reminders are provided based on target risk events.
  • a bond risk information processing device based on big data
  • a word segmentation processing module used to perform word segmentation processing on the risk information of the issuance object corresponding to the target bond, and obtain multiple corresponding content words
  • the risk information includes financial information, credit information and historical default information corresponding to the issuance object
  • the determination module is used to determine the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table.
  • the keyword table includes multiple Keywords related to preset risk events, in which the importance parameter is used to represent the importance of the corresponding target content word to the risk information; the matching module is used to match the feature vector with the risk events in the preset risk event library , obtain the target risk event that matches the feature vector, and the risk event library includes multiple risk events; the risk prompt module is used to provide risk prompts based on the target risk event.
  • the determination module includes a matching unit, a determination unit and a feature vector acquisition unit, wherein the matching unit is used to match each content word with the keyword table respectively to obtain the target content word; the determination unit is used to determine each target Importance parameters of content words; the feature vector acquisition unit is used to use a vector composed of importance parameters of each target content word as a feature vector.
  • the big data-based bond risk information processing device provided in this embodiment also includes a calculation module, wherein the calculation module is used to use the target matching degree as the weight of the risk event score value corresponding to the target risk event, Calculate the weighted sum of the risk event score values of the target risk event, and use the weighted sum as the risk score value of the target bond.
  • the target matching degree is the matching degree between the target risk event and the feature vector.
  • the risk event library includes risk event score values corresponding to each risk event.
  • the risk prompt module includes a response unit and a loading unit, wherein the response unit is configured to obtain the target risk event and the risk score value in response to the first operation instruction for the target bond triggered in the bond trading page;
  • the loading unit is used to load target risk events and risk score values into the bond trading page.
  • the loading unit is also used to associate the risk level, target risk event, and risk score value of the target bond with the target bond, and load the associated risk level, target risk event, and risk score value into the bond.
  • the risk level is determined based on the risk score value.
  • the loading unit is also used to bring the risk level, target risk event, risk score value and target bond into the pre-generated risk prompt field template to obtain the corresponding risk prompt field.
  • the big data-based bond risk information processing device provided in this embodiment also includes a response module, configured to load bond information in response to the second operation instruction for the bond information link triggered in the bond transaction page.
  • the information page corresponding to the link is used to display risk information.
  • the target bond is associated with the bond information link for the target bond.
  • an electronic device including a processor and a memory.
  • Computer-readable instructions are stored on the memory.
  • the above big data-based bond risk information is implemented. Approach.
  • a computer-readable storage medium on which computer-readable instructions are stored.
  • the computer-readable instructions are executed by a processor of a computer, the computer is caused to execute the large-scale method as provided above. Data processing methods for bond risk information.
  • a computer program product or a computer program is provided, the computer program product or the computer program including computer instructions, and the computer instructions are stored in a computer-readable storage medium.
  • the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the big data-based bond risk information processing method provided in the above various optional embodiments.
  • the risk information of the issuance object corresponding to the target bond is segmented to obtain multiple corresponding content words.
  • the risk information includes the financial information, credit information and historical default information corresponding to the issuance object;
  • the feature vector of the risk information is determined based on the importance parameters of the content words that match the preset keyword table.
  • the keyword table includes multiple keywords related to the preset risk events, where the importance parameters are used to represent the corresponding target content words.
  • the risk event library includes multiple risk events; based on the target risk event risk warning.
  • This embodiment determines the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table.
  • the above method can fully extract the semantic information in the risk information, and thereby more accurately extract the features in the risk information, so that The feature vector corresponding to the obtained risk information is more accurate, and information related to risk events is extracted to avoid unnecessary waste of computing resources and improve the efficiency of big data risk assessment based on bond information; in addition, this embodiment starts from a smaller granularity Start by analyzing possible risk events in risk information, and provide risk reminders based on risk events, so that users can be more clear about the risks brought by bond products.
  • Figure 1 is a flow chart of a bond risk information processing method based on big data according to an exemplary embodiment of the present application
  • Figure 2 is a schematic diagram of jumping from the bond trading page to the information page according to an exemplary embodiment
  • Figure 3 is a flow chart of a bond risk warning method according to an exemplary embodiment
  • Figure 4 is a block diagram of a big data-based bond risk information processing device according to an exemplary embodiment of the present application
  • FIG. 5 shows a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
  • the network elements involved in the embodiments of this application can also be called functions or functional entities, which are not limited by this application.
  • the access and mobility management function network element may also be called the access and mobility management function or the access and mobility management function entity
  • the session management function network element may be called the session management function or session management function entity, etc.
  • the names of each network element are not limited in this application. Those skilled in the art can replace the names of the above network elements with other names to perform the same functions, which all fall within the scope of protection of this application.
  • big data refers to a collection of data that cannot be captured, managed and processed with conventional software tools within a certain time range. It requires new processing models to have stronger decision-making power, insight discovery and process optimization capabilities. Massive, high-growth and diversified information assets. With the advent of the cloud era, big data has also attracted more and more attention. Big data requires special technologies to effectively handle large amounts of data within a tolerable time. Technologies applicable to big data include massively parallel processing databases, data mining, distributed file systems, distributed databases, cloud computing platforms, the Internet, and scalable storage systems. Specifically, as people's economic level continues to improve, investment and financial management have become people's mainstream financial management methods. As one of the investment and financial management methods, bonds have also received considerable attention, especially those based on big data. However, investing in bonds based on big data may yield returns and may also bear certain transaction risks. Therefore, in order to enable customers to understand the characteristics of bond products and the risks they need to bear, it is particularly important to assess and prompt bond transaction risks based on big data.
  • Bond default refers to the failure of the bond issuer to perform its obligations in accordance with the bond agreement reached in advance.
  • the high incidence of bond defaults in recent years has sounded the alarm to individual and institutional investors. Therefore, it is extremely important to identify the risks that may cause bond defaults. important.
  • Traditional smart bond analysis tools often only provide financial data browsing and simple credit rating functions for bonds. The information content is single. Investors cannot obtain an intuitive evaluation of bonds from the provided financial data and credit ratings, and it is difficult to achieve accurate evaluation of bonds. Comprehensive tracking and control of bond default risk points.
  • FIG. 1 is a flow chart of a bond risk information processing method based on big data according to an exemplary embodiment of the present application.
  • the bond risk information processing method based on big data in this embodiment can be applied to the bond risk reminder device.
  • the bond risk reminder device of this application can be a server, a mobile device, or a system in which a server and a mobile device cooperate with each other.
  • various parts included in the mobile device such as each unit, sub-unit, module, and sub-module, may all be provided in the server, may all be provided in the mobile device, or may be provided in the server and the mobile device respectively.
  • the above-mentioned server may be hardware or software.
  • the server When the server is hardware, it can be implemented as a distributed server cluster composed of multiple servers or as a single server.
  • the server is software, it can be implemented as multiple software or software modules, such as software or software modules used to provide distributed servers, or it can be implemented as a single software or software module, which is not specifically limited here.
  • the bond trading risk warning method based on big data at least includes steps S101 to S104, which are described in detail as follows:
  • Step S101 Perform word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words.
  • the risk information is information that may cause bond default.
  • the risk information includes financial information, credit information and historical default information corresponding to the issuing object. Since the risk source of a bond is usually closely related to the object to which it is issued, this embodiment performs a risk assessment on the target bond by analyzing the risk information of the object to be issued.
  • the number of default events that occur in the issuance object within a preset time period can be for the target bond or other bonds of the issuance object.
  • Risk information also includes the financial status of the issuer, credit information, third-party credit rating reports or research reports, etc.
  • Word segmentation is the process of recombining continuous word sequences into semantically independent word sequences according to certain specifications. Word segmentation is the basis of natural language processing. The accuracy of word segmentation directly determines the quality of subsequent part-of-speech tagging, syntactic analysis, word vectors and text analysis. This embodiment can segment the text through the Chinese word segmentation algorithm. The specific word segmentation process is here Don’t go into too much detail.
  • the risk information is segmented into words through steps S601 to S602.
  • the details are as follows:
  • Step S601 Split the document string corresponding to the risk information to obtain multiple strings of different lengths.
  • This embodiment divides the document string corresponding to the risk information according to the number of characters included in the string. For example, the risk information "The market sentiment faced by Evergrande, which is currently under pressure, has further deteriorated", the risk information corresponding to The document string is divided, and the obtained string including one character includes “objective”, “pressure”, “de”, “field”, “emotion”, “mood”, “hua”, “jin”, “hua”, etc., and the obtained string includes two characters.
  • the string includes "current”, “former”, “under pressure”, “pressured”, “ever”, “everda”, “big firm”, “face”, “market”, etc.
  • the string including three characters includes “currently” “Previously under pressure”, “under pressure”, “emotional”, “further”, “market situation”, etc., including four-character strings including “previously under pressure”, “faced”, etc., including five-character strings
  • the string includes "market sentiment", “further deterioration”, etc. Since the number of content words does not exceed five characters at most, this embodiment sets the characters included in the string when segmenting the document string corresponding to the risk information. No more than 5, which can save computing resources and speed up word segmentation efficiency.
  • Step S602 Match each character string with a preset dictionary respectively. If there is a match, determine that the corresponding character string is a content word.
  • this embodiment matches each character string with a general Chinese dictionary. If a match is found, the corresponding character string is determined to be a content word.
  • the General Chinese Dictionary contains the main vocabulary commonly used in the language and is suitable for readers from different walks of life to use in the language learning process, such as scholar, students at various stages of learning, language teachers or housewives, etc.
  • the General Chinese Dictionary collects representative commonly used vocabulary from all walks of life for the entire public readership.
  • the vocabulary included in the General Chinese Dictionary is more comprehensive than other special dictionaries.
  • the content words obtained after word segmentation processing are processed to remove stop words, and meaningless modal particles, adverbs, special symbols and punctuation marks are removed.
  • the content word " ⁇ " is deleted to save money in subsequent processing. human resources.
  • Step S102 Determine the feature vector of the risk information based on the importance parameters of the target content words that match the preset keyword table.
  • risk events are pre-configured, and risk events refer to possible default events of bond products.
  • This embodiment can determine multiple risk events based on historical default information of bond products, for example, determine risk events based on machine learning.
  • risk events include financial, legal, capital, and operating risk events.
  • financial risk events include labels such as changes in capital structure, poor liquidity, and performance losses.
  • This embodiment pre-constructs a keyword table based on risk events, and the keyword table includes a plurality of keywords related to preset risk events. Keywords related to risk events are characteristic identifiers corresponding to risk events. For example, keywords corresponding to the risk event “poor liquidity” include “liquidity”, “poor”, etc.
  • This embodiment considers that the content words obtained by word segmentation of risk information include those related to the corresponding risk event, such as "liquidity”, and also include those that are not related to the corresponding risk event, such as some punctuation marks, connectives, etc., and also include Including those related to risk events, this embodiment screens multiple content words obtained by word segmentation of risk events based on the keyword table to determine the target content words that can be used as feature vectors for constructing risk information, thereby speeding up the efficiency of risk assessment. Avoid unnecessary waste of computing resources.
  • this embodiment can be implemented based on steps S501 to S503. The details are described as follows:
  • Step S501 Match each content word with the keyword table to obtain the target content word matching the keyword table.
  • the content words that match the keyword table are first determined. For example, each content word can be directly compared with the keyword list. If the same keyword as the corresponding content word can be found in the keyword table, the corresponding content word is determined. Content words are matched against the keyword list.
  • this embodiment calculates the correlation between each content word and the keyword. If the correlation is greater than the preset threshold, it is determined that the corresponding content word matches the corresponding keyword. If the obtained correlation is less than the preset threshold, and if it is determined that the corresponding content words do not match the keyword list, it means that the corresponding content words do not contribute much to the user's characteristics. Therefore, when determining the feature vector, such content words are discarded.
  • the above method can fully extract the semantic information in the risk information, and then more accurately extract the features in the risk information, making the obtained feature vector corresponding to the risk information more accurate.
  • Step S502 Determine the importance parameters of each target content word.
  • the importance parameter of the target content word is used to represent the importance of the corresponding target content word to the risk information.
  • this embodiment can determine the importance parameter of the target content word through the following formula:
  • W x, y represents the corresponding importance parameter of the target content word x in the risk information y
  • tf x, y represents the frequency of the target content word x appearing in the risk information y
  • N represents the total number of texts included in the risk information y
  • df x represents the number of texts including the target content word x in the total number of texts N.
  • the inventor of the present application considers that since the importance of content words increases in direct proportion to the number of times they appear in the document, but at the same time decreases inversely proportional to the frequency of their appearance in the corpus, therefore, in this embodiment, tf x, y
  • This parameter is placed in the numerator position of the formula, and represents the importance of the target content word.
  • the parameter is proportional to the frequency of the target content word x appearing in the risk information y; placing the parameter df
  • the size of the importance parameter is inversely proportional to the number of texts including the target content word x in the total number of texts N.
  • the importance parameter of the content word obtained through the above formula is more accurate.
  • Step S503 Use a vector composed of importance parameters of each target content word as a feature vector.
  • the importance parameters of each target content word are arranged according to the order of keywords matching the target content word in the keyword table, thereby obtaining a feature vector.
  • the dimensions of the feature vectors corresponding to the risk information are unified.
  • the dimensions of the feature vectors corresponding to the risk information are preset to be equal to the number of keywords included in the keyword table.
  • the ranking order sorts the importance parameters corresponding to the target content words to obtain the feature vector. Among them, when the number of target content words corresponding to the risk information is less than the number of keywords included in the keyword table, there will be no target content words and their corresponding keywords. Set the element of the feature vector to 0 at the position of the word.
  • Step S103 Match the feature vector with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector.
  • the risk event library includes risk events of different risk event types.
  • the server can take known defaulted bonds as examples in advance, analyze the public opinion information of defaulted bonds, and obtain different risk events.
  • the feature vector corresponding to each risk event is determined based on a text extraction algorithm. Specifically, the label risk information corresponding to multiple risk events is obtained, the label risk information is segmented, the importance parameters corresponding to the multiple content words are counted, and the feature vector corresponding to the risk event is constructed based on the importance parameters corresponding to the multiple content words. .
  • this embodiment matches the feature vector corresponding to the risk information with the feature vector corresponding to each risk event, and determines the target risk event matching the feature vector based on the obtained matching result.
  • this embodiment calculates the correlation between the feature vector corresponding to the risk information and the feature vector corresponding to each risk event, and uses the risk event corresponding to the correlation degree greater than the preset threshold as the target risk event. It is understandable that the target risk event can be one or more depending on the actual application scenario.
  • Step S104 Provide risk prompts based on target risk events.
  • the bond risk information processing method based on big data performs word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words.
  • the risk information includes the financial information, credit information and history corresponding to the issuance object. Default information; determine the feature vector of risk information based on the importance parameters of the content words matching the preset keyword table.
  • the keyword table includes multiple keywords related to the preset risk events, where the importance parameter is used to represent Corresponds to the importance of the target content word to the risk information; matches the feature vector with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector.
  • the risk event library includes multiple risk events; based on the target Provide risk reminders for risk events.
  • This embodiment determines the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table.
  • the above method can fully extract the semantic information in the risk information, and thereby more accurately extract the features in the risk information, so that The feature vector corresponding to the obtained risk information is more accurate, and information related to risk events is extracted to avoid unnecessary waste of computing resources and improve the efficiency of big data risk assessment based on bond information; in addition, this embodiment starts from a smaller granularity Start by analyzing possible risk events in risk information, and provide risk reminders based on risk events, so that users can be more clear about the risks brought by bond products.
  • the risk event library includes a risk event score value corresponding to each risk event.
  • count the number of occurrences of risk events for each historical default bond of the issuer For example, count the number of occurrences of risk events for each historical default bond of the issuer, generate a risk event matrix, calculate the probability value of each risk event based on the risk event matrix, and determine the risk of each risk event based on the probability value of each risk event. Event rating value.
  • a risk event matrix is generated. After obtaining the risk event matrix, calculate the probability value of each risk event when the bond is a default bond, and quantify the obtained probability value into each The risk event score value corresponding to the risk event. For example, if the probability value of the risk event "poor liquidity" is 80% to 89%, then the risk event score value corresponding to the risk event is set to 8.
  • the target matching degree is used as the weight of the risk event score value corresponding to the target risk event, the weighted sum of the risk event score values of the target risk event is calculated, and the weighted sum is used as the risk score value of the target bond. It can be understood that if there is one target risk event, the risk event score value corresponding to the target risk event and the target matching degree are directly multiplied to obtain the risk score value. If there are multiple target risk events, multiple target risks are calculated. The weighted sum of the risk event score values corresponding to the event.
  • the target matching degree is the matching degree between the target risk event and the feature vector.
  • the matching degree between the target risk event and the feature vector can be determined in a variety of ways. For example, the Euclidean distance or cosine value between the target risk event and the feature vector is used as the matching between the target risk event and the feature vector. Spend.
  • This embodiment uses the matching degree between the target risk event and the feature vector as the weight of the risk event score value corresponding to the target risk event to determine the contribution of the corresponding target risk event to the risk assessment of the target bond, which can more accurately determine the risk assessment of the target bond. Risk score value.
  • the bond trading platform has a function to prompt bond risks
  • the description of the bond risks is cumbersome and there are no key reminders throughout the article, which is not conducive to customers quickly understanding the characteristics of the corresponding bond products and the risks they need to bear, resulting in information utilization The problem of low rates.
  • step S104 includes steps S201-step S202, which are described in detail as follows:
  • Step S201 In response to the first operation instruction for the target bond triggered in the bond trading page, obtain the target risk event and risk score value.
  • the bond trading page refers to the interactive interface used to display or display bonds and bond-related information in the bond risk warning device. For example, it can be a touch screen interface of a mobile terminal, etc.
  • the bond trading page displays bond information.
  • Customers can trigger the first operation instruction for the bond that the user wants to know about based on their own needs.
  • the first operation instruction includes but is not limited to long press, click, double click, drag.
  • Step S202 Load the target risk event and risk score value on the bond trading page.
  • the risk warning method provided by this embodiment also includes steps S301 to S302, which are described in detail as follows:
  • Step S301 Determine the risk level corresponding to the target bond based on the risk score value.
  • the risk level includes low risk level, medium risk level, high risk level and the highest risk level.
  • the risk assessment interval corresponding to the low risk level is [ 0, 0.2)
  • the risk assessment interval corresponding to the medium risk level is [0.2, 0.5)
  • the risk assessment interval corresponding to the high risk level is [0.5, 0.8)
  • the risk assessment interval corresponding to the highest risk level is [0.8, 1].
  • This embodiment matches the risk score value of the target bond with the risk score interval corresponding to each risk level to determine the risk level of the target bond.
  • Step S302 Associate the risk level, target risk event, and risk score value with the target bond, and load the associated risk level, target risk event, and risk score value onto the bond transaction page.
  • This embodiment takes into account that if the risk level, target risk event, risk score value and target bond are loaded separately on the bond trading page, it may not be possible to clearly indicate to the user the risks that the target bond may bring. For example, if the risk level, target risk The events and risk score values are far apart on the bond trading page, and there are no corresponding explanation fields to explain the relationship between them. It is difficult for users to see the risk events, risk score values and risk levels of the target bonds. Based on this, this embodiment associates the risk level with the target risk event and the risk score value with the target bond, so that the user can see the risks brought by the target bond at a glance, avoid misleading the user's bond transaction process, and improve the user experience.
  • a risk association table is constructed.
  • the risk association table has bond type as the title column.
  • the risk association table includes multiple bond products displayed on the bond transaction page, as well as the risk events, risk score values and risks corresponding to each bond product. grade. Among them, the risk events, risk score values and risk levels corresponding to multiple bond products each occupy one column of the risk association table, and the risk events, risk score values and risk levels corresponding to each bond product occupy one row of the risk association table.
  • the risk prompt field template is pre-generated, where the risk prompt field template includes four parameters to be brought in, namely, target bond, risk event, risk score value and risk level.
  • the risk prompt field template is " The risk events that may occur in the target bond include the target risk event, the probability of occurrence is the risk score value, and the risk level is the target risk level.”
  • the code or name of the target bond, the specific risk event, and the specific risk score value , the specific risk level is brought into the above risk prompt field template, and the risk prompt field can be obtained.
  • the target risk event is "poor liquidity”
  • the risk score value is 80%
  • the risk level is the highest risk level
  • the risk prompt field "Target Bond”
  • “Poor liquidity, risk score value is 80%
  • risk level is the highest risk level”
  • risk score value is 80%
  • risk level is the highest risk level” on the bond trading page.
  • the obtained risk prompt field is loaded into the bond trading page.
  • the target risk event corresponding to the target bond is displayed on the bond transaction page, so that the user can clearly understand the risk events that may occur in the target bond in the future, as well as the possibility of the risk event occurring, based on the characteristics of the target bond and the responsibilities that need to be borne. The risks are clear at a glance.
  • the information page corresponding to the bond information link is loaded.
  • this embodiment is a bond information link corresponding to the bond product displayed on the bond transaction page.
  • the bond information link corresponding to the bond information link is loaded. information page, and then display the risk information corresponding to the target bond to users.
  • FIG. 2 is a schematic diagram of jumping from the bond transaction page to the information page in an exemplary embodiment.
  • the bond transaction page includes multiple bond products, specifically including bbbb, aaaa, cccc, The numbers of bond products such as eeeee, fffff, ggggg, etc., wherein a corresponding interaction button is configured for each bond product.
  • the interaction button is the "+" symbol on the right side of the number of each bond product. It can be understood that Yes, you can also configure corresponding interaction buttons for bond products by setting other identifiers, which are not specifically limited here.
  • the user can trigger the second operation instruction for the bond information link, and then load the information page corresponding to the bond information link to display relevant risk warning information.
  • the information page corresponding to the bond number aaaaa includes the risk information of the bond product. The user can understand the risk status of the corresponding bond product through the risk information displayed on the information page.
  • FIG 3 is a flow chart of a bond risk warning method according to an exemplary embodiment.
  • the bond risk warning method involved in this embodiment is suitable for a risk warning system.
  • the risk warning system includes users, clients, products, and servers.
  • the bond risk warning method provided in this embodiment includes steps 1 to 5, which are described in detail as follows;
  • Step 1 Allocate high-risk bonds.
  • the product is a bond trading platform, and users can conduct bond transactions through the bond trading platform.
  • the product evaluates high-risk bonds through multiple risk dimensions such as credit risk, default risk, and liquidation risk of the bond, and configures them on the server side.
  • high-risk bonds are bonds whose risk score value is greater than the preset threshold among the bonds displayed on the client.
  • the product determines high-risk bonds and low-risk bonds by evaluating the risk score value of each bond product.
  • Step 2 View bond information.
  • the user requests to view bond information from the client.
  • the bond information represents whether the corresponding target bond is a high-risk bond.
  • the user sends a viewing request to the client, where the viewing request includes the target bond and corresponding bond information.
  • Step 3 Request bond information.
  • the client requests bond information from the server, and the client requests bond information from the server.
  • the client forwards the view request sent by the user to the server, so that the server responds to the view request target. Bond bond information is played to the client.
  • Step 4 Return bond information.
  • the server returns bond information to the client.
  • Step 5 Display bond information.
  • the client displays bond information to the user. For example, if the bond information indicates that the target bond is a high-risk bond, the client will display a high-risk warning mark or a high-risk warning bar.
  • the client displays bond information to the user so that the user can intuitively understand the bond information of the target bond, that is, can intuitively understand whether the target bond is a high-risk bond.
  • the client can avoid the user from blindly conducting bond transactions. Transactions can lead to unnecessary losses.
  • it can also prevent users from blaming the platform for losses after trading high-risk products, which helps reduce unnecessary risks.
  • FIG 4 is a block diagram of a big data-based bond risk information processing device illustrating an exemplary embodiment of the present application.
  • the big data-based bond risk information processing device 400 includes a word segmentation processing module 401, Determination module 402, matching module 403 and risk prompt module 404.
  • the word segmentation processing module 401 is used to segment the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words.
  • the risk information includes the financial information, credit information and historical default information corresponding to the issuance object;
  • the determination module 402 uses The feature vector of the risk information is determined based on the importance parameters of the content words that match the preset keyword table.
  • the keyword table includes a plurality of keywords related to the preset risk event; the matching module 403 is used to match the feature vector with the preset risk event.
  • the risk events in the set risk event library are matched to obtain the target risk event that matches the feature vector.
  • the risk event library includes multiple risk events; the risk prompt module 404 is used to provide risk prompts based on the target risk event.
  • the determination module 402 includes a matching unit, a determination unit and a feature vector acquisition unit, wherein the matching unit is used to match each content word with the keyword table respectively to obtain the target content word; the determination unit is used to determine The importance parameters of each target content word; the feature vector acquisition unit is used to use a vector composed of the importance parameters of each target content word as a feature vector.
  • the big data-based bond risk information processing device 400 provided in this embodiment also includes a calculation module, wherein the calculation module is used to use the target matching degree as the risk event score value corresponding to the target risk event.
  • Weight calculate the weighted sum of the risk event score values of the target risk event, and use the weighted sum as the risk score value of the target bond.
  • the target matching degree is the matching degree between the target risk event and the feature vector.
  • the risk event library includes risk event score values corresponding to each risk event.
  • the risk prompt module 404 includes a response unit and a loading unit, wherein the response unit is used to obtain the target risk event and risk score in response to the first operation instruction for the target bond triggered in the bond trading page. value; the loading unit is used to load target risk events and risk score values into the bond trading page.
  • the loading unit is also used to associate the risk level, target risk event, and risk score value of the target bond with the target bond, and load the associated risk level, target risk event, and risk score value into On the bond trading page, the risk level is determined based on the risk score value.
  • the loading unit is also used to bring the risk level, target risk event, risk score value and target bond into the pre-generated risk prompt field template to obtain the corresponding risk prompt field.
  • the big data-based bond risk information processing device 400 provided in this embodiment also includes an identification adding module, configured to display the target bond on the bond transaction page if the risk level is determined to be the highest risk level. , add an exclamation point mark at the default position on the bond trading page.
  • the big data-based bond risk information processing device 400 provided in this embodiment also includes a response module, configured to load, in response to the second operation instruction for the bond information link triggered in the bond transaction page, The information page corresponding to the bond information link.
  • the information page is used to display risk information.
  • the target bond is associated with the bond information link for the target bond.
  • the present application provides an electronic device, including a processor and a memory, wherein computer readable instructions are stored on the memory, and when the computer readable instructions are executed by the processor, the above large-scale based on Data processing methods for bond risk information.
  • FIG. 5 shows a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
  • the computer system 1000 includes a central processing unit (Central Processing Unit, CPU) 1001, which can be loaded into a random computer according to a program stored in a read-only memory (Read-Only Memory, ROM) 1002 or from a storage part 1008. Access the program in the memory (Random Access Memory, RAM) 1003 to perform various appropriate actions and processing, such as performing the information recommendation method in the above embodiment. In RAM 1003, various programs and data required for system operation are also stored.
  • CPU 1001, ROM 1002 and RAM 1003 are connected to each other through bus 1004.
  • An input/output (I/O) interface 1005 is also connected to bus 1004.
  • the following components are connected to the I/O interface 1005: an input part 1006 including a keyboard, a mouse, etc.; an output part 1007 including a cathode ray tube (Cathode Ray Tube, CRT), a liquid crystal display (Liquid Crystal Display, LCD), etc., and a speaker, etc. ; a storage part 1008 including a hard disk, etc.; and a communication part 1009 including a network interface card such as a LAN (Local Area Network) card, a modem, etc.
  • the communication section 1009 performs communication processing via a network such as the Internet.
  • Driver 1010 is also connected to I/O interface 1005 as needed.
  • Removable media 1011 such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are installed on the drive 1010 as needed, so that a computer program read therefrom is installed into the storage portion 1008 as needed.
  • the process described above with reference to the flowchart may be implemented as a computer software program.
  • embodiments of the present application include a computer program product including a computer program carried on a computer-readable medium, the computer program including a computer program for performing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network via communication portion 1009 and/or installed from removable media 1011.
  • CPU central processing unit
  • the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof.
  • Computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable The combination.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which a computer-readable computer program is carried. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Computer programs embodied on computer-readable media may be transmitted using any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
  • each block in the flow chart or block diagram may represent a module, program segment, or part of the code.
  • the above-mentioned module, program segment, or part of the code includes one or more executable components for implementing the specified logical function. instruction.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
  • each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
  • the units involved in the embodiments of this application can be implemented in software or hardware, and the described units can also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.
  • Another aspect of the present application also provides a computer-readable storage medium on which computer-readable instructions are stored.
  • the computer-readable instructions are executed by a processor, the big data-based method in any one of the previous embodiments is implemented. Bond risk information processing methods.
  • Another aspect of the present application also provides a computer program product or computer program, which includes computer instructions stored in a computer-readable storage medium.
  • the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the big data-based bond risk information processing method provided in the above embodiments.
  • the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium may be, for example, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof.
  • Computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable The combination.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which a computer-readable computer program is carried. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Computer programs embodied on computer-readable media may be transmitted using any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
  • each block in the flow chart or block diagram may represent a module, program segment, or part of the code.
  • the above-mentioned module, program segment, or part of the code includes one or more executable components for implementing the specified logical function. instruction.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
  • each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
  • the units involved in the embodiments of this application can be implemented in software or hardware, and the described units can also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.

Abstract

The present application discloses a bond risk information processing method based on big data and a related device. The method comprises: performing word segmentation processing on risk information of an issuing object corresponding to a target bond to obtain a plurality of corresponding content words, the risk information comprising financial information, credit information, and historical default information that correspond to the issuing object; determining a feature vector of the risk information on the basis of an importance parameter of a content word matched with a preset keyword table, the keyword table comprising a plurality of keywords related to a preset risk event; matching the feature vector with risk events in a preset risk event library to obtain a target risk event matched with the feature vector, the risk event library comprising a plurality of risk events; and performing risk warning on the basis of the target risk event.

Description

基于大数据的债券风险信息处理方法及相关设备Bond risk information processing methods and related equipment based on big data
相关申请的交叉引用Cross-references to related applications
本申请要求于2022年8月25日提交的,申请名称为“基于大数据的债券风险信息处理方法及相关设备”的、中国专利申请号为“202211032661.3”的优先权,该中国专利申请的全部内容通过引用结合在本申请中。This application requests the priority of the Chinese patent application number "202211032661.3", which was submitted on August 25, 2022 and is titled "Bond risk information processing method and related equipment based on big data". All the Chinese patent applications The contents are incorporated into this application by reference.
技术领域Technical field
本申请涉及大数据技术领域,具体而言,涉及一种基于大数据的债券风险信息处理方法及相关设备。This application relates to the field of big data technology, specifically, to a bond risk information processing method and related equipment based on big data.
背景技术Background technique
债券作为一类重要的投资工具。客户在投资债券时,可能会获得收益,也可能会承担一定交易风险。其中,交易债券的风险包括但不限于:信贷风险、流动性风险、货币风险、利率风险以及市场风险等。若客户选择投资高息债券或某些别具特点及风险的债券时,会承受更高的风险。例如,若债券发生违约,客户甚至可能会损失本金。因此有必要在客户进行交易前,让其了解债券产品的特点和需要承担的风险程度。相关技术对债券进行风险评估所用到的数据过于单一,分析得到的风险值误差较大,进而导致风险提示误差,进而造成债券交易过程中的数据展示不全面的问题。Bonds are an important investment tool. When customers invest in bonds, they may gain income or bear certain transaction risks. Among them, the risks of trading bonds include but are not limited to: credit risk, liquidity risk, currency risk, interest rate risk, market risk, etc. If customers choose to invest in high-interest bonds or certain bonds with unique characteristics and risks, they will bear higher risks. For example, if a bond defaults, customers may even lose their principal. Therefore, it is necessary for customers to understand the characteristics of bond products and the degree of risk they need to bear before they make transactions. The data used by related technologies to conduct risk assessment on bonds is too simple, and the risk value obtained through analysis has a large error, which in turn leads to errors in risk warnings, and in turn causes the problem of incomplete data display in the bond trading process.
技术解决方案Technical solutions
为解决上述技术问题,本申请的实施例提供了一种基于大数据的债券风险信息处理方法及相关设备。In order to solve the above technical problems, embodiments of the present application provide a bond risk information processing method and related equipment based on big data.
根据本申请实施例的一个方面,提供了一种基于大数据的债券风险信息处理方法,包括:对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息;基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,关键词表包括多个与预设的风险事件相关的关键词,其中,重要程度参数用于表示对应目标实词对于风险信息的重要程度;将特征向量与预设的风险事件库中的风险事件进行匹配,得到与特征向量相匹配的目标风险事件,风险事件库中包括多个风险事件;基于目标风险事件进行风险提示。According to one aspect of the embodiment of the present application, a bond risk information processing method based on big data is provided, which includes: performing word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words. The risk information includes issuance Financial information, credit information and historical default information corresponding to the object; determine the feature vector of the risk information based on the importance parameters of the content words matching the preset keyword table. The keyword table includes multiple keys related to the preset risk events. word, where the importance parameter is used to represent the importance of the corresponding target content word to the risk information; the feature vector is matched with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector, and the risk event The library includes multiple risk events; risk reminders are provided based on target risk events.
根据本申请实施例的一个方面,提供了基于大数据的债券风险信息处理装置,包括:分词处理模块,用于对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息;确定模块,用于基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,关键词表包括多个与预设的风险事件相关的关键词,其中,重要程度参数用于表示对应目标实词对于风险信息的重要程度;匹配模块,用于将特征向量与预设的风险事件库中的风险事件 进行匹配,得到与特征向量相匹配的目标风险事件,风险事件库中包括多个风险事件;风险提示模块,用于基于目标风险事件进行风险提示。According to one aspect of the embodiment of the present application, a bond risk information processing device based on big data is provided, including: a word segmentation processing module, used to perform word segmentation processing on the risk information of the issuance object corresponding to the target bond, and obtain multiple corresponding content words, The risk information includes financial information, credit information and historical default information corresponding to the issuance object; the determination module is used to determine the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table. The keyword table includes multiple Keywords related to preset risk events, in which the importance parameter is used to represent the importance of the corresponding target content word to the risk information; the matching module is used to match the feature vector with the risk events in the preset risk event library , obtain the target risk event that matches the feature vector, and the risk event library includes multiple risk events; the risk prompt module is used to provide risk prompts based on the target risk event.
在一个示例性实施例中,确定模块包括匹配单元、确定单元以及特征向量获取单元,其中,匹配单元用于将各个实词分别与关键词表进行匹配,得到目标实词;确定单元用于确定各个目标实词的重要程度参数;特征向量获取单元用于将由各个目标实词的重要程度参数组成的向量作为特征向量。In an exemplary embodiment, the determination module includes a matching unit, a determination unit and a feature vector acquisition unit, wherein the matching unit is used to match each content word with the keyword table respectively to obtain the target content word; the determination unit is used to determine each target Importance parameters of content words; the feature vector acquisition unit is used to use a vector composed of importance parameters of each target content word as a feature vector.
在一个示例性实施例中,本实施例提供的基于大数据的债券风险信息处理装置还包括计算模块,其中,计算模块用于将目标匹配度作为目标风险事件对应的风险事件评分值的权重,计算目标风险事件的风险事件评分值的加权和,将加权和作为目标债券的风险评分值,目标匹配度为目标风险事件与特征向量之间的匹配度。风险事件库中包括每个风险事件对应的风险事件评分值。In an exemplary embodiment, the big data-based bond risk information processing device provided in this embodiment also includes a calculation module, wherein the calculation module is used to use the target matching degree as the weight of the risk event score value corresponding to the target risk event, Calculate the weighted sum of the risk event score values of the target risk event, and use the weighted sum as the risk score value of the target bond. The target matching degree is the matching degree between the target risk event and the feature vector. The risk event library includes risk event score values corresponding to each risk event.
在一个示例性实施例中,风险提示模块包括响应单元和加载单元,其中,响应单元用于响应于债券交易页面中触发的针对目标债券的第一操作指令,获取目标风险事件以及风险评分值;加载单元用于将目标风险事件以及风险评分值加载于债券交易页面。In an exemplary embodiment, the risk prompt module includes a response unit and a loading unit, wherein the response unit is configured to obtain the target risk event and the risk score value in response to the first operation instruction for the target bond triggered in the bond trading page; The loading unit is used to load target risk events and risk score values into the bond trading page.
在一个示例性实施例中,加载单元还用于将目标债券的风险等级、目标风险事件、风险评分值与目标债券进行关联,将关联之后的风险等级、目标风险事件、风险评分值加载于债券交易页面,风险等级基于风险评分值确定。In an exemplary embodiment, the loading unit is also used to associate the risk level, target risk event, and risk score value of the target bond with the target bond, and load the associated risk level, target risk event, and risk score value into the bond. On the transaction page, the risk level is determined based on the risk score value.
在一个示例性实施例中,加载单元还用于将风险等级、目标风险事件、风险评分值与目标债券带入预先生成的风险提示字段模板,得到对应的风险提示字段。In an exemplary embodiment, the loading unit is also used to bring the risk level, target risk event, risk score value and target bond into the pre-generated risk prompt field template to obtain the corresponding risk prompt field.
在一个示例性实施例中,本实施例提供的基于大数据的债券风险信息处理装置还包括响应模块,用于响应于债券交易页面中触发的针对债券资讯链接的第二操作指令,加载债券资讯链接所对应的资讯页面,资讯页面用于展示风险信息,其中,目标债券关联针对目标债券的债券资讯链接。In an exemplary embodiment, the big data-based bond risk information processing device provided in this embodiment also includes a response module, configured to load bond information in response to the second operation instruction for the bond information link triggered in the bond transaction page. The information page corresponding to the link is used to display risk information. The target bond is associated with the bond information link for the target bond.
根据本申请实施例的一个方面,提供了一种电子设备,包括处理器及存储器,存储器上存储有计算机可读指令,计算机可读指令被处理器执行时实现如上的基于大数据的债券风险信息处理方法。According to one aspect of the embodiment of the present application, an electronic device is provided, including a processor and a memory. Computer-readable instructions are stored on the memory. When the computer-readable instructions are executed by the processor, the above big data-based bond risk information is implemented. Approach.
根据本申请实施例的一个方面,提供了一种计算机可读存储介质,其上存储有计算机可读指令,当计算机可读指令被计算机的处理器执行时,使计算机执行如前提供的基于大数据的债券风险信息处理方法。According to an aspect of an embodiment of the present application, a computer-readable storage medium is provided, on which computer-readable instructions are stored. When the computer-readable instructions are executed by a processor of a computer, the computer is caused to execute the large-scale method as provided above. Data processing methods for bond risk information.
根据本申请实施例的一个方面,提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各种可选实施例中提供的基于大数据的债券风险信息处理方法。According to an aspect of an embodiment of the present application, a computer program product or a computer program is provided, the computer program product or the computer program including computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the big data-based bond risk information processing method provided in the above various optional embodiments.
在本申请的实施例提供的技术方案中,对目标债券对应的发行对象的风险信息进行分 词处理,得到对应的多个实词,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息;基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,关键词表包括多个与预设的风险事件相关的关键词,其中,重要程度参数用于表示对应目标实词对于风险信息的重要程度;将特征向量与预设的风险事件库中的风险事件进行匹配,得到与特征向量相匹配的目标风险事件,风险事件库中包括多个风险事件;基于目标风险事件进行风险提示。本实施例基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,上述方式可以充分提取风险信息中的语义信息,进而能够更加准确地提取风险信息中的特征,使得到的风险信息对应的特征向量更加准确,并提取与风险事件相关的信息,避免浪费不必要的算力资源,提高基于债券信息进行大数据风险评估的效率;另外,本实施例从更小粒度出发分析风险信息可能存在的风险事件,基于风险事件进行风险提示,使得用户能够更加明确债券产品带来的风险。In the technical solution provided by the embodiment of this application, the risk information of the issuance object corresponding to the target bond is segmented to obtain multiple corresponding content words. The risk information includes the financial information, credit information and historical default information corresponding to the issuance object; The feature vector of the risk information is determined based on the importance parameters of the content words that match the preset keyword table. The keyword table includes multiple keywords related to the preset risk events, where the importance parameters are used to represent the corresponding target content words. Regarding the importance of risk information; match the feature vector with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector. The risk event library includes multiple risk events; based on the target risk event risk warning. This embodiment determines the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table. The above method can fully extract the semantic information in the risk information, and thereby more accurately extract the features in the risk information, so that The feature vector corresponding to the obtained risk information is more accurate, and information related to risk events is extracted to avoid unnecessary waste of computing resources and improve the efficiency of big data risk assessment based on bond information; in addition, this embodiment starts from a smaller granularity Start by analyzing possible risk events in risk information, and provide risk reminders based on risk events, so that users can be more clear about the risks brought by bond products.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。It should be understood that the above general description and the following detailed description are only exemplary and explanatory, and do not limit the present application.
附图说明Description of drawings
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例,并与说明书一起用于解释本申请的原理。显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术者来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。在附图中:The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description, serve to explain the principles of the application. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without exerting creative efforts. In the attached picture:
图1是本申请的一示例性实施例示出的基于大数据的债券风险信息处理方法的流程图;Figure 1 is a flow chart of a bond risk information processing method based on big data according to an exemplary embodiment of the present application;
图2是一示例性实施例示出的由债券交易页面跳转至资讯页面的示意图;Figure 2 is a schematic diagram of jumping from the bond trading page to the information page according to an exemplary embodiment;
图3是一示例性实施例示出的债券的风险提示方法的流程图;Figure 3 is a flow chart of a bond risk warning method according to an exemplary embodiment;
图4是本申请一示例性实施例示出的基于大数据的债券风险信息处理装置的框图;Figure 4 is a block diagram of a big data-based bond risk information processing device according to an exemplary embodiment of the present application;
图5示出了适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。FIG. 5 shows a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
具体实施方式Detailed ways
这里将详细地对示例性实施例执行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the drawings, the same numbers in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatus and methods consistent with aspects of the application as detailed in the appended claims.
附图中所示的方框图仅仅是功能实体,不一定必须与物理上独立的实体相对应。即,可以采用软件形式来实现这些功能实体,或在一个或多个硬件模块或集成电路中实现这些功能实体,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. That is, these functional entities may be implemented in software form, or implemented in one or more hardware modules or integrated circuits, or implemented in different networks and/or processor devices and/or microcontroller devices. entity.
附图中所示的流程图仅是示例性说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解,而有的操作/步骤可以合并或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowcharts shown in the drawings are only illustrative, and do not necessarily include all contents and operations/steps, nor must they be performed in the order described. For example, some operations/steps can be decomposed, and some operations/steps can be merged or partially merged, so the actual order of execution may change according to the actual situation.
还需要说明的是:在本申请中提及的“多个”是指两个或者两个以上。“和/或”描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。It should also be noted that the “multiple” mentioned in this application refers to two or more. "And/or" describes the relationship between related objects, indicating that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the related objects are in an "or" relationship.
需要说明的是,本申请实施例中所涉及的网元还可以称为功能或功能实体,本申请不做限制。例如,接入与移动性管理功能网元还可以称为接入与移动性管理功能或接入与移动性管理功能实体,会话管理功能网元可以称为会话管理功能或会话管理功能实体等。各个网元的名称在本申请中不做限定,本领域技术人员可以将上述网元的名称更换为其它名称而执行相同的功能,均属于本申请保护的范围。It should be noted that the network elements involved in the embodiments of this application can also be called functions or functional entities, which are not limited by this application. For example, the access and mobility management function network element may also be called the access and mobility management function or the access and mobility management function entity, and the session management function network element may be called the session management function or session management function entity, etc. The names of each network element are not limited in this application. Those skilled in the art can replace the names of the above network elements with other names to perform the same functions, which all fall within the scope of protection of this application.
为了更好的理解及说明本申请实施例的方案,下面对本申请实施例中所涉及到的技术用语进行简单说明。In order to better understand and explain the solutions of the embodiments of the present application, the technical terms involved in the embodiments of the present application are briefly described below.
首先说明的是,大数据是指无法在一定时间范围内用常规软件工具进行捕捉、管理和处理的数据集合,是需要新处理模式才能具有更强的决策力、洞察发现力和流程优化能力的海量、高增长率和多样化的信息资产。随着云时代的来临,大数据也吸引了越来越多的关注,大数据需要特殊的技术,以有效地处理大量的容忍经过时间内的数据。适用于大数据的技术,包括大规模并行处理数据库、数据挖掘、分布式文件系统、分布式数据库、云计算平台、互联网和可扩展的存储系统。具体的,随着人们的经济水平不断提高,投资理财成为了人们的主流理财方式。而债券作为其中的一种投资理财方式,也得到了相当多的关注,特别是基于大数据的投资理财方式。但基于大数据的投资债券可能会获得收益,也可能会承担一定交易风险,因此,为了使客户了解债券产品的特点和需要承担的风险,基于大数据的债券交易风险的评估与提示尤为重要。First of all, big data refers to a collection of data that cannot be captured, managed and processed with conventional software tools within a certain time range. It requires new processing models to have stronger decision-making power, insight discovery and process optimization capabilities. Massive, high-growth and diversified information assets. With the advent of the cloud era, big data has also attracted more and more attention. Big data requires special technologies to effectively handle large amounts of data within a tolerable time. Technologies applicable to big data include massively parallel processing databases, data mining, distributed file systems, distributed databases, cloud computing platforms, the Internet, and scalable storage systems. Specifically, as people's economic level continues to improve, investment and financial management have become people's mainstream financial management methods. As one of the investment and financial management methods, bonds have also received considerable attention, especially those based on big data. However, investing in bonds based on big data may yield returns and may also bear certain transaction risks. Therefore, in order to enable customers to understand the characteristics of bond products and the risks they need to bear, it is particularly important to assess and prompt bond transaction risks based on big data.
债券违约是指债券发行主体不能按照事先达成的债券协议履行其义务的行为,近年来高发的债券违约现象给个人和机构投资者敲响了警钟,因此针对可能会造成债券违约的风险识别显得极为重要。传统的智能债券分析工具,往往只能提供债券的财务数据浏览和简单的信用评级功能,信息内容单一,投资者无法从所提供的财务数据以及信用评级中获取债券的直观评价,且难以实现对债券违约风险点的全面跟踪把控。Bond default refers to the failure of the bond issuer to perform its obligations in accordance with the bond agreement reached in advance. The high incidence of bond defaults in recent years has sounded the alarm to individual and institutional investors. Therefore, it is extremely important to identify the risks that may cause bond defaults. important. Traditional smart bond analysis tools often only provide financial data browsing and simple credit rating functions for bonds. The information content is single. Investors cannot obtain an intuitive evaluation of bonds from the provided financial data and credit ratings, and it is difficult to achieve accurate evaluation of bonds. Comprehensive tracking and control of bond default risk points.
请参阅图1,图1是本申请的一示例性实施例示出的基于大数据的债券风险信息处理方法的流程图。本实施例中基于大数据的债券风险信息处理方法可以应用于债券风险提示装置,本申请的债券风险提示装置可以为服务器,也可以为移动设备,还可以为由服务器和移动设备相互配合的系统。相应地,移动设备包括的各个部分,例如各个单元、子单元、模块、子模块可以全部设置于服务器中,也可以全部设置于移动设备中,还可以分别设置于服务器和移动设备中。Please refer to Figure 1, which is a flow chart of a bond risk information processing method based on big data according to an exemplary embodiment of the present application. The bond risk information processing method based on big data in this embodiment can be applied to the bond risk reminder device. The bond risk reminder device of this application can be a server, a mobile device, or a system in which a server and a mobile device cooperate with each other. . Correspondingly, various parts included in the mobile device, such as each unit, sub-unit, module, and sub-module, may all be provided in the server, may all be provided in the mobile device, or may be provided in the server and the mobile device respectively.
进一步地,上述服务器可以是硬件,也可以是软件。当服务器为硬件时,可以实现成多个服务器组成的分布式服务器集群,也可以实现成单个服务器。当服务器为软件时,可以实现成多个软件或软件模块,例如用来提供分布式服务器的软件或软件模块,也可以实现成单个软件或软件模块,在此不做具体限定。Further, the above-mentioned server may be hardware or software. When the server is hardware, it can be implemented as a distributed server cluster composed of multiple servers or as a single server. When the server is software, it can be implemented as multiple software or software modules, such as software or software modules used to provide distributed servers, or it can be implemented as a single software or software module, which is not specifically limited here.
可以理解的是,在本申请的具体实施方式中,涉及到债券交易、用户数据等相关的数据,当本申请以上实施例运用到具体产品或技术中时,需要获得用户许可同意或者授权,且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准。It can be understood that in the specific implementation of this application, related data such as bond transactions and user data are involved. When the above embodiments of this application are applied to specific products or technologies, user consent or authorization is required, and The collection, use and processing of relevant data need to comply with relevant laws, regulations and standards of relevant countries and regions.
如图1所示,在一示例性的实施例中,基于大数据的债券交易风险提示方法至少包括步骤S101-步骤S104,详细介绍如下:As shown in Figure 1, in an exemplary embodiment, the bond trading risk warning method based on big data at least includes steps S101 to S104, which are described in detail as follows:
步骤S101:对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词。Step S101: Perform word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words.
在本实施例中,风险信息为可能造成债券违约的信息,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息。由于债券的风险来源通常与其发行对象息息相关,因此,本实施例通过分析发行对象的风险信息对目标债券进行风险评估。In this embodiment, the risk information is information that may cause bond default. The risk information includes financial information, credit information and historical default information corresponding to the issuing object. Since the risk source of a bond is usually closely related to the object to which it is issued, this embodiment performs a risk assessment on the target bond by analyzing the risk information of the object to be issued.
例如,发行对象在预设时间段内的发生的违约事件的次数,在本实施例中,违约事件可以是针对目标债券的,也可以是针对发行对象的其他债券的,在此不做具体限定。风险信息还包括发行对象的财务状况、征信信息、第三方信用评级报告或者研究报告等。For example, the number of default events that occur in the issuance object within a preset time period. In this embodiment, the default event can be for the target bond or other bonds of the issuance object. There is no specific limit here. . Risk information also includes the financial status of the issuer, credit information, third-party credit rating reports or research reports, etc.
示例性地,本实施例可以从公开渠道获得企业的财务数据和评级机构评级数据,并进行储存,如果存在空值,则也将其储存为空值;其中,财务信息包括企业经营情况、企业偿债能力、企业负债情况,企业经营情况进一步包括销售毛利率、营业净利率,企业偿债能力进一步包括短期负债、速动比率、经营性现金流/有息负债,企业负债情况进一步包括资产负债率、合并报表经调整的资产负债率、母公司经调整的资产负债率;评级机构数据包括评级机构当期评级、评级机构评级展望;合并报表经调整的资产负债率=合并报表中的经调整的负债合计/经调整的资产合计;母公司经调整的资产负债率=母公司报表中的经调整的负债合计/经调整的资产合计。Illustratively, this embodiment can obtain an enterprise's financial data and rating agency rating data from public channels and store them. If there is a null value, it will also be stored as a null value; where the financial information includes the enterprise's operating conditions, the enterprise's Solvency, corporate liabilities, corporate operating conditions further include gross sales profit margin, operating net profit margin, corporate solvency further includes short-term liabilities, quick ratio, operating cash flow/interest-bearing liabilities, corporate liabilities further include assets and liabilities ratio, the adjusted asset-liability ratio of the consolidated statement, and the adjusted asset-liability ratio of the parent company; the rating agency data includes the current rating of the rating agency and the rating outlook of the rating agency; the adjusted asset-liability ratio of the consolidated statement = the adjusted asset-liability ratio of the consolidated statement Total liabilities/total adjusted assets; the parent company’s adjusted asset-liability ratio = total adjusted liabilities/total adjusted assets in the parent company’s statement.
分词就是将连续的字序列按照一定的规范重新组合成语义独立词序列的过程。分词是自然语言处理的基础,分词准确度直接决定了后面的词性标注、句法分析、词向量以及文本分析的质量,本实施例可以通过中文分词算法对文本进行分词,对于具体的分词过程在此不做过多叙述。Word segmentation is the process of recombining continuous word sequences into semantically independent word sequences according to certain specifications. Word segmentation is the basis of natural language processing. The accuracy of word segmentation directly determines the quality of subsequent part-of-speech tagging, syntactic analysis, word vectors and text analysis. This embodiment can segment the text through the Chinese word segmentation algorithm. The specific word segmentation process is here Don’t go into too much detail.
示例性地,通过步骤S601-步骤S602对风险信息进行分词处理,详细叙述如下:Illustratively, the risk information is segmented into words through steps S601 to S602. The details are as follows:
步骤S601:对风险信息对应的文档字符串进行分割,得到多个长度不一的字符串。Step S601: Split the document string corresponding to the risk information to obtain multiple strings of different lengths.
本实施例根据字符串中包括的字符个数对风险信息对应的文档字符串进行分割,例如,风险信息“目前承压的恒大所面临的市场情绪化进一步恶化”,对该风险信息对应的文档字符串进行分割,得到的包括一个字符的字符串包括“目”“压”“的”“场”“情”“绪”“化”“进”“化”等,包括两个字符的字符串包括“目前”“前承”“承压”“压的”“的恒”“恒大”“大所”“所面”“市场”等,包括三个字符的字符串包括“目前承”“前承压”“承压的”“情绪化”“进一步”“市场情”等,包括四个字符的字符串包括“前承压的”“所面临的”等,包括五个字符的字符串包括“市场情绪化”“进一步恶化”……,由于实词的个数最多不超过五个字符,因此,本实施例在对风险信息对应 的文档字符串进行分割时,设置字符串包括的字符不超过5,能够节省算力资源和加快分词效率。This embodiment divides the document string corresponding to the risk information according to the number of characters included in the string. For example, the risk information "The market sentiment faced by Evergrande, which is currently under pressure, has further deteriorated", the risk information corresponding to The document string is divided, and the obtained string including one character includes "objective", "pressure", "de", "field", "emotion", "mood", "hua", "jin", "hua", etc., and the obtained string includes two characters. The string includes "current", "former", "under pressure", "pressured", "ever", "everda", "big firm", "face", "market", etc. The string including three characters includes "currently" "Previously under pressure", "under pressure", "emotional", "further", "market situation", etc., including four-character strings including "previously under pressure", "faced", etc., including five-character strings The string includes "market sentiment", "further deterioration", etc. Since the number of content words does not exceed five characters at most, this embodiment sets the characters included in the string when segmenting the document string corresponding to the risk information. No more than 5, which can save computing resources and speed up word segmentation efficiency.
步骤S602:将各个字符串分别与预设的词典进行匹配,若能匹配上,确定对应字符串为一个实词。Step S602: Match each character string with a preset dictionary respectively. If there is a match, determine that the corresponding character string is a content word.
示例性地,本实施例将各个字符串分别与通用语文词典进行匹配,若能匹配上,确定对应字符串为一个实词。Illustratively, this embodiment matches each character string with a general Chinese dictionary. If a match is found, the corresponding character string is determined to be a content word.
通用语文词典收录语言中常用的主要词汇,适合于不同阶层的读者在语言学习过程中使用,例如学者、处于各个学习阶段的学生、语言教师或者家庭主妇等。通用语文词典面向整个大众读者群收录各行各业有代表性的常用词汇,通用语文词典所收录词汇相较其他专用词典更全更多。The General Chinese Dictionary contains the main vocabulary commonly used in the language and is suitable for readers from different walks of life to use in the language learning process, such as scholars, students at various stages of learning, language teachers or housewives, etc. The General Chinese Dictionary collects representative commonly used vocabulary from all walks of life for the entire public readership. The vocabulary included in the General Chinese Dictionary is more comprehensive than other special dictionaries.
示例性地,对分词处理之后得到的实词进行去停用词处理,去除无含义的语气词、副词、特殊符号和标点符号,例如,将“的”这一实词删除,以为后续处理过程节约算力资源。For example, the content words obtained after word segmentation processing are processed to remove stop words, and meaningless modal particles, adverbs, special symbols and punctuation marks are removed. For example, the content word "的" is deleted to save money in subsequent processing. human resources.
步骤S102:基于与预设的关键词表匹配的目标实词的重要程度参数确定风险信息的特征向量。Step S102: Determine the feature vector of the risk information based on the importance parameters of the target content words that match the preset keyword table.
在本实施例中,预先配置多个风险事件,风险事件是指债券产品可能发生的违约事件。本实施例可以基于债券产品的历史违约信息确定多个风险事件,例如,基于机器学习的方式确定风险事件。在本实施例中,风险事件包括财务类、法律类、资本类以及经营类等风险事件,例如,财务类风险事件包括资本结构变动、流动性差以及业绩亏损等标签。In this embodiment, multiple risk events are pre-configured, and risk events refer to possible default events of bond products. This embodiment can determine multiple risk events based on historical default information of bond products, for example, determine risk events based on machine learning. In this embodiment, risk events include financial, legal, capital, and operating risk events. For example, financial risk events include labels such as changes in capital structure, poor liquidity, and performance losses.
本实施例基于风险事件预先构建关键词表,关键词表中包括多个与预设的风险事件相关的关键词。与风险事件相关的关键词是对应风险事件的特征标识,比如风险事件“流动性差”对应的关键词包括“流动性”、“差”等等。This embodiment pre-constructs a keyword table based on risk events, and the keyword table includes a plurality of keywords related to preset risk events. Keywords related to risk events are characteristic identifiers corresponding to risk events. For example, keywords corresponding to the risk event "poor liquidity" include "liquidity", "poor", etc.
本实施例考虑到由风险信息分词得来的实词包括与对应风险事件相关的,例如,“流动性”,也包括与对应风险事件不相关的,例如,一些标点符号、连接词等等,还包括与风险事件部分相关的,本实施例基于关键词表对由风险事件分词得来的多个实词进行筛选以确定可以作为用于构建风险信息的特征向量的目标实词,进而加快风险评估效率,避免浪费不必要的算力资源。This embodiment considers that the content words obtained by word segmentation of risk information include those related to the corresponding risk event, such as "liquidity", and also include those that are not related to the corresponding risk event, such as some punctuation marks, connectives, etc., and also include Including those related to risk events, this embodiment screens multiple content words obtained by word segmentation of risk events based on the keyword table to determine the target content words that can be used as feature vectors for constructing risk information, thereby speeding up the efficiency of risk assessment. Avoid unnecessary waste of computing resources.
示例性地,本实施例可以基于步骤S501-步骤S503实现,详细叙述如下:Illustratively, this embodiment can be implemented based on steps S501 to S503. The details are described as follows:
步骤S501:将各个实词分别与关键词表进行匹配,得到与关键词表匹配的目标实词。Step S501: Match each content word with the keyword table to obtain the target content word matching the keyword table.
本实施例首先确定与关键词表匹配的实词,示例性地,可以直接将每个实词与关键词表进行对比,若在关键词表中能找出与对应实词相同的关键词,则确定对应实词与关键词表匹配。In this embodiment, the content words that match the keyword table are first determined. For example, each content word can be directly compared with the keyword list. If the same keyword as the corresponding content word can be found in the keyword table, the corresponding content word is determined. Content words are matched against the keyword list.
考虑到实词与关键词表中的关键词在不完全相同的情况下,也可能存在比较强的语义相关关系,例如“流动性”和“流通性”。示例性地,本实施例通过计算每个实词与关键词之间的相关度,若相关度大于预设阈值,则确定对应实词和对应关键词匹配。若得到的 相关度小于预设阈值,若确定对应实词与关键词表不匹配,说明对应实词对用户的特征贡献不大,因此在确定特征向量时,将这类实词舍去,本实施例通过上述方式,能够充分提取风险信息中的语义信息,进而能够更加准确地提取风险信息中的特征,使得到的风险信息对应的特征向量更加准确。Considering that the content words and the keywords in the keyword list are not exactly the same, there may be relatively strong semantic correlations, such as "liquidity" and "circulation". Illustratively, this embodiment calculates the correlation between each content word and the keyword. If the correlation is greater than the preset threshold, it is determined that the corresponding content word matches the corresponding keyword. If the obtained correlation is less than the preset threshold, and if it is determined that the corresponding content words do not match the keyword list, it means that the corresponding content words do not contribute much to the user's characteristics. Therefore, when determining the feature vector, such content words are discarded. In this embodiment, The above method can fully extract the semantic information in the risk information, and then more accurately extract the features in the risk information, making the obtained feature vector corresponding to the risk information more accurate.
步骤S502:确定各个目标实词的重要程度参数。Step S502: Determine the importance parameters of each target content word.
在本实施例中,目标实词的重要程度参数用于表示对应目标实词对于风险信息的重要程度。In this embodiment, the importance parameter of the target content word is used to represent the importance of the corresponding target content word to the risk information.
示例性地,本实施例可以通过以下公式确定目标实词的重要程度参数:For example, this embodiment can determine the importance parameter of the target content word through the following formula:
Figure PCTCN2022140776-appb-000001
Figure PCTCN2022140776-appb-000001
其中,W x,y表示目标实词x在风险信息y中对应的重要程度参数,tf x,y表示目标实词x在风险信息y中出现的频率,N表示风险信息y中包括的文本总数,df x表示文本总数N中包括目标实词x的文本数目。 Among them, W x, y represents the corresponding importance parameter of the target content word x in the risk information y, tf x, y represents the frequency of the target content word x appearing in the risk information y, N represents the total number of texts included in the risk information y, and df x represents the number of texts including the target content word x in the total number of texts N.
本申请发明人考虑到,由于实词的重要性随其在文件中出现的次数成正比增加,但同时会随着它在语料库中出现的频率成反比下降,因此,本实施例将tf x,y这一参数置于公式的分子位置,表征目标实词的重要程度参数与目标实词x在风险信息y中出现的频率成正比;将df x这一参数置于公式的分母为孩子自,表征目标实词的重要程度参数的大小与文本总数N中包括目标实词x的文本数目成反比,通过上述公式求得的实词的重要程度参数更加准确。 The inventor of the present application considers that since the importance of content words increases in direct proportion to the number of times they appear in the document, but at the same time decreases inversely proportional to the frequency of their appearance in the corpus, therefore, in this embodiment, tf x, y This parameter is placed in the numerator position of the formula, and represents the importance of the target content word. The parameter is proportional to the frequency of the target content word x appearing in the risk information y; placing the parameter df The size of the importance parameter is inversely proportional to the number of texts including the target content word x in the total number of texts N. The importance parameter of the content word obtained through the above formula is more accurate.
步骤S503:将由各个目标实词的重要程度参数组成的向量作为特征向量。Step S503: Use a vector composed of importance parameters of each target content word as a feature vector.
示例性地,按照与目标实词相匹配的关键词在关键词表中的顺序排列各个目标实词的重要程度参数,从而得到特征向量。For example, the importance parameters of each target content word are arranged according to the order of keywords matching the target content word in the keyword table, thereby obtaining a feature vector.
示例性地,统一风险信息对应的特征向量的维度,示例性地,预先设置风险信息对应的特征向量的维度等于关键词表中包括的关键词的数目,按照关键词表中包括的关键词的排列顺序对目标实词对应的重要程度参数进行排序,从而得到特征向量,其中,在风险信息对应的目标实词的数目小于关键词表中包括的关键词的数目时,将没有目标实词与其对应的关键词的位置处设置特征向量的元素为0。Exemplarily, the dimensions of the feature vectors corresponding to the risk information are unified. Exemplarily, the dimensions of the feature vectors corresponding to the risk information are preset to be equal to the number of keywords included in the keyword table. According to the number of keywords included in the keyword table, The ranking order sorts the importance parameters corresponding to the target content words to obtain the feature vector. Among them, when the number of target content words corresponding to the risk information is less than the number of keywords included in the keyword table, there will be no target content words and their corresponding keywords. Set the element of the feature vector to 0 at the position of the word.
步骤S103:将特征向量与预设的风险事件库中的风险事件进行匹配,得到与特征向量相匹配的目标风险事件。Step S103: Match the feature vector with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector.
在本实施例中,风险事件库中包括不同风险事件类型的风险事件。本步骤中,服务器可以预先将已知的违约债券作为例子,对违约债券的舆情信息进行分析,得到不同风险事件。In this embodiment, the risk event library includes risk events of different risk event types. In this step, the server can take known defaulted bonds as examples in advance, analyze the public opinion information of defaulted bonds, and obtain different risk events.
示例性地,基于文本提取算法确定每个风险事件对应的特征向量。具体地,获取多个风险事件对应的标签风险信息,对标签风险信息进行分词处理,统计得到的多个实词对应的重要程度参数,基于多个实词对应的重要程度参数构建风险事件对应的特征向量。For example, the feature vector corresponding to each risk event is determined based on a text extraction algorithm. Specifically, the label risk information corresponding to multiple risk events is obtained, the label risk information is segmented, the importance parameters corresponding to the multiple content words are counted, and the feature vector corresponding to the risk event is constructed based on the importance parameters corresponding to the multiple content words. .
示例性地,本实施例将风险信息对应的特征向量分别与每个风险事件对应的特征向量进行匹配,基于得到的匹配结果确定与特征向量相匹配的目标风险事件。示例性地,本实施例计算风险信息对应的特征向量分别与每个风险事件对应的特征向量之间的相关度,将相关度大于预设阈值对应的风险事件作为目标风险事件。可以理解的是,目标风险事件可以根据实际应用场景的不同为一个或者多个。Illustratively, this embodiment matches the feature vector corresponding to the risk information with the feature vector corresponding to each risk event, and determines the target risk event matching the feature vector based on the obtained matching result. Illustratively, this embodiment calculates the correlation between the feature vector corresponding to the risk information and the feature vector corresponding to each risk event, and uses the risk event corresponding to the correlation degree greater than the preset threshold as the target risk event. It is understandable that the target risk event can be one or more depending on the actual application scenario.
步骤S104:基于目标风险事件进行风险提示。Step S104: Provide risk prompts based on target risk events.
本实施例提供的基于大数据的债券风险信息处理方法,对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息;基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,关键词表包括多个与预设的风险事件相关的关键词,其中,重要程度参数用于表示对应目标实词对于风险信息的重要程度;将特征向量与预设的风险事件库中的风险事件进行匹配,得到与特征向量相匹配的目标风险事件,风险事件库中包括多个风险事件;基于目标风险事件进行风险提示。本实施例基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,上述方式可以充分提取风险信息中的语义信息,进而能够更加准确地提取风险信息中的特征,使得到的风险信息对应的特征向量更加准确,并提取与风险事件相关的信息,避免浪费不必要的算力资源,提高基于债券信息进行大数据风险评估的效率;另外,本实施例从更小粒度出发分析风险信息可能存在的风险事件,基于风险事件进行风险提示,使得用户能够更加明确债券产品带来的风险。The bond risk information processing method based on big data provided in this embodiment performs word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words. The risk information includes the financial information, credit information and history corresponding to the issuance object. Default information; determine the feature vector of risk information based on the importance parameters of the content words matching the preset keyword table. The keyword table includes multiple keywords related to the preset risk events, where the importance parameter is used to represent Corresponds to the importance of the target content word to the risk information; matches the feature vector with the risk events in the preset risk event library to obtain the target risk event that matches the feature vector. The risk event library includes multiple risk events; based on the target Provide risk reminders for risk events. This embodiment determines the feature vector of the risk information based on the importance parameters of the content words that match the preset keyword table. The above method can fully extract the semantic information in the risk information, and thereby more accurately extract the features in the risk information, so that The feature vector corresponding to the obtained risk information is more accurate, and information related to risk events is extracted to avoid unnecessary waste of computing resources and improve the efficiency of big data risk assessment based on bond information; in addition, this embodiment starts from a smaller granularity Start by analyzing possible risk events in risk information, and provide risk reminders based on risk events, so that users can be more clear about the risks brought by bond products.
示例性地,风险事件库中包括每个风险事件对应的风险事件评分值。For example, the risk event library includes a risk event score value corresponding to each risk event.
示例性地,统计发行对象的各历史违约债券的风险事件出现的次数,生成风险事件矩阵,根据风险事件矩阵计算各风险事件出现的概率值,根据各风险事件的概率值确定各风险事件的风险事件评分值。For example, count the number of occurrences of risk events for each historical default bond of the issuer, generate a risk event matrix, calculate the probability value of each risk event based on the risk event matrix, and determine the risk of each risk event based on the probability value of each risk event. Event rating value.
例如,从各个历史违约债券的新闻语料数据中获得风险事件后,获取所有历史违约债券对应的风险事件,去除重复的风险事件,得到风险事件表,统计每个历史违约债券对应的风险事件中,在风险事件表中对应风险事件出现的次数,生成风险事件矩阵,在获得风险事件矩阵后,计算在债券为违约债券的情况下,各风险事件出现的概率值,将获得的概率值量化为各个风险事件对应的风险事件评分值,例如,风险事件“流动性差”出现的概率值为80%至89%,则该风险事件对应的风险事件评分值设置为8。通过统计历史违约债券的风险事件出现的次数,根据风险事件出现的次数计算债券违约事件发生的情况下,风险事件出现的概率,进而确定各风险事件的风险事件评分值,实现根据风险事件与违约债券的关联程度设置风险事件的风险事件评分值,提高风险事件评分值的准确性,在后续进行目标债券的违约风险识别的过程中,提高了目标债券违约风险等级的准确性。For example, after obtaining risk events from the news corpus data of each historical default bond, obtain the risk events corresponding to all historical default bonds, remove duplicate risk events, obtain a risk event table, and count the risk events corresponding to each historical default bond, Corresponding to the number of occurrences of risk events in the risk event table, a risk event matrix is generated. After obtaining the risk event matrix, calculate the probability value of each risk event when the bond is a default bond, and quantify the obtained probability value into each The risk event score value corresponding to the risk event. For example, if the probability value of the risk event "poor liquidity" is 80% to 89%, then the risk event score value corresponding to the risk event is set to 8. By counting the number of risk events in historical default bonds, we can calculate the probability of risk events in the event of a bond default event based on the number of risk events, and then determine the risk event score value of each risk event to achieve the goal of determining the risk event score based on risk events and defaults. The correlation degree of the bond sets the risk event score value of the risk event, improves the accuracy of the risk event score value, and improves the accuracy of the target bond default risk level in the subsequent process of identifying the default risk of the target bond.
示例性地,将目标匹配度作为目标风险事件对应的风险事件评分值的权重,计算目标风险事件的风险事件评分值的加权和,将加权和作为目标债券的风险评分值。可以理解的是,若目标风险事件为一个,则直接将目标风险事件对应的风险事件评分值和目标匹配度 相乘,得到风险评分值,若目标风险事件为多个,则计算多个目标风险事件对应的风险事件评分值的加权和。For example, the target matching degree is used as the weight of the risk event score value corresponding to the target risk event, the weighted sum of the risk event score values of the target risk event is calculated, and the weighted sum is used as the risk score value of the target bond. It can be understood that if there is one target risk event, the risk event score value corresponding to the target risk event and the target matching degree are directly multiplied to obtain the risk score value. If there are multiple target risk events, multiple target risks are calculated. The weighted sum of the risk event score values corresponding to the event.
在本实施例中,目标匹配度为目标风险事件与特征向量之间的匹配度。本实施例可以通过多种方式确定目标风险事件与特征向量之间的匹配度,例如,将目标风险事件与特征向量之间的欧氏距离或者余弦值作为目标风险事件与特征向量之间的匹配度。In this embodiment, the target matching degree is the matching degree between the target risk event and the feature vector. In this embodiment, the matching degree between the target risk event and the feature vector can be determined in a variety of ways. For example, the Euclidean distance or cosine value between the target risk event and the feature vector is used as the matching between the target risk event and the feature vector. Spend.
本实施例将目标风险事件与特征向量之间的匹配度作为目标风险事件对应的风险事件评分值的权重,确定对应目标风险事件对目标债券的风险评估的贡献,能够更加准确地确定目标债券的风险评分值。This embodiment uses the matching degree between the target risk event and the feature vector as the weight of the risk event score value corresponding to the target risk event to determine the contribution of the corresponding target risk event to the risk assessment of the target bond, which can more accurately determine the risk assessment of the target bond. Risk score value.
相关技术中,债券交易平台上虽然有提示债券风险的功能,但其罗列债券风险的描述繁琐,通篇没有重点提示,不利于客户快速了解对应债券产品的特点和需要承担的风险,造成信息利用率低的问题。Among related technologies, although the bond trading platform has a function to prompt bond risks, the description of the bond risks is cumbersome and there are no key reminders throughout the article, which is not conducive to customers quickly understanding the characteristics of the corresponding bond products and the risks they need to bear, resulting in information utilization The problem of low rates.
在一个示例性实施例中,步骤S104包括步骤S201-步骤S202,详细叙述如下:In an exemplary embodiment, step S104 includes steps S201-step S202, which are described in detail as follows:
步骤S201:响应于债券交易页面中触发的针对目标债券的第一操作指令,获取目标风险事件以及风险评分值。Step S201: In response to the first operation instruction for the target bond triggered in the bond trading page, obtain the target risk event and risk score value.
债券交易页面是指债券风险提示装置中用于显示或展现债券及债券相关信息的交互界面。例如,可以是移动终端的触控屏界面等。债券交易页面显示有个债券信息,客户根据自身的需要从债券交易页面上触发针对用户想要了解的债券的第一操作指令,其中,第一操作指令包括但不限于长按、点击、双击、拖动。The bond trading page refers to the interactive interface used to display or display bonds and bond-related information in the bond risk warning device. For example, it can be a touch screen interface of a mobile terminal, etc. The bond trading page displays bond information. Customers can trigger the first operation instruction for the bond that the user wants to know about based on their own needs. The first operation instruction includes but is not limited to long press, click, double click, drag.
步骤S202:将目标风险事件以及风险评分值加载于债券交易页面。Step S202: Load the target risk event and risk score value on the bond trading page.
示例性地,本实施例提供的风险提示方法还包括步骤S301至步骤S302,详细叙述如下:Illustratively, the risk warning method provided by this embodiment also includes steps S301 to S302, which are described in detail as follows:
步骤S301:基于风险评分值确定目标债券对应的风险等级。Step S301: Determine the risk level corresponding to the target bond based on the risk score value.
预先设置多个风险等级以及每个风险等级对应的风险评分区间,例如,风险等级包括低风险等级、中风险等级、高风险等级以及最高风险等级,其中,低风险等级对应的风险评估区间为[0,0.2),中风险等级对应的风险评估区间为[0.2,0.5),高风险等级对应的风险评估区间为[0.5,0.8),最高风险等级对应的风险评估区间为[0.8,1]。Preset multiple risk levels and the risk score interval corresponding to each risk level. For example, the risk level includes low risk level, medium risk level, high risk level and the highest risk level. Among them, the risk assessment interval corresponding to the low risk level is [ 0, 0.2), the risk assessment interval corresponding to the medium risk level is [0.2, 0.5), the risk assessment interval corresponding to the high risk level is [0.5, 0.8), and the risk assessment interval corresponding to the highest risk level is [0.8, 1].
本实施例将目标债券的风险评分值和每个风险等级对应的风险评分区间进行匹配,即可确定目标债券的风险等级。This embodiment matches the risk score value of the target bond with the risk score interval corresponding to each risk level to determine the risk level of the target bond.
步骤S302:将风险等级、目标风险事件、风险评分值与目标债券进行关联,将关联之后的风险等级、目标风险事件、风险评分值加载于债券交易页面。Step S302: Associate the risk level, target risk event, and risk score value with the target bond, and load the associated risk level, target risk event, and risk score value onto the bond transaction page.
本实施例考虑到若将风险等级、目标风险事件、风险评分值与目标债券单独加载于债券交易页面,则可能无法明确为用户指示目标债券可能带来的风险,例如,若风险等级、目标风险事件、风险评分值在债券交易页面的位置相距较远,也没有相应的解释字段用于解释它们之间的关系,则用户不易看出目标债券的风险事件、风险评分值以及风险等级。基于此,本实施例将风险等级与目标风险事件以及风险评分值与目标债券进行关联,使得 用户对目标债券带来的风险一目了然,避免给用户的债券交易过程带来误导,提高用户体验。This embodiment takes into account that if the risk level, target risk event, risk score value and target bond are loaded separately on the bond trading page, it may not be possible to clearly indicate to the user the risks that the target bond may bring. For example, if the risk level, target risk The events and risk score values are far apart on the bond trading page, and there are no corresponding explanation fields to explain the relationship between them. It is difficult for users to see the risk events, risk score values and risk levels of the target bonds. Based on this, this embodiment associates the risk level with the target risk event and the risk score value with the target bond, so that the user can see the risks brought by the target bond at a glance, avoid misleading the user's bond transaction process, and improve the user experience.
示例性地,构建风险关联表格,风险关联表格以债券种类为标题列,风险关联表格中包括债券交易页面中展示的多个债券产品,以及每个债券产品对应的风险事件、风险评分值以及风险等级。其中,多个债券产品对应的风险事件、风险评分值以及风险等级分别占风险关联表格的一列,每个债券产品对应的风险事件、风险评分值以及风险等级占据风险关联表格的一行。For example, a risk association table is constructed. The risk association table has bond type as the title column. The risk association table includes multiple bond products displayed on the bond transaction page, as well as the risk events, risk score values and risks corresponding to each bond product. grade. Among them, the risk events, risk score values and risk levels corresponding to multiple bond products each occupy one column of the risk association table, and the risk events, risk score values and risk levels corresponding to each bond product occupy one row of the risk association table.
示例性地,预先生成的风险提示字段模板,其中,风险提示字段模板包括四个待带入的参量,即,目标债券、风险事件、风险评分值以及风险等级,例如,风险提示字段模板为“目标债券可能发生的风险事件包括目标风险事件,发生的概率为风险评分值,风险等级为目标风险等级。”在实际应用场景中,将目标债券的代号或者名称、具体风险事件、具体风险评分值、具体风险等级带入上述风险提示字段模板,即可得到风险提示字段。例如,若目标风险事件为“流动性差”,风险评分值为80%,风险等级为最高风险等级,将目标风险事件、风险评分值以及风险等级与目标债券进行关联之后得到风险提示字段“目标债券流动性差,风险评分值为80%,风险等级为最高风险等级”,将“目标债券流动性差,风险评分值为80%,风险等级为最高风险等级”这一字段加载于债券交易页面。响应于债券交易页面中触发的针对目标债券的第一操作指令,将得到的风险提示字段加载与债券交易页面。Illustratively, the risk prompt field template is pre-generated, where the risk prompt field template includes four parameters to be brought in, namely, target bond, risk event, risk score value and risk level. For example, the risk prompt field template is " The risk events that may occur in the target bond include the target risk event, the probability of occurrence is the risk score value, and the risk level is the target risk level." In the actual application scenario, the code or name of the target bond, the specific risk event, and the specific risk score value , the specific risk level is brought into the above risk prompt field template, and the risk prompt field can be obtained. For example, if the target risk event is "poor liquidity", the risk score value is 80%, and the risk level is the highest risk level, after associating the target risk event, risk score value and risk level with the target bond, the risk prompt field "Target Bond" will be obtained. "Poor liquidity, risk score value is 80%, risk level is the highest risk level", load the field "Target bond has poor liquidity, risk score value is 80%, risk level is the highest risk level" on the bond trading page. In response to the first operation instruction for the target bond triggered in the bond trading page, the obtained risk prompt field is loaded into the bond trading page.
在本实施例中,将目标债券对应的目标风险事件展示于债券交易页面,使得用户明确目标债券未来可能会发生的风险事件,以及发生该风险事件的可能性,针对目标债券的特点和需要承担的风险一目了然。In this embodiment, the target risk event corresponding to the target bond is displayed on the bond transaction page, so that the user can clearly understand the risk events that may occur in the target bond in the future, as well as the possibility of the risk event occurring, based on the characteristics of the target bond and the responsibilities that need to be borne. The risks are clear at a glance.
示例性地,响应于债券交易页面中触发的针对债券资讯链接的第二操作指令,加载债券资讯链接所对应的资讯页面。考虑到部分用户存在进一步了解债券产品的风险信息的需求,本实施例为债券交易页面中展示的债券产品关联对应的债券资讯链接,在用户触发该债券资讯链接时,加载债券资讯链接所对应的资讯页面,进而为用户展示目标债券对应的风险信息。For example, in response to the second operation instruction for the bond information link triggered in the bond transaction page, the information page corresponding to the bond information link is loaded. Considering that some users have a need to further understand the risk information of bond products, this embodiment is a bond information link corresponding to the bond product displayed on the bond transaction page. When the user triggers the bond information link, the bond information link corresponding to the bond information link is loaded. information page, and then display the risk information corresponding to the target bond to users.
参阅图2,图2是一示例性实施例示出的由债券交易页面跳转至资讯页面的示意图,如图2所示,债券交易页面中包括多个债券产品,具体包括bbbb、aaaa、cccc、eeeee、fffff、ggggg等债券产品的编号,其中,为每个债券产品配置对应的交互按钮,在本实施例中,交互按钮为每个债券产品的编号右侧的“+”符号,可以理解的是,还可以通过设置其他标识来为债券产品配置对应的交互按钮,在此不做具体限定。用户通过触发债券产品对应的交互按钮,即可触发针对债券资讯链接的第二操作指令,进而加载债券资讯链接所对应的资讯页面,以展示相关的风险提示信息。如图2所示,债券编号aaaaa对应的资讯页面中包括该债券产品的风险信息,用户通过资讯页面中显示的风险信息了解对应债券产品的风险情况。Refer to Figure 2. Figure 2 is a schematic diagram of jumping from the bond transaction page to the information page in an exemplary embodiment. As shown in Figure 2, the bond transaction page includes multiple bond products, specifically including bbbb, aaaa, cccc, The numbers of bond products such as eeeee, fffff, ggggg, etc., wherein a corresponding interaction button is configured for each bond product. In this embodiment, the interaction button is the "+" symbol on the right side of the number of each bond product. It can be understood that Yes, you can also configure corresponding interaction buttons for bond products by setting other identifiers, which are not specifically limited here. By triggering the interactive button corresponding to the bond product, the user can trigger the second operation instruction for the bond information link, and then load the information page corresponding to the bond information link to display relevant risk warning information. As shown in Figure 2, the information page corresponding to the bond number aaaaa includes the risk information of the bond product. The user can understand the risk status of the corresponding bond product through the risk information displayed on the information page.
参阅图3,图3是一示例性实施例示出的债券的风险提示方法的流程图,如图3所示,本实施例涉及的债券的风险提示方法适用于风险提示系统,在本实施例中,风险提示系统包括用户、客户端、产品以及服务端,其中,本实施例提供的债券的风险提示方法包括步骤1-步骤5,详细叙述如下;Referring to Figure 3, Figure 3 is a flow chart of a bond risk warning method according to an exemplary embodiment. As shown in Figure 3, the bond risk warning method involved in this embodiment is suitable for a risk warning system. In this embodiment , the risk warning system includes users, clients, products, and servers. The bond risk warning method provided in this embodiment includes steps 1 to 5, which are described in detail as follows;
步骤1:配置高风险债券。Step 1: Allocate high-risk bonds.
在本实施例中,产品为债券交易平台,用户可以通过债券交易平台进行债券交易。示例性地,产品通过债券的信用风险、违约风险、清算风险等多种风险维度评估出高风险债券,并在服务端进行配置。In this embodiment, the product is a bond trading platform, and users can conduct bond transactions through the bond trading platform. For example, the product evaluates high-risk bonds through multiple risk dimensions such as credit risk, default risk, and liquidation risk of the bond, and configures them on the server side.
在本实施例中,高风险债券是客户端显示的债券中,风险评分值大于预设阈值的债券,产品通过评估各个债券产品的风险评分值,确定高风险债券和低风险债券。In this embodiment, high-risk bonds are bonds whose risk score value is greater than the preset threshold among the bonds displayed on the client. The product determines high-risk bonds and low-risk bonds by evaluating the risk score value of each bond product.
步骤2:查看债券信息。Step 2: View bond information.
在本实施例中,用户向客户端请求查看债券信息。债券信息表征对应目标债券是否为高风险债券。示例性地,用户向客户端发送查看请求,其中查看请求包括目标债券以及对应的债券信息。In this embodiment, the user requests to view bond information from the client. The bond information represents whether the corresponding target bond is a high-risk bond. For example, the user sends a viewing request to the client, where the viewing request includes the target bond and corresponding bond information.
步骤3:请求债券信息。Step 3: Request bond information.
在本实施例中,客户端向服务端请求债券信息,客户端向服务端请求债券信息,示例性地,客户端将用户发送的查看请求转发至服务端,以使得服务端响应于查看请求目标债券的债券信息发挥至客户端。In this embodiment, the client requests bond information from the server, and the client requests bond information from the server. For example, the client forwards the view request sent by the user to the server, so that the server responds to the view request target. Bond bond information is played to the client.
步骤4:返回债券信息。Step 4: Return bond information.
在本实施例中,服务端返回债券信息至客户端。In this embodiment, the server returns bond information to the client.
步骤5:展示债券信息。Step 5: Display bond information.
在本实施例中,客户端向用户展示债券信息。示例性地,如果债券信息指示目标债券是高风险债券,客户端将会展示高风险提示标识或者高风险提示条。In this embodiment, the client displays bond information to the user. For example, if the bond information indicates that the target bond is a high-risk bond, the client will display a high-risk warning mark or a high-risk warning bar.
在本实施例中,客户端通过向用户展示债券信息,使得用户能够直观了解到目标债券的债券信息,也即,能够直观了解到目标债券是否为高风险债券,一方面能够避免用户盲目进行债券交易,导致不必要的亏损,另外,也能够避免用户在交易高风险产品后产生损失后,怪责平台方,有助于降低不必要的风险。In this embodiment, the client displays bond information to the user so that the user can intuitively understand the bond information of the target bond, that is, can intuitively understand whether the target bond is a high-risk bond. On the one hand, it can avoid the user from blindly conducting bond transactions. Transactions can lead to unnecessary losses. In addition, it can also prevent users from blaming the platform for losses after trading high-risk products, which helps reduce unnecessary risks.
参阅图4,图4是本申请一示例性实施例示出的基于大数据的债券风险信息处理装置的框图,如图4所示,基于大数据的债券风险信息处理装置400包括分词处理模块401、确定模块402、匹配模块403以及风险提示模块404。Referring to Figure 4, Figure 4 is a block diagram of a big data-based bond risk information processing device illustrating an exemplary embodiment of the present application. As shown in Figure 4, the big data-based bond risk information processing device 400 includes a word segmentation processing module 401, Determination module 402, matching module 403 and risk prompt module 404.
其中,分词处理模块401用于对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,风险信息包括发行对象对应的财务信息、信用信息和历史违约信息;确定模块402用于基于与预设的关键词表匹配的实词的重要程度参数确定风险信息的特征向量,关键词表包括多个与预设的风险事件相关的关键词;匹配模块403用于将特征向量与预设的风险事件库中的风险事件进行匹配,得到与特征向量相匹配的目标风险事件,风 险事件库中包括多个风险事件;风险提示模块404用于基于目标风险事件进行风险提示。Among them, the word segmentation processing module 401 is used to segment the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words. The risk information includes the financial information, credit information and historical default information corresponding to the issuance object; the determination module 402 uses The feature vector of the risk information is determined based on the importance parameters of the content words that match the preset keyword table. The keyword table includes a plurality of keywords related to the preset risk event; the matching module 403 is used to match the feature vector with the preset risk event. The risk events in the set risk event library are matched to obtain the target risk event that matches the feature vector. The risk event library includes multiple risk events; the risk prompt module 404 is used to provide risk prompts based on the target risk event.
在另一示例性实施例中,确定模块402包括匹配单元、确定单元以及特征向量获取单元,其中,匹配单元用于将各个实词分别与关键词表进行匹配,得到目标实词;确定单元用于确定各个目标实词的重要程度参数;特征向量获取单元用于将由各个目标实词的重要程度参数组成的向量作为特征向量。In another exemplary embodiment, the determination module 402 includes a matching unit, a determination unit and a feature vector acquisition unit, wherein the matching unit is used to match each content word with the keyword table respectively to obtain the target content word; the determination unit is used to determine The importance parameters of each target content word; the feature vector acquisition unit is used to use a vector composed of the importance parameters of each target content word as a feature vector.
在另一示例性实施例中,本实施例提供的基于大数据的债券风险信息处理装置400还包括计算模块,其中,计算模块用于将目标匹配度作为目标风险事件对应的风险事件评分值的权重,计算目标风险事件的风险事件评分值的加权和,将加权和作为目标债券的风险评分值,目标匹配度为目标风险事件与特征向量之间的匹配度。风险事件库中包括每个风险事件对应的风险事件评分值。In another exemplary embodiment, the big data-based bond risk information processing device 400 provided in this embodiment also includes a calculation module, wherein the calculation module is used to use the target matching degree as the risk event score value corresponding to the target risk event. Weight, calculate the weighted sum of the risk event score values of the target risk event, and use the weighted sum as the risk score value of the target bond. The target matching degree is the matching degree between the target risk event and the feature vector. The risk event library includes risk event score values corresponding to each risk event.
在另一示例性实施例中,风险提示模块404包括响应单元和加载单元,其中,响应单元用于响应于债券交易页面中触发的针对目标债券的第一操作指令,获取目标风险事件以及风险评分值;加载单元用于将目标风险事件以及风险评分值加载于债券交易页面。In another exemplary embodiment, the risk prompt module 404 includes a response unit and a loading unit, wherein the response unit is used to obtain the target risk event and risk score in response to the first operation instruction for the target bond triggered in the bond trading page. value; the loading unit is used to load target risk events and risk score values into the bond trading page.
在另一示例性实施例中,加载单元还用于将目标债券的风险等级、目标风险事件、风险评分值与目标债券进行关联,将关联之后的风险等级、目标风险事件、风险评分值加载于债券交易页面,风险等级基于风险评分值确定。In another exemplary embodiment, the loading unit is also used to associate the risk level, target risk event, and risk score value of the target bond with the target bond, and load the associated risk level, target risk event, and risk score value into On the bond trading page, the risk level is determined based on the risk score value.
在另一示例性实施例中,加载单元还用于将风险等级、目标风险事件、风险评分值与目标债券带入预先生成的风险提示字段模板,得到对应的风险提示字段。In another exemplary embodiment, the loading unit is also used to bring the risk level, target risk event, risk score value and target bond into the pre-generated risk prompt field template to obtain the corresponding risk prompt field.
在另一示例性实施例中,本实施例提供的基于大数据的债券风险信息处理装置400还包括标识增设模块,用于若确定风险等级为最高风险等级,则在债券交易页面显示目标债券时,在债券交易页面的预设位置处增加感叹号标识。In another exemplary embodiment, the big data-based bond risk information processing device 400 provided in this embodiment also includes an identification adding module, configured to display the target bond on the bond transaction page if the risk level is determined to be the highest risk level. , add an exclamation point mark at the default position on the bond trading page.
在另一示例性实施例中,本实施例提供的基于大数据的债券风险信息处理装置400还包括响应模块,用于响应于债券交易页面中触发的针对债券资讯链接的第二操作指令,加载债券资讯链接所对应的资讯页面,资讯页面用于展示风险信息,其中,目标债券关联针对目标债券的债券资讯链接。In another exemplary embodiment, the big data-based bond risk information processing device 400 provided in this embodiment also includes a response module, configured to load, in response to the second operation instruction for the bond information link triggered in the bond transaction page, The information page corresponding to the bond information link. The information page is used to display risk information. The target bond is associated with the bond information link for the target bond.
需要说明的是,上述实施例所提供的装置与上述实施例所提供的方法属于同一构思,其中各个模块和单元执行操作的具体方式已经在方法实施例中进行了详细描述,此处不再赘述。It should be noted that the device provided by the above embodiments and the method provided by the above embodiments belong to the same concept. The specific manner in which each module and unit performs operations has been described in detail in the method embodiments and will not be described again here. .
在另一示例性实施例中,本申请提供一种电子设备,包括处理器和存储器,其中,存储器上存储有计算机可读指令,该计算机可读指令被处理器执行时实现如前的基于大数据的债券风险信息处理方法。In another exemplary embodiment, the present application provides an electronic device, including a processor and a memory, wherein computer readable instructions are stored on the memory, and when the computer readable instructions are executed by the processor, the above large-scale based on Data processing methods for bond risk information.
图5示出了适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。FIG. 5 shows a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
需要说明的是,图5示出的电子设备的计算机系统1000仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。It should be noted that the computer system 1000 of the electronic device shown in FIG. 5 is only an example, and should not impose any restrictions on the functions and scope of use of the embodiments of the present application.
如图5所示,计算机系统1000包括中央处理单元(Central Processing Unit,CPU)1001, 其可以根据存储在只读存储器(Read-Only Memory,ROM)1002中的程序或者从存储部分1008加载到随机访问存储器(Random Access Memory,RAM)1003中的程序而执行各种适当的动作和处理,例如执行上述实施例中的信息推荐方法。在RAM 1003中,还存储有系统操作所需的各种程序和数据。CPU 1001、ROM 1002以及RAM 1003通过总线1004彼此相连。输入/输出(Input/Output,I/O)接口1005也连接至总线1004。As shown in Figure 5, the computer system 1000 includes a central processing unit (Central Processing Unit, CPU) 1001, which can be loaded into a random computer according to a program stored in a read-only memory (Read-Only Memory, ROM) 1002 or from a storage part 1008. Access the program in the memory (Random Access Memory, RAM) 1003 to perform various appropriate actions and processing, such as performing the information recommendation method in the above embodiment. In RAM 1003, various programs and data required for system operation are also stored. CPU 1001, ROM 1002 and RAM 1003 are connected to each other through bus 1004. An input/output (I/O) interface 1005 is also connected to bus 1004.
以下部件连接至I/O接口1005:包括键盘、鼠标等的输入部分1006;包括诸如阴极射线管(Cathode Ray Tube,CRT)、液晶显示器(Liquid Crystal Display,LCD)等以及扬声器等的输出部分1007;包括硬盘等的存储部分1008;以及包括诸如LAN(Local Area Network,局域网)卡、调制解调器等的网络接口卡的通信部分1009。通信部分1009经由诸如因特网的网络执行通信处理。驱动器1010也根据需要连接至I/O接口1005。可拆卸介质1011,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1010上,以便于从其上读出的计算机程序根据需要被安装入存储部分1008。The following components are connected to the I/O interface 1005: an input part 1006 including a keyboard, a mouse, etc.; an output part 1007 including a cathode ray tube (Cathode Ray Tube, CRT), a liquid crystal display (Liquid Crystal Display, LCD), etc., and a speaker, etc. ; a storage part 1008 including a hard disk, etc.; and a communication part 1009 including a network interface card such as a LAN (Local Area Network) card, a modem, etc. The communication section 1009 performs communication processing via a network such as the Internet. Driver 1010 is also connected to I/O interface 1005 as needed. Removable media 1011, such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are installed on the drive 1010 as needed, so that a computer program read therefrom is installed into the storage portion 1008 as needed.
特别地,根据本申请的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本申请的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的计算机程序。在这样的实施例中,该计算机程序可以通过通信部分1009从网络上被下载和安装,和/或从可拆卸介质1011被安装。在该计算机程序被中央处理单元(CPU)1001执行时,执行本申请的系统中限定的各种功能。In particular, according to embodiments of the present application, the process described above with reference to the flowchart may be implemented as a computer software program. For example, embodiments of the present application include a computer program product including a computer program carried on a computer-readable medium, the computer program including a computer program for performing the method shown in the flowchart. In such embodiments, the computer program may be downloaded and installed from the network via communication portion 1009 and/or installed from removable media 1011. When the computer program is executed by the central processing unit (CPU) 1001, various functions defined in the system of the present application are executed.
需要说明的是,本申请实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的计算机程序。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的计算机程序可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable The combination. As used herein, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which a computer-readable computer program is carried. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Computer programs embodied on computer-readable media may be transmitted using any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产 品的可能实现的体系架构、功能和操作。其中,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the possible implementation architecture, functions and operations of systems, methods and computer program products according to various embodiments of the present application. Each block in the flow chart or block diagram may represent a module, program segment, or part of the code. The above-mentioned module, program segment, or part of the code includes one or more executable components for implementing the specified logical function. instruction. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
描述于本申请实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现,所描述的单元也可以设置在处理器中。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of this application can be implemented in software or hardware, and the described units can also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.
本申请的另一方面还提供了一种计算机可读存储介质,其上存储有计算机可读指令,该计算机可读指令被处理器执行时实现如前实施例中任一项的基于大数据的债券风险信息处理方法。Another aspect of the present application also provides a computer-readable storage medium on which computer-readable instructions are stored. When the computer-readable instructions are executed by a processor, the big data-based method in any one of the previous embodiments is implemented. Bond risk information processing methods.
本申请的另一方面还提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各个实施例中提供的基于大数据的债券风险信息处理方法。Another aspect of the present application also provides a computer program product or computer program, which includes computer instructions stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the big data-based bond risk information processing method provided in the above embodiments.
需要说明的是,本申请实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的计算机程序。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的计算机程序可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium may be, for example, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable The combination. As used herein, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which a computer-readable computer program is carried. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Computer programs embodied on computer-readable media may be transmitted using any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产 品的可能实现的体系架构、功能和操作。其中,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the possible implementation architecture, functions and operations of systems, methods and computer program products according to various embodiments of the present application. Each block in the flow chart or block diagram may represent a module, program segment, or part of the code. The above-mentioned module, program segment, or part of the code includes one or more executable components for implementing the specified logical function. instruction. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
描述于本申请实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现,所描述的单元也可以设置在处理器中。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of this application can be implemented in software or hardware, and the described units can also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.
上述内容,仅为本申请的较佳示例性实施例,并非用于限制本申请的实施方案,本领域普通技术人员根据本申请的主要构思和精神,可以十分方便地进行相应的变通或修改,故本申请的保护范围应以权利要求书所要求的保护范围为准。The above content is only a preferred exemplary embodiment of the present application and is not intended to limit the implementation of the present application. Those of ordinary skill in the art can easily make corresponding modifications or modifications based on the main concept and spirit of the present application. Therefore, the protection scope of this application should be subject to the protection scope required by the claims.

Claims (11)

  1. 一种基于大数据的债券风险信息处理方法,包括:A bond risk information processing method based on big data, including:
    对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,所述风险信息包括所述发行对象对应的财务信息、信用信息和历史违约信息;Perform word segmentation processing on the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words. The risk information includes the financial information, credit information and historical default information corresponding to the issuance object;
    基于与预设的关键词表匹配的目标实词的重要程度参数确定所述风险信息的特征向量,所述关键词表包括多个与预设的风险事件相关的关键词,其中,所述重要程度参数用于表示对应目标实词对于所述风险信息的重要程度;The feature vector of the risk information is determined based on the importance parameter of the target content word that matches a preset keyword table. The keyword table includes a plurality of keywords related to the preset risk event, wherein the importance level Parameters are used to represent the importance of the corresponding target content words to the risk information;
    将所述特征向量与预设的风险事件库中的风险事件进行匹配,得到与所述特征向量相匹配的目标风险事件,所述风险事件库中包括多个风险事件;Match the feature vector with risk events in a preset risk event library to obtain a target risk event that matches the feature vector, where the risk event library includes multiple risk events;
    基于所述目标风险事件进行风险提示。Risk prompts are made based on the target risk events.
  2. 根据权利要求1所述的方法,其中,所述基于与预设的关键词表匹配的目标实词的重要程度参数确定所述风险信息的特征向量包括:The method according to claim 1, wherein determining the feature vector of the risk information based on the importance parameter of the target content word matching the preset keyword table includes:
    将各个实词分别与所述关键词表进行匹配,得到所述目标实词;Match each content word with the keyword list to obtain the target content word;
    确定各个目标实词的重要程度参数;Determine the importance parameters of each target content word;
    将由各个目标实词的重要程度参数组成的向量作为所述特征向量。A vector composed of importance parameters of each target content word is used as the feature vector.
  3. 根据权利要求1所述的方法,其中,所述风险事件库中包括每个风险事件对应的风险事件评分值;所述方法还包括:The method according to claim 1, wherein the risk event library includes a risk event score value corresponding to each risk event; the method further includes:
    将目标匹配度作为所述目标风险事件对应的风险事件评分值的权重,计算所述目标风险事件的风险事件评分值的加权和,将所述加权和作为所述目标债券的风险评分值,所述目标匹配度为所述目标风险事件与所述特征向量之间的匹配度。The target matching degree is used as the weight of the risk event score value corresponding to the target risk event, the weighted sum of the risk event score values of the target risk event is calculated, and the weighted sum is used as the risk score value of the target bond, so The target matching degree is the matching degree between the target risk event and the feature vector.
  4. 根据权利要求3所述的方法,其中,所述基于所述目标风险事件进行风险提示包括:The method according to claim 3, wherein the risk prompt based on the target risk event includes:
    响应于债券交易页面中触发的针对所述目标债券的第一操作指令,获取所述目标风险事件以及所述风险评分值;In response to the first operation instruction for the target bond triggered in the bond trading page, obtain the target risk event and the risk score value;
    将所述目标风险事件以及所述风险评分值加载于所述债券交易页面。Load the target risk event and the risk score value into the bond trading page.
  5. 根据权利要求4所述的方法,其中,所述将所述目标风险事件以及所述风险评分值加载于所述债券交易页面包括:The method according to claim 4, wherein loading the target risk event and the risk score value onto the bond trading page includes:
    将所述目标债券的风险等级、所述目标风险事件、所述风险评分值与目标债券进行关联;Associating the risk level of the target bond, the target risk event, and the risk score value with the target bond;
    将关联之后的所述风险等级、所述目标风险事件、所述风险评分值加载于所述债券交易页面,所述风险等级基于所述风险评分值确定。The associated risk level, the target risk event, and the risk score value are loaded onto the bond transaction page, and the risk level is determined based on the risk score value.
  6. 根据权利要求5所述的方法,其中,所述将所述风险等级、所述目标风险事件、所述风险评分值与目标债券进行关联包括:The method of claim 5, wherein associating the risk level, the target risk event, the risk score value with the target bond includes:
    将所述风险等级、所述目标风险事件、所述风险评分值与目标债券带入预先生成的风 险提示字段模板,得到对应的风险提示字段。The risk level, the target risk event, the risk score value and the target bond are brought into the pre-generated risk prompt field template to obtain the corresponding risk prompt field.
  7. 根据权利要求3所述的方法,其中,所述目标债券关联针对所述目标债券的债券资讯链接;所述方法还包括:The method according to claim 3, wherein the target bond is associated with a bond information link for the target bond; the method further includes:
    响应于所述债券交易页面中触发的针对所述债券资讯链接的第二操作指令,加载所述债券资讯链接所对应的资讯页面,所述资讯页面用于展示所述风险信息。In response to the second operation instruction for the bond information link triggered in the bond transaction page, the information page corresponding to the bond information link is loaded, and the information page is used to display the risk information.
  8. 一种基于大数据的债券风险信息处理装置,包括:A bond risk information processing device based on big data, including:
    分词处理模块,用于对目标债券对应的发行对象的风险信息进行分词处理,得到对应的多个实词,所述风险信息包括所述发行对象对应的财务信息、信用信息和历史违约信息;The word segmentation processing module is used to segment the risk information of the issuance object corresponding to the target bond to obtain multiple corresponding content words. The risk information includes the financial information, credit information and historical default information corresponding to the issuance object;
    确定模块,用于基于与预设的关键词表匹配的实词的重要程度参数确定所述风险信息的特征向量,所述关键词表包括多个与预设的风险事件相关的关键词,其中,所述重要程度参数用于表示对应目标实词对于所述风险信息的重要程度;A determination module configured to determine the feature vector of the risk information based on the importance parameters of content words that match a preset keyword table, where the keyword table includes a plurality of keywords related to preset risk events, wherein, The importance parameter is used to represent the importance of the corresponding target content word to the risk information;
    匹配模块,用于将所述特征向量与预设的风险事件库中的风险事件进行匹配,得到与所述特征向量相匹配的目标风险事件,所述风险事件库中包括多个风险事件;A matching module, configured to match the feature vector with risk events in a preset risk event library to obtain a target risk event that matches the feature vector, where the risk event library includes multiple risk events;
    风险提示模块,用于基于所述目标风险事件进行风险提示。A risk warning module is used to provide risk warning based on the target risk event.
  9. 一种电子设备,包括:An electronic device including:
    存储器,存储有计算机可读指令;A memory storing computer-readable instructions;
    处理器,读取存储器存储的计算机可读指令,以执行权利要求1-7中的任一项所述的方法。The processor reads the computer-readable instructions stored in the memory to execute the method according to any one of claims 1-7.
  10. 一种计算机可读存储介质,其上存储有计算机可读指令,当所述计算机可读指令被计算机的处理器执行时,使计算机执行权利要求1-7中的任一项所述的方法。A computer-readable storage medium having computer-readable instructions stored thereon. When the computer-readable instructions are executed by a processor of a computer, the computer is caused to perform the method described in any one of claims 1-7.
  11. 一种计算机程序产品,所述计算机程序产品包括计算机指令,计算机设备的处理器执行所述计算机指令时,使得所述计算机设备执行权利要求1-7中的任一项所述的方法。A computer program product. The computer program product includes computer instructions. When a processor of a computer device executes the computer instructions, the computer device performs the method described in any one of claims 1-7.
PCT/CN2022/140776 2022-08-25 2022-12-21 Bond risk information processing method based on big data and related device WO2024040817A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211032661.3 2022-08-25
CN202211032661.3A CN115470785A (en) 2022-08-25 2022-08-25 Bond risk information processing method based on big data and related equipment

Publications (1)

Publication Number Publication Date
WO2024040817A1 true WO2024040817A1 (en) 2024-02-29

Family

ID=84369613

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/140776 WO2024040817A1 (en) 2022-08-25 2022-12-21 Bond risk information processing method based on big data and related device

Country Status (2)

Country Link
CN (1) CN115470785A (en)
WO (1) WO2024040817A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115470785A (en) * 2022-08-25 2022-12-13 深圳市富途网络科技有限公司 Bond risk information processing method based on big data and related equipment
CN117078025A (en) * 2023-10-17 2023-11-17 金华市交通投资集团有限公司 Risk early warning method and device, electronic equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670837A (en) * 2018-11-30 2019-04-23 平安科技(深圳)有限公司 Recognition methods, device, computer equipment and the storage medium of bond default risk
WO2021172775A1 (en) * 2020-02-28 2021-09-02 Shinhanaitas Co.,Ltd. Method and apparatus for providing service for determining risk of derivatives trading of funds in real time
CN114240656A (en) * 2021-12-15 2022-03-25 平安科技(深圳)有限公司 Risk prediction method, device and equipment based on reinforcement learning and storage medium
CN114913016A (en) * 2022-05-25 2022-08-16 深圳市富途网络科技有限公司 Bond transaction risk prompting method, device, equipment and medium based on big data
CN115470785A (en) * 2022-08-25 2022-12-13 深圳市富途网络科技有限公司 Bond risk information processing method based on big data and related equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670837A (en) * 2018-11-30 2019-04-23 平安科技(深圳)有限公司 Recognition methods, device, computer equipment and the storage medium of bond default risk
WO2021172775A1 (en) * 2020-02-28 2021-09-02 Shinhanaitas Co.,Ltd. Method and apparatus for providing service for determining risk of derivatives trading of funds in real time
CN114240656A (en) * 2021-12-15 2022-03-25 平安科技(深圳)有限公司 Risk prediction method, device and equipment based on reinforcement learning and storage medium
CN114913016A (en) * 2022-05-25 2022-08-16 深圳市富途网络科技有限公司 Bond transaction risk prompting method, device, equipment and medium based on big data
CN115470785A (en) * 2022-08-25 2022-12-13 深圳市富途网络科技有限公司 Bond risk information processing method based on big data and related equipment

Also Published As

Publication number Publication date
CN115470785A (en) 2022-12-13

Similar Documents

Publication Publication Date Title
US11334635B2 (en) Domain specific natural language understanding of customer intent in self-help
US10896392B2 (en) Methods and systems for generating supply chain representations
WO2024040817A1 (en) Bond risk information processing method based on big data and related device
US8484228B2 (en) Extraction and grouping of feature words
TWI643076B (en) Financial analysis system and method for unstructured text data
US11853684B2 (en) Synchronization and tagging of image and text data
CN116235144A (en) Domain specific language interpreter and interactive visual interface for rapid screening
CN113722433A (en) Information pushing method and device, electronic equipment and computer readable medium
CN111179051A (en) Financial target customer determination method and device and electronic equipment
CN111126073B (en) Semantic retrieval method and device
CN114880498B (en) Event information display method and device, equipment and medium
CN115827994A (en) Data processing method, device, equipment and storage medium
US11593740B1 (en) Computing system for automated evaluation of process workflows
CN114913016A (en) Bond transaction risk prompting method, device, equipment and medium based on big data
CN113095078A (en) Associated asset determination method and device and electronic equipment
CN113722593A (en) Event data processing method and device, electronic equipment and medium
CN117390170B (en) Method and device for matching data standards, electronic equipment and readable storage medium
US20240086815A1 (en) Systems and methods for risk factor predictive modeling with document summarization
US20240086816A1 (en) Systems and methods for risk factor predictive modeling with document summarization
CN115600600B (en) Label naming method, device, electronic equipment and medium of multi-object label system
US20220398660A1 (en) System and method for computational shelf forecasting
CN117036004A (en) Personal credit risk prediction method and device based on LightGBM
CN117453973A (en) Information recommendation method, device, equipment and medium
CN114584616A (en) Message pushing method and device, electronic equipment and storage medium
CN116226173A (en) Data query method, device, storage medium and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22956350

Country of ref document: EP

Kind code of ref document: A1