WO2020048247A1 - Settlement data processing method and apparatus, and computer device and storage medium - Google Patents

Settlement data processing method and apparatus, and computer device and storage medium Download PDF

Info

Publication number
WO2020048247A1
WO2020048247A1 PCT/CN2019/096961 CN2019096961W WO2020048247A1 WO 2020048247 A1 WO2020048247 A1 WO 2020048247A1 CN 2019096961 W CN2019096961 W CN 2019096961W WO 2020048247 A1 WO2020048247 A1 WO 2020048247A1
Authority
WO
WIPO (PCT)
Prior art keywords
settlement
data
field
settled
settlement data
Prior art date
Application number
PCT/CN2019/096961
Other languages
French (fr)
Chinese (zh)
Inventor
夏雷
Original Assignee
平安医疗健康管理股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安医疗健康管理股份有限公司 filed Critical 平安医疗健康管理股份有限公司
Publication of WO2020048247A1 publication Critical patent/WO2020048247A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance

Definitions

  • the present application relates to a method, an apparatus, a computer device, and a storage medium for processing settlement data.
  • a method, an apparatus, a computer device, and a storage medium for processing settlement data are provided.
  • a method for processing settlement data including:
  • the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
  • the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data, and a decision model is obtained according to The decision model makes a decision on the settlement data to be decided, outputs settlement attributes and categories corresponding to the settlement data to be decided, and adds to the settlement data to be decided; and according to a plurality of added settlement attributes and categories,
  • the settlement data generates a project settlement directory according to the category, so that the server performs project settlement processing according to the project settlement directory.
  • a settlement data processing device includes:
  • An obtaining module configured to obtain multiple initial settlement directories sent by the terminal, the initial settlement directories including multiple pending settlement data and corresponding multiple pending settlement fields; obtaining a settlement data table, where the settlement data table includes standardized settlement data And corresponding multiple specification fields, the specification settlement data is set with settlement attributes and categories;
  • a matching module configured to match the field to be settled with a specification field in the specification data table, and calculate a degree of matching between the field to be settled and the plurality of specification fields; if the degree of matching reaches a preset value A threshold to-be-settled field, determining that the to-be-settled data corresponding to the to-be-settled field is standard settlement data, and adding the settlement attribute and category set by the standard-settlement data corresponding to the specification field to the to-be-settled data;
  • a decision-making module configured to output to-be-settled data corresponding to the to-be-settled fields as non-standard settlement data if there is a to-be-settled field with a matching degree that does not reach a preset threshold value, and generate to-be-determined settlement data based on the non-standard to-be-settled data, Inputting the settlement data to be decided into a trained decision model for decision making, obtaining settlement attributes and categories corresponding to the settlement data to be decided, and adding to the settlement data to be decided; and
  • the directory generating module is configured to generate a project settlement directory according to the category based on a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  • a computer device includes a memory and one or more processors.
  • the memory stores computer-readable instructions.
  • the one or more processors execute the following: step:
  • the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
  • the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data.
  • Settlement data is input into a trained decision model for decision making, and outputs settlement attributes and categories corresponding to the settlement data to be determined, and is added to the settlement data to be determined; and settlement is added based on multiple settlement attributes and categories added
  • the data generates a project settlement directory according to the category, so that the server performs project settlement processing according to the project settlement directory.
  • One or more non-volatile computer-readable storage media storing computer-readable instructions.
  • the one or more processors execute the following steps: obtaining multiple initial A settlement directory, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
  • the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with a settlement attribute and category;
  • FIG. 1 is an application scenario diagram of a settlement data processing method according to one or more embodiments.
  • FIG. 2 is a schematic flowchart of a settlement data processing method according to one or more embodiments.
  • FIG. 3 is a schematic flowchart of a decision-making process for decision-making settlement data according to one or more embodiments.
  • FIG. 4 is a schematic flowchart of steps of constructing a decision model according to one or more embodiments.
  • FIG. 5 is a structural block diagram of a settlement data processing apparatus according to one or more embodiments.
  • FIG. 6 is an internal structural diagram of a computer device according to one or more embodiments.
  • the settlement data processing method provided in this application can be applied to the application environment shown in FIG. 1.
  • the terminal 102 communicates with the server 104 through the network through the network.
  • the terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
  • the server 104 may be implemented by an independent server or a server cluster composed of multiple servers.
  • the server 104 may obtain multiple initial settlement directories sent by multiple terminals 102.
  • the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled.
  • the server 104 further obtains a authority data table, and the authority data table includes authority settlement data and corresponding authority fields.
  • the authority settlement data is set with a settlement attribute and a category. Match the fields to be settled with the canonical settlement fields.
  • the server 104 inputs the settlement data to be decided into the trained decision model for decision making, outputs the settlement attributes and categories corresponding to the settlement data to be decided, and adds it to the settlement data to be decided.
  • the server 104 generates a project settlement directory according to a corresponding category based on a plurality of settlement data to which settlement attributes and categories are added, and performs project settlement processing according to the project settlement directory.
  • a method for processing settlement data is provided.
  • the method is applied to the server in FIG. 1 as an example, and includes the following steps:
  • Step 202 Obtain multiple initial settlement directories sent by the terminal.
  • the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled.
  • the initial settlement directory may be a settlement directory released in various places, such as a social insurance settlement directory, a medical insurance settlement directory, and a drug settlement directory.
  • the data to be settled refers to the settlement information of the items to be settled.
  • the server can obtain multiple initial settlement directories sent by multiple terminals.
  • the initial settlement directory includes multiple data to be settled, and each piece of data to be settled includes corresponding multiple fields to be settled.
  • the drug settlement directory may include multiple drug settlement data, and each drug settlement data may include multiple field information such as "general name”, “dosage form”, “specification”, and "manufacturer”.
  • Step 204 Obtain a normative data table.
  • the normative data table includes normative settlement data and corresponding normative fields.
  • the normative settlement data is set with settlement attributes and categories.
  • the server After the server obtains multiple initial settlement directories, it further obtains the specification data table.
  • the authority data table may be authority settlement data defined in advance according to a preset rule. It may also be that after the server obtains a large amount of settlement data, it performs a big data analysis on the large amount of settlement data, and according to the analysis result and a plurality of standardized settlement data defined by preset rules.
  • the authority data table includes a plurality of authority fields corresponding to authority settlement data.
  • the authority settlement data also includes corresponding settlement attributes and categories.
  • Step 206 Match the fields to be settled with the specification fields in the specification data table, and calculate the degree of matching between the field to be settled and the plurality of specification fields.
  • step 208 if there is a field to be settled whose matching degree reaches a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is standardized settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the specification field are added to the data to be settled. in.
  • the server After the server obtains the specification data table, it matches the to-be-settled fields in the to-be-settled data with the specification fields in the specification-settlement data, and calculates the degree of matching between the to-be-settled fields and multiple specification fields.
  • the degree of matching between the to-be-settled field in the to-be-settled data and the canonical field in the to-be-settled data reaches a preset threshold, it indicates that the to-be-settled data is normative settlement data, and it is determined that the to-be-settled data corresponding to the to-be-settled field is normative.
  • Settle the data Settle the data, and add the settlement attributes and categories set by the standard settlement data corresponding to the specification field to the data to be settled. In this way, the standardized settlement data in the data to be settled can be directly identified, and corresponding settlement attributes and categories are added.
  • step 210 if there is a field to be settled whose matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data. Make a decision in the trained decision model, output settlement attributes and categories corresponding to the settlement data to be decided, and add it to the settlement data to be decided.
  • the server determines that the data to be settled corresponding to the field to be settled is irregular settlement data, and generates the settlement data to be decided according to the non-standardized settlement data. Specifically, when the server matches the field to be settled in the data to be settled with the specification field in the standard settlement data, the server extracts data to be settled that does not reach the preset matching degree, and the data to be settled that does not reach the preset matching degree is Data to be settled that are inconsistent with the standard settlement data.
  • the server generates the settlement data for decision-making based on the non-standard data to be settled, and then obtains a decision model that has been trained and constructed.
  • the decision model includes multiple nodes, and the data for decision-making is input into the decision model, and the node order of the decision model is followed. Iterate until the settlement attributes and categories corresponding to the settlement data to be determined are obtained.
  • the settlement attributes and categories corresponding to the settlement data to be decided can be accurately determined, and the settlement attributes and categories decided on are added to the settlement data to be decided.
  • Step 212 Generate a project settlement directory according to the category according to a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  • the server After the server determines the corresponding settlement attribute and category for the decision settlement data according to the decision model, it generates a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories.
  • the settlement data after adding the settlement attribute and category includes the pending settlement data with the settlement attribute and category added, and the pending decision settlement data with the settlement attribute and category added.
  • the server After the server generates the project settlement directory and forms a standardized project settlement directory, the corresponding project can be settled according to the project settlement directory.
  • the server obtains multiple initial settlement directories sent by the terminal.
  • the initial settlement directory includes multiple data to be settled, and the data to be settled includes corresponding multiple fields to be settled.
  • the server obtains the authority data table, and the authority data table includes a plurality of authority fields corresponding to the authority settlement data.
  • the authority settlement data is set with a settlement attribute and a category.
  • the server matches the field to be settled with the specification field in the specification data table, and calculates a degree of matching between the field to be settled and the plurality of specification fields.
  • the server inputs the settlement data to be decided into a trained decision model for decision making, outputs the settlement attributes and categories corresponding to the settlement data to be decided, and adds it to the settlement data to be decided.
  • the server generates a project settlement directory according to the category based on a plurality of settlement data after adding settlement attributes and categories, so that the project settlement processing can be performed according to the project settlement directory.
  • the settlement attributes and categories corresponding to the data to be settled can be accurately obtained, and a standardized project settlement directory can be formed, which can effectively improve the processing efficiency of settlement data.
  • matching the fields to be settled with the canonical fields in the canonical data table includes: obtaining a semantic matching model, which includes multiple canonical settlement data and corresponding multiple canonical field vectors; through semantic matching The model extracts field vectors corresponding to a plurality of fields to be settled in the data to be settled; and calculates a degree of matching between the field vectors corresponding to the data to be settled and a plurality of canonical field vectors.
  • the server obtains multiple initial settlement directories.
  • the initial settlement directory includes multiple pending settlement data, and each piece of pending settlement data includes a corresponding multiple pending settlement fields.
  • the drug settlement directory may include multiple drug settlement data, and each drug settlement data may include multiple field information such as "general name”, “dosage form”, “specification”, and "manufacturer”.
  • the server After the server obtains a plurality of initial settlement directories, it further obtains a specification data table.
  • the specification data table includes a plurality of specification fields corresponding to the specification settlement data, and the specification settlement data is set with settlement attributes and categories.
  • the server obtains the corresponding semantic matching model according to the official data table.
  • the semantic matching model includes a plurality of canonical field vectors corresponding to a plurality of canonical settlement data.
  • the server extracts the field vectors corresponding to the plurality of fields to be settled in the data to be settled, and matches the field vectors corresponding to the data to be settled with the plurality of standard field vectors corresponding to the plurality of standard settlement data to calculate the corresponding The degree of match between the field vector and multiple canonical field vectors.
  • the drug settlement directory includes multiple drug data, and each drug data includes the corresponding multiple to-be-cleared fields such as "general name”, “dosage form”, “specification”, and “manufacturer”. Content, and extract field vectors corresponding to the contents of multiple fields to be settled.
  • the server obtains the corresponding canonical data table, it further obtains the corresponding semantic matching model.
  • the semantic matching model includes multiple canonical field vectors corresponding to multiple canonical settlement data.
  • the server matches the multiple field vectors corresponding to the drug data with the multiple standard field vectors corresponding to multiple regulatory settlement data, that is, the corresponding "common name”, “dosage form”, “specification”, and “manufacturer” of the pharmaceutical data.
  • the contents of multiple to-be-settled fields are matched with multiple canonical field vectors corresponding to multiple canonical settlement data.
  • the decision model includes a plurality of nodes, and the steps of inputting settlement data to be decided into the trained decision model for decision specifically include the following:
  • Step 302 Use a decision model to extract multiple field vectors corresponding to the settlement data to be decided.
  • step 304 the multiple field vectors corresponding to the settlement data to be decided are traversed and matched according to the node order of the decision model, and the matching degree between the field vector and the multiple nodes is calculated.
  • Step 306 Until a plurality of field vectors are matched to obtain the corresponding target settlement attribute and target category, the decision model outputs the settlement attribute and category corresponding to the settlement data to be decided.
  • the server After the server obtains multiple initial settlement directories, it obtains a specification data table.
  • the initial settlement directory includes multiple data to be settled, and the data to be settled includes corresponding multiple fields to be settled.
  • the authority data table includes a plurality of authority fields corresponding to authority settlement data, and authority data includes corresponding settlement attributes and categories.
  • the server matches the field to be settled with the specification field, and calculates the matching degree of the field to be settled with multiple specification fields. If there is a field to be settled when the matching degree reaches a preset threshold, it means that the field to be settled is consistent with the specification field, then the data to be settled corresponding to the field to be settled is the standard settlement data, and the standard settlement data corresponding to the specification field is set Added to the pending data.
  • the server further extracts multiple field vectors corresponding to the settlement data to be decided, and obtains a decision model.
  • the decision model includes a decision tree, and a plurality of nodes are set in the decision tree in advance. Multiple field vectors corresponding to the settlement data to be decided are input to the decision model, and the traversal matching is performed according to the order of the nodes in the decision model to calculate the matching degree of the field vector with multiple nodes.
  • the target settlement attribute and the corresponding target category of each field vector are determined according to the matching degree. Until multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the server outputs the settlement attributes and categories corresponding to the settlement data to be decided according to the decision model, and adds the corresponding settlement attributes and categories to the settlement data to be decided .
  • the efficiency of the decision can be improved, and the settlement attributes and categories corresponding to the settlement data to be determined can be effectively and accurately obtained.
  • a step of constructing a decision model is further included.
  • the step specifically includes the following content:
  • Step 402 Obtain multiple settlement data in multiple databases.
  • the settlement data includes multiple field names and corresponding field values.
  • Step 404 Perform cluster analysis on the field names of the settlement data to obtain priority parameters for each field name.
  • Step 406 Calculate the weights of multiple field names according to the priority parameters of the field names.
  • Step 408 Train the association relationship between the multiple settlement data and the settlement attribute and the corresponding category according to the field names and corresponding field values of the multiple settlement data.
  • Step 410 Construct a decision tree according to the weights of multiple field names and the correlation between the training settlement data, settlement attributes, and categories, and generate a decision model according to the decision tree.
  • the server Before the server obtains the decision model, it needs to build a decision model in advance. Specifically, the server may obtain multiple settlement data from databases of multiple websites, and the obtained settlement data may be an initial settlement directory issued by each place or institution.
  • the settlement data includes multiple field names and corresponding field values.
  • the server first performs cluster analysis on multiple field names in the settlement data. Specifically, the server analyzes multiple field names in the settlement data to analyze the probability of each field name being used in the settlement field. The more frequently used field names are more important, and according to the field names, Probability gets the priority parameter for each field name. The server further sorts the field names according to the priority probability of each field name. By prioritizing the field names and calculating the weights of multiple field names, the efficiency of decision-making on settlement data can be effectively improved.
  • the server further trains association relationships between the multiple settlement data and settlement attributes and categories according to the field names and corresponding field values of the multiple settlement data.
  • the server may take each settlement data as a sample, take the field name and corresponding field value in each settlement data as one dimension, and each settlement data has multiple dimensions.
  • the server further performs cluster analysis on each dimension of each settlement data.
  • the server can use a cluster analysis algorithm, such as the K-means algorithm, and iteratively calculates multiple samples using multiple dimensions of each settlement data as data objects in order to calculate the clustering result corresponding to each settlement data.
  • the server can obtain multiple dimensional variables corresponding to multiple field names and corresponding field values in the settlement data, and then perform cluster analysis on multiple settlement data samples, thereby analyzing the relationship between settlement data and settlement attributes and categories. Relationship.
  • the server can construct a decision tree based on the weights of multiple field names and the association between the training settlement data and settlement attributes and categories, and generate a decision model based on the decision tree.
  • a decision model based on the relationship between the training data of the sorted field names and the settlement attributes and categories, it is possible to effectively improve the efficiency and accuracy of decision-making based on the decision model.
  • the method further includes: obtaining a plurality of update settlement data, the update settlement data is set with a settlement attribute and a category; performing cluster analysis on the plurality of update settlement data to obtain the update weights and Update the association relationship parameters; adjust the parameters of the decision model according to the update weight and the update association parameter to optimize the decision model.
  • the decision model also needs to be adjusted with the changes of various factors and time to improve the stability of the model.
  • the server constructs a decision model according to the priority probability of the field names of the plurality of settlement data and the training relationship between the settlement data and the settlement attributes and categories, it can further optimize the decision model to make the decision model accurate. higher.
  • the server may obtain the updated settlement data within a preset time period from the database.
  • the preset time may be one year, or half a year, one quarter, or one month.
  • the updated settlement data may be updated settlement data after updating relevant rules of the settlement data, or may be updated normative settlement data defined in accordance with a preset rule.
  • Settlement attributes and categories are set in the update settlement data.
  • the server further performs cluster analysis on multiple update settlement data to obtain update weights and update association parameters for multiple field names.
  • the server further adjusts the relevant commitments of the decision model according to the update weight and the update relationship parameter to optimize the decision model, which can effectively ensure the stability and newness of the decision model, and can effectively improve the accuracy of classification of settlement data according to the decision model. Sex.
  • the category includes multiple hierarchical categories.
  • the method further includes: according to the settlement attributes of the settlement data and multiple hierarchical category pairs. Sorting settlement data; encoding settlement data of multiple hierarchical categories according to a preset manner; storing the encoded settlement directory.
  • the server sorts the settlement data according to the settlement attribute and category of the settlement data.
  • the category of settlement data may also include multiple hierarchical categories, and the server sorts the settlement data in the project settlement directory according to the corresponding category or hierarchical category.
  • the server sorts the project settlement directory, it encodes the settlement data of each hierarchical category according to a preset method, and stores the encoded project settlement directory.
  • the coded characters include at least two types. Coded characters can be letters and numbers, such as ABS10001, ABS100012. Sub-characters are added in sequence to the settlement data under each hierarchical category in accordance with the corresponding encoded characters. For example ABS10001001, ABS10001002. By encoding the detailed classified item settlement catalog data, the management efficiency of settlement data is effectively improved.
  • steps in the flowcharts of FIGS. 2-4 are sequentially displayed in accordance with the directions of the arrows, these steps are not necessarily performed in the order indicated by the arrows. Unless explicitly stated in this document, the execution of these steps is not strictly limited, and these steps can be performed in other orders. Moreover, at least a part of the steps in Figure 2-4 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily performed at the same time, but may be performed at different times. These sub-steps or stages The execution order of is not necessarily performed sequentially, but may be performed in turn or alternately with at least a part of another step or a sub-step or stage of another step.
  • a settlement data processing device which includes: an acquisition module 502, a matching module 504, a decision module 506, and a directory generation module 508, where:
  • the obtaining module 502 is configured to obtain multiple initial settlement directories sent by the terminal.
  • the initial settlement directory includes multiple pending settlement data and corresponding multiple pending settlement fields; and obtains a settlement data table, where the settlement data table includes standardized settlement data and corresponding Multiple specification fields. Settlement attributes and categories are set in the standard settlement data;
  • a matching module 504 is configured to match a field to be settled with a standard field in a specification data table, and calculate a degree of matching between the field to be settled and a plurality of standard fields; if there is a field to be settled with a matching degree that reaches a preset threshold, determine a field to be settled
  • the data to be settled corresponding to the field is standard settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the standard field are added to the data to be settled;
  • a decision module 506 configured to determine that the pending data corresponding to the pending settlement field is non-standard settlement data if there is a pending field that does not reach a preset threshold, and generate pending decision settlement data based on the non-standard pending settlement data, and Settlement data is input into the trained decision model for decision-making, output settlement attributes and categories corresponding to the settlement data to be decided, and added to the settlement data to be decided;
  • the directory generation module 508 is configured to generate a project settlement directory according to categories based on a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  • the matching module 504 is further configured to obtain a semantic matching model.
  • the semantic matching model includes multiple canonical settlement data and corresponding multiple canonical field vectors.
  • a plurality of to-be-settled data in the to-be-settled data are extracted through the semantic matching model.
  • the decision model includes multiple nodes, and the decision module 506 is further configured to use the decision model to extract multiple field vectors corresponding to the settlement data to be decided; Nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the decision model outputs the settlement attributes and categories corresponding to the settlement data to be decided.
  • the device further includes a model building module for obtaining multiple settlement data in multiple databases, the settlement data includes multiple field names and corresponding field values; and performs cluster analysis on the field names of the settlement data To obtain the priority parameters of each field name; calculate the weight of multiple field names according to the priority parameters of the field names; train multiple settlement data and settlement attributes and corresponding categories based on the field names and corresponding field values of multiple settlement data The association relationship between them; and constructing a decision tree based on the weights of multiple field names and the association relationship between the training settlement data and settlement attributes and categories, and generating a decision model based on the decision tree.
  • a model building module for obtaining multiple settlement data in multiple databases, the settlement data includes multiple field names and corresponding field values; and performs cluster analysis on the field names of the settlement data To obtain the priority parameters of each field name; calculate the weight of multiple field names according to the priority parameters of the field names; train multiple settlement data and settlement attributes and corresponding categories based on the field names and corresponding field values of multiple settlement data The association relationship between them; and
  • the device further includes a model optimization module for obtaining and acquiring multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; cluster analysis is performed on the multiple updated settlement data to obtain multiple Update weights of field names and update association relationship parameters; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
  • a model optimization module for obtaining and acquiring multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; cluster analysis is performed on the multiple updated settlement data to obtain multiple Update weights of field names and update association relationship parameters; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
  • the category includes multiple hierarchical categories
  • the directory generation module 508 is further configured to sort the settlement data according to the settlement attributes and multiple hierarchical categories of the settlement data; and to settle the settlement data of the multiple hierarchical categories in a preset manner Encode; and store the encoded settlement directory.
  • Each module in the above-mentioned settlement data processing device may be implemented in whole or in part by software, hardware, and a combination thereof.
  • the above-mentioned modules may be embedded in the hardware form or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor calls and performs the operations corresponding to the above modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 6.
  • the computer device includes a processor, a memory, a network interface, and a database connected through a system bus.
  • the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer-readable instructions, and a database.
  • the internal memory provides an environment for operating the operating system and computer-readable instructions in a non-volatile storage medium.
  • the computer equipment database is used to store the initial settlement data directory, the standard settlement data table, and the project settlement data directory.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions are executed by a processor to implement a settlement data processing method.
  • FIG. 6 is only a block diagram of a part of the structure related to the scheme of the present application, and does not constitute a limitation on the computer equipment to which the scheme of the present application is applied.
  • the specific computer equipment may be Include more or fewer parts than shown in the figure, or combine certain parts, or have a different arrangement of parts.
  • a computer device includes a memory and one or more processors.
  • Computer-readable instructions are stored in the memory.
  • the one or more processors execute the following steps:
  • the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
  • Obtaining the authority data table which includes authority settlement data and corresponding authority fields, and authority properties are set in the authority settlement data;
  • the processor when the processor executes the computer-readable instructions, the processor further implements the following steps: obtaining a semantic matching model, the semantic matching model including multiple canonical settlement data and corresponding multiple canonical field vectors; and extracting the data through the semantic matching model Field vectors corresponding to a plurality of fields to be settled in the data to be settled; and a degree of matching between a field vector corresponding to the data to be settled and a plurality of canonical field vectors.
  • the processor when the processor executes the computer-readable instructions, the processor further implements the following steps: using a decision model to extract multiple field vectors corresponding to the settlement data to be decided; and dividing the multiple field vectors corresponding to the settlement data to be decided according to the decision model. Nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the decision model outputs the settlement attributes and categories corresponding to the settlement data to be decided.
  • the processor when the processor executes the computer-readable instructions, the processor further implements the following steps: obtaining multiple settlement data in multiple databases, the settlement data including multiple field names and corresponding field values; performing field names on the settlement data Cluster analysis to obtain the priority parameters of each field name; calculate the weight of multiple field names based on the priority parameters of the field names; train multiple settlement data and settlement attributes based on the field names and corresponding field values of multiple settlement data Association with corresponding categories; and constructing a decision tree based on the weights of multiple field names and the association between training data and settlement attributes and categories, and generating a decision model based on the decision tree.
  • the processor when the processor executes the computer-readable instructions, the processor further implements the following steps: acquiring multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; and performing cluster analysis on the multiple updated settlement data to obtain Update weights and update association relationship parameters of multiple field names; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
  • the category includes multiple hierarchical categories.
  • the processor executes the computer-readable instructions, the following steps are further implemented: sorting the settlement data according to the settlement attributes of the settlement data and multiple hierarchical categories; Encoding settlement data of each hierarchical category; and storing the encoded settlement directory.
  • One or more non-volatile computer-readable storage media storing computer-readable instructions.
  • the one or more processors execute the following steps:
  • the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
  • Obtaining the authority data table which includes authority settlement data and corresponding authority fields, and authority properties are set in the authority settlement data;
  • the following steps are further implemented: obtaining a semantic matching model, the semantic matching model including multiple canonical settlement data and corresponding multiple canonical field vectors; and extracting through the semantic matching model A field vector corresponding to a plurality of fields to be settled in the data to be settled is calculated; and a matching degree between a field vector corresponding to the data to be settled and a plurality of canonical field vectors is calculated.
  • the following steps are further implemented: using a decision model to extract multiple field vectors corresponding to the settlement data to be decided; and using the multiple field vectors corresponding to the settlement data to be decided according to the decision model
  • the nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the settlement attributes and categories corresponding to the settlement data to be decided are output through the decision model.
  • the following steps are further implemented: obtaining multiple settlement data in multiple databases, the settlement data including multiple field names and corresponding field values; and field names of the settlement data Perform cluster analysis to obtain the priority parameters of each field name; calculate the weight of multiple field names based on the priority parameters of the field names; train multiple settlement data and settlements based on the field names and corresponding field values of multiple settlement data Association between attributes and corresponding categories; and constructing a decision tree based on the weights of multiple field names and the association between training data and settlement attributes and categories, and generating a decision model based on the decision tree.
  • the following steps are also implemented: obtaining multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; performing cluster analysis on the multiple updated settlement data, Obtain the update weights and update association relationship parameters of multiple field names; and adjust the parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
  • the category includes multiple hierarchical categories, and when the computer-readable instructions are executed by the processor, the following steps are further implemented: sorting the settlement data according to the settlement attributes of the settlement data and multiple hierarchical categories; Encoding settlement data for multiple hierarchical categories; and storing the encoded settlement directory.
  • Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM is available in various forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
  • SRAM static RAM
  • DRAM dynamic RAM
  • SDRAM synchronous DRAM
  • DDRSDRAM dual data rate SDRAM
  • ESDRAM enhanced SDRAM
  • SLDRAM synchronous chain Synchlink DRAM
  • Rambus direct RAM
  • DRAM direct memory bus dynamic RAM
  • RDRAM memory bus dynamic RAM

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Disclosed is a settlement data processing method, comprising: acquiring multiple initial settlement directories sent by a terminal, wherein the initial settlement directories comprise multiple pieces of data to be settled and multiple corresponding fields to be settled; acquiring a standard data table, matching the fields to be settled with standard fields corresponding to standard settlement data in the standard data table, and calculating the degree of matching between the fields to be settled and multiple standard fields; if there is a field to be settled with the degree of matching reaching a pre-set threshold value, adding a settlement attribute and category corresponding to the standard fields to the data to be settled; if there is a field to be settled with the degree of matching not reaching the pre-set threshold value, generating settlement data to be decided according to unmatched data to be settled; inputting the settlement data to be decided into a trained decision model for making a decision, and adding the obtained settlement attribute and category to the settlement data to be decided; and generating an item settlement directory according to the category, so that a server performs item settlement processing according to the item settlement directory.

Description

结算数据处理方法、装置、计算机设备和存储介质Settlement data processing method, device, computer equipment and storage medium
相关申请的交叉引用Cross-reference to related applications
本申请要求于2018年9月3日提交中国专利局,申请号为2018110201698,申请名称为“结算数据处理方法、装置、计算机设备和存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed on September 3, 2018 with the Chinese Patent Office under the application number 2018110201698 and the application name is "Settlement Data Processing Method, Device, Computer Equipment and Storage Medium", the entire contents of which are hereby incorporated by reference Incorporated in this application.
技术领域Technical field
本申请涉及一种结算数据处理方法、装置、计算机设备和存储介质。The present application relates to a method, an apparatus, a computer device, and a storage medium for processing settlement data.
背景技术Background technique
随着互联网技术的迅速发展,利用互联网技术对各种结算项目进行结算也越来越便捷。例如对社会保险数据和医疗保险数据进行结算十分便捷。利用互联网技术处理各种结算数据能够有效地提高结算数据的处理效率。With the rapid development of Internet technology, it is becoming more and more convenient to use Internet technology to settle various settlement items. For example, settlement of social insurance data and medical insurance data is very convenient. Using Internet technology to process various settlement data can effectively improve the processing efficiency of settlement data.
然而,发明人意识到,传统的保险结算方式,通常只是根据传统规定的社会保险结算目录进行结算。而传统的结算目录范围比较限定,各个地方的结算目录中的结算数据参差不齐,并不统一。导致在利用结算目录与各机构的结算项目进行对接和结算时,可能会存在一些例如计算量复杂、结算项目之间转换比较耗时等问题,进而导致根据结算目录进行处理的效率和准确率比较低。因此,如何有效提高准确有效地识别和匹配规范化结算数据,以提高对结算数据的处理效率成为目前需要解决的技术问题。However, the inventors realized that traditional insurance settlement methods usually only settle according to the traditional social insurance settlement catalog. However, the scope of the traditional settlement directory is relatively limited. The settlement data in the settlement directories in various places is uneven and not uniform. As a result, when using the settlement catalog to connect and settle with the settlement items of various institutions, there may be problems such as complex calculations and the time-consuming conversion between settlement items, which leads to comparison of the efficiency and accuracy of processing according to the settlement catalog low. Therefore, how to effectively and accurately identify and match standardized settlement data in order to improve the processing efficiency of settlement data has become a technical problem that needs to be solved at present.
发明内容Summary of the Invention
根据本申请公开的各种实施例,提供一种结算数据处理方法、装置、计算机设备和存储介质。According to various embodiments disclosed in the present application, a method, an apparatus, a computer device, and a storage medium for processing settlement data are provided.
一种结算数据处理方法,包括:A method for processing settlement data, including:
获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;Acquiring multiple initial settlement directories sent by the terminal, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
获取规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段相对应的规范结算数据设定的结算属性和类 别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field To the data to be settled;
若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,获取决策模型,根据所述决策模型对所述待决策结算数据进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data, and a decision model is obtained according to The decision model makes a decision on the settlement data to be decided, outputs settlement attributes and categories corresponding to the settlement data to be decided, and adds to the settlement data to be decided; and according to a plurality of added settlement attributes and categories, The settlement data generates a project settlement directory according to the category, so that the server performs project settlement processing according to the project settlement directory.
一种结算数据处理装置,包括:A settlement data processing device includes:
获取模块,用于获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;获取结算数据表,所述结算数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;An obtaining module, configured to obtain multiple initial settlement directories sent by the terminal, the initial settlement directories including multiple pending settlement data and corresponding multiple pending settlement fields; obtaining a settlement data table, where the settlement data table includes standardized settlement data And corresponding multiple specification fields, the specification settlement data is set with settlement attributes and categories;
匹配模块,用于将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段相对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;A matching module, configured to match the field to be settled with a specification field in the specification data table, and calculate a degree of matching between the field to be settled and the plurality of specification fields; if the degree of matching reaches a preset value A threshold to-be-settled field, determining that the to-be-settled data corresponding to the to-be-settled field is standard settlement data, and adding the settlement attribute and category set by the standard-settlement data corresponding to the specification field to the to-be-settled data;
决策模块,用于若存在所述匹配度未达到预设阈值的待结算字段,输出所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,得到所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及A decision-making module, configured to output to-be-settled data corresponding to the to-be-settled fields as non-standard settlement data if there is a to-be-settled field with a matching degree that does not reach a preset threshold value, and generate to-be-determined settlement data based on the non-standard to-be-settled data, Inputting the settlement data to be decided into a trained decision model for decision making, obtaining settlement attributes and categories corresponding to the settlement data to be decided, and adding to the settlement data to be decided; and
目录生成模块,用于根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。The directory generating module is configured to generate a project settlement directory according to the category based on a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the processor, the one or more processors execute the following: step:
获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;Acquiring multiple initial settlement directories sent by the terminal, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
获取规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段相对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field To the data to be settled;
若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data. Settlement data is input into a trained decision model for decision making, and outputs settlement attributes and categories corresponding to the settlement data to be determined, and is added to the settlement data to be determined; and settlement is added based on multiple settlement attributes and categories added The data generates a project settlement directory according to the category, so that the server performs project settlement processing according to the project settlement directory.
一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:获取多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;One or more non-volatile computer-readable storage media storing computer-readable instructions. When the computer-readable instructions are executed by one or more processors, the one or more processors execute the following steps: obtaining multiple initial A settlement directory, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
获取终端发送的规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table sent by the terminal, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with a settlement attribute and category;
将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段相对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field To the data to be settled;
若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is irregular settlement data, and to-be-determined settlement data is generated based on the non-standard to-be-settled data, and Settlement data is input into a trained decision model for decision making, outputting settlement attributes and categories corresponding to the pending decision settlement data, and added to the pending decision settlement data; and
根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。Details of one or more embodiments of the present application are set forth in the accompanying drawings and description below. Other features and advantages of the application will become apparent from the description, the drawings, and the claims.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to explain the technical solutions in the embodiments of the present application more clearly, the drawings used in the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present application. Those of ordinary skill in the art can obtain other drawings according to the drawings without paying creative labor.
图1为根据一个或多个实施例中结算数据处理方法的应用场景图。FIG. 1 is an application scenario diagram of a settlement data processing method according to one or more embodiments.
图2为根据一个或多个实施例中结算数据处理方法的流程示意图。FIG. 2 is a schematic flowchart of a settlement data processing method according to one or more embodiments.
图3为根据一个或多个实施例中对待决策结算数据进行决策步骤的流程示意图。FIG. 3 is a schematic flowchart of a decision-making process for decision-making settlement data according to one or more embodiments.
图4为根据一个或多个实施例中构建决策模型步骤的流程示意图。FIG. 4 is a schematic flowchart of steps of constructing a decision model according to one or more embodiments.
图5为根据一个或多个实施例中结算数据处理装置的结构框图。FIG. 5 is a structural block diagram of a settlement data processing apparatus according to one or more embodiments.
图6为根据一个或多个实施例中计算机设备的内部结构图。FIG. 6 is an internal structural diagram of a computer device according to one or more embodiments.
具体实施方式detailed description
为了使本申请的技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the technical solution and advantages of the present application more clear and clear, the present application is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the application, and are not used to limit the application.
本申请提供的结算数据处理方法,可以应用于如图1所示的应用环境中。终端102通过网络与服务器104通过网络进行通信。终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。服务器104可以获取多个终端102发送的多个初始结算目录,初始结算目录中包括多个待结算数据和对应的多个待结算字段。服务器104进一步获取规范数据表,规范数据表中包括规范结算数据和对应的多个规范字段,规范结算数据设定有结算属性和类别。将待结算字段与规范结算字段进行匹配。若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段相对应的规范结算数据设定的结算属性和类别添加至待结算数据。若存在匹配度未达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据。服务器104则将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中。服务器104根据多个添加了结算属性和类别后的结算数据按照对应的类别生成项目结算目录,并根据项目结算目录进行项目结算处理。The settlement data processing method provided in this application can be applied to the application environment shown in FIG. 1. The terminal 102 communicates with the server 104 through the network through the network. The terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices. The server 104 may be implemented by an independent server or a server cluster composed of multiple servers. The server 104 may obtain multiple initial settlement directories sent by multiple terminals 102. The initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled. The server 104 further obtains a authority data table, and the authority data table includes authority settlement data and corresponding authority fields. The authority settlement data is set with a settlement attribute and a category. Match the fields to be settled with the canonical settlement fields. If there is a field to be settled whose matching degree reaches a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is standard settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the standard field are added to the data to be settled. If there is a field to be settled whose matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is irregular settlement data, and the settlement data to be decided is generated according to the non-standard settled data. The server 104 inputs the settlement data to be decided into the trained decision model for decision making, outputs the settlement attributes and categories corresponding to the settlement data to be decided, and adds it to the settlement data to be decided. The server 104 generates a project settlement directory according to a corresponding category based on a plurality of settlement data to which settlement attributes and categories are added, and performs project settlement processing according to the project settlement directory.
在一些实施例中,如图2所示,提供了一种结算数据处理方法,以该方法应用于图1中的服务器为例进行说明,包括以下步骤:In some embodiments, as shown in FIG. 2, a method for processing settlement data is provided. The method is applied to the server in FIG. 1 as an example, and includes the following steps:
步骤202,获取终端发送的多个初始结算目录,初始结算目录包括多个待结算数据和对应的多个待结算字段。Step 202: Obtain multiple initial settlement directories sent by the terminal. The initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled.
其中,初始结算目录可以是各地发布的结算目录,例如社会保险结算目录、医疗保险结算目录和药品结算目录等。待结算数据指待结算项目的结算信息。The initial settlement directory may be a settlement directory released in various places, such as a social insurance settlement directory, a medical insurance settlement directory, and a drug settlement directory. The data to be settled refers to the settlement information of the items to be settled.
服务器可以获取多个终端发送的多个初始结算目录,初始结算目录中包括了多个待结算数据,每条待结算数据包括对应的多个待结算字段。例如,药品结算目录可以包括多个药品结算数据,每个药品结算数据可以包括“通用名”、“剂型”、“规格”以 及“生产厂家”等多个字段信息。The server can obtain multiple initial settlement directories sent by multiple terminals. The initial settlement directory includes multiple data to be settled, and each piece of data to be settled includes corresponding multiple fields to be settled. For example, the drug settlement directory may include multiple drug settlement data, and each drug settlement data may include multiple field information such as "general name", "dosage form", "specification", and "manufacturer".
步骤204,获取规范数据表,规范数据表中包括规范结算数据和对应的多个规范字段,规范结算数据设定有结算属性和类别。Step 204: Obtain a normative data table. The normative data table includes normative settlement data and corresponding normative fields. The normative settlement data is set with settlement attributes and categories.
服务器获取多个初始结算目录后,进一步获取规范数据表。规范数据表可以是根据预设规则预先定义的规范结算数据。还可以是服务器通过获取大量的结算数据后,对大量的结算数据进行大数据分析,并根据分析结果和预设规则定义出的多个规范结算数据。规范数据表中包括了规范结算数据对应的多个规范字段,规范结算数据还包括对应的结算属性和类别。After the server obtains multiple initial settlement directories, it further obtains the specification data table. The authority data table may be authority settlement data defined in advance according to a preset rule. It may also be that after the server obtains a large amount of settlement data, it performs a big data analysis on the large amount of settlement data, and according to the analysis result and a plurality of standardized settlement data defined by preset rules. The authority data table includes a plurality of authority fields corresponding to authority settlement data. The authority settlement data also includes corresponding settlement attributes and categories.
步骤206,将待结算字段与规范数据表中的规范字段进行匹配,计算待结算字段与多个规范字段的匹配度。Step 206: Match the fields to be settled with the specification fields in the specification data table, and calculate the degree of matching between the field to be settled and the plurality of specification fields.
步骤208,若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段对应的规范结算数据设定的结算属性和类别添加至待结算数据中。In step 208, if there is a field to be settled whose matching degree reaches a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is standardized settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the specification field are added to the data to be settled. in.
服务器获取规范数据表后,就将待结算数据中的待结算字段与规范结算数据中的规范字段进行匹配,计算待结算字段与多个规范字段的匹配度。当待结算数据中的待结算字段与规范结算数据中的规范字段的匹配度达到预设阈值时,表示该待结算数据为规范的结算数据,则确定与待结算字段对应的待结算数据为规范结算数据,并将规范字段对应的规范结算数据设定的结算属性和类别添加至待结算数据中。由此可以直接识别出待结算数据中规范的结算数据,并添加对应的结算属性和类别。After the server obtains the specification data table, it matches the to-be-settled fields in the to-be-settled data with the specification fields in the specification-settlement data, and calculates the degree of matching between the to-be-settled fields and multiple specification fields. When the degree of matching between the to-be-settled field in the to-be-settled data and the canonical field in the to-be-settled data reaches a preset threshold, it indicates that the to-be-settled data is normative settlement data, and it is determined that the to-be-settled data corresponding to the to-be-settled field is normative. Settle the data, and add the settlement attributes and categories set by the standard settlement data corresponding to the specification field to the data to be settled. In this way, the standardized settlement data in the data to be settled can be directly identified, and corresponding settlement attributes and categories are added.
步骤210,若存在匹配度未达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中。In step 210, if there is a field to be settled whose matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is non-standardized settlement data, and to-be-determined settlement data is generated based on the non-standardized to-be-settled data. Make a decision in the trained decision model, output settlement attributes and categories corresponding to the settlement data to be decided, and add it to the settlement data to be decided.
若存在匹配度未达到预设阈值的待结算字段时,表示该待结算数据为不规范的结算数据,则需要对不规范的结算数据进行进一步地处理。服务器则确定与待结算字段对应的待结算数据为不规范结算数据,并根据不规范待结算数据生成待决策结算数据。具体地,服务器将待结算数据中的待结算字段与规范结算数据中的规范字段进行匹配时,提取出未达到预设匹配度的待结算数据,未达到预设匹配度的待结算数据则为与规范结算数据不一致的待结算数据。If there is a field to be settled whose matching degree does not reach a preset threshold, it indicates that the data to be settled is non-standardized settlement data, and the non-standardized settlement data needs to be further processed. The server determines that the data to be settled corresponding to the field to be settled is irregular settlement data, and generates the settlement data to be decided according to the non-standardized settlement data. Specifically, when the server matches the field to be settled in the data to be settled with the specification field in the standard settlement data, the server extracts data to be settled that does not reach the preset matching degree, and the data to be settled that does not reach the preset matching degree is Data to be settled that are inconsistent with the standard settlement data.
服务器则根据不规范待结算数据生成待决策结算数据后,进而获取已经训练构建完成的决策模型,决策模型中包括多个节点,将待决策数据输入至决策模型中,并按照决策模型的节点顺序进行遍历,直到得到待决策结算数据对应的结算属性和类别。通过根据决策模型对待决策结算数据进行决策,可以准确地决策出待决策结算数据所 对应的结算属性和类别,并将决策出的结算属性和类别添加至待决策结算数据中。The server generates the settlement data for decision-making based on the non-standard data to be settled, and then obtains a decision model that has been trained and constructed. The decision model includes multiple nodes, and the data for decision-making is input into the decision model, and the node order of the decision model is followed. Iterate until the settlement attributes and categories corresponding to the settlement data to be determined are obtained. By making decision on the settlement data to be decided according to the decision model, the settlement attributes and categories corresponding to the settlement data to be decided can be accurately determined, and the settlement attributes and categories decided on are added to the settlement data to be decided.
步骤212,根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录,使得服务器根据项目结算目录进行项目结算处理。Step 212: Generate a project settlement directory according to the category according to a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
服务器根据决策模型对待决策结算数据策出对应的结算属性和类别后,则根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录。其中,添加了结算属性和类别后的结算数据包括添加了结算属性和类别的待结算数据,以及添加了结算属性和类别的待决策结算数据。After the server determines the corresponding settlement attribute and category for the decision settlement data according to the decision model, it generates a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories. The settlement data after adding the settlement attribute and category includes the pending settlement data with the settlement attribute and category added, and the pending decision settlement data with the settlement attribute and category added.
服务器生成项目结算目录后,形成规范化的项目结算目录,则可以根据项目结算目录对相应的项目进行结算处理。After the server generates the project settlement directory and forms a standardized project settlement directory, the corresponding project can be settled according to the project settlement directory.
上述结算数据处理方法中,服务器获取终端发送的多个初始结算目录,初始结算目录包括多个待结算数据,待结算数据包括对应的多个待结算字段。服务器获取规范数据表,规范数据表中包括规范结算数据对应的多个规范字段,规范结算数据设定有结算属性和类别。服务器将待结算字段与规范数据表中的规范字段进行匹配,计算待结算字段与所述多个规范字段的匹配度。若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段相对应的规范结算数据设定的结算属性和类别添加至待结算数据中。若存在匹配度未达到预设阈值的,确定待结算字段对应的待结算数据为不规范结算数据,并根据不规范待结算数据生成待决策结算数据。服务器将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中。服务器根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录,由此可以根据项目结算目录进行项目结算处理。通过对初始结算目录中的结算数据进行匹配和分类,由此能够准确地得到待结算数据对应的结算属性和类别,并形成规范的项目结算目录,进而能够有效提高对结算数据的进行处理效率。In the method for processing settlement data, the server obtains multiple initial settlement directories sent by the terminal. The initial settlement directory includes multiple data to be settled, and the data to be settled includes corresponding multiple fields to be settled. The server obtains the authority data table, and the authority data table includes a plurality of authority fields corresponding to the authority settlement data. The authority settlement data is set with a settlement attribute and a category. The server matches the field to be settled with the specification field in the specification data table, and calculates a degree of matching between the field to be settled and the plurality of specification fields. If there is a field to be settled whose matching degree reaches a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is standard settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the standard field are added to the data to be settled. If there is a matching degree that does not reach the preset threshold, it is determined that the data to be settled corresponding to the field to be settled is non-standardized settlement data, and the decision-making settlement data is generated based on the non-standardized data to be settled. The server inputs the settlement data to be decided into a trained decision model for decision making, outputs the settlement attributes and categories corresponding to the settlement data to be decided, and adds it to the settlement data to be decided. The server generates a project settlement directory according to the category based on a plurality of settlement data after adding settlement attributes and categories, so that the project settlement processing can be performed according to the project settlement directory. By matching and classifying the settlement data in the initial settlement directory, the settlement attributes and categories corresponding to the data to be settled can be accurately obtained, and a standardized project settlement directory can be formed, which can effectively improve the processing efficiency of settlement data.
在其中一个实施例中,将待结算字段与规范数据表中的规范字段进行匹配包括:获取语义匹配模型,语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过语义匹配模型提取出待结算数据中多个待结算字段对应的字段向量;计算待结算数据对应的字段向量与多个规范字段向量之间的匹配度。In one embodiment, matching the fields to be settled with the canonical fields in the canonical data table includes: obtaining a semantic matching model, which includes multiple canonical settlement data and corresponding multiple canonical field vectors; through semantic matching The model extracts field vectors corresponding to a plurality of fields to be settled in the data to be settled; and calculates a degree of matching between the field vectors corresponding to the data to be settled and a plurality of canonical field vectors.
服务器获取多个初始结算目录,初始结算目录中包括了多个待结算数据,每条待结算数据包括对应的多个待结算字段。例如,药品结算目录可以包括多个药品结算数据,每个药品结算数据可以包括“通用名”、“剂型”、“规格”以及“生产厂家”等多个字段信息。The server obtains multiple initial settlement directories. The initial settlement directory includes multiple pending settlement data, and each piece of pending settlement data includes a corresponding multiple pending settlement fields. For example, the drug settlement directory may include multiple drug settlement data, and each drug settlement data may include multiple field information such as "general name", "dosage form", "specification", and "manufacturer".
服务器获取多个初始结算目录后,进一步获取规范数据表,规范数据表中包括了规范结算数据对应的多个规范字段,规范结算数据设定有结算属性和类别。服务器则 根据官方数据表获取对应的语义匹配模型,语义匹配模型中包括了多个规范结算数据对应的多个规范字段向量。服务器提取出待结算数据中的多个待结算字段对应的字段向量,并将待结算数据对应的字段向量与多个规范结算数据对应的多个规范字段向量进行匹配,计算出待结算数据对应的字段向量与多个规范字段向量之间的匹配度。通过将待结算数据与规范结算数据进行匹配,能够有效地匹配出与规范结算数据一致的待结算数据,由此能够提高对待结算数据进一步处理的效率。After the server obtains a plurality of initial settlement directories, it further obtains a specification data table. The specification data table includes a plurality of specification fields corresponding to the specification settlement data, and the specification settlement data is set with settlement attributes and categories. The server obtains the corresponding semantic matching model according to the official data table. The semantic matching model includes a plurality of canonical field vectors corresponding to a plurality of canonical settlement data. The server extracts the field vectors corresponding to the plurality of fields to be settled in the data to be settled, and matches the field vectors corresponding to the data to be settled with the plurality of standard field vectors corresponding to the plurality of standard settlement data to calculate the corresponding The degree of match between the field vector and multiple canonical field vectors. By matching the data to be settled with the standard settlement data, it is possible to effectively match the data to be settled with the standard settlement data, thereby improving the efficiency of further processing of the data to be settled.
例如,以药品结算目录为例,药品结算目录中包括多个药品数据,每个药品数据包括“通用名”、“剂型”、“规格”以及“生产厂家”等对应的多个待结算字段的内容,提取出多个待结算字段的内容对应的字段向量。服务器获取了相应的规范数据表后,进一步获取对应的语义匹配模型,语义匹配模型中包括了多个规范结算数据对应的多个规范字段向量。服务器将药品数据对应的多个字段向量与多个规范结算数据对应的多个规范字段向量进行匹配,即将药品数据的“通用名”、“剂型”、“规格”以及“生产厂家”等对应的多个待结算字段的内容与多个规范结算数据对应的多个规范字段向量进行匹配。For example, taking the drug settlement directory as an example, the drug settlement directory includes multiple drug data, and each drug data includes the corresponding multiple to-be-cleared fields such as "general name", "dosage form", "specification", and "manufacturer". Content, and extract field vectors corresponding to the contents of multiple fields to be settled. After the server obtains the corresponding canonical data table, it further obtains the corresponding semantic matching model. The semantic matching model includes multiple canonical field vectors corresponding to multiple canonical settlement data. The server matches the multiple field vectors corresponding to the drug data with the multiple standard field vectors corresponding to multiple regulatory settlement data, that is, the corresponding "common name", "dosage form", "specification", and "manufacturer" of the pharmaceutical data. The contents of multiple to-be-settled fields are matched with multiple canonical field vectors corresponding to multiple canonical settlement data.
在其中一个实施例中,如图3所示,决策模型包括多个节点,将待决策结算数据输入至已训练的决策模型中进行决策的步骤具体包括以下内容:In one embodiment, as shown in FIG. 3, the decision model includes a plurality of nodes, and the steps of inputting settlement data to be decided into the trained decision model for decision specifically include the following:
步骤302,利用决策模型提取待决策结算数据对应的多个字段向量。Step 302: Use a decision model to extract multiple field vectors corresponding to the settlement data to be decided.
步骤304,将待决策结算数据对应的多个字段向量按照决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度。In step 304, the multiple field vectors corresponding to the settlement data to be decided are traversed and matched according to the node order of the decision model, and the matching degree between the field vector and the multiple nodes is calculated.
步骤306,直到多个字段向量匹配得到对应的目标结算属性和目标类别,通过决策模型输出待决策结算数据对应的结算属性和类别。Step 306: Until a plurality of field vectors are matched to obtain the corresponding target settlement attribute and target category, the decision model outputs the settlement attribute and category corresponding to the settlement data to be decided.
服务器获取多个初始结算目录后,获取规范数据表,初始结算目录包括多个待结算数据,待结算数据包括对应的多个待结算字段。规范数据表中包括规范结算数据对应的多个规范字段,规范结算数据包括对应的结算属性和类别。服务器将待结算字段与规范字段进行匹配,计算待结算字段与多个规范字段的匹配度。若存在匹配度达到预设阈值的待结算字段时,表示待结算字段与规范字段一致,则确定与待结算字段对应的待结算数据为规范结算数据,将规范字段相对应的规范结算数据设定的结算属性和类别添加至待结算数据。After the server obtains multiple initial settlement directories, it obtains a specification data table. The initial settlement directory includes multiple data to be settled, and the data to be settled includes corresponding multiple fields to be settled. The authority data table includes a plurality of authority fields corresponding to authority settlement data, and authority data includes corresponding settlement attributes and categories. The server matches the field to be settled with the specification field, and calculates the matching degree of the field to be settled with multiple specification fields. If there is a field to be settled when the matching degree reaches a preset threshold, it means that the field to be settled is consistent with the specification field, then the data to be settled corresponding to the field to be settled is the standard settlement data, and the standard settlement data corresponding to the specification field is set Added to the pending data.
若存在匹配度未达到预设阈值的待结算字段时,表示待结算字段与规范字段不一致,则确定与待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据。服务器进而提取出待决策结算数据对应的多个字段向量,并获取决策模型。决策模型中包括决策树,决策树中预先设置有多个节点。将待决策结算数据对应的多个字段向量输入至决策模型,并按照决策模型中的节点顺序进行遍历 匹配,计算字段向量与多个节点的匹配度。根据匹配度确定每个字段向量的目标结算属性和对应的目标类别。直到多个字段向量匹配得到对应的目标结算属性和目标类别时,服务器则通过根据决策模型输出待决策结算数据对应的结算属性和类别,并将对应的结算属性和类别添加至待决策结算数据中。通过根据决策模型按照节点顺序对待决策结算数据进行决策,能够提高决策的效率,由此能够有效准确地得到待决策结算数据对应的结算属性和类别。If there is a field to be settled that does not reach the preset threshold, it indicates that the field to be settled is inconsistent with the standard field, then the data to be settled corresponding to the field to be settled is non-standardized settlement data, and a pending decision is generated based on the non-standardized settled data Billing data. The server further extracts multiple field vectors corresponding to the settlement data to be decided, and obtains a decision model. The decision model includes a decision tree, and a plurality of nodes are set in the decision tree in advance. Multiple field vectors corresponding to the settlement data to be decided are input to the decision model, and the traversal matching is performed according to the order of the nodes in the decision model to calculate the matching degree of the field vector with multiple nodes. The target settlement attribute and the corresponding target category of each field vector are determined according to the matching degree. Until multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the server outputs the settlement attributes and categories corresponding to the settlement data to be decided according to the decision model, and adds the corresponding settlement attributes and categories to the settlement data to be decided . By making decisions on the settlement data according to the decision model in the order of nodes, the efficiency of the decision can be improved, and the settlement attributes and categories corresponding to the settlement data to be determined can be effectively and accurately obtained.
在其中一个实施例中,如图4所示,在所述获取决策模型之前,还包括构建决策模型的步骤,该步骤具体包括以下内容:In one embodiment, as shown in FIG. 4, before the obtaining a decision model, a step of constructing a decision model is further included. The step specifically includes the following content:
步骤402,获取多个数据库中的多个结算数据,结算数据包括多个字段名和对应的字段值。Step 402: Obtain multiple settlement data in multiple databases. The settlement data includes multiple field names and corresponding field values.
步骤404,对结算数据的字段名进行聚类分析,得到每个字段名的优先级参数。Step 404: Perform cluster analysis on the field names of the settlement data to obtain priority parameters for each field name.
步骤406,根据字段名的优先级参数计算多个字段名的权重。Step 406: Calculate the weights of multiple field names according to the priority parameters of the field names.
步骤408,根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系。Step 408: Train the association relationship between the multiple settlement data and the settlement attribute and the corresponding category according to the field names and corresponding field values of the multiple settlement data.
步骤410,根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据决策树生成决策模型。Step 410: Construct a decision tree according to the weights of multiple field names and the correlation between the training settlement data, settlement attributes, and categories, and generate a decision model according to the decision tree.
服务器在获取决策模型之前,需要预先构建决策模型。具体地,服务器可以从多个网站的数据库中获取多个结算数据,获取的结算数据可以是各地或各机构发布的初始结算目录。Before the server obtains the decision model, it needs to build a decision model in advance. Specifically, the server may obtain multiple settlement data from databases of multiple websites, and the obtained settlement data may be an initial settlement directory issued by each place or institution.
结算数据中包括多个字段名和对应的字段值。服务器首先对结算数据中的多个字段名进行聚类分析。具体地,服务器通过对结算数据中的多个字段名进行大数据分析,分析出每个字段名在结算领域中使用的概率,使用比较频繁的字段名的重要性更高,进而根据字段名的概率得到每个字段名的优先级参数。服务器进而根据每个字段名的优先级概率对字段名进行排序。通过对字段名进行优先级排序后,计算出多个字段名的权重,可以有效地提高对结算数据进行决策的效率。The settlement data includes multiple field names and corresponding field values. The server first performs cluster analysis on multiple field names in the settlement data. Specifically, the server analyzes multiple field names in the settlement data to analyze the probability of each field name being used in the settlement field. The more frequently used field names are more important, and according to the field names, Probability gets the priority parameter for each field name. The server further sorts the field names according to the priority probability of each field name. By prioritizing the field names and calculating the weights of multiple field names, the efficiency of decision-making on settlement data can be effectively improved.
服务器进一步根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和类别之间的关联关系。具体地,服务器可以将每个结算数据作为一个样本,将每个结算数据中的字段名和对应的字段值作为一个维度,每个结算数据则有多个维度。服务器进一步对每个结算数据的每个维度进行聚类分析。服务器可以采用聚类分析算法,例如K-means算法,将每个结算数据的多个维度作为数据对象依次对多个样本进行迭代计算,计算出每个结算数据对应的聚类结果。例如,服务器可以获取结算数据中多个字段名和对应的字段值所对应的多个维度变量,进而对多个结算数据样本进行聚类分析,由此可以分析出结算数据与结算属性和类别之间的关联关系。The server further trains association relationships between the multiple settlement data and settlement attributes and categories according to the field names and corresponding field values of the multiple settlement data. Specifically, the server may take each settlement data as a sample, take the field name and corresponding field value in each settlement data as one dimension, and each settlement data has multiple dimensions. The server further performs cluster analysis on each dimension of each settlement data. The server can use a cluster analysis algorithm, such as the K-means algorithm, and iteratively calculates multiple samples using multiple dimensions of each settlement data as data objects in order to calculate the clustering result corresponding to each settlement data. For example, the server can obtain multiple dimensional variables corresponding to multiple field names and corresponding field values in the settlement data, and then perform cluster analysis on multiple settlement data samples, thereby analyzing the relationship between settlement data and settlement attributes and categories. Relationship.
进而服务器则可以根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据决策树生成决策模型。通过根据优先级排序后的字段名训练后的结算数据与结算属性以及类别之间的关联关系构建决策模型,可以有效提高根据决策模型进行决策的效率和准确率。Furthermore, the server can construct a decision tree based on the weights of multiple field names and the association between the training settlement data and settlement attributes and categories, and generate a decision model based on the decision tree. By constructing a decision model based on the relationship between the training data of the sorted field names and the settlement attributes and categories, it is possible to effectively improve the efficiency and accuracy of decision-making based on the decision model.
在其中一个实施例中,该方法还包括:获取多个更新结算数据,更新结算数据设定有结算属性和类别;对多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;根据更新权重和更新关联关系参数调节决策模型的参数,以对决策模型进行优化。In one embodiment, the method further includes: obtaining a plurality of update settlement data, the update settlement data is set with a settlement attribute and a category; performing cluster analysis on the plurality of update settlement data to obtain the update weights and Update the association relationship parameters; adjust the parameters of the decision model according to the update weight and the update association parameter to optimize the decision model.
随着各种因素的变化,决策模型也需要随着各种因素的变化和时间的推移进行调整,以提高模型的稳定性。服务器根据获取的多个结算数据的字段名的优先级概率和训练出结算数据与结算属性以及类别之间的关联关系构建决策模型后,还可以进一步对决策模型进行优化,使得决策模型的准确率更高。With the changes of various factors, the decision model also needs to be adjusted with the changes of various factors and time to improve the stability of the model. After the server constructs a decision model according to the priority probability of the field names of the plurality of settlement data and the training relationship between the settlement data and the settlement attributes and categories, it can further optimize the decision model to make the decision model accurate. higher.
服务器可以从数据库中获取预设时间段内的更新结算数据,预设时间可以是一年,也可以是半年,一个季度或一个月等。更新结算数据可以是对结算数据的相关规则进行更新后的结算数据,也可以是按照预设规则定义的更新的规范结算数据。更新结算数据中设定有结算属性和类别,服务器进一步通过对多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数。服务器进而根据更新权重和更新关联关系参数调节决策模型的相关承诺书,以对决策模型进行优化,可以有效地保证决策模型的稳定新,进而能够有效提高根据决策模型对结算数据息进行分类的准确性。The server may obtain the updated settlement data within a preset time period from the database. The preset time may be one year, or half a year, one quarter, or one month. The updated settlement data may be updated settlement data after updating relevant rules of the settlement data, or may be updated normative settlement data defined in accordance with a preset rule. Settlement attributes and categories are set in the update settlement data. The server further performs cluster analysis on multiple update settlement data to obtain update weights and update association parameters for multiple field names. The server further adjusts the relevant commitments of the decision model according to the update weight and the update relationship parameter to optimize the decision model, which can effectively ensure the stability and newness of the decision model, and can effectively improve the accuracy of classification of settlement data according to the decision model. Sex.
在其中一个实施例中,类别包括多个层级类别,在根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录之后,还包括:根据结算数据的结算属性和多个层级类别对结算数据进行排序;按照预设方式对多个层级类别的结算数据进行编码;对编码后的结算目录进行存储。In one of the embodiments, the category includes multiple hierarchical categories. After generating the project settlement directory according to the category based on the multiple settlement attributes and category-added settlement data, the method further includes: according to the settlement attributes of the settlement data and multiple hierarchical category pairs. Sorting settlement data; encoding settlement data of multiple hierarchical categories according to a preset manner; storing the encoded settlement directory.
服务器根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录之后,根据结算数据的结算属性和类别对结算数据进行排序。结算数据的类别还可能包含多个层级类别,进而服务器将项目结算目录中的结算数据根据对应的类别或层级类别进行排序。服务器对项目结算目录进行排序后,按照预设方式对各个层级类别的结算数据进行编码,并对编码后的项目结算目录进行存储。其中,编码字符至少包括两种。编码字符可以是字母加数字,例如ABS10001、ABS100012。再将每个层级类别下的结算数据按照对应的编码字符按序添加子字符。例如ABS10001001、ABS10001002。通过对细化分类后的项目结算目录数据进行编码,有效地提高了结算数据的管理效率。After the server generates the project settlement directory according to the category based on the multiple settlement data after adding the settlement attribute and category, the server sorts the settlement data according to the settlement attribute and category of the settlement data. The category of settlement data may also include multiple hierarchical categories, and the server sorts the settlement data in the project settlement directory according to the corresponding category or hierarchical category. After the server sorts the project settlement directory, it encodes the settlement data of each hierarchical category according to a preset method, and stores the encoded project settlement directory. Among them, the coded characters include at least two types. Coded characters can be letters and numbers, such as ABS10001, ABS100012. Sub-characters are added in sequence to the settlement data under each hierarchical category in accordance with the corresponding encoded characters. For example ABS10001001, ABS10001002. By encoding the detailed classified item settlement catalog data, the management efficiency of settlement data is effectively improved.
应该理解的是,虽然图2-4的流程图中的各个步骤按照箭头的指示依次显示,但 是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2-4中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of FIGS. 2-4 are sequentially displayed in accordance with the directions of the arrows, these steps are not necessarily performed in the order indicated by the arrows. Unless explicitly stated in this document, the execution of these steps is not strictly limited, and these steps can be performed in other orders. Moreover, at least a part of the steps in Figure 2-4 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily performed at the same time, but may be performed at different times. These sub-steps or stages The execution order of is not necessarily performed sequentially, but may be performed in turn or alternately with at least a part of another step or a sub-step or stage of another step.
在一个实施例中,如图5所示,提供了一种结算数据处理装置,包括:获取模块502、匹配模块504、决策模块506和目录生成模块508,其中:In one embodiment, as shown in FIG. 5, a settlement data processing device is provided, which includes: an acquisition module 502, a matching module 504, a decision module 506, and a directory generation module 508, where:
获取模块502,用于获取终端发送的多个初始结算目录,初始结算目录包括多个待结算数据和对应的多个待结算字段;获取结算数据表,结算数据表中包括规范结算数据和对应的多个规范字段,规范结算数据设定有结算属性和类别;The obtaining module 502 is configured to obtain multiple initial settlement directories sent by the terminal. The initial settlement directory includes multiple pending settlement data and corresponding multiple pending settlement fields; and obtains a settlement data table, where the settlement data table includes standardized settlement data and corresponding Multiple specification fields. Settlement attributes and categories are set in the standard settlement data;
匹配模块504,用于将待结算字段与规范数据表中的规范字段进行匹配,计算待结算字段与多个规范字段的匹配度;若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段相对应的规范结算数据设定的结算属性和类别添加至待结算数据中;A matching module 504 is configured to match a field to be settled with a standard field in a specification data table, and calculate a degree of matching between the field to be settled and a plurality of standard fields; if there is a field to be settled with a matching degree that reaches a preset threshold, determine a field to be settled The data to be settled corresponding to the field is standard settlement data, and the settlement attributes and categories set by the standard settlement data corresponding to the standard field are added to the data to be settled;
决策模块506,用于若存在匹配度未达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中;A decision module 506, configured to determine that the pending data corresponding to the pending settlement field is non-standard settlement data if there is a pending field that does not reach a preset threshold, and generate pending decision settlement data based on the non-standard pending settlement data, and Settlement data is input into the trained decision model for decision-making, output settlement attributes and categories corresponding to the settlement data to be decided, and added to the settlement data to be decided;
目录生成模块508,用于根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录,使得服务器根据项目结算目录进行项目结算处理。The directory generation module 508 is configured to generate a project settlement directory according to categories based on a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
在其中一个实施例中,匹配模块504还用于获取语义匹配模型,语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过语义匹配模型提取出待结算数据中多个待结算字段对应的字段向量;及计算待结算数据对应的字段向量与多个规范字段向量之间的匹配度。In one embodiment, the matching module 504 is further configured to obtain a semantic matching model. The semantic matching model includes multiple canonical settlement data and corresponding multiple canonical field vectors. A plurality of to-be-settled data in the to-be-settled data are extracted through the semantic matching model. A field vector corresponding to the settlement field; and calculating a matching degree between the field vector corresponding to the data to be settled and a plurality of canonical field vectors.
在其中一个实施例中,决策模型包括多个节点,决策模块506还用于利用决策模型提取待决策结算数据对应的多个字段向量;将待决策结算数据对应的多个字段向量按照决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到多个字段向量匹配得到对应的目标结算属性和目标类别,通过决策模型输出待决策结算数据对应的结算属性和类别。In one embodiment, the decision model includes multiple nodes, and the decision module 506 is further configured to use the decision model to extract multiple field vectors corresponding to the settlement data to be decided; Nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the decision model outputs the settlement attributes and categories corresponding to the settlement data to be decided.
在其中一个实施例中,该装置还包括模型构建模块,用于获取多个数据库中的多个结算数据,结算数据包括多个字段名和对应的字段值;对结算数据的字段名进行聚 类分析,得到每个字段名的优先级参数;根据字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据决策树生成决策模型。In one of the embodiments, the device further includes a model building module for obtaining multiple settlement data in multiple databases, the settlement data includes multiple field names and corresponding field values; and performs cluster analysis on the field names of the settlement data To obtain the priority parameters of each field name; calculate the weight of multiple field names according to the priority parameters of the field names; train multiple settlement data and settlement attributes and corresponding categories based on the field names and corresponding field values of multiple settlement data The association relationship between them; and constructing a decision tree based on the weights of multiple field names and the association relationship between the training settlement data and settlement attributes and categories, and generating a decision model based on the decision tree.
在其中一个实施例中,该装置还包括模型优化模块,用于获取获取多个更新结算数据,更新结算数据设定有结算属性和类别;对多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;及根据更新权重和更新关联关系参数调节决策模型的参数,以对决策模型进行优化。In one of the embodiments, the device further includes a model optimization module for obtaining and acquiring multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; cluster analysis is performed on the multiple updated settlement data to obtain multiple Update weights of field names and update association relationship parameters; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
在其中一个实施例中,类别包括多个层级类别,目录生成模块508还用于根据结算数据的结算属性和多个层级类别对结算数据进行排序;按照预设方式对多个层级类别的结算数据进行编码;及对编码后的结算目录进行存储。In one embodiment, the category includes multiple hierarchical categories, and the directory generation module 508 is further configured to sort the settlement data according to the settlement attributes and multiple hierarchical categories of the settlement data; and to settle the settlement data of the multiple hierarchical categories in a preset manner Encode; and store the encoded settlement directory.
关于结算数据处理装置的具体限定可以参见上文中对于结算数据处理方法的限定,在此不再赘述。上述结算数据处理装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific limitation of the settlement data processing device, refer to the foregoing limitation on the settlement data processing method, which is not repeated here. Each module in the above-mentioned settlement data processing device may be implemented in whole or in part by software, hardware, and a combination thereof. The above-mentioned modules may be embedded in the hardware form or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor calls and performs the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储初始结算数据目录、规范结算数据表和项目结算数据目录等。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种结算数据处理方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 6. The computer device includes a processor, a memory, a network interface, and a database connected through a system bus. The processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer-readable instructions, and a database. The internal memory provides an environment for operating the operating system and computer-readable instructions in a non-volatile storage medium. The computer equipment database is used to store the initial settlement data directory, the standard settlement data table, and the project settlement data directory. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions are executed by a processor to implement a settlement data processing method.
本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of a part of the structure related to the scheme of the present application, and does not constitute a limitation on the computer equipment to which the scheme of the present application is applied. The specific computer equipment may be Include more or fewer parts than shown in the figure, or combine certain parts, or have a different arrangement of parts.
一种计算机设备,包括存储器和一个或多个处理器,存储器中存储有计算机可读指令,计算机可读指令被处理器执行时,使得一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. Computer-readable instructions are stored in the memory. When the computer-readable instructions are executed by the processor, the one or more processors execute the following steps:
获取终端发送的多个初始结算目录,初始结算目录包括多个待结算数据和对应的多个待结算字段;Obtaining multiple initial settlement directories sent by the terminal, the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
获取规范数据表,规范数据表中包括规范结算数据和对应的多个规范字段,规范结算数据设定有结算属性和类别;Obtaining the authority data table, which includes authority settlement data and corresponding authority fields, and authority properties are set in the authority settlement data;
将待结算字段与规范数据表中的规范字段进行匹配,计算待结算字段与多个规范字段的匹配度;Match the field to be settled with the specification field in the specification data table, and calculate the degree of matching between the field to be settled and multiple specification fields;
若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段对应的规范结算数据设定的结算属性和类别添加至待结算数据中;If there is a field to be settled whose matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field to the data to be settled;
若存在匹配度未达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中;及If there is a field to be settled that does not reach a preset threshold, determine that the data to be settled corresponding to the field to be settled is non-standardized settlement data, generate settlement data for decision-making based on the non-standardized settlement data, and enter the settlement data for decision-making into the trained Make a decision in the decision model of the company, output the settlement attributes and categories corresponding to the settlement data to be decided, and add it to the settlement data to be decided; and
根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录,使得服务器根据项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
在其中一个实施例中,处理器执行计算机可读指令时还实现以下步骤:获取语义匹配模型,语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过语义匹配模型提取出待结算数据中多个待结算字段对应的字段向量;及计算待结算数据对应的字段向量与多个规范字段向量之间的匹配度。In one of the embodiments, when the processor executes the computer-readable instructions, the processor further implements the following steps: obtaining a semantic matching model, the semantic matching model including multiple canonical settlement data and corresponding multiple canonical field vectors; and extracting the data through the semantic matching model Field vectors corresponding to a plurality of fields to be settled in the data to be settled; and a degree of matching between a field vector corresponding to the data to be settled and a plurality of canonical field vectors.
在其中一个实施例中,处理器执行计算机可读指令时还实现以下步骤:利用决策模型提取待决策结算数据对应的多个字段向量;将待决策结算数据对应的多个字段向量按照决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到多个字段向量匹配得到对应的目标结算属性和目标类别,通过决策模型输出待决策结算数据对应的结算属性和类别。In one embodiment, when the processor executes the computer-readable instructions, the processor further implements the following steps: using a decision model to extract multiple field vectors corresponding to the settlement data to be decided; and dividing the multiple field vectors corresponding to the settlement data to be decided according to the decision model. Nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the decision model outputs the settlement attributes and categories corresponding to the settlement data to be decided.
在其中一个实施例中,处理器执行计算机可读指令时还实现以下步骤:获取多个数据库中的多个结算数据,结算数据包括多个字段名和对应的字段值;对结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;根据字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据决策树生成决策模型。In one of the embodiments, when the processor executes the computer-readable instructions, the processor further implements the following steps: obtaining multiple settlement data in multiple databases, the settlement data including multiple field names and corresponding field values; performing field names on the settlement data Cluster analysis to obtain the priority parameters of each field name; calculate the weight of multiple field names based on the priority parameters of the field names; train multiple settlement data and settlement attributes based on the field names and corresponding field values of multiple settlement data Association with corresponding categories; and constructing a decision tree based on the weights of multiple field names and the association between training data and settlement attributes and categories, and generating a decision model based on the decision tree.
在其中一个实施例中,处理器执行计算机可读指令时还实现以下步骤:获取多个更新结算数据,更新结算数据设定有结算属性和类别;对多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;及根据更新权重和更新关联关系参数调节决策模型的参数,以对决策模型进行优化。In one of the embodiments, when the processor executes the computer-readable instructions, the processor further implements the following steps: acquiring multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; and performing cluster analysis on the multiple updated settlement data to obtain Update weights and update association relationship parameters of multiple field names; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
在其中一个实施例中,类别包括多个层级类别,处理器执行计算机可读指令时还实 现以下步骤:根据结算数据的结算属性和多个层级类别对结算数据进行排序;按照预设方式对多个层级类别的结算数据进行编码;及对编码后的结算目录进行存储。In one embodiment, the category includes multiple hierarchical categories. When the processor executes the computer-readable instructions, the following steps are further implemented: sorting the settlement data according to the settlement attributes of the settlement data and multiple hierarchical categories; Encoding settlement data of each hierarchical category; and storing the encoded settlement directory.
一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:One or more non-volatile computer-readable storage media storing computer-readable instructions. When the computer-readable instructions are executed by one or more processors, the one or more processors execute the following steps:
获取终端发送的多个初始结算目录,初始结算目录包括多个待结算数据和对应的多个待结算字段;Obtaining multiple initial settlement directories sent by the terminal, the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
获取规范数据表,规范数据表中包括规范结算数据和对应的多个规范字段,规范结算数据设定有结算属性和类别;Obtaining the authority data table, which includes authority settlement data and corresponding authority fields, and authority properties are set in the authority settlement data;
将待结算字段与规范数据表中的规范字段进行匹配,计算待结算字段与多个规范字段的匹配度;Match the field to be settled with the specification field in the specification data table, and calculate the degree of matching between the field to be settled and multiple specification fields;
若存在匹配度达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为规范结算数据,将规范字段对应的规范结算数据设定的结算属性和类别添加至待结算数据中;If there is a field to be settled whose matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field to the data to be settled;
若存在匹配度未达到预设阈值的待结算字段,确定待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将待决策结算数据输入至已训练的决策模型中进行决策,输出待决策结算数据对应的结算属性和类别,并添加至待决策结算数据中;及If there is a field to be settled that does not reach a preset threshold, determine that the data to be settled corresponding to the field to be settled is non-standardized settlement data, generate settlement data for decision-making based on the non-standardized settlement data, and enter the settlement data for decision-making into the trained Make a decision in the decision model of the company, output the settlement attributes and categories corresponding to the settlement data to be decided, and add it to the settlement data to be decided; and
根据多个添加结算属性和类别后的结算数据按照类别生成项目结算目录,使得服务器根据项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
在其中一个实施例中,计算机可读指令被处理器执行时还实现以下步骤:获取语义匹配模型,语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过语义匹配模型提取出待结算数据中多个待结算字段对应的字段向量;及计算待结算数据对应的字段向量与多个规范字段向量之间的匹配度。In one of the embodiments, when the computer-readable instructions are executed by the processor, the following steps are further implemented: obtaining a semantic matching model, the semantic matching model including multiple canonical settlement data and corresponding multiple canonical field vectors; and extracting through the semantic matching model A field vector corresponding to a plurality of fields to be settled in the data to be settled is calculated; and a matching degree between a field vector corresponding to the data to be settled and a plurality of canonical field vectors is calculated.
在其中一个实施例中,计算机可读指令被处理器执行时还实现以下步骤:利用决策模型提取待决策结算数据对应的多个字段向量;将待决策结算数据对应的多个字段向量按照决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到多个字段向量匹配得到对应的目标结算属性和目标类别,通过决策模型输出待决策结算数据对应的结算属性和类别。In one embodiment, when the computer-readable instructions are executed by the processor, the following steps are further implemented: using a decision model to extract multiple field vectors corresponding to the settlement data to be decided; and using the multiple field vectors corresponding to the settlement data to be decided according to the decision model The nodes are traversed and matched sequentially to calculate the matching degree between the field vector and multiple nodes; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and target categories, the settlement attributes and categories corresponding to the settlement data to be decided are output through the decision model.
在其中一个实施例中,计算机可读指令被处理器执行时还实现以下步骤:获取多个数据库中的多个结算数据,结算数据包括多个字段名和对应的字段值;对结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;根据字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数 据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据决策树生成决策模型。In one embodiment, when the computer-readable instructions are executed by the processor, the following steps are further implemented: obtaining multiple settlement data in multiple databases, the settlement data including multiple field names and corresponding field values; and field names of the settlement data Perform cluster analysis to obtain the priority parameters of each field name; calculate the weight of multiple field names based on the priority parameters of the field names; train multiple settlement data and settlements based on the field names and corresponding field values of multiple settlement data Association between attributes and corresponding categories; and constructing a decision tree based on the weights of multiple field names and the association between training data and settlement attributes and categories, and generating a decision model based on the decision tree.
在其中一个实施例中,计算机可读指令被处理器执行时还实现以下步骤:获取多个更新结算数据,更新结算数据设定有结算属性和类别;对多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;及根据更新权重和更新关联关系参数调节决策模型的参数,以对决策模型进行优化。In one of the embodiments, when the computer-readable instructions are executed by the processor, the following steps are also implemented: obtaining multiple updated settlement data, the updated settlement data is set with a settlement attribute and category; performing cluster analysis on the multiple updated settlement data, Obtain the update weights and update association relationship parameters of multiple field names; and adjust the parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
在其中一个实施例中,类别包括多个层级类别,计算机可读指令被处理器执行时还实现以下步骤:根据结算数据的结算属性和多个层级类别对结算数据进行排序;按照预设方式对多个层级类别的结算数据进行编码;及对编码后的结算目录进行存储。In one of the embodiments, the category includes multiple hierarchical categories, and when the computer-readable instructions are executed by the processor, the following steps are further implemented: sorting the settlement data according to the settlement attributes of the settlement data and multiple hierarchical categories; Encoding settlement data for multiple hierarchical categories; and storing the encoded settlement directory.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by computer-readable instructions to instruct related hardware. The computer-readable instructions can be stored in a non-volatile computer. In the readable storage medium, the computer-readable instructions, when executed, may include the processes of the embodiments of the methods described above. Wherein, any reference to the memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and / or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be arbitrarily combined. In order to make the description concise, all possible combinations of the technical features in the above embodiments have not been described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered to be the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation manners of the present application, and their descriptions are more specific and detailed, but they cannot be understood as limiting the scope of the invention patent. It should be noted that, for those of ordinary skill in the art, without departing from the concept of the present application, several modifications and improvements can be made, which all belong to the protection scope of the present application. Therefore, the protection scope of this application patent shall be subject to the appended claims.

Claims (20)

  1. 一种结算数据处理方法,包括:A method for processing settlement data, including:
    获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;Acquiring multiple initial settlement directories sent by the terminal, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
    获取规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
    将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
    若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field to In the data to be settled;
    若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is irregular settlement data, and to-be-determined settlement data is generated based on the non-standard to-be-settled data, and Settlement data is input into a trained decision model for decision making, outputting settlement attributes and categories corresponding to the pending decision settlement data, and added to the pending decision settlement data; and
    根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  2. 根据权利要求1所述的方法,其特征在于,所述将所述待结算字段与所述规范数据表中的规范字段进行匹配包括:The method according to claim 1, wherein the matching the field to be settled with a specification field in the specification data table comprises:
    获取语义匹配模型,所述语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;Obtaining a semantic matching model, where the semantic matching model includes multiple canonical settlement data and corresponding multiple canonical field vectors;
    通过所述语义匹配模型提取出所述待结算数据中多个待结算字段对应的字段向量;及Extracting a field vector corresponding to a plurality of fields to be settled in the data to be settled through the semantic matching model; and
    计算所述待结算数据对应的字段向量与多个规范字段向量之间的匹配度。Calculate a matching degree between a field vector corresponding to the data to be settled and a plurality of canonical field vectors.
  3. 根据权利要求1所述的方法,其特征在于,所述决策模型包括多个节点,所述将所述待决策结算数据输入至已训练的决策模型中进行决策的步骤,包括:The method according to claim 1, wherein the decision model includes a plurality of nodes, and the step of inputting the settlement data to be decided into a trained decision model for decision making comprises:
    利用所述决策模型提取所述待决策结算数据对应的多个字段向量;Using the decision model to extract a plurality of field vectors corresponding to the settlement data to be decided;
    将所述待决策结算数据对应的多个字段向量按照所述决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及Performing traversal matching on the plurality of field vectors corresponding to the settlement data to be decided according to the node order of the decision model, and calculating the degree of matching between the field vector and the plurality of nodes; and
    直到所述多个字段向量匹配得到对应的目标结算属性和目标类别,通过所述决策模型输出所述待决策结算数据对应的结算属性和类别。Until the plurality of field vectors are matched to obtain the corresponding target settlement attribute and target category, the decision model outputs the settlement attribute and category corresponding to the settlement data to be decided.
  4. 根据权利要求1所述的方法,其特征在于,在所述获取决策模型之前,还包括:The method according to claim 1, before the obtaining a decision model, further comprising:
    获取多个数据库中的多个结算数据,所述结算数据包括多个字段名和对应的字段 值;Obtaining multiple settlement data in multiple databases, where the settlement data includes multiple field names and corresponding field values;
    对所述结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;Performing cluster analysis on the field names of the settlement data to obtain priority parameters for each field name;
    根据所述字段名的优先级参数计算多个字段名的权重;Calculating weights of multiple field names according to the priority parameters of the field names;
    根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及Training association relationships between multiple settlement data and settlement attributes and corresponding categories according to field names and corresponding field values of the multiple settlement data; and
    根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据所述决策树生成决策模型。A decision tree is constructed according to the weights of multiple field names and the association relationship between the training settlement data and settlement attributes and categories, and a decision model is generated according to the decision tree.
  5. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, further comprising:
    获取多个更新结算数据,所述更新结算数据设定有结算属性和类别;Obtaining a plurality of updated settlement data, wherein the updated settlement data is set with a settlement attribute and a category;
    对所述多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;及Perform cluster analysis on the multiple update settlement data to obtain update weights and update association parameters of multiple field names; and
    根据所述更新权重和更新关联关系参数调节所述决策模型的参数,以对所述决策模型进行优化。The parameters of the decision model are adjusted according to the update weight and the update association parameter to optimize the decision model.
  6. 根据权利要求1所述的方法,其特征在于,所述类别包括多个层级类别,在所述根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录之后,还包括:The method according to claim 1, wherein the category comprises a plurality of hierarchical categories, and after the generating a project settlement directory according to the category based on a plurality of settlement attributes after adding settlement attributes and categories, further comprising:
    根据所述结算数据的结算属性和多个层级类别对所述结算数据进行排序;Sorting the settlement data according to the settlement attributes and multiple hierarchical categories of the settlement data;
    按照预设方式对多个层级类别的结算数据进行编码;及Encoding settlement data for multiple tier categories in a preset manner; and
    对编码后的结算目录进行存储。Store the encoded settlement directory.
  7. 一种结算数据处理装置,包括:A settlement data processing device includes:
    获取模块,用于获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据,所述待结算数据包括对应的多个待结算字段;获取结算数据表,所述结算数据表中包括规范结算数据对应的多个规范字段,所述规范结算数据设定有结算属性和类别;An obtaining module, configured to obtain multiple initial settlement directories sent by the terminal, where the initial settlement directories include multiple data to be settled, the data to be settled includes corresponding multiple fields to be settled; and a settlement data table, the settlement data The table includes a plurality of specification fields corresponding to the specification settlement data, and the specification settlement data is set with settlement attributes and categories;
    匹配模块,用于将所述待结算字段与所述规范数据表中的规范字段进行比对,计算所述待结算字段与所述多个规范字段的匹配度;若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段相对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;A matching module, configured to compare the field to be settled with a specification field in the specification data table, and calculate a degree of matching between the field to be settled and the plurality of specification fields; Setting a threshold to-be-settled field, determining that the to-be-settled data corresponding to the to-be-settled field is normative settlement data, and adding the settlement attribute and category set by the normative settlement data corresponding to the specification field to the to-be-settled data;
    决策模块,用于若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及A decision module, configured to determine that the pending data corresponding to the pending settlement field is non-standard settlement data if there is a pending field to be settled where the matching degree does not reach a preset threshold, and generate the pending decision settlement data according to the non-standard pending settlement data; Inputting the settlement data to be decided into a trained decision model for decision making, outputting settlement attributes and categories corresponding to the settlement data to be decided, and adding to the settlement data to be decided; and
    目录生成模块,用于根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。The directory generating module is configured to generate a project settlement directory according to the category based on a plurality of settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  8. 根据权利要求7所述的装置,其特征在于,所述匹配模块还用于获取语义匹配模型,所述语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过所述语义匹配模型提取出所述待结算数据中多个待结算字段对应的字段向量;及计算所述待结算数据对应的字段向量与多个规范字段向量之间的匹配度。The device according to claim 7, wherein the matching module is further configured to obtain a semantic matching model, wherein the semantic matching model includes a plurality of canonical settlement data and a corresponding plurality of canonical field vectors; The matching model extracts field vectors corresponding to a plurality of fields to be settled in the data to be settled; and calculates a degree of matching between a field vector corresponding to the data to be settled and a plurality of canonical field vectors.
  9. 根据权利要求7所述的装置,其特征在于,所述决策模型包括多个节点,所述决策模块还用于利用所述决策模型提取所述待决策结算数据对应的多个字段向量;将所述待决策结算数据对应的多个字段向量按照所述决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到所述多个字段向量匹配得到对应的目标结算属性和目标类别,通过所述决策模型输出所述待决策结算数据对应的结算属性和类别。The device according to claim 7, wherein the decision model includes a plurality of nodes, and the decision module is further configured to use the decision model to extract multiple field vectors corresponding to the settlement data to be decided; The multiple field vectors corresponding to the settlement data to be determined are traversed and matched according to the node order of the decision model, and the degree of matching between the field vector and the multiple nodes is calculated; and until the multiple field vectors are matched to obtain the corresponding target settlement attributes and The target category outputs the settlement attribute and category corresponding to the settlement data to be decided through the decision model.
  10. 根据权利要求7所述的装置,其特征在于,所述装置还包括模型构建模块,用于获取多个数据库中的多个结算数据,所述结算数据包括多个字段名和对应的字段值;对所述结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;根据所述字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据所述决策树生成决策模型。The device according to claim 7, wherein the device further comprises a model building module, configured to obtain multiple settlement data in multiple databases, where the settlement data includes multiple field names and corresponding field values; Perform cluster analysis on the field names of the settlement data to obtain priority parameters for each field name; calculate the weight of multiple field names according to the priority parameters of the field names; and according to the field names and corresponding fields of the multiple settlement data Value training out the correlation between multiple settlement data and settlement attributes and corresponding categories; and constructing a decision tree based on the weights of multiple field names and the correlation between the training settlement data and settlement attributes and categories, and according to the Describe the decision tree generation decision model.
  11. 根据权利要求7所述的装置,其特征在于,所述装置还包括模型优化模块,用于获取多个更新结算数据,所述更新结算数据设定有结算属性和类别;对所述多个更新结算数据进行聚类分析,得到多个字段名的更新权重和更新关联关系参数;及根据所述更新权重和更新关联关系参数调节所述决策模型的参数,以对所述决策模型进行优化。The device according to claim 7, characterized in that the device further comprises a model optimization module for obtaining a plurality of updated settlement data, the updated settlement data is set with a settlement attribute and a category; and the plurality of updates The settlement data is subjected to cluster analysis to obtain update weights and update association relationship parameters of multiple field names; and adjust parameters of the decision model according to the update weights and update association relationship parameters to optimize the decision model.
  12. 根据权利要求7所述的装置,其特征在于,所述类别包括多个层级类别,所述目录生成模块还用于根据所述结算数据的结算属性和多个层级类别对所述结算数据进行排序;按照预设方式对多个层级类别的结算数据进行编码;及对编码后的结算目录进行存储。The device according to claim 7, wherein the category includes multiple hierarchical categories, and the directory generation module is further configured to sort the settlement data according to the settlement attributes of the settlement data and multiple hierarchical categories. ; Encoding settlement data of multiple hierarchical categories in a preset manner; and storing the encoded settlement directory.
  13. 一种计算机设备,包括存储器及一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the one or more processors, the one or more processors are caused. Each processor performs the following steps:
    获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;Acquiring multiple initial settlement directories sent by the terminal, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
    获取规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
    将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
    若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field to In the data to be settled;
    若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is irregular settlement data, and to-be-determined settlement data is generated based on the non-standard to-be-settled data, and Settlement data is input into a trained decision model for decision making, outputting settlement attributes and categories corresponding to the pending decision settlement data, and added to the pending decision settlement data; and
    根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  14. 根据权利要求13所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:获取语义匹配模型,所述语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过所述语义匹配模型提取出所述待结算数据中多个待结算字段对应的字段向量;及计算所述待结算数据对应的字段向量与多个规范字段向量之间的匹配度。The computer device according to claim 13, wherein when the processor executes the computer-readable instructions, the processor further executes the following steps: acquiring a semantic matching model, the semantic matching model comprising a plurality of normative settlement data and correspondences A plurality of canonical field vectors; extracting a field vector corresponding to a plurality of to-be-cleared fields in the data to be settled through the semantic matching model; and calculating a field vector corresponding to the data to be settled and a plurality of canonical field vectors Match.
  15. 根据权利要求13所述的计算机设备,其特征在于,所述决策模型包括多个节点,所述处理器执行所述计算机可读指令时还执行以下步骤:利用所述决策模型提取所述待决策结算数据对应的多个字段向量;将所述待决策结算数据对应的多个字段向量按照所述决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到所述多个字段向量匹配得到对应的目标结算属性和目标类别,通过所述决策模型输出所述待决策结算数据对应的结算属性和类别。The computer device according to claim 13, wherein the decision model includes a plurality of nodes, and the processor further executes the following steps when executing the computer-readable instructions: using the decision model to extract the pending decision Multiple field vectors corresponding to the settlement data; performing traversal matching on the multiple field vectors corresponding to the settlement data to be determined according to the node order of the decision model, and calculating the degree of matching between the field vector and the multiple nodes; and up to the multiple The field vectors are matched to obtain the corresponding target settlement attribute and target category, and the settlement attribute and category corresponding to the settlement data to be decided are output through the decision model.
  16. 根据权利要求13所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:获取多个数据库中的多个结算数据,所述结算数据包括多个字段名和对应的字段值;对所述结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;根据所述字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据所述决策树生成决策模型。The computer device according to claim 13, wherein when the processor executes the computer-readable instructions, the processor further performs the following steps: obtaining multiple settlement data in multiple databases, the settlement data including multiple fields Cluster name and corresponding field value; perform cluster analysis on the field names of the settlement data to obtain priority parameters for each field name; calculate the weights of multiple field names according to the priority parameters of the field names; and based on multiple settlements The field names of the data and the corresponding field values train the association relationships between multiple settlement data and settlement attributes and corresponding categories; and the association relationships between the training field settlement data and settlement attributes and categories according to the weights of multiple field names A decision tree is constructed, and a decision model is generated based on the decision tree.
  17. 一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-transitory computer-readable storage media storing computer-readable instructions that, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取终端发送的多个初始结算目录,所述初始结算目录包括多个待结算数据和对应的多个待结算字段;Acquiring multiple initial settlement directories sent by the terminal, where the initial settlement directory includes multiple data to be settled and corresponding multiple fields to be settled;
    获取规范数据表,所述规范数据表中包括规范结算数据和对应的多个规范字段,所述规范结算数据设定有结算属性和类别;Obtaining a authority data table, where the authority data table includes authority settlement data and corresponding authority fields, and the authority settlement data is set with settlement attributes and categories;
    将所述待结算字段与所述规范数据表中的规范字段进行匹配,计算所述待结算字段与所述多个规范字段的匹配度;Matching the field to be settled with a specification field in the specification data table, and calculating a degree of matching between the field to be settled and the plurality of specification fields;
    若存在所述匹配度达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为规范结算数据,将所述规范字段对应的规范结算数据设定的结算属性和类别添加至所述待结算数据中;If there is a field to be settled where the matching degree reaches a preset threshold, determine that the data to be settled corresponding to the field to be settled is standard settlement data, and add the settlement attributes and categories set by the standard settlement data corresponding to the standard field to In the data to be settled;
    若存在所述匹配度未达到预设阈值的待结算字段,确定所述待结算字段对应的待结算数据为不规范结算数据,根据不规范待结算数据生成待决策结算数据,将所述待决策结算数据输入至已训练的决策模型中进行决策,输出所述待决策结算数据对应的结算属性和类别,并添加至所述待决策结算数据中;及If there is a field to be settled where the matching degree does not reach a preset threshold, it is determined that the data to be settled corresponding to the field to be settled is irregular settlement data, and to-be-determined settlement data is generated based on the non-standard to-be-settled data, and Settlement data is input into a trained decision model for decision making, outputting settlement attributes and categories corresponding to the pending decision settlement data, and added to the pending decision settlement data; and
    根据多个添加结算属性和类别后的结算数据按照所述类别生成项目结算目录,使得服务器根据所述项目结算目录进行项目结算处理。Generate a project settlement directory according to the category based on multiple settlement data after adding settlement attributes and categories, so that the server performs project settlement processing according to the project settlement directory.
  18. 根据权利要求17所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:获取语义匹配模型,所述语义匹配模型中包括多个规范结算数据和对应的多个规范字段向量;通过所述语义匹配模型提取出所述待结算数据中多个待结算字段对应的字段向量;及计算所述待结算数据对应的字段向量与多个规范字段向量之间的匹配度。The storage medium according to claim 17, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed: obtaining a semantic matching model, the semantic matching model comprising a plurality of normative settlement data and Corresponding multiple canonical field vectors; extracting field vectors corresponding to multiple to-be-cleared fields in the to-be-settled data through the semantic matching model; and calculating a field vector corresponding to the to-be-settled data and a plurality of canonical field vectors Match.
  19. 根据权利要求17所述的存储介质,其特征在于,所述决策模型包括多个节点,所述计算机可读指令被所述处理器执行时还执行以下步骤:利用所述决策模型提取所述待决策结算数据对应的多个字段向量;将所述待决策结算数据对应的多个字段向量按照所述决策模型的节点顺序进行遍历匹配,计算字段向量与多个节点的匹配度;及直到所述多个字段向量匹配得到对应的目标结算属性和目标类别,通过所述决策模型输出所述待决策结算数据对应的结算属性和类别。The storage medium according to claim 17, wherein the decision model includes a plurality of nodes, and when the computer-readable instructions are executed by the processor, the following step is further performed: using the decision model to extract the pending Multiple field vectors corresponding to the decision settlement data; performing traversal matching on the multiple field vectors corresponding to the settlement data to be decided according to the node order of the decision model, and calculating the degree of matching between the field vector and the multiple nodes; and until the A plurality of field vectors are matched to obtain a corresponding target settlement attribute and target category, and the settlement attribute and category corresponding to the settlement data to be decided are output through the decision model.
  20. 根据权利要求17所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:获取多个数据库中的多个结算数据,所述结算数据包括多个字段名和对应的字段值;对所述结算数据的字段名进行聚类分析,得到每个字段名的优先级参数;根据所述字段名的优先级参数计算多个字段名的权重;根据多个结算数据的字段名和对应的字段值训练出多个结算数据与结算属性和对应类别之间的关联关系;及根据多个字段名的权重和训练后的结算数据与结算属性以及类别之间的关联关系构建决策树,并根据所述决策树生成决策模型。The storage medium according to claim 17, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed: acquiring multiple settlement data in multiple databases, and the settlement data includes multiple Field names and corresponding field values; performing cluster analysis on the field names of the settlement data to obtain priority parameters for each field name; calculating weights of multiple field names according to the priority parameters of the field names; and The field names and corresponding field values of the settlement data train the association relationships between multiple settlement data and settlement attributes and corresponding categories; and the association between the training field settlement data and settlement attributes and categories according to the weights of multiple field names The relationship builds a decision tree, and generates a decision model based on the decision tree.
PCT/CN2019/096961 2018-09-03 2019-07-22 Settlement data processing method and apparatus, and computer device and storage medium WO2020048247A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811020169.8A CN109410069A (en) 2018-09-03 2018-09-03 Settlement data processing method, device, computer equipment and storage medium
CN201811020169.8 2018-09-03

Publications (1)

Publication Number Publication Date
WO2020048247A1 true WO2020048247A1 (en) 2020-03-12

Family

ID=65463897

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/096961 WO2020048247A1 (en) 2018-09-03 2019-07-22 Settlement data processing method and apparatus, and computer device and storage medium

Country Status (2)

Country Link
CN (1) CN109410069A (en)
WO (1) WO2020048247A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109410069A (en) * 2018-09-03 2019-03-01 平安医疗健康管理股份有限公司 Settlement data processing method, device, computer equipment and storage medium
CN113420077B (en) * 2021-06-10 2024-01-30 浙江大华技术股份有限公司 Data processing method, device, equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110016162A1 (en) * 2009-07-15 2011-01-20 David Booth Onboarding resources to an identity management system
CN107403199A (en) * 2017-08-07 2017-11-28 北京京东尚科信息技术有限公司 Data processing method and device
CN107704625A (en) * 2017-10-30 2018-02-16 锐捷网络股份有限公司 Fields match method and apparatus
CN107844560A (en) * 2017-10-30 2018-03-27 北京锐安科技有限公司 A kind of method, apparatus of data access, computer equipment and readable storage medium storing program for executing
CN109410069A (en) * 2018-09-03 2019-03-01 平安医疗健康管理股份有限公司 Settlement data processing method, device, computer equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079851B (en) * 2007-07-09 2011-01-05 华为技术有限公司 Email type judgement method,device and system
CN103729386B (en) * 2012-10-16 2017-08-04 阿里巴巴集团控股有限公司 Information query system and method
CN106547742B (en) * 2016-11-30 2019-05-03 百度在线网络技术(北京)有限公司 Semantic parsing result treating method and apparatus based on artificial intelligence
CN108108477B (en) * 2018-01-04 2019-04-05 焦点科技股份有限公司 A kind of the KPI system and Rights Management System of linkage

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110016162A1 (en) * 2009-07-15 2011-01-20 David Booth Onboarding resources to an identity management system
CN107403199A (en) * 2017-08-07 2017-11-28 北京京东尚科信息技术有限公司 Data processing method and device
CN107704625A (en) * 2017-10-30 2018-02-16 锐捷网络股份有限公司 Fields match method and apparatus
CN107844560A (en) * 2017-10-30 2018-03-27 北京锐安科技有限公司 A kind of method, apparatus of data access, computer equipment and readable storage medium storing program for executing
CN109410069A (en) * 2018-09-03 2019-03-01 平安医疗健康管理股份有限公司 Settlement data processing method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN109410069A (en) 2019-03-01

Similar Documents

Publication Publication Date Title
US20210257066A1 (en) Machine learning based medical data classification method, computer device, and non-transitory computer-readable storage medium
WO2020057022A1 (en) Associative recommendation method and apparatus, computer device, and storage medium
WO2021128679A1 (en) Data decision-making-based test data generation method and apparatus, and computer device
CN109063217B (en) Work order classification method and device in electric power marketing system and related equipment thereof
WO2020177365A1 (en) Data mining-based social insurance data processing method and apparatus, and computer device
CN109783604B (en) Information extraction method and device based on small amount of samples and computer equipment
CN110458324B (en) Method and device for calculating risk probability and computer equipment
WO2020057021A1 (en) Data table processing method and device, computer device and storage medium
WO2020048048A1 (en) Unbalanced sample data preprocessing method and apparatus, and computer device
CN112015900B (en) Medical attribute knowledge graph construction method, device, equipment and medium
WO2020034801A1 (en) Medical feature screening method and apparatus, computer device, and storage medium
WO2021151358A1 (en) Triage information recommendation method and apparatus based on interpretation model, and device and medium
WO2020048247A1 (en) Settlement data processing method and apparatus, and computer device and storage medium
CN110362798B (en) Method, apparatus, computer device and storage medium for judging information retrieval analysis
WO2020015140A1 (en) Passenger rating model generation method and apparatus, and computer device and storage medium
CN110135943B (en) Product recommendation method, device, computer equipment and storage medium
WO2020052183A1 (en) Method and apparatus for identifying trademark infringement, and computer device and storage medium
CN110727836B (en) Social network analysis system based on Spark GraphX and implementation method thereof
CN115062016A (en) Incidence relation extraction method and device and computer equipment
CN112800179B (en) Associated database query method and device, storage medium and electronic equipment
CN111552810B (en) Entity extraction and classification method, entity extraction and classification device, computer equipment and storage medium
CN112069269A (en) Big data and multidimensional feature-based data tracing method and big data cloud server
CN112765183B (en) Multi-source data fusion method and device, storage medium and electronic equipment
WO2022127124A1 (en) Meta learning-based entity category recognition method and apparatus, device and storage medium
US20230138491A1 (en) Continuous learning for document processing and analysis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19857437

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19857437

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19857437

Country of ref document: EP

Kind code of ref document: A1