CN111967790B - Credit scoring method capable of automatically calculating and terminal - Google Patents

Credit scoring method capable of automatically calculating and terminal Download PDF

Info

Publication number
CN111967790B
CN111967790B CN202010882083.7A CN202010882083A CN111967790B CN 111967790 B CN111967790 B CN 111967790B CN 202010882083 A CN202010882083 A CN 202010882083A CN 111967790 B CN111967790 B CN 111967790B
Authority
CN
China
Prior art keywords
data
credit
normative
cleaning
credit data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010882083.7A
Other languages
Chinese (zh)
Other versions
CN111967790A (en
Inventor
张美跃
刘侃
范章华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hengruitong Fujian Information Technology Co ltd
Original Assignee
Hengruitong Fujian Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hengruitong Fujian Information Technology Co ltd filed Critical Hengruitong Fujian Information Technology Co ltd
Priority to CN202010882083.7A priority Critical patent/CN111967790B/en
Publication of CN111967790A publication Critical patent/CN111967790A/en
Application granted granted Critical
Publication of CN111967790B publication Critical patent/CN111967790B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • Quality & Reliability (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Marketing (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

The invention relates to a credit score method and a terminal capable of automatically calculating, wherein a credit information list of each province is obtained according to a state-level credit information directory, original credit data of the credit information list is obtained, a first single-word-segment weight of the original credit data is configured, a first algorithm model is determined according to the first single-word-segment weight, the original credit data is substituted into the first algorithm model to obtain a first credit score, an actual credit score and actual policy information matched with the original credit data are obtained, the first algorithm model is corrected according to the actual credit score, the first credit score and the actual policy information to obtain a normative algorithm model, and the normative algorithm model is stored; cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data, and if verification is successful, taking the cleaning credit data as normative credit data; and calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.

Description

Credit scoring method capable of automatic calculation and terminal
Technical Field
The invention relates to the field of computer software, in particular to a credit scoring method capable of automatically calculating and a terminal.
Background
The collection of the corporate credit data is an important function in a corporate credit platform, and the collection of the corporate credit data is mainly used for pertinently extracting corporate credit data of different business types of 10 units of a city industry and commerce bureau, a city tax bureau, a city quality supervision bureau, a city food and drug supervision bureau, a city traffic bureau, a city government bureau, a city public security bureau, a city court and a city inspection and quarantine bureau.
The legal credit data acquisition is realized by relying on public information service platforms and public basic databases (hereinafter referred to as 'one database for short') of Fujian province and the province, the public information platforms are built one by one according to planning, the public information platforms are mainly responsible for gathering data of all committees and offices of various cities, meanwhile, the public basic databases, public services and the public service databases are built, and finally an information resource pool facing city sharing coordination is formed.
Due to various objective factors, credit data reported in various cities often have regional characteristics, collected data types, data dimensions, data frequency and the like can change due to specific conditions of the cities, and corresponding credit scores cannot be calculated by using the same algorithm model; in addition, aiming at the problems of ambiguous data, incompleteness, violation of business rules and the like which may occur in the public credit information of the legal person and the natural person reported in various places, the reasonable credit score cannot be calculated according to the collected credit data of the legal person.
Disclosure of Invention
Technical problem to be solved
In order to solve the above problems in the prior art, the present invention provides an automatic calculation credit score method, which can establish a reasonable credit score algorithm model to ensure the reasonability of the calculated credit score.
(II) technical scheme
In order to achieve the purpose, the invention adopts a technical scheme that: an automatically calculable credit scoring method, comprising:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
and S3, calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.
The other technical scheme adopted by the invention is as follows: an automatically calculable credit terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the following steps when executing the computer program:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
and S3, calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.
(III) advantageous effects
The invention has the beneficial effects that: configuring a first single-word-segment weight through original credit data to determine a first algorithm model, and correcting the first algorithm model by combining actual credit score and actual policy information to obtain a normative algorithm model, so that the accuracy and reliability of the normative algorithm model are ensured; in addition, the original credit data are cleaned and verified successfully to obtain the normative credit data, so that the problems of ambiguity, incompleteness, violation of business rules and the like of the original credit data are avoided, the validity of the data is guaranteed, and the rationality of the normative credit data can be guaranteed through the normative credit data and the normative credit score obtained by the normative algorithm model.
Drawings
FIG. 1 is a flow chart of an automatically calculable credit scoring method of the present invention;
FIG. 2 is a schematic diagram of a credit point terminal capable of automatic calculation according to the present invention;
[ description of reference ]
1. A credit sub-terminal capable of automatic calculation; 2. a memory; 3. a processor.
Detailed Description
For the purpose of better explaining the present invention and to facilitate understanding, the present invention will be described in detail by way of specific embodiments with reference to the accompanying drawings.
Referring to fig. 1, a credit scoring method capable of automatic calculation includes:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
and S3, calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.
From the above description, the beneficial effects of the present invention are: configuring a first single-word-segment weight through original credit data to determine a first algorithm model, and modifying the first algorithm model by combining actual credit score and actual policy information to obtain a normative algorithm model, so that the accuracy and reliability of the normative algorithm model can be ensured according to actual conditions; in addition, the original credit data are cleaned and verified successfully to obtain the normative credit data, so that the problems of ambiguity, incompleteness, violation of business rules and the like of the original credit data are avoided, the validity of the data is guaranteed, and the rationality of the normative credit data can be guaranteed through the normative credit data and the normative credit score obtained by the normative algorithm model.
Further, the S3 includes before:
judging whether the normative credit data is missing or not, if so, sending the normative credit data to a modification terminal, configuring a second single-word segment weight of the normative credit data, determining a second algorithm model according to the second single-word segment weight, calculating the normative credit data according to the second algorithm model to obtain a second credit score, sending the second credit score to an application end, returning a modification suggestion returned by the application end, and modifying the normative algorithm model according to the modification suggestion and the second single-word segment weight to obtain a final normative algorithm model.
The S3 specifically comprises the following steps:
and S3, calculating the normative credit data according to the final normative algorithm model to obtain the normative credit score.
As can be seen from the above description, because there is a difference in data amount between the original credit data of the credit information lists of each city and each city, there may be a situation that some of the original credit data of the legal person in the city and some of the original credit data may have partial data missing, and the original credit data cannot be calculated by using the general normative algorithm model, when the normative credit data are missing, a second single-field weight needs to be configured correspondingly to obtain a second algorithm model, and the second algorithm model is used to obtain a second credit score for the normative credit data, and the second algorithm model needs to be corrected according to a modification opinion returned by an application end (for example, when the second algorithm model is used by a bank in the city to calculate the normative credit data missing in the city and the second credit score is not within a reasonable range, the modification opinion is sent), so as to obtain a final normative algorithm model, thereby ensuring reliability of the algorithm model.
Further, the S2 further includes:
if the verification is unsuccessful, analyzing the cleaning credit data, and judging whether the cleaning credit data is data capable of automatically correcting errors;
if so, automatically correcting the cleaning credit data to obtain error correction data, and taking the error correction data as normative credit data;
if not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data.
As can be seen from the above description, after the verification of the cleaning credit data is unsuccessful, whether the data is automatically error-correctable or not is analyzed, if so, the cleaning credit data can be automatically error-corrected and then stored, the degree of automation is high, and the manpower consumption is reduced, and if not, the cleaning credit data is classified and stored according to the division attribution of the cleaning credit data (namely, the cleaning credit data is classified according to the specific city), and the error description and the error data field of the cleaning credit data are simultaneously stored, so that the cleaning credit data with problems can be classified, and the manual verification of the cleaning credit data by a worker can be facilitated according to the error description and the error data field of the cleaning credit data.
If not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data, the method further comprises the following steps:
and acquiring processing data obtained by processing the cleaning credit data according to the error description and the error data field of the cleaning credit data, and taking the processing data as normative credit data if the processing data is successfully processed.
From the above description, it can be known that, when the processed data after manual processing is acquired, whether the processing is successful or not is judged, and the processed data is used as normative credit data after the processing is successful, so that the data quality is ensured.
Further, the preset rules comprise basic preset rules and newly added preset rules;
counting and monitoring the data quality and the data quantity of cleaning credit data obtained by cleaning and filtering the original credit data according to a preset rule, respectively obtaining the front data quality, the front data quantity, the rear data quality and the rear data quantity of the cleaning credit data obtained by cleaning and filtering the original credit data within a preset time period before and after adding the newly added preset rule in a basic preset rule, respectively comparing the front data quality and the front data quantity with the rear data quality and the rear data quantity to obtain a comparison result, and judging whether to alarm or not according to the comparison result.
From the above description, after the preset rule is newly added to the basic preset rule, the change of the subsequent data quality and data quantity is counted to judge whether to alarm, so that the preset rule is convenient to adjust.
Referring to fig. 2, a credit score method capable of automatic calculation and a terminal thereof include a memory, a processor and a computer program stored in the memory and running on the processor, wherein the processor implements the following steps when executing the computer program:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
and S3, calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.
From the above description, the beneficial effects of the present invention are: configuring a first single-word-segment weight through original credit data to determine a first algorithm model, and modifying the first algorithm model by combining actual credit score and actual policy information to obtain a normative algorithm model, so that the accuracy and reliability of the normative algorithm model are guaranteed; in addition, the original credit data are cleaned and verified successfully to obtain the normative credit data, so that the problems of ambiguity, incompleteness, violation of business rules and the like of the original credit data are avoided, the validity of the data is guaranteed, and the rationality of the normative credit data can be guaranteed through the normative credit data and the normative credit score obtained by the normative algorithm model.
Further, the S3 includes before:
judging whether the normative credit data is missing or not, if so, sending the normative credit data to a modification terminal, configuring a second single-word segment weight of the normative credit data, determining a second algorithm model according to the second single-word segment weight, calculating the normative credit data according to the second algorithm model to obtain a second credit score, sending the second credit score to an application end, returning a modification suggestion returned by the application end, and modifying the normative algorithm model according to the modification suggestion and the second single-word segment weight to obtain a final normative algorithm model.
The S3 specifically comprises the following steps:
and S3, calculating the normative credit data according to the final normative algorithm model to obtain the normative credit score.
As can be seen from the above description, because there is a difference in data amount between the original credit data of the credit information lists of each city and each city, there may be a situation that some of the original credit data of the legal person in the city and some of the original credit data may have partial data missing, and the original credit data cannot be calculated by using the general normative algorithm model, when the normative credit data are missing, a second single-field weight needs to be configured correspondingly to obtain a second algorithm model, and the second algorithm model is used to obtain a second credit score for the normative credit data, and the second algorithm model needs to be corrected according to a modification opinion returned by an application end (for example, when the second algorithm model is used by a bank in the city to calculate the normative credit data missing in the city and the second credit score is not within a reasonable range, the modification opinion is sent), so as to obtain a final normative algorithm model, thereby ensuring reliability of the algorithm model.
Further, the S2 further includes:
if the verification is unsuccessful, analyzing the cleaning credit data, and judging whether the cleaning credit data is data capable of automatically correcting errors;
if so, automatically correcting the cleaning credit data to obtain error correction data, and taking the error correction data as normative credit data;
if not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data.
As can be seen from the above description, after the verification of the cleaning credit data is unsuccessful, whether the data is automatically error-correctable or not is analyzed, if so, the cleaning credit data can be automatically error-corrected and then stored, the degree of automation is high, and the manpower consumption is reduced, and if not, the cleaning credit data is classified and stored according to the division attribution of the cleaning credit data (namely, the cleaning credit data is classified according to the specific city), and the error description and the error data field of the cleaning credit data are simultaneously stored, so that the cleaning credit data with problems can be classified, and the manual verification of the cleaning credit data by a worker can be facilitated according to the error description and the error data field of the cleaning credit data.
If not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data, the method further comprises the following steps:
and acquiring processing data obtained by processing the cleaning credit data according to the error description and the error data field of the cleaning credit data, and if the processing data is successfully processed, taking the processing data as normative credit data.
From the above description, it can be known that, when the processed data after manual processing is acquired, whether the processing is successful or not is judged, and the processed data is used as normative credit data after the processing is successful, so that the data quality is ensured.
Further, the preset rules comprise basic preset rules and newly added preset rules;
counting and monitoring the data quality and the data quantity of cleaning credit data obtained by cleaning and filtering the original credit data according to a preset rule, respectively obtaining the front data quality, the front data quantity, the rear data quality and the rear data quantity of the cleaning credit data obtained by cleaning and filtering the original credit data within a preset time period before and after adding the newly added preset rule in a basic preset rule, respectively comparing the front data quality and the front data quantity with the rear data quality and the rear data quantity to obtain a comparison result, and judging whether to alarm or not according to the comparison result.
From the above description, after the preset rule is newly added to the basic preset rule, the change of the subsequent data quality and data quantity is counted to judge whether to alarm, so that the preset rule is convenient to adjust.
Example one
Referring to fig. 1, a credit scoring method capable of automatic calculation includes:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
and S3, calculating the normative credit data according to the normative algorithm model to obtain the normative credit score.
Specifically, acquiring the original credit data of the credit information list comprises: and acquiring original credit data of the credit information list, and formatting the original credit data, so that the disordered format is avoided after format conversion, and the uniformity of the data format is ensured.
Specifically, the following preset rules may be configured for the integrity rule, the uniqueness rule, the consistency rule, the validity rule, and the authority rule, respectively:
1. the preset rules for completeness, such as partial data missing in the original credit data, can be complemented by the following preset rules:
(1) Completing by other information, for example, the original credit data lacks the administrative division code, and automatically filling the administrative division code according to the administrative division based on the administrative division code comparison table; if the time format is not correct, the error can be automatically corrected according to the time bits in the code.
(2) Through the completion of the previous and subsequent data, for example, the time series in the original credit data has short data, the average value before and after the data is used, the short data is more, the data can be omitted by using the smoothing treatment, and the cleaning of other data is not influenced.
2. The preset rules aiming at the legality rules can be processed through the following preset rules:
(1) Setting a forced legal rule, and forcibly setting the rule to be a maximum value or judging the rule to be invalid and rejecting the rule if the rule is not in the rule range;
(2) Field type legal rules: the date field format is "2010-10-10";
(3) The legal rule of the field content is as follows: the sex of the corporate legal person is 'male or female or unknown'; birth date < = today;
(4) Setting a warning rule, warning if the rule is not in the rule range, and then manually processing;
(5) And (4) carrying out manual special treatment on the outlier, and finding the outlier by using modes of box separation, clustering, regression and the like.
3. The preset rule for data uniqueness, i.e. removing duplicate records, only keeps one, for example:
(1) Pressing a main key to remove duplicate, and triggering an instruction of 'removing duplicate records' in the sql or excel;
(2) And (4) removing duplication according to rules, and compiling a corresponding series of rules according to complex data with repeated conditions to remove duplication. For example, data from different channels can be matched through the same key information, and the duplication is removed through combination.
4. Aiming at the preset rule of data consistency, for example, corresponding names in each table are kept consistent, for example, the existence names of the household locations in the same type of information submitted in each city are different, some are the household locations, and other are the household mouths and the native place, and the names are cleaned and filtered uniformly to be the household locations; for example, in the same type of information filling form submitted by each city, if the name of "the xi information filling form of the province of Fujian province" is different, if some are "xx information filling forms of the province of Fujian province", and if some are "xx information filling forms of the Fujian province", these names are cleaned and filtered uniformly to be "xx information filling forms of the province of Fujian province".
5. According to the preset rule of data authority, original credit data come from different channels, different authority levels are set for the different channels, and when the original credit data input by the channel with lower authority level and the original credit data input by the channels with higher authority level are the same group of data but have differences, the original credit data input by the channel with lower authority level are cleaned and filtered.
Wherein S3 is preceded by:
judging whether the normative credit data is missing or not, if so, sending the normative credit data to a modification terminal, configuring a second single-word segment weight of the normative credit data, determining a second algorithm model according to the second single-word segment weight, calculating the normative credit data according to the second algorithm model to obtain a second credit score, sending the second credit score to an application end, returning a modification suggestion returned by the application end, and modifying the normative algorithm model according to the modification suggestion and the second single-word segment weight to obtain a final normative algorithm model.
The S3 specifically comprises the following steps:
and S3, calculating the normative credit data according to the final normative algorithm model to obtain the normative credit score.
Wherein the S2 further comprises:
if the verification is unsuccessful, analyzing the cleaning credit data, and judging whether the cleaning credit data is data capable of automatically correcting errors;
if so, automatically correcting the cleaning credit data to obtain error correction data, and taking the error correction data as normative credit data;
if not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data.
If not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data, and then the method further comprises the following steps:
acquiring processing data obtained by processing the cleaning credit data according to the error description and the error data field of the cleaning credit data, and taking the processing data as normative credit data if the processing data is successfully processed; if the processing data processing is unsuccessful, returning to the step of storing the error description and the error data field of the cleaning credit data to wait for the next data processing.
The preset rules comprise basic preset rules and newly added preset rules;
counting and monitoring the data quality and the data quantity of cleaning credit data obtained by cleaning and filtering the original credit data according to a preset rule, respectively obtaining the front data quality, the front data quantity, the rear data quality and the rear data quantity of the cleaning credit data obtained by cleaning and filtering the original credit data within a preset time period before and after adding the newly added preset rule in a basic preset rule, respectively comparing the front data quality and the front data quantity with the rear data quality and the rear data quantity to obtain a comparison result, and judging whether to alarm or not according to the comparison result.
Example two
Referring to fig. 2, a credit terminal 1 capable of automatic calculation includes a memory 2, a processor 3 and a computer program stored on the memory 2 and capable of running on the processor 3, wherein the processor 3 implements the steps of the first embodiment when executing the computer program.
In summary, according to the credit score method and the terminal capable of automatic calculation provided by the invention, the first algorithm model is determined by configuring the first single-field weight through the original credit data, and the first algorithm model is corrected by combining the actual credit score and the actual policy information to obtain the normative algorithm model, so that the accuracy and reliability of the normative algorithm model can be ensured according to the actual situation; in addition, the original credit data are cleaned and verified successfully to obtain the normative credit data, so that the problems of ambiguity, incompleteness, violation of business rules and the like of the original credit data are avoided, the validity of the data is guaranteed, and the rationality of the normative credit data can be guaranteed through the normative credit data and the normative credit score obtained by the normative algorithm model.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.

Claims (4)

1. An automatically calculable credit scoring method, comprising:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
judging whether the normative credit data is missing or not, if so, sending the normative credit data to a modification terminal, configuring a second single-word segment weight of the normative credit data, determining a second algorithm model according to the second single-word segment weight, calculating the normative credit data according to the second algorithm model to obtain a second credit score, sending the second credit score to an application end, returning a modification suggestion returned by the application end, and modifying the normative algorithm model according to the modification suggestion and the second single-word segment weight to obtain a final normative algorithm model;
s3, calculating normative credit data according to the final normative algorithm model to obtain normative credit scores;
the S2 further comprises:
if the verification is unsuccessful, analyzing the cleaning credit data, and judging whether the cleaning credit data is data capable of automatically correcting errors;
if so, automatically correcting the cleaning credit data to obtain error correction data, and taking the error correction data as normative credit data;
if not, classifying and storing according to the region attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data;
if not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data, and then the method further comprises the following steps:
and acquiring processing data obtained by processing the cleaning credit data according to the error description and the error data field of the cleaning credit data, and taking the processing data as normative credit data if the processing data is successfully processed.
2. The automatically calculable credit scoring method according to claim 1, wherein the preset rules include basic preset rules and additional preset rules;
counting and monitoring the data quality and the data quantity of cleaning credit data obtained by cleaning and filtering the original credit data according to a preset rule, respectively obtaining the front data quality, the front data quantity, the rear data quality and the rear data quantity of the cleaning credit data obtained by cleaning and filtering the original credit data within a preset time period before and after adding the newly added preset rule in a basic preset rule, respectively comparing the front data quality and the front data quantity with the rear data quality and the rear data quantity to obtain a comparison result, and judging whether to alarm or not according to the comparison result.
3. An automatically calculable credit terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the computer program implements the steps of:
s1, obtaining a credit information list of each province according to a state-level credit information directory, obtaining original credit data of the credit information list, configuring first single-word-segment weights of the original credit data, determining a first algorithm model according to the first single-word-segment weights, substituting the original credit data into the first algorithm model to obtain a first credit score, obtaining actual credit scores and actual policy information matched with the original credit data, correcting the first algorithm model according to the actual credit scores, the first credit score and the actual policy information to obtain a normative algorithm model, and storing the normative algorithm model;
s2, cleaning and filtering the original credit data according to a preset rule to obtain cleaning credit data, verifying the cleaning credit data according to an integrity rule, a uniqueness rule, a consistency rule, a legality rule and an authority rule, and taking the cleaning credit data as normative credit data if verification is successful;
judging whether the normative credit data is missing or not, if so, sending the normative credit data to a modification terminal, configuring a second single-word segment weight of the normative credit data, determining a second algorithm model according to the second single-word segment weight, calculating the normative credit data according to the second algorithm model to obtain a second credit score, sending the second credit score to an application end, returning a modification suggestion returned by the application end, and modifying the normative algorithm model according to the modification suggestion and the second single-word segment weight to obtain a final normative algorithm model;
s3, calculating normative credit data according to the final normative algorithm model to obtain normative credit scores;
the S2 further comprises:
if the verification is unsuccessful, analyzing the cleaning credit data, and judging whether the cleaning credit data is data capable of automatically correcting errors;
if so, automatically correcting the cleaning credit data to obtain error correction data, and taking the error correction data as normative credit data;
if not, classifying and storing according to the region attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data;
if not, classifying and storing according to the zone attribution of the cleaning credit data, and simultaneously storing the error description and the error data field of the cleaning credit data, and then the method further comprises the following steps:
and acquiring processing data obtained by processing the cleaning credit data according to the error description and the error data field of the cleaning credit data, and taking the processing data as normative credit data if the processing data is successfully processed.
4. The credit terminal capable of automatic calculation according to claim 3, wherein the preset rules include basic preset rules and new preset rules;
counting and monitoring the data quality and the data quantity of cleaning credit data obtained by cleaning and filtering the original credit data according to a preset rule, respectively obtaining the front data quality, the front data quantity, the back data quality and the back data quantity of the cleaning credit data obtained by cleaning and filtering the original credit data in a preset time period before and after adding the newly added preset rule in a basic preset rule, respectively comparing the front data quality and the front data quantity with the back data quality and the back data quantity to obtain a comparison result, and judging whether to alarm or not according to the comparison result.
CN202010882083.7A 2020-08-28 2020-08-28 Credit scoring method capable of automatically calculating and terminal Active CN111967790B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010882083.7A CN111967790B (en) 2020-08-28 2020-08-28 Credit scoring method capable of automatically calculating and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010882083.7A CN111967790B (en) 2020-08-28 2020-08-28 Credit scoring method capable of automatically calculating and terminal

Publications (2)

Publication Number Publication Date
CN111967790A CN111967790A (en) 2020-11-20
CN111967790B true CN111967790B (en) 2023-04-07

Family

ID=73401072

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010882083.7A Active CN111967790B (en) 2020-08-28 2020-08-28 Credit scoring method capable of automatically calculating and terminal

Country Status (1)

Country Link
CN (1) CN111967790B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017148269A1 (en) * 2016-02-29 2017-09-08 阿里巴巴集团控股有限公司 Method and apparatus for acquiring score credit and outputting feature vector value
CN107545356A (en) * 2017-06-29 2018-01-05 惠国征信服务股份有限公司 Personal social credibility methods of marking based on government data and application
CN108256993A (en) * 2017-12-29 2018-07-06 浪潮天元通信信息系统有限公司 A kind of credit score appraisal procedure and credit score Evaluation Platform
CN108564286A (en) * 2018-04-19 2018-09-21 天合泽泰(厦门)征信服务有限公司 A kind of artificial intelligence finance air control credit assessment method and system based on big data reference
CN109636181A (en) * 2018-12-11 2019-04-16 北京首汽智行科技有限公司 A kind of user credit divides calculation method and system
CN110264340A (en) * 2019-06-12 2019-09-20 重庆无界领智普惠商务信息咨询有限公司 A kind of P2P net loan customers' credit methods of marking and system based on machine learning

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017148269A1 (en) * 2016-02-29 2017-09-08 阿里巴巴集团控股有限公司 Method and apparatus for acquiring score credit and outputting feature vector value
CN107545356A (en) * 2017-06-29 2018-01-05 惠国征信服务股份有限公司 Personal social credibility methods of marking based on government data and application
CN108256993A (en) * 2017-12-29 2018-07-06 浪潮天元通信信息系统有限公司 A kind of credit score appraisal procedure and credit score Evaluation Platform
CN108564286A (en) * 2018-04-19 2018-09-21 天合泽泰(厦门)征信服务有限公司 A kind of artificial intelligence finance air control credit assessment method and system based on big data reference
CN109636181A (en) * 2018-12-11 2019-04-16 北京首汽智行科技有限公司 A kind of user credit divides calculation method and system
CN110264340A (en) * 2019-06-12 2019-09-20 重庆无界领智普惠商务信息咨询有限公司 A kind of P2P net loan customers' credit methods of marking and system based on machine learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于优化CBR的个人信用评分研究;姜明辉等;《中国软件学》;20141231(第12期);全文 *
基于数据的信用评级处理和分析系统的设计与实现;程冠皓;《中国优秀硕士学位论文全文数据库(电子期刊)信息科技辑》;20160215(第02期);全文 *

Also Published As

Publication number Publication date
CN111967790A (en) 2020-11-20

Similar Documents

Publication Publication Date Title
US8316114B2 (en) Updating IP address database based on transaction information
CN110209660B (en) Cheating group mining method and device and electronic equipment
CN104462604A (en) Data processing method and system
CN107909516A (en) A kind of problem source of houses recognition methods and system
CN113535689B (en) Database establishment management system for digital therapy
CN111428095B (en) Graph data quality verification method and graph data quality verification device
CN116012178B (en) Automatic financial accounting method based on artificial intelligence
CN113010903A (en) Catering industry oil smoke online monitoring method and system based on block chain
CN111967790B (en) Credit scoring method capable of automatically calculating and terminal
US20070130316A1 (en) System and method to generate hosting company statistics
CN114049016A (en) Index similarity judgment method, system, terminal device and computer storage medium
WO2013067575A1 (en) A method of analysing data
CN111966675A (en) Fixed asset investment project data cleaning method and terminal
CN111413952A (en) Robot fault detection method and device, electronic equipment and readable storage medium
CN114741381B (en) Data cleaning method based on association dependency
CN113553336B (en) Talent resume database updating system and method based on Internet
WO2021129024A1 (en) Method for recognizing state of communication device, and communication system and storage medium
CN113918705A (en) Contribution auditing method and system with early warning and recommendation functions
CN112199396A (en) Industrial Internet identification query method and system facing MES
CN112468444A (en) Internet domain name abuse identification method and device independent of content analysis
CN111882289A (en) Device and method for measuring and calculating item data audit index interval
CN112416993A (en) Trademark change judgment method, system, equipment and readable storage medium
CN112162991B (en) Intelligent management method and device for data
CN115330497A (en) Data processing method, system and computer readable storage medium
CN110022304A (en) A kind of website hung Trojan method for early warning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant