CN112559578A - Data processing method and device, electronic equipment and storage medium - Google Patents

Data processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN112559578A
CN112559578A CN202011506330.XA CN202011506330A CN112559578A CN 112559578 A CN112559578 A CN 112559578A CN 202011506330 A CN202011506330 A CN 202011506330A CN 112559578 A CN112559578 A CN 112559578A
Authority
CN
China
Prior art keywords
data
user certificate
certificate data
user
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011506330.XA
Other languages
Chinese (zh)
Inventor
毛剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Saiante Technology Service Co Ltd
Original Assignee
Shenzhen Saiante Technology Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Saiante Technology Service Co Ltd filed Critical Shenzhen Saiante Technology Service Co Ltd
Priority to CN202011506330.XA priority Critical patent/CN112559578A/en
Publication of CN112559578A publication Critical patent/CN112559578A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures

Abstract

The application discloses a data processing method, a data processing device, electronic equipment and a storage medium, and relates to the technical field of computers. The data processing method comprises the steps of acquiring two user certificate data subjected to index merging and the certificate types of the user certificate data, wherein the two user certificate data come from different data systems; determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type to which the user certificate data belongs; and comparing the field values in the target fields contained in the two user certificate data. The data processing method is suitable for a user medical information query scene or a user personal information query scene. The technical scheme provided by the application avoids repeated display when inquiring the user certificate data of a certain certificate type of a user, and improves the accuracy of data inquiry to a certain extent.

Description

Data processing method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of computer technologies, and in particular, to a method and an apparatus for processing data, an electronic device, and a storage medium.
Background
In the data query field, the query system may query the user certificate information from a plurality of different data systems storing the user certificate information, and the data query under a plurality of scenarios, such as a user medical information query scenario for querying a medical archive or a case archive of a user, and a user personal information query scenario for querying an execution certificate or a degree certificate of the user. Because the user certificate information stored in different data systems has the condition of consistent information or approximately similar information, and the data systems are difficult to interconnect and intercommunicate, when the user certificate information is inquired through the inquiry system, the situation that the same user certificate information is stored in different data systems cannot be identified repeatedly, so that repeated display is caused, and the technical problem that how to avoid the repeated display of the user certificate information becomes urgent to be solved is solved.
Disclosure of Invention
Based on the above, the application provides a data processing method, a data processing device, an electronic device and a storage medium, which can avoid repeated display of user certificate information to a certain extent and improve the accuracy of data query.
In a first aspect, the present application provides a data processing method, including: acquiring two user certificate data subjected to index combination and a certificate type to which the user certificate data belongs, wherein the two user certificate data are from different data systems; according to the certificate type of the user certificate data, determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type; comparing field values in all target fields contained in the two pieces of user certificate data, and determining the field value matching similarity between the field values in all target fields contained in the two pieces of user certificate data; determining data matching similarity between two pieces of user certificate data based on field value matching similarity between field values in target fields contained in the two pieces of user certificate data and corresponding weights of the target fields; and if the data matching similarity between the two user certificate data is higher than a first similarity threshold, carrying out index merging on the two user certificate data.
In a second aspect, the present application provides a data processing apparatus comprising: the first acquisition unit is used for acquiring two pieces of user certificate data for index combination and certificate types to which the user certificate data belong, wherein the two pieces of user certificate data are from different data systems; the first execution unit is used for determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type to which the user certificate data belongs; the comparison unit is used for comparing field values in all target fields contained in the two pieces of user certificate data and determining the field value matching similarity between the field values in all the target fields contained in the two pieces of user certificate data; the second execution unit is used for determining the data matching similarity between the two user certificate data based on the field value matching similarity between the field values in the target fields contained in the two user certificate data and the corresponding weight of each target field; and the merging unit is used for performing index merging on the two user certificate data if the data matching similarity between the two user certificate data is higher than a first similarity threshold.
In a third aspect, the present application provides an electronic device comprising a memory and a processor, wherein the memory stores computer readable instructions, and the computer readable instructions, when executed by the processor, cause the processor to execute the steps of the data processing method.
In a fourth aspect, the present application provides a storage medium having stored thereon computer-readable instructions which, when executed by one or more processors, cause the one or more processors to perform the steps of the above-described data processing method.
The technical scheme provided by the embodiment of the application can have the following beneficial effects: by determining each target field which is required to be verified and contained in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type corresponding to the user certificate data, determining the field value matching similarity between the field values in each target field contained in the two user certificate data by comparing the field values in each target field contained in the two user certificate data, and finally determining whether the user certificate data of two same certificate types from different systems are the user certificate data of the same user or not based on the contrast relation between the field value matching similarity and the first similarity threshold, the user certificate data of the same user stored in different data systems are indexed and merged, and repeated display when the user certificate data of a certain certificate type of the user is inquired is avoided, the accuracy of data query is improved to a certain extent.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application.
Drawings
Fig. 1 shows a schematic diagram of an exemplary system architecture to which the technical solution of the embodiments of the present application can be applied.
Fig. 2 is a flowchart illustrating a data processing method according to an exemplary embodiment of the present application.
Fig. 3 is a specific flowchart of step S230 of the data processing method according to an exemplary embodiment of the present application.
Fig. 4 is a specific flowchart of step S250 of the data processing method according to an exemplary embodiment of the present application.
Fig. 5 is a flowchart illustrating a data processing method according to an exemplary embodiment of the present application.
Fig. 6 is a block diagram of a data processing apparatus according to an exemplary embodiment of the present application.
Fig. 7 is a block diagram illustrating an example of an electronic device for implementing the data processing method according to an example embodiment of the present application.
Fig. 8 is a computer-readable storage medium for implementing the above-described data processing method according to an exemplary embodiment of the present application.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided to give a thorough understanding of embodiments of the application. One skilled in the relevant art will recognize, however, that the subject matter of the present application can be practiced without one or more of the specific details, or with other methods, components, devices, steps, and so forth. In other instances, well-known methods, devices, implementations, or operations have not been shown or described in detail to avoid obscuring aspects of the application.
The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor means and/or microcontroller means.
The flow charts shown in the drawings are merely illustrative and do not necessarily include all of the contents and operations/steps, nor do they necessarily have to be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the actual execution sequence may be changed according to the actual situation.
Fig. 1 shows a schematic diagram of an exemplary system architecture to which the technical solution of the embodiments of the present application can be applied.
As shown in fig. 1, the system architecture may include a client 101, a network 102, and a server 103, where the server 103 may be a server providing a data query service, the client 101 is a target electronic device performing the data query service, and the client 101 may be one or more of a smartphone, a tablet computer, and a portable computer, but may also be a desktop computer, and the like. Network 102 serves as a medium for providing communication links between clients 101 and servers 103. Network 102 may include various connection types, such as wired communication links, wireless communication links, and so forth.
It should be understood that the number of clients 101, networks 102, and servers 103 in fig. 1 is merely illustrative. There may be any number of clients 101, networks 102, and servers 103, as desired for implementation, e.g., a server 103 may be a server cluster of multiple servers, etc.
Optionally, the two user certificate data acquired by the server 103 of the present application are from different data systems, each data system may be any node server 104 of the blockchain data server system, and the server 103 may further feed back the result of index merging to the multiple node servers 104 of the blockchain data server system, and store the result by the node servers 104, so that the security and reliability of data processing are effectively ensured based on the security and the irreplaceable characteristic of blockchain data sharing.
In this embodiment, the server 103 obtains two pieces of user certificate data for index merging and the certificate type to which the user certificate data belongs, where the two pieces of user certificate data are from different data systems; determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type to which the user certificate data belongs; comparing field values in all target fields contained in the two user certificate data, and determining the field value matching similarity between the field values in all target fields contained in the two user certificate data; determining data matching similarity between two user certificate data based on field value matching similarity between field values in target fields contained in the two user certificate data and corresponding weights of the target fields; and if the data matching similarity between the two user certificate data is higher than a first similarity threshold, carrying out index merging on the two user certificate data.
By determining each target field which is required to be verified and contained in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type corresponding to the user certificate data, determining the field value matching similarity between the field values in each target field contained in the two user certificate data by comparing the field values in each target field contained in the two user certificate data, and finally determining whether the user certificate data of two same certificate types from different systems are the user certificate data of the same user or not based on the contrast relation between the field value matching similarity and the first similarity threshold, the user certificate data of the same user stored in different data systems are indexed and merged, and repeated display when the user certificate data of a certain certificate type of the user is inquired is avoided, the accuracy of data query is improved to a certain extent.
It should be noted that the data processing method provided in the embodiment of the present application is generally executed by the server 103, and accordingly, the data processing apparatus is generally disposed in the server 103. The details of implementation of the technical solution of the embodiments of the present application are set forth in the following.
Referring to fig. 2, fig. 2 is a flowchart of a data processing method shown in an exemplary embodiment of the present application, an execution subject of the data processing method in the present embodiment is the server 103, and the data processing method shown in fig. 2 may include the following steps S210 to S260, which are described in detail below.
In step S210, two pieces of user certificate data for index merging and the certificate type to which the user certificate data belongs are acquired, and the two pieces of user certificate data come from different data systems.
In one embodiment, the certificate type is a certificate type to which user certificate data including multiple types of user attribute data belongs, and corresponds to different types of user information in different application scenarios of data query. Such as in a data query scenario of the user's medical information, the certificate type refers to the user's medical or case profile, such as in a data query scenario of the user's personal information, the certificate type refers to the user's identification card, driver's license, lawyer's license, and so on. User credential data for the same credential type can come from different data systems, which can be database systems that store user credential data for a certain credential type. It should be noted that the same database system may also store user credential data for multiple credential types. The two user certificate data in this embodiment are specifically user certificate data of the same certificate type.
In step S220, according to the certificate type to which the user certificate data belongs, each target field that needs to be verified in the user certificate data, the weight corresponding to each target field, and the first similarity threshold corresponding to the user certificate data of the certificate type are determined.
In one embodiment, the user certificate data comprises a plurality of fields, each field has a field value for storing certain attribute data of the user, the fields can be name, gender, mobile phone number, birth date, ethnicity, contacts, contact phone numbers, addresses and the like, and the types and the number of the fields contained in different types of user certificate data are generally different.
The target field is a field to be checked in fields included in the user certificate data, and it can be understood that the target field may be a partial field included in the user certificate data or all fields included in the user certificate data, and the target field to be checked in the user certificate data has an association relationship with a certificate type to which the user certificate data belongs. The server can pre-establish the corresponding relation between the certificate type of the user certificate data and the target field needing to be checked in the user certificate data according to the user certificate data of various different certificate types and the target field needing to be checked in the user certificate data, and further, each target field needing to be checked in the user certificate data can be directly determined according to the certificate type of the user certificate data.
Optionally, because the importance degrees of different target fields for determining whether the two user certificate data are the user certificate data of the same user are inconsistent, corresponding weights can be assigned to the target fields in advance according to the importance degrees of the target fields to the user certificate data of the same user, which are checked as required. It can be understood that, for a certain type of user certificate data, the server may determine, according to the importance degree of the target field pair to determine whether two user certificate data are user certificate data of the same user, and pre-establish a correspondence between the certificate type to which the user certificate data belongs and the weight corresponding to each target field, thereby facilitating to directly determine the weight corresponding to each target field according to the certificate type to which the user certificate data belongs.
Optionally, the first similarity threshold is a preset similarity threshold used for determining whether the two user certificate data are the user certificate data of the same user. Because field values in each target field contained in user certificate data stored in the database system can have an entry error during entry, in order to improve the accuracy of checking whether two user certificate data are user certificate data of the same user, corresponding first similarity threshold values can be allocated to user certificate data of different certificate types in advance, so that under the condition that an entry error exists in a field value in one target field of the user certificate data and the field values in other target fields are determined to be the same, whether two user certificate data are the user certificate data of the same user can be determined.
The server can pre-establish the corresponding relation between the certificate type of the user certificate data and the similarity threshold value of the user certificate data, and further, the similarity threshold value corresponding to the user certificate data of the certificate type can be directly determined according to the certificate type of the user certificate data.
In step S230, the field values in the respective target fields included in the two user certificate data are compared, and the field value matching similarity between the field values in the respective target fields included in the two user certificate data is determined.
In one embodiment, for each target field of the two user certificate data for comparison and verification, a corresponding field value may be extracted from the target field corresponding to the user certificate data. For example, if the target fields to be compared are fields such as "name", "gender", "mobile phone number", "date of birth", "nationality", etc., corresponding field values are extracted from the target fields, and similarity calculation is performed based on the extracted two field values, so as to determine matching similarity between the two field values as field value matching similarity between the two field values.
Referring to fig. 3, fig. 3 is a specific flowchart of step S230 of the data processing method shown in an exemplary embodiment of the present application, in which embodiment, step S230 may specifically include step S310 to step S340, which is described in detail as follows.
In step S310, character conversion processing is performed on field values in respective destination fields included in the two pieces of user certificate data, so as to obtain converted field values.
In one embodiment, for some specific target fields, such as name fields, when the field value of the name field is a Chinese name, the field value in the name field may be subjected to a character conversion process to obtain a converted field value, where the converted field value is generally a character string.
Specifically, for the field value in the name field, the field value can be converted into the corresponding pinyin, so as to obtain the converted field value. The similarity between the field values in the target fields contained in the user certificate data can be determined more accurately by performing character conversion processing on the field values in the target fields contained in the user certificate data.
For example, for target fields of which the input data is Chinese data, different areas have the condition that the input Chinese data has larger difference due to difference of spoken languages, and the corresponding Chinese pinyins of the Chinese data are similar and identical, for example, the 'king' word is wrongly input into the 'yellow' word, the 'king' word and the 'yellow' word belong to different Chinese characters, and the 'wang' word and the 'huang' word belong to the similar condition, so that the input error can be effectively identified by performing character conversion processing on field values in each target field included in the user certificate data.
In step S320, the converted field values corresponding to the target fields included in the two user certificate data are compared to determine a first similarity.
In one embodiment, the converted field values corresponding to the target fields included in the two user certificate data are compared to determine the second similarity. Alternatively, for a field value obtained by performing the character conversion process, a first similarity between two converted field values may be determined according to a similarity algorithm. Specifically, the first similarity between the two converted field values may be determined by a similarity algorithm that calculates euclidean metrics, but may be calculated by other similarity algorithms, and is not limited herein.
In step S330, the field values in the target fields included in the two user certificate data are compared to determine a second similarity.
In one embodiment, when the field value in the target field is Chinese data, the similarity between the two field values can be determined by directly comparing whether the Chinese characters in the two field values are the same. Specifically, the second similarity may be determined by a ratio of the number of the same chinese characters in the two field values to the total number of the chinese characters in the two field values, and of course, the second similarity may also be determined by first determining a sum of the total number of the chinese characters in the two field values and the number of the same chinese characters in the two field values, and determining the second similarity according to the ratio of the number of the same chinese characters in the two field values to the sum of the two chinese characters.
In step S340, based on the first similarity and the second similarity, a field value matching similarity between field values in respective object fields included in the two user certificate data is determined.
In the technical solution of the embodiment shown in fig. 3, character conversion processing is performed on field values in each target field included in the user certificate data to obtain converted field values, and the similarity between the field values in each target field included in the user certificate data is determined by the converted field values and the unconverted field values, so that the accuracy of determining whether two user certificate data are user certificate data of the same user can be further improved.
In one embodiment, after the first similarity and the second similarity are determined, the field value matching similarity between the field values in the target fields included in the two user certificate data is determined according to the first similarity and the second similarity. Specifically, the sum of the first similarity and the second similarity may be directly calculated, and the sum is used as the field value matching similarity between the field values in the target fields included in the two user certificate data.
Optionally, step S340 may specifically include: and determining field value matching similarity between field values in the target fields contained in the two user certificate data based on the weighted sum of the first similarity and the second similarity.
In this embodiment, when determining the field value matching similarity between the field values in the respective target fields included in the two user certificate data based on the first similarity and the second similarity, the field value matching similarity between the field values in the respective target fields included in the two user certificate data may also be calculated according to a weighted sum of the first similarity and the second similarity.
Alternatively, the weight of the second similarity may be set to be larger than the weight of the first similarity.
Still referring to fig. 2, in step S240, a data matching similarity between two user certificate data is determined based on a field value matching similarity between field values in respective target fields included in the two user certificate data and a corresponding weight of each target field.
In one embodiment, when determining field value matching similarity between field values in respective target fields included in two user certificate data, a product between field matching similarity between field values in each target field included in the two user certificate data and a weight corresponding to each target field may be determined, and a first matching similarity between the user certificate data may be calculated according to a sum of corresponding products.
In step S250, if the data matching similarity between the two user certificate data is higher than the first similarity threshold, the two user certificate data are indexed and merged.
In one embodiment, the first similarity threshold is a preset similarity threshold for determining whether the user certificate data of two same certificate types are the same user, and if it is determined that the first matching similarity between the user certificate data is higher than the first similarity threshold, it is determined that the user certificate data of two same certificate types are the same user, the two user certificate data may be indexed and merged to avoid repeated display when querying the user certificate data of a certain certificate type of the user.
Referring to fig. 4, fig. 4 is a specific flowchart of step S250 of the data processing method shown in an exemplary embodiment of the present application, in which step S250 may specifically include step S410 to step S430, which is described in detail as follows.
In step S410, if the data matching similarity between two user certificate data is higher than the first similarity threshold, the user identification information is extracted from any one of the user certificate data.
In one embodiment, on the premise that the data matching similarity between two user certificate data is higher than a first similarity threshold, the two user certificate data need to be indexed and merged. When index merging is performed, user identification information can be extracted from any one of the user certificate data.
Specifically, a specific field in the user certificate data, in which the user identification information is stored, may be determined first, and a field value in the specific field is extracted, that is, the user identification information in the user certificate data is extracted. The specific field can be a field which can uniquely identify the user, such as 'mobile phone number' or 'identification number'.
In step S420, index information is generated according to the user identification information and the certificate type to which the user certificate data belongs.
In one embodiment, in acquiring the user identification information, index information for indexing the user certificate data may be generated according to the user identification information and the certificate type to which the user certificate data belongs.
In step S430, the generated index information is stored in association with the two user certificate data.
In one embodiment, the generated index information and the two user certificate data are stored in an associated manner, so that the two associated user certificate data can be directly found according to the generated index information.
Optionally, step S250 may specifically include: and if the data matching similarity between the two user certificate data is higher than a first similarity threshold value, and the field value matching similarity between the user identification information contained in the two user certificate data is higher than a second similarity threshold value corresponding to a field for identifying the user identification information, extracting the user identification information from any one user certificate data.
In this embodiment, when the user identification information in the field identifying the user identification information included in the user certificate data is acquired as the index information, it is further required to determine whether the field value matching similarity between the field values in the field identifying the user identification information included in the user certificate data is greater than the similarity threshold corresponding to the field identifying the user identification information.
If the field value matching similarity between the field values in the fields for identifying the user identification information contained in the user certificate data is greater than the similarity threshold corresponding to the fields for identifying the user identification information, it is described that no entry error exists in the field values in each target field of the user certificate data stored in the data database system during entry, and the user identification information contained in the user certificate data can be used as index information for indexing the user certificate data.
In the technical scheme of the embodiment shown in fig. 4, the generation of index information according to the input wrong user identification information is avoided, the situation that the user certificate data exists but cannot be inquired is reduced, and the accuracy of inquiring the user certificate data is improved.
It can be seen from the above that, by determining each target field included in the user certificate data to be verified, the weight corresponding to each target field, and the first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type corresponding to the user certificate data, and by comparing the field values in each target field included in the two user certificate data, determining the field value matching similarity between the field values in each target field included in the two user certificate data, and finally determining whether the two user certificate data of the same certificate type from different systems are the user certificate data of the same user based on the comparison relationship between the field value matching similarity and the first similarity threshold, the user data of the same user stored in different data systems are indexed and merged, and repeated display when querying the user certificate data of a certain certificate type of the user is avoided, the accuracy of data query is improved to a certain extent.
In one embodiment, the data processing method of the present application may further include: and if the data matching similarity between the two user certificate data is higher than a first similarity threshold value and the field value matching similarity between the user identification information contained in the two user certificate data is smaller than or equal to a second similarity threshold value corresponding to the field for identifying the user identification information, generating an alarm notice that the index merging fails.
In this embodiment, when the user identification information in the field identifying the user identification information included in the user certificate data is acquired as the index information, it is further required to determine whether the field value matching similarity between the field values in the field identifying the user identification information included in the user certificate data is higher than the similarity threshold corresponding to the field identifying the user identification information.
If the field value matching similarity between the field values in the fields for identifying the user identification information contained in the user certificate data is less than or equal to the similarity threshold corresponding to the fields for identifying the user identification information, the situation that the field values in each target field of the user certificate data stored in the data database system have an entry error during entry is explained, so that the two user certificate data cannot be indexed and merged. Therefore, an alarm notification of index merging failure needs to be generated, and the alarm notification may specifically be "the user identification information in the user certificate data does not meet the requirement, and the index merging failure" is not limited herein.
The warning notice is used for prompting related personnel to update the field value in the field for identifying the user identification information contained in the user certificate data so as to smoothly index and merge the two user certificate data.
Referring to fig. 5, fig. 5 is a flowchart of a data processing method according to an exemplary embodiment of the present application, and the data processing method in this embodiment may further include steps S510 to S530, which are described in detail as follows.
In step S510, an inquiry request is obtained, where the inquiry request carries a certificate type for inquiry and user identification information for inquiry.
In one embodiment, when user certificate data of a certain user needs to be queried, the query can be implemented through a query request, and the query request carries certificate types used for querying and user identification information used for querying. In general, the user may trigger the query request through a virtual button or a physical key provided by the client, and the client sends the query request to the server.
In step S520, if target index information that is consistent with both the certificate type used for performing the query and the user identification information used for performing the query is monitored, any user certificate data associated with the target index information is acquired as a query result.
In one embodiment, after the server acquires the query request, the server compares the certificate type for query and the user identification information for query carried in the query request with the certificate type and the user identification information in the target index information, and if the target index information which is consistent with both the certificate type for query and the user identification information for query is monitored, acquires any user certificate data associated with the target index information as a query result.
In step S530, presentation data is generated based on the query result.
In one embodiment, when any user certificate data associated with the target index information is acquired, presentation data for presentation can be generated according to any user certificate data associated with the target index information so as to meet the query requirement of the user.
Referring to fig. 6, fig. 6 schematically shows a block diagram of a data processing apparatus according to an embodiment of the present application, which may be integrated in an electronic device, and a data processing apparatus 600 according to an embodiment of the present application may include: a first obtaining unit 610, a first executing unit 620, a comparing unit 630, a second executing unit 640 and a merging unit 650; a first obtaining unit 610, configured to obtain two pieces of user certificate data subjected to index merging and a certificate type to which the user certificate data belongs, where the two pieces of user certificate data are from different data systems; a first executing unit 620, configured to determine, according to a certificate type to which the user certificate data belongs, each target field that needs to be verified in the user certificate data, a weight corresponding to each target field, and a first similarity threshold corresponding to the user certificate data of the certificate type; a comparing unit 630, configured to compare field values in respective target fields included in the two pieces of user certificate data, and determine field value matching similarity between the field values in the respective target fields included in the two pieces of user certificate data; a second executing unit 640, configured to determine data matching similarity between two pieces of user certificate data based on field value matching similarity between field values in target fields included in the two pieces of user certificate data and a weight corresponding to each target field; a merging unit 650, configured to index and merge two pieces of user certificate data if data matching similarity between the two pieces of user certificate data is higher than a first similarity threshold.
Optionally, the comparing unit 630 is configured to: respectively carrying out character conversion processing on field values in each target field contained in the two user certificate data to obtain converted field values; comparing the converted field values corresponding to the target fields contained in the two user certificate data to determine a first similarity; comparing field values in each target field contained in the two user certificate data to determine a second similarity; and determining field value matching similarity between field values in all target fields contained in the two pieces of user certificate data based on the first similarity and the second similarity.
Optionally, the comparing unit 630 is configured to: and determining field value matching similarity between field values in various target fields contained in the two pieces of user certificate data based on the weighted sum of the first similarity and the second similarity.
Optionally, the merging unit 650 is configured to: if the data matching similarity between the two user certificate data is higher than a first similarity threshold, extracting user identification information from any one user certificate data; generating index information according to the user identification information and the certificate type to which the user certificate data belongs; and storing the generated index information and the two user certificate data in an associated manner.
Optionally, the merging unit 650 is configured to: and if the data matching similarity between the two user certificate data is higher than a first similarity threshold value, and the field value matching similarity between the user identification information contained in the two user certificate data is higher than a second similarity threshold value corresponding to a field for identifying the user identification information, extracting the user identification information from any one user certificate data.
Optionally, the data processing apparatus further includes: and the first generating unit is used for generating an alarm notice that the index merging fails if the data matching similarity between the two user certificate data is higher than a first similarity threshold value and the field value matching similarity between the user identification information contained in the two user certificate data is less than or equal to a second similarity threshold value corresponding to a field for identifying the user identification information.
Optionally, the data processing apparatus further includes: the second acquisition unit is used for acquiring an inquiry request, wherein the inquiry request carries certificate types used for inquiry and user identification information used for inquiry; a third obtaining unit, configured to obtain, if target index information that is consistent with both the certificate type used for performing the query and the user identification information used for performing the query is monitored, any user certificate data associated with the target index information as a query result; and the second generating unit is used for generating display data based on the query result.
The implementation process of the functions and actions of each module in the device is specifically described in the implementation process based on the corresponding steps in the data processing method, and is not described herein again.
It should be noted that although in the above detailed description several modules or units of the device for action execution are mentioned, such a division is not mandatory. Indeed, the features and functionality of two or more modules or units described above may be embodied in one module or unit, according to embodiments disclosed herein. Conversely, the features and functions of one module or unit described above may be further divided into embodiments by a plurality of modules or units.
Moreover, although the steps of the methods herein are depicted in the drawings in a particular order, this does not require or imply that the steps must be performed in this particular order, or that all of the depicted steps must be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions, etc.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which can be a personal computer, a server, a mobile terminal, or a network device, etc.) to execute the method according to the embodiments of the present application.
In an exemplary embodiment of the present application, there is also provided an electronic device capable of implementing the above method.
As will be appreciated by one skilled in the art, aspects of the present application may be embodied as a system, method or program product. Accordingly, various aspects of the present application may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" system.
Referring to fig. 7, fig. 7 is a block diagram illustrating an example of an electronic device for implementing the data processing method according to an example embodiment of the present application. The electronic device 700 shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 7, electronic device 700 is embodied in the form of a general purpose computing device. The components of the electronic device 700 may include, but are not limited to: the at least one processing unit 710, the at least one memory unit 720, and a bus 730 that couples various system components including the memory unit 720 and the processing unit 710.
Wherein the storage unit stores program code that is executable by the processing unit 710 to cause the processing unit 710 to perform steps according to various exemplary embodiments of the present application described in the above section "exemplary method" of the present specification. For example, the processing unit 710 may perform steps S210 to S250 as shown in fig. 2.
The storage unit 720 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)7201 and/or a cache memory unit 7202, and may further include a read only memory unit (ROM) 7203.
The storage unit 720 may also include a program/utility 7204 having a set (at least one) of program modules 7205, such program modules 7205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 730 may be any representation of one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 700 may also communicate with one or more external devices 900 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 700, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 700 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 750. The electronic device 700 also includes a display unit 740. Also, the electronic device 700 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the internet) via the network adapter 760. As shown, the network adapter 760 communicates with the other modules of the electronic device 700 via the bus 730. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with the electronic device 700, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to make a computing device (which can be a personal computer, a server, a terminal device, or a network device, etc.) execute the method according to the embodiments of the present application.
In an exemplary embodiment of the present application, there is also provided a computer-readable storage medium having stored thereon a program product capable of implementing the above-described method of the present specification. In some possible embodiments, various aspects of the present application may also be implemented in the form of a program product comprising program code means for causing an electronic device to carry out the steps according to various exemplary embodiments of the present application described in the above-mentioned "exemplary methods" section of the present description, when said program product is run on a terminal device.
Referring to fig. 8, fig. 8 is a computer-readable storage medium for implementing the above-mentioned data verification method according to an exemplary embodiment of the present application. Fig. 8 depicts a program product 800, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on an electronic device, such as a personal computer, for implementing the above-described methods according to embodiments of the present application. However, the program product of the present application is not limited thereto, and in this document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable signal medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable signal medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
Furthermore, the above-described figures are merely schematic illustrations of processes involved in methods according to exemplary embodiments of the present application, and are not intended to be limiting. It will be readily understood that the processes shown in the above figures are not intended to indicate or limit the chronological order of the processes. In addition, it is also readily understood that these processes may be performed synchronously or asynchronously, e.g., in multiple modules.
Other embodiments of the present application will be apparent to those skilled in the art from consideration of the specification and practice of the application disclosed herein. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the application and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the application being indicated by the following claims.

Claims (10)

1. A data processing method, comprising:
acquiring two user certificate data subjected to index combination and a certificate type to which the user certificate data belongs, wherein the two user certificate data are from different data systems;
according to the certificate type of the user certificate data, determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type;
comparing field values in all target fields contained in the two pieces of user certificate data, and determining the field value matching similarity between the field values in all target fields contained in the two pieces of user certificate data;
determining data matching similarity between two pieces of user certificate data based on field value matching similarity between field values in target fields contained in the two pieces of user certificate data and corresponding weights of the target fields;
and if the data matching similarity between the two user certificate data is higher than a first similarity threshold, carrying out index merging on the two user certificate data.
2. The data processing method according to claim 1, wherein the comparing the field values in the respective target fields included in the two pieces of user certificate data, and the determining the similarity of the field values matching between the field values in the respective target fields included in the two pieces of user certificate data comprises:
respectively carrying out character conversion processing on field values in each target field contained in the two user certificate data to obtain converted field values;
comparing the converted field values corresponding to the target fields contained in the two user certificate data to determine a first similarity;
comparing field values in each target field contained in the two user certificate data to determine a second similarity;
and determining field value matching similarity between field values in all target fields contained in the two pieces of user certificate data based on the first similarity and the second similarity.
3. The data processing method according to claim 2, wherein the determining, based on the first similarity and the second similarity, a field value matching similarity between field values in respective target fields included in the two pieces of user certificate data comprises:
and determining field value matching similarity between field values in various target fields contained in the two pieces of user certificate data based on the weighted sum of the first similarity and the second similarity.
4. The data processing method of claim 1, wherein if the data matching similarity between the two user certificate data is higher than a first similarity threshold, the indexing and merging the two user certificate data comprises:
if the data matching similarity between the two user certificate data is higher than a first similarity threshold, extracting user identification information from any one user certificate data;
generating index information according to the user identification information and the certificate type to which the user certificate data belongs;
and storing the generated index information and the two user certificate data in an associated manner.
5. The data processing method of claim 1, wherein if the data matching similarity between the two user certificate data is higher than a first similarity threshold, the indexing and merging the two user certificate data comprises:
and if the data matching similarity between the two user certificate data is higher than a first similarity threshold value, and the field value matching similarity between the user identification information contained in the two user certificate data is higher than a second similarity threshold value corresponding to a field for identifying the user identification information, extracting the user identification information from any one user certificate data.
6. The data processing method of claim 5, further comprising:
and if the data matching similarity between the two user certificate data is higher than a first similarity threshold value, and the field value matching similarity between the user identification information contained in the two user certificate data is smaller than or equal to a second similarity threshold value corresponding to a field for identifying the user identification information, generating an alarm notice of index merging failure.
7. The data processing method of claim 1, further comprising:
acquiring a query request, wherein the query request carries certificate types used for querying and user identification information used for querying;
if target index information which is consistent with the certificate type for inquiring and the user identification information for inquiring is monitored, acquiring any user certificate data associated with the target index information as an inquiring result;
and generating display data based on the query result.
8. A data processing apparatus, comprising:
the first acquisition unit is used for acquiring two pieces of user certificate data for index combination and certificate types to which the user certificate data belong, wherein the two pieces of user certificate data are from different data systems;
the first execution unit is used for determining each target field needing to be verified in the user certificate data, the weight corresponding to each target field and a first similarity threshold corresponding to the user certificate data of the certificate type according to the certificate type to which the user certificate data belongs;
the comparison unit is used for comparing field values in all target fields contained in the two pieces of user certificate data and determining the field value matching similarity between the field values in all the target fields contained in the two pieces of user certificate data;
the second execution unit is used for determining the data matching similarity between the two user certificate data based on the field value matching similarity between the field values in the target fields contained in the two user certificate data and the corresponding weight of each target field;
and the merging unit is used for performing index merging on the two user certificate data if the data matching similarity between the two user certificate data is higher than a first similarity threshold.
9. An electronic device comprising a memory and a processor, the memory having stored therein computer-readable instructions that, when executed by the processor, cause the processor to perform the data processing method of any of claims 1 to 7.
10. A storage medium storing computer-readable instructions which, when executed by one or more processors, cause the one or more processors to perform a data processing method as claimed in any one of claims 1 to 7.
CN202011506330.XA 2020-12-18 2020-12-18 Data processing method and device, electronic equipment and storage medium Pending CN112559578A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011506330.XA CN112559578A (en) 2020-12-18 2020-12-18 Data processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011506330.XA CN112559578A (en) 2020-12-18 2020-12-18 Data processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN112559578A true CN112559578A (en) 2021-03-26

Family

ID=75030379

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011506330.XA Pending CN112559578A (en) 2020-12-18 2020-12-18 Data processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112559578A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115497633A (en) * 2022-10-19 2022-12-20 联仁健康医疗大数据科技股份有限公司 Data processing method, device, equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111178069A (en) * 2019-12-25 2020-05-19 平安健康保险股份有限公司 Data processing method and device, computer equipment and storage medium
CN111785341A (en) * 2020-06-30 2020-10-16 平安国际智慧城市科技股份有限公司 Patient main index data merging method and device based on similarity
CN112016488A (en) * 2020-09-01 2020-12-01 中国银行股份有限公司 Service information processing method, related device and computer storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111178069A (en) * 2019-12-25 2020-05-19 平安健康保险股份有限公司 Data processing method and device, computer equipment and storage medium
CN111785341A (en) * 2020-06-30 2020-10-16 平安国际智慧城市科技股份有限公司 Patient main index data merging method and device based on similarity
CN112016488A (en) * 2020-09-01 2020-12-01 中国银行股份有限公司 Service information processing method, related device and computer storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115497633A (en) * 2022-10-19 2022-12-20 联仁健康医疗大数据科技股份有限公司 Data processing method, device, equipment and storage medium
CN115497633B (en) * 2022-10-19 2024-01-30 联仁健康医疗大数据科技股份有限公司 Data processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109558748B (en) Data processing method and device, electronic equipment and storage medium
CN108776696B (en) Node configuration method and device, storage medium and electronic equipment
CN110705235B (en) Information input method and device for business handling, storage medium and electronic equipment
CN111178687B (en) Financial risk classification method and device and electronic equipment
CN111343162A (en) System secure login method, device, medium and electronic equipment
CN111191677B (en) User characteristic data generation method and device and electronic equipment
CN112130781A (en) Log printing method and device, electronic equipment and storage medium
CN109785867B (en) Double-recording flow configuration method and device, computer equipment and storage medium
CN110704390A (en) Method, device, electronic equipment and medium for obtaining server maintenance script
CN112559578A (en) Data processing method and device, electronic equipment and storage medium
US20210334597A1 (en) Confident peak-aware response time estimation by exploiting telemetry data from different system configurations
CN116720119A (en) Big data identification method and system applied to multi-terminal service interaction
US20220172102A1 (en) Machine learning model trained using features extracted from n-grams of mouse event data
CN110532304B (en) Data processing method and device, computer readable storage medium and electronic device
CN114579580A (en) Data storage method and data query method and device
CN111753548A (en) Information acquisition method and device, computer storage medium and electronic equipment
CN108768742B (en) Network construction method and device, electronic equipment and storage medium
CN115022002B (en) Verification mode determining method and device, storage medium and electronic equipment
CN110781188B (en) Form information processing method and device, electronic equipment and storage medium
CN110647519B (en) Method and device for predicting missing attribute value in test sample
CN116483735B (en) Method, device, storage medium and equipment for analyzing influence of code change
CN113420037B (en) Asset data real-time changing method and device
JP2019020979A (en) Information processing device and credibility calculation method
US20210272402A1 (en) Authentication security
CN111444393A (en) Method and device for acquiring data processing result, electronic equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination