Disclosure of Invention
In order to solve the technical problem, the application provides a data dictionary construction method and device, and when a system developer needs to use a data field, the data dictionary can be inquired to prevent data from being used wrongly.
In a first aspect, an embodiment of the present application provides a data dictionary construction method, where the method includes:
acquiring data fields in all databases in a bank system;
analyzing the data fields, and determining whether any two data fields have an association relation;
if yes, inquiring sources of all data fields with incidence relation, and determining data flow among the data fields according to the sources;
and constructing a data dictionary according to the data stream, wherein the data dictionary embodies system information and evolution information of the data field.
Optionally, the method further includes:
and determining a corresponding target data dictionary according to the field to be queried.
Optionally, the method further includes:
and if the field to be queried is not queried in the data dictionary, supplementing the field to be queried in the data dictionary.
Optionally, the analyzing the data fields to determine whether any two data fields have an association relationship includes:
analyzing the data fields, and determining the similarity between any two data fields;
and determining whether the association relationship exists between any two data fields according to whether the similarity reaches a preset threshold value.
Optionally, constructing a data dictionary according to the data stream includes:
presenting the data stream;
and constructing the data dictionary according to the feedback information aiming at the data stream.
Optionally, the feedback information is acknowledgement information or data stream supplementary information.
In a second aspect, an embodiment of the present application provides a data dictionary constructing apparatus, where the apparatus includes:
the acquisition unit is used for acquiring data fields in all databases in the bank system;
the analysis unit is used for analyzing the data fields and determining whether any two data fields have an association relation;
the determining unit is used for inquiring the sources of all the data fields with the association relationship if the data fields are in the associated relationship, and determining the data flow among the data fields according to the sources;
and the construction unit is used for constructing a data dictionary according to the data stream, and the data dictionary embodies the system information and the evolution information of the data field.
Optionally, the determining unit is further configured to:
and determining a corresponding target data dictionary according to the field to be queried.
Optionally, the apparatus further comprises:
and the supplementing unit is used for supplementing the field to be queried in the data dictionary if the field to be queried is not queried in the data dictionary.
Optionally, the analysis unit is configured to:
analyzing the data fields, and determining the similarity between any two data fields;
and determining whether the association relationship exists between any two data fields according to whether the similarity reaches a preset threshold value.
Optionally, the building unit is configured to:
presenting the data stream;
and constructing the data dictionary according to the feedback information aiming at the data stream.
Optionally, the feedback information is acknowledgement information or data stream supplementary information.
According to the technical scheme, the data dictionary construction method comprises the following steps: acquiring data fields in all databases in a bank system; analyzing the data fields, and determining whether any two data fields have an association relation; if yes, inquiring sources of all data fields with incidence relation, and determining data flow among the data fields according to the sources; and constructing a data dictionary according to the data stream, wherein the data dictionary embodies system information and evolution information of the data field. The method provided by the embodiment of the application analyzes the data fields in all bank systems, establishes the bank intelligent data dictionary, embodies the evolution information, namely the incidence relation, among different data fields and the system information, namely the source of the data fields, and can prevent wrong data from being used by querying through the data dictionary when a system developer needs to use the data fields.
Detailed Description
In order to make the technical solutions of the present application better understood, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The bank, especially a large bank, has a plurality of systems therein, each system has a plurality of data with similar data names, the data names of the data may be the same or only partially similar but different, for example, the data a and the data B have very similar names but come from different systems and have different contents after different processing, the data which is expected to be used is the data a, but the data B is used because of the similar names, but the content of the data B is different, which causes data use errors in the process of developing the system by workers, especially for the systems which are long in the past, and thus production problems easily occur, which bring unnecessary loss to the bank.
In order to solve the above technical problem, the present application provides a data dictionary construction method, including: acquiring data fields in all databases in a bank system; analyzing the data fields, and determining whether any two data fields have an association relation; if yes, inquiring sources of all data fields with incidence relation, and determining data flow among the data fields according to the sources; and constructing a data dictionary according to the data stream, wherein the data dictionary embodies system information and evolution information of the data field. The method provided by the embodiment of the application analyzes the data fields in all bank systems, establishes the bank intelligent data dictionary, embodies the evolution information, namely the incidence relation, among different data fields and the system information, namely the source of the data fields, and can prevent wrong data from being used by querying through the data dictionary when a system developer needs to use the data fields.
The method provided by the embodiment of the application can be applied to terminal equipment, and the terminal equipment can be equipment such as a computer, a Personal Digital Assistant (PDA for short), a tablet computer and the like.
The method provided by the embodiment of the application can also be applied to a server, and the server executes the method provided by the embodiment of the application.
Next, a data dictionary construction method provided by the present application will be described mainly with a terminal device as an execution subject, with reference to the accompanying drawings. Referring to fig. 1, the method comprises:
s101, acquiring data fields in all databases in the bank system.
The bank may include a plurality of systems, data names of data in the systems may be the same or partially similar but different, and when data needs to be used, in order to avoid that data obtained from the systems according to the data names are not data that actually needs to be used, so that data use errors are caused, and unnecessary loss is brought to the bank, in this embodiment, a data dictionary needs to be built, so that correct data is queried according to the data dictionary, and the data is processed.
At this time, a data dictionary is built for the original system, and all data fields need to be acquired from all databases of the banking system, for example, the data fields are sampled and taken out (according to time intervals).
S102, analyzing the data fields, and determining whether any two data fields have an association relationship.
The terminal device may analyze all data fields to determine whether any two data fields have an association relationship, where the association relationship may refer to performing service processing on one data field to obtain another data field, or modifying one data field to obtain another data field, and so on.
In a possible implementation manner, the terminal device analyzes the data fields, and the manner of determining whether any two data fields have an association relationship may be to analyze the data fields and determine a similarity between any two data fields; and determining whether the association relationship exists between any two data fields according to whether the similarity reaches a preset threshold value.
And if the similarity exceeds a preset threshold and does not reach one hundred percent, the two data fields are considered to have the association relationship (if the similarity reaches one hundred percent, the two data fields are considered to be the same), and the data fields with the association relationship are extracted to be used for determining the data flow. And if the similarity does not exceed the preset threshold, the two data fields are not considered to have the association relationship.
The similarity may be determined according to related information of the data field, for example, important information such as system information including the data field, for example, a system name of which system the data field is from, and the like. By analyzing the acquired data fields, the similarity of the data fields can be determined, so that whether the two data fields have an association relationship or not can be determined. The preset threshold may be set based on practical experience, and may be set to forty percent, for example.
S103, if yes, inquiring sources of all data fields with the association relationship, and determining data flow among the data fields according to the sources.
If the two data fields have the incidence relation, all the data fields with the incidence relation are extracted, the sources of the data fields are inquired by the corresponding system through an artificial intelligence model, and the data flow between the data fields is determined. The source of the data field may be which system the data field comes from, for example, for a branch line, # # # branch line included in the total line, etc., it may be determined from which system the data field comes from. Or, it may refer to which data fields the data fields are evolved from, etc.
S104, constructing a data dictionary according to the data stream, wherein the data dictionary represents system information and evolution information of the data field.
And the terminal equipment constructs a data dictionary by a machine self-analysis method according to the determined data stream.
In some possible embodiments, the terminal device may present the data stream, and construct the data dictionary according to the feedback information for the data stream. The feedback information may be acknowledgement information or data stream supplementary information.
For example, when the data stream is presented, the staff may determine whether the direction of the data stream is correct and whether the association relationship needs to be supplemented, and if the direction of the data stream is correct and the association relationship does not need to be supplemented, the staff triggers confirmation information on the terminal device, and the terminal device constructs a data dictionary according to the confirmation information (feedback information). If the direction of the data stream is incorrect or the incidence relation needs to be supplemented, the staff triggers supplementary information on the terminal equipment, the supplementary information is used for supplementing the incidence relation or correcting the direction, and the terminal equipment constructs a data dictionary according to the supplementary information (feedback information).
The supplementary information can be obtained by judging the data fields which cannot be analyzed in a manual experience mode and verifying the data.
And then the background system can inquire the related data according to the incidence relation in the data dictionary to judge whether the data dictionary is correct or not, and final verification is carried out to obtain the effective data dictionary of the bank system.
When a worker needs to develop a new system, only a field required by the worker, such as a field to be queried, needs to be queried, the terminal device determines a corresponding target data dictionary according to the field to be queried, so that which system calls the field to be queried is most convenient and the obtained field to be queried is processed according to system information and evolution information of each data field on the target data dictionary, and efficiency and accuracy are improved.
And if the field to be queried is not queried in the data dictionary, determining that the target data dictionary corresponding to the field to be queried is not determined, and supplementing the field to be queried in the data dictionary. That is, if the worker finds that the searched field is not in the data dictionary through the query result, the worker can supplement the information of the newly added field to realize the supplement of the field to be queried in the data dictionary, so that the worker can conveniently query and supplement the field to be queried later, and the bank system unified standard is formed.
According to the technical scheme, the data dictionary construction method comprises the following steps: acquiring data fields in all databases in a bank system; analyzing the data fields, and determining whether any two data fields have an association relation; if yes, inquiring sources of all data fields with incidence relation, and determining data flow among the data fields according to the sources; and constructing a data dictionary according to the data stream, wherein the data dictionary embodies system information and evolution information of the data field. The method provided by the embodiment of the application analyzes the data fields in all bank systems, establishes the bank intelligent data dictionary, embodies the evolution information, namely the incidence relation, among different data fields and the system information, namely the source of the data fields, and can prevent wrong data from being used by querying through the data dictionary when a system developer needs to use the data fields.
Based on the data dictionary construction method provided by the foregoing embodiment, an embodiment of the present application further provides a data dictionary construction device, with reference to fig. 2, where the device includes:
an obtaining unit 201, configured to obtain data fields in all databases in a bank system;
an analyzing unit 202, configured to analyze the data fields, and determine whether any two data fields have an association relationship;
a determining unit 203, configured to query sources of all data fields having an association relationship if yes, and determine data streams between the data fields according to the sources;
a constructing unit 204, configured to construct a data dictionary according to the data stream, where the data dictionary embodies system information and evolution information of the data field.
Optionally, the determining unit is further configured to:
and determining a corresponding target data dictionary according to the field to be queried.
Optionally, the apparatus further comprises:
and the supplementing unit is used for supplementing the field to be queried in the data dictionary if the field to be queried is not queried in the data dictionary.
Optionally, the analysis unit is configured to:
analyzing the data fields, and determining the similarity between any two data fields;
and determining whether the association relationship exists between any two data fields according to whether the similarity reaches a preset threshold value.
Optionally, the building unit is configured to:
presenting the data stream;
and constructing the data dictionary according to the feedback information aiming at the data stream.
Optionally, the feedback information is acknowledgement information or data stream supplementary information.
According to the technical scheme, the data dictionary construction method comprises the following steps: acquiring data fields in all databases in a bank system; analyzing the data fields, and determining whether any two data fields have an association relation; if yes, inquiring sources of all data fields with incidence relation, and determining data flow among the data fields according to the sources; and constructing a data dictionary according to the data stream, wherein the data dictionary embodies system information and evolution information of the data field. The method provided by the embodiment of the application analyzes the data fields in all bank systems, establishes the bank intelligent data dictionary, embodies the evolution information, namely the incidence relation, among different data fields and the system information, namely the source of the data fields, and can prevent wrong data from being used by querying through the data dictionary when a system developer needs to use the data fields.
Those of ordinary skill in the art will understand that: all or part of the steps for realizing the method embodiments can be completed by hardware related to program instructions, the program can be stored in a computer readable storage medium, and the program executes the steps comprising the method embodiments when executed; and the aforementioned storage medium may be at least one of the following media: various media that can store program codes, such as read-only memory (ROM), RAM, magnetic disk, or optical disk.
It should be noted that, in the present specification, all the embodiments are described in a progressive manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the apparatus and system embodiments, since they are substantially similar to the method embodiments, they are described in a relatively simple manner, and reference may be made to some of the descriptions of the method embodiments for related points. The above-described embodiments of the apparatus and system are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
The above description is only one specific embodiment of the present application, but the scope of the present application is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present application should be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.