CN110879799B - Method and device for labeling technical metadata - Google Patents

Method and device for labeling technical metadata Download PDF

Info

Publication number
CN110879799B
CN110879799B CN201911118020.8A CN201911118020A CN110879799B CN 110879799 B CN110879799 B CN 110879799B CN 201911118020 A CN201911118020 A CN 201911118020A CN 110879799 B CN110879799 B CN 110879799B
Authority
CN
China
Prior art keywords
term
information
target
technical metadata
version
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911118020.8A
Other languages
Chinese (zh)
Other versions
CN110879799A (en
Inventor
于阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Mininglamp Software System Co ltd
Original Assignee
Beijing Mininglamp Software System Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Mininglamp Software System Co ltd filed Critical Beijing Mininglamp Software System Co ltd
Priority to CN201911118020.8A priority Critical patent/CN110879799B/en
Publication of CN110879799A publication Critical patent/CN110879799A/en
Application granted granted Critical
Publication of CN110879799B publication Critical patent/CN110879799B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the application discloses a method and a device for managing technical metadata. The method comprises the following steps: when the marking operation of the service information is carried out on the technical metadata in the service system, the target type of the service system to which the technical metadata belongs is obtained; searching a target term corresponding to the target type of the service system in a pre-stored term base to obtain identification information of the target term; determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term; searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination; and outputting prompt information for labeling the technical metadata by using the label information in the target combination.

Description

Method and device for labeling technical metadata
Technical Field
The present disclosure relates to the field of information processing, and more particularly, to a method and apparatus for labeling technical metadata.
Background
Metadata (Metadata) is data describing data, and descriptive information about data and information resources is data describing other data (data about other data), or structured data (structured data) for providing information about a resource. Functionally, the metadata may provide user-based information, such as metadata that records business description information for data items that can assist a user in using the data; in addition, management and maintenance of data by the system may also be supported, as metadata about the data item storage method can support the system accessing data in the most efficient manner.
In the field of data warehousing, metadata is divided into technical metadata and business metadata by purpose. The technical metadata is obscure, the naming of the technical metadata is not standard, the formats are not uniform, the searching and the classification are difficult according to the difference of the data quality and the difference of the service range, and the association between the service information and the service is more difficult to find.
In the related art, in order to conveniently analyze the technical metadata and obtain the related information with the service based on the analyzed result, the technical metadata may be labeled by using the label information, and the service information included in the technical metadata may be obtained by using the labeled content. Along with the continuous increase of technical metadata, the quantity of the label information is also continuously increased, so that great difficulty is brought to the selection and the use of the label information in the labeling process, and the processing efficiency of the labeling operation is influenced.
Disclosure of Invention
In order to solve any technical problem, embodiments of the present application provide a method and an apparatus for tagging technical metadata.
To achieve the purpose of the embodiments of the present application, an embodiment of the present application provides a method for labeling technical metadata, including:
when the marking operation of the service information is carried out on the technical metadata in the service system, the target type of the service system to which the technical metadata belongs is obtained;
searching a target term corresponding to the target type of the service system in a pre-stored term base to obtain identification information of the target term;
determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
and outputting prompt information for labeling the technical metadata by using the label information in the target combination.
In an exemplary embodiment, before determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the method further includes:
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
creating a new term matched with the technical metadata according to the information carried in the technical metadata and the service information of the service system;
configuring the tag information corresponding to the target term as the tag information corresponding to the new term.
In an exemplary embodiment, after configuring the tag information corresponding to the target term as the tag information corresponding to the new term, the method further includes:
acquiring a management request for the tag information in the new term, wherein the management request comprises newly added tag information and/or deleted existing tag information;
and adjusting the content of the tag information included in the new term according to the management request.
In an exemplary embodiment, after determining version information of a target term to be used from at least two versions corresponding to the identification information of the target term, the method further includes:
recording identification information and version information of a target term used by each business system;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
and when an operation request for determining the terms of the target version required to be used by the business system is detected, outputting the identification information and the version information of the target terms corresponding to the business system according to the corresponding relationship.
In an exemplary embodiment, after the outputting the prompt information for labeling the technical metadata with the tag information in the target combination, the method further includes:
recording terms of a target version corresponding to a target combination used by the technical metadata of which the labeling operation is finished;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
An apparatus for annotating technical metadata comprising a processor and a memory, the memory storing a computer program, the processor invoking the computer program in the memory to implement operations comprising:
when the marking operation of the service information is carried out on the technical metadata in the service system, the target type of the service system to which the technical metadata belongs is obtained;
searching a target term corresponding to the target type of the service system in a pre-stored term library to obtain identification information of the target term;
determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
and outputting prompt information for labeling the technical metadata by using the label information in the target combination.
In an exemplary embodiment, before the operation of determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the processor calls the computer program in the memory to further implement the following operations, including:
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
according to the information carried in the technical metadata and the service information of the service system, creating a new term matched with the technical metadata;
configuring the label information corresponding to the target term as the label information corresponding to the new term.
In an exemplary embodiment, after the processor invokes the computer program in the memory to implement the operation of configuring the tag information corresponding to the target term as the tag information corresponding to the new term, the processor invokes the computer program in the memory to further implement the following operations, including:
acquiring a management request for the tag information in the new term, wherein the management request comprises added tag information and/or deleted existing tag information;
and adjusting the content of the tag information included in the new term according to the management request.
In an exemplary embodiment, after the operation of determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the processor calls the computer program in the memory to further implement the following operations, including:
recording identification information and version information of a target term used by each business system;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
and when an operation request for determining the terms of the target version required to be used by the business system is detected, outputting the identification information and the version information of the target terms corresponding to the business system according to the corresponding relationship.
In an exemplary embodiment, after the processor invokes the computer program in the memory to implement the operation of outputting the hint information labeling the technical metadata with the tag information in the target combination, the processor invokes the computer program in the memory to further implement operations comprising:
recording terms of a target version corresponding to a target combination used by the technical metadata of which the labeling operation is finished;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
According to the scheme provided by the embodiment of the application, when service information labeling operation is performed on technical metadata in a service system, the target type of the service system to which the technical metadata belongs is obtained, a target term corresponding to the target type of the service system is searched in a pre-stored term library to obtain identification information of the target term, version information of the target term to be used is determined from at least two versions corresponding to the identification information of the target term, a label combination corresponding to the target term is searched according to the identification information and the version information of the target term to obtain the target combination, prompt information for labeling the technical metadata by using the label information in the target combination is output, the corresponding relation between the technical metadata and the service information is established by using the term used by the service system, and the efficiency of the labeling operation of the technical metadata is improved by using the label combination corresponding to the term; the use of different service systems is completed by means of the versions of the terms, so that the maintenance of the terms and the label information is facilitated, and the maintenance efficiency is improved.
Additional features and advantages of the embodiments of the application will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the embodiments of the application. The objectives and other advantages of the embodiments of the application may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the embodiments of the present application and are incorporated in and constitute a part of this specification, illustrate embodiments of the present application and together with the examples of the embodiments of the present application do not limit the embodiments of the present application.
FIG. 1 is a flowchart of a method for tagging technical metadata provided by an embodiment of the present application;
FIG. 2 is a diagram illustrating a method for managing metadata based on a graph database according to an embodiment of the present application.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present application more apparent, the embodiments of the present application will be described in detail below with reference to the accompanying drawings. It should be noted that, in the embodiments of the present application, features in the embodiments and the examples may be arbitrarily combined with each other without conflict.
In order to solve the problem of difficult association between technical metadata and business, the inventor finds that business metadata and technical metadata need to be associated; after the association is realized, the business analyst can operate the corresponding technical metadata only by understanding and retrieving the business metadata, so that the data is more efficiently used and integrated to be used as data support for an upper-layer business system.
First, terms related to the embodiments of the present application will be explained:
the technical metadata is real metadata; the metadata information can be collected from various data sources, and the data sources can be data systems such as Mysql, oracle, hive, hbase and the like;
service metadata: a business system constructed by business analysts according to the business system; tags may be generated based on the business metadata to identify the classification of the technical metadata or its content;
the term is a term used to denote a concept in a particular business domain, enriched in its content by tags; for example, the term "bank" contains the labels "finance," "manager," "clerk," and the like;
glossaries are a specific business domain category enriched by terms, e.g., the glossaries "financial domain" contains the terms "bank", "securities", etc.;
a directory is a specific combination of terms; it includes the combination of terms for direct multiplexing when annotating technology metadata.
Fig. 1 is a flowchart of a method for annotating technical metadata according to an embodiment of the present application. The method shown in fig. 1 comprises:
step 101, when performing service information labeling operation on technical metadata in a service system, acquiring a target type of the service system to which the technical metadata belongs;
in one exemplary embodiment, the label information used by the annotation operation is determined based on the label determined by the business attribute; the user can select proper tag information from the stored tag information to complete the labeling operation of the technical metadata according to the understanding of the content of the technical metadata, so that the corresponding relation between the technical metadata and the service metadata is established.
In an exemplary embodiment, the type of the business system is determined according to different business functions in the technical field, for example, in the 5G communication field, the business functions may be divided into a plurality of functional modules, such as a product development function, a product production function, and a product sales function, and the development functions are taken as an example and are subdivided according to a development subject, and may also be a development company, a scientific and technological institution, a research institute, or an individual. The particle size of the business system can be set according to actual needs, and can be selected from the above-mentioned research and development companies, or a specific research and development company, such as the research and development company with company name a.
The same type of service system may include at least two service systems.
Step 102, searching a target term corresponding to the target type of the service system in a prestored term library to obtain identification information of the target term;
in an exemplary embodiment, the terms in the term library are managed according to the service system, and each service system may specifically include a plurality of terms according to the internal configuration of the service system; the target term corresponding to the identification information can be searched in the term library by utilizing the identification information of the service system.
Taking a business system as a research and development company A as an example, the research and development company A can comprise a research and development department, a financial department, a personnel department, a legal department and the like, each department can be used as a business subsystem of the business system, and each business subsystem can have respective terms;
in an exemplary embodiment, before searching a target term corresponding to a target type of the service system in a pre-stored term library to obtain identification information of the target term, the method further includes:
determining a term used by each service subsystem when the service system comprises at least two service subsystems with different service functions;
establishing a directory structure based on the service subsystem for the terms of the service system according to the division strategy of the service subsystem in the service system;
the searching, in a pre-stored term library, for a target term corresponding to a target type of the service system to obtain identification information of the target term, includes:
determining a target service subsystem corresponding to the technical metadata in the service system;
and querying terms used by the target service subsystem in the directory structure from a pre-stored directory structure to obtain target terms.
The division strategy of the service subsystem can be divided according to the internal organization architecture or the service function of the service system; the method can be divided into at least two stages according to actual requirements; that is, after the first division of the service system, the first-level service subsystem may be determined; and selecting one first-stage service subsystem from the first-stage service subsystems for division to determine a second-stage service subsystem.
After the terms of the business system are obtained, the business subsystem to which the terms belong is determined based on the determined business subsystem, for example, the term "finance" may be determined to belong to the business subsystem corresponding to the finance department, and the term "test" may be determined to belong to the business subsystem corresponding to the development department.
After determining the operation on the service subsystem to which the term belongs, a corresponding directory structure can be established based on the determined service subsystem, and the establishment of the corresponding relationship between the service subsystem and the term is completed.
When the required terms are searched, the terms can be inquired according to the directory, and the purpose of quick inquiry is achieved.
In one exemplary embodiment, the identification information of the term may be a number.
103, determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
in an exemplary embodiment, the inventor finds that, in the related art, when the name of a service is changed, the tagging operation has no way to be adaptively changed, only new terms can be created for re-tagging, and the efficiency is low, and a change record of the technical metadata tagged by the same service metadata also does not exist. Based on the technical analysis, the term is composed of a plurality of labels and can be multiplexed, one term can exist in a plurality of business systems, and different business systems can have some fine adjustment on the term, so that the term configuration version information can well accord with the above use scenes, namely, the term configuration is accurate, the multiplexing performance is high, and the labeling efficiency is high.
Taking the term "personnel department" as an example, the description information of the term in a part of business systems is "human resource department", although the characters used for description are different, the described information can be determined to be consistent. Any name may be chosen for use in different companies.
In an exemplary embodiment, before determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the method further includes:
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
creating a new term matched with the technical metadata according to the information carried in the technical metadata and the service information of the service system;
configuring the tag information corresponding to the target term as the tag information corresponding to the new term.
Explaining by taking technical metadata as a piece of new employee enrollment information as an example, terms which can be provided in a term library are personnel department, and terms matched with a business system are determined to be human resource department according to an organization structure of the business system and information carried by the technical metadata; therefore, on the basis that the term is the human resource department, a new version is created, and the corresponding term name of the version is the human resource department.
When recording the term, the identifier of the term and the corresponding version number may be recorded; for example, the term number of the personnel department is 001, the corresponding version is 01, the term number of the human resources department is 001, and the corresponding version is 02.
Each term has a corresponding label combination, and the label combination corresponding to the target term can be used as the label of the new term, for example, the label corresponding to the personnel department is used as the label of the human resource department, so that the maintenance efficiency of the terms and the labels is improved, and the terms and the labels are convenient to use in different business systems.
Step 104, searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
in one exemplary embodiment, each term corresponds to tag information; the content of the label can be modified according to the needs of the user; alternatively, the user may add a new tag and set the term to which the new tag belongs.
In an exemplary embodiment, after configuring the tag information corresponding to the target term as the tag information corresponding to the new term, the method further includes:
acquiring a management request for the tag information in the new term, wherein the management request comprises newly added tag information and/or deleted existing tag information;
and adjusting the content of the tag information included in the new term according to the management request.
Because the same term has corresponding description modes in different business systems, labels used in different business systems also have certain differences. When detecting that the term is a newly created term, the tag information corresponding to the term may be adjusted, a required new tag may be added, or a tag that does not match the current business system may be deleted.
105, outputting prompt information for labeling the technical metadata by using the label information in the target combination;
in an exemplary embodiment, the notification is used for labeling by using the label information in the target combination, so that the operation of selecting a proper label from a large number of labels in the labeling operation process is avoided, and the efficiency of the labeling operation is improved by providing the label related to the technical metadata for the labeling operation.
The method provided by the embodiment of the application, when performing service information labeling operation on technical metadata in a service system, obtains a target type of the service system to which the technical metadata belongs, searches a target term corresponding to the target type of the service system in a pre-stored term library to obtain identification information of the target term, determines version information of the target term to be used from at least two versions corresponding to the identification information of the target term, searches a tag combination corresponding to the target term according to the identification information and the version information of the target term to obtain the target combination, outputs prompt information for labeling the technical metadata by using the tag information in the target combination, establishes a corresponding relation between the technical metadata and the service information by using the term used by the service system, and improves efficiency of the labeling operation of the technical metadata by using the tag combination corresponding to the term; the use of different service systems is completed by means of the version of the terms, so that the maintenance of the terms and the label information is facilitated, and the maintenance efficiency is improved.
The method provided by the embodiments of the present application is explained as follows:
FIG. 2 is a diagram illustrating a method for managing metadata based on a graph database according to an embodiment of the present application. As shown in FIG. 2, the graph database includes four top-level models, a label model, a term model, a vocabulary model, and a directory model; and each model is created with corresponding instance nodes, and the relationship of the instance nodes is an instance relationship. Wherein:
the label model comprises a plurality of label instances, and the label instances are in labeling relation with the term instances in the term module, namely, the terms can be labeled by at least two labels to enrich the content of the terms.
The relationship between instances in the vocabulary module and instances in the terminology module is an aggregate relationship, i.e., the vocabulary is composed of several terms.
The relationship between the instances in the directory module and the instances in the term module is a combinatorial relationship, i.e., the directory is made up of terms. Additionally, there may be containment relationships between directory instances that are used to construct subdirectories.
In annotating technical metadata, annotation operations can be performed using instances of terms and tags together.
In addition, corresponding labels and terms can be created according to the requirements of the business system, technical metadata are labeled through fine-grained labels and brief generalized terms, so that a complex business system outline is constructed, a label library and a term library are formed, terms are organized by a catalog, and the terms are arranged and combined. In addition, terms in the constructed term library can be multiplexed to form a new service system, the flow direction and the dependency relationship of data are determined, and the term library of one category forms a vocabulary table, namely, the whole service field is described.
The following describes implementations of the above functions, respectively:
a metadata management method based on graph database is based on a management structure chart formed by terms and used for describing complex service systems, mining the data flow direction, data relationship, data dependence and relationship between the service systems, realizing service multiplexing and getting through the service systems, service metadata and technical metadata.
In an exemplary embodiment, the combination of the included tags in the term is dynamically changeable, and the tags in the tag combination can be added or deleted according to the use frequency of the tag information in the term so as to meet the personalized use of the user; or selecting the label combination with high use frequency to be combined into a new label combination for subsequent use.
In an exemplary embodiment, after the outputting the prompt information for labeling the technical metadata with the tag information in the target combination, the method further includes:
after at least two technical metadata are labeled by using the labeling combination corresponding to the same target term, a labeling result is obtained;
determining common tag information from the tag information used by the labeling results of the at least two technical metadata;
and adjusting the label combination corresponding to the target term by using the determined common label information.
In an exemplary embodiment, the combination of the included tags in the term is dynamically changeable, and the tags in the tag combination can be added or deleted according to the use frequency of the tag information in the term so as to meet the personalized use of the user; or selecting the label combination with high use frequency to be a new label combination for subsequent use.
The inventor finds that the same term is used in a plurality of business systems, for example, the term "IT department" exists in various business systems, such as science and technology companies, banks, government departments, and these terms do not belong to the same business system and can not be described by a simple term, so the embodiment of the present application proposes that a directory can be established, existing terms can be arranged and combined to form a new business system or business meaning, and creation of duplicate terms is avoided.
In an exemplary embodiment, after the querying the terms used by the target service subsystem in the directory structure from the pre-stored directory structure, the method further comprises:
if the term corresponding to the business subsystem is not searched in a prestored term library, whether the target business subsystem is included in other business systems except the target business system or not is inquired;
if at least one service system except the target service system comprises the target subsystem, acquiring terms used by the target subsystem from the at least one service system, and taking the terms as target terms corresponding to the target service subsystem.
Taking a business system as an example of a research and development company A, the research and development company A adds a market research part, because the term corresponding to the research and development company A does not include the term of the market research part, whether the term of the department is included in other business systems is inquired, if the term corresponding to the research and development company B is inquired to include the department, the catalog information of the research and development company B is obtained, the term of the market research part is searched from the catalog information, and the obtained term is used as the target term corresponding to the market research part in the research and development company A.
By the mode, terms and labels can be reused, the operation of repeatedly constructed terms and labels is reduced, and the management efficiency is improved.
In an exemplary embodiment, after determining the term used by each service subsystem when the service system includes service subsystems of at least two different service functions, the method further includes:
acquiring terms used by service subsystems with the same service function to obtain vocabulary information corresponding to the service subsystems with the same service function;
after the outputting of the prompt information for labeling the technical metadata by using the tag information in the target combination, the method further includes:
if the technical metadata cannot be labeled by the label information corresponding to the target term inquired from the business subsystem, searching a new target term from the vocabulary information corresponding to the business subsystem;
determining a label combination corresponding to the new target term;
and outputting prompt information labeled by the label combination corresponding to the new target term.
The label information configured for the term in different service systems can be respectively obtained, and all the label information corresponding to the term is obtained in a summary manner, wherein the label information comprises common label information and non-common label information.
When the tag information of the target combination output by a certain service subsystem can not meet the use requirement of the labeling operation, the tag information can be selected from the non-shared tag information in the summarized tag information to form a new target combination for the labeling operation, so that the sharing of the tag information in different service systems is realized, the occurrence of the operation of randomly increasing the tag information is reduced, and the smooth completion of the labeling operation is ensured on the premise of effectively controlling the number of tags.
In an exemplary embodiment, after determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the method further includes:
recording identification information and version information of a target term used by each business system;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
and when an operation request for determining the terms of the target version required to be used by the business system is detected, outputting the identification information and the version information of the target terms corresponding to the business system according to the corresponding relationship.
In an exemplary embodiment, the term used by the recording business system 1 includes the term identified as 001 and the term identified as 01, and the term used by the recording business system 2 includes the term identified as 001 and the term identified as 01, so that the later labeling operation of the technical metadata in the business system is facilitated by recording the term information used by different business systems.
In an exemplary embodiment, after the outputting the prompt information for labeling the technical metadata with the tag information in the target combination, the method further includes:
recording terms of a target version corresponding to a target combination used by the technical metadata of which the labeling operation is finished;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
Based on the mode, when the tag information changes, the technical metadata which is already subjected to the tagging operation does not need to be tagged again, and only the content of the tag information tagged by the technical metadata needs to be updated, so that when the tag information changes, the tagged technical metadata can be ensured to be adaptively changed, and the processing efficiency of the tagging operation is improved.
The application in a graph database is taken as an example for explanation, and the marking behavior is optimized by means of the relation between the nodes of the graph database and the nodes; each node in the graph has a Label value to indicate what type of node it belongs to; one label is a node, the type of the label is a label type, each item of technical metadata is a node, the type of the node is a metadata type, and the node is associated to the technical metadata node through a one-way labeling relation initiated by the label node, namely, one-time labeling is completed.
If the content of one label is changed, a new node is copied for the label node stored in the graph database corresponding to the label, the original attribute is kept, all the attributes of the new node are consistent with the source node, and the node type is changed from the label node type to a waste node type; the attribute content of the source node is updated, and then a change relationship is initiated to associate with a new node, wherein the change time and the change type are stored in the relationship attribute. Thus, even if the label is changed due to business meaning, the label is not required to be recreated and marked again.
In addition, if a business process of a certain technical metadata (i.e. a change process of the business process) is queried, historical versions of all tags corresponding to the business process can be queried. When the query operation is executed, the corresponding label nodes are found in the graph database, then all nodes corresponding to the change relations are traversed, and then all the change versions can be obtained according to the change time sequence.
An embodiment of the present application provides an apparatus for tagging technical metadata, including a processor and a memory, where the memory stores a computer program, and the processor calls the computer program in the memory to implement the following operations, including:
when carrying out service information labeling operation on technical metadata in a service system, acquiring a target type of the service system to which the technical metadata belongs;
searching a target term corresponding to the target type of the service system in a pre-stored term library to obtain identification information of the target term;
determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
and outputting prompt information for labeling the technical metadata by using the label information in the target combination.
In an exemplary embodiment, before the operation of determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the processor calls the computer program in the memory to further implement the following operations, including:
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
creating a new term matched with the technical metadata according to the information carried in the technical metadata and the service information of the service system;
configuring the tag information corresponding to the target term as the tag information corresponding to the new term.
In an exemplary embodiment, after the processor invokes the computer program in the memory to implement the operation of configuring the tag information corresponding to the target term as the tag information corresponding to the new term, the processor invokes the computer program in the memory to further implement the following operations, including:
acquiring a management request for the tag information in the new term, wherein the management request comprises added tag information and/or deleted existing tag information;
and adjusting the content of the tag information included in the new term according to the management request.
In an exemplary embodiment, after the operation of determining the version information of the target term to be used from the at least two versions corresponding to the identification information of the target term, the processor calls the computer program in the memory to further implement the following operations, including:
recording identification information and version information of a target term used by each business system;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
and when an operation request for determining the terms of the target version required to be used by the business system is detected, outputting the identification information and the version information of the target terms corresponding to the business system according to the corresponding relationship.
In one exemplary embodiment, after the processor invokes the computer program in the memory to implement the operation of outputting the hint information for tagging the technical metadata with the tag information in the target combination, the processor invokes the computer program in the memory to further implement operations comprising:
recording terms of a target version corresponding to a target combination used by the technical metadata with the labeling operation completed;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
The device provided by the embodiment of the application, when performing service information labeling operation on technical metadata in a service system, obtains a target type of the service system to which the technical metadata belongs, searches a target term corresponding to the target type of the service system in a pre-stored term library to obtain identification information of the target term, determines version information of the target term to be used from at least two versions corresponding to the identification information of the target term, searches a tag combination corresponding to the target term according to the identification information and the version information of the target term to obtain the target combination, outputs prompt information for labeling the technical metadata by using tag information in the target combination, establishes a corresponding relationship between the technical metadata and the service information by using the term used by the service system, and improves efficiency of the labeling operation of the technical metadata by using the tag combination corresponding to the term; the use of different service systems is completed by means of the version of the terms, so that the maintenance of the terms and the label information is facilitated, and the maintenance efficiency is improved.
It will be understood by those of ordinary skill in the art that all or some of the steps of the methods, systems, functional modules/units in the devices disclosed above may be implemented as software, firmware, hardware, or suitable combinations thereof. In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be performed by several physical components in cooperation. Some or all of the components may be implemented as software executed by a processor, such as a digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, as is well known to those skilled in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any information delivery media as is well known to those skilled in the art.

Claims (4)

1. A method of annotating technical metadata, the method comprising:
labeling technical metadata by using a label combination corresponding to a term, wherein the term is a title used for expressing a concept in a specific business field, comprises a plurality of labels and configures version information for the term; the tag is generated based on the business metadata to identify the classification or content of the technical metadata;
when the marking operation of the service information is carried out on the technical metadata in the service system, the target type of the service system to which the technical metadata belongs is obtained;
searching a target term corresponding to the target type of the service system in a pre-stored term library to obtain identification information of the target term;
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
creating a new term matched with the technical metadata according to the information carried in the technical metadata and the service information of the service system;
configuring the label information corresponding to the target term as the label information corresponding to the new term;
acquiring a management request for the tag information in the new term, wherein the management request comprises added tag information and/or deleted existing tag information;
according to the management request, adjusting the content of the tag information included by the new term;
determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
when an operation request for determining a term of a target version required to be used by the business system is detected, outputting identification information and version information of the target term corresponding to the business system according to the corresponding relation;
searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
and outputting prompt information for labeling the technical metadata by using the label information in the target combination.
2. The method of claim 1, wherein after outputting hint information for tagging the technical metadata with tag information in the target combination, the method further comprises:
recording terms of a target version corresponding to a target combination used by the technical metadata of which the labeling operation is finished;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
3. An apparatus for annotating technical metadata, comprising a processor and a memory, the memory storing a computer program, the processor invoking the computer program in the memory to perform operations comprising:
labeling technical metadata by using a label combination corresponding to a term, wherein the term is a title used for expressing a concept in a specific business field, comprises a plurality of labels and configures version information for the term; the tag is generated based on the business metadata to identify the classification or content of the technical metadata;
when the marking operation of the service information is carried out on the technical metadata in the service system, the target type of the service system to which the technical metadata belongs is obtained;
searching a target term corresponding to the target type of the service system in a pre-stored term base to obtain identification information of the target term;
when detecting that the target terms in the target service system in the term library cannot be matched with the technical metadata, acquiring service information of the service system corresponding to the technical metadata;
according to the information carried in the technical metadata and the service information of the service system, creating a new term matched with the technical metadata;
configuring the label information corresponding to the target term as the label information corresponding to the new term;
acquiring a management request for the tag information in the new term, wherein the management request comprises newly added tag information and/or deleted existing tag information;
according to the management request, adjusting the content of the tag information included by the new term;
determining version information of the target term required to be used from at least two versions corresponding to the identification information of the target term;
recording identification information and version information of a target term used by each business system;
establishing a corresponding relation between each business system and the identification information and the version information of the target term;
when an operation request for determining a term of a target version required to be used by the business system is detected, outputting identification information and version information of the target term corresponding to the business system according to the corresponding relation;
searching a label combination corresponding to the target term according to the identification information and the version information of the target term to obtain a target combination;
and outputting prompt information for labeling the technical metadata by using the label information in the target combination.
4. The apparatus of claim 3, wherein after the processor invokes the computer program in the memory to implement the operation of outputting the hint information for tagging the technical metadata with tag information in the target combination, the processor invokes the computer program in the memory to further implement operations comprising:
recording terms of a target version corresponding to a target combination used by the technical metadata with the labeling operation completed;
when detecting that the version information of the term of the target version in the term library changes, acquiring the label information with changed content from the label information corresponding to the term of the target version with changed version;
and updating the tag information of the technical metadata which has finished the tag operation by using the tag information after the content is changed.
CN201911118020.8A 2019-11-15 2019-11-15 Method and device for labeling technical metadata Active CN110879799B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911118020.8A CN110879799B (en) 2019-11-15 2019-11-15 Method and device for labeling technical metadata

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911118020.8A CN110879799B (en) 2019-11-15 2019-11-15 Method and device for labeling technical metadata

Publications (2)

Publication Number Publication Date
CN110879799A CN110879799A (en) 2020-03-13
CN110879799B true CN110879799B (en) 2023-04-07

Family

ID=69729017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911118020.8A Active CN110879799B (en) 2019-11-15 2019-11-15 Method and device for labeling technical metadata

Country Status (1)

Country Link
CN (1) CN110879799B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111752920A (en) * 2020-06-22 2020-10-09 杭州数澜科技有限公司 Method, system, and storage medium for managing metadata
CN114547018B (en) * 2022-04-24 2022-08-16 西安热工研究院有限公司 Method and system for automatically cleaning waste points of SIS real-time database

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241358A (en) * 2018-08-14 2019-01-18 中国平安财产保险股份有限公司 Metadata management method, device, computer equipment and storage medium
CN110245149A (en) * 2019-06-25 2019-09-17 北京明略软件系统有限公司 The method for edition management and device of metadata

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7685084B2 (en) * 2007-02-09 2010-03-23 Yahoo! Inc. Term expansion using associative matching of labeled term pairs

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241358A (en) * 2018-08-14 2019-01-18 中国平安财产保险股份有限公司 Metadata management method, device, computer equipment and storage medium
CN110245149A (en) * 2019-06-25 2019-09-17 北京明略软件系统有限公司 The method for edition management and device of metadata

Also Published As

Publication number Publication date
CN110879799A (en) 2020-03-13

Similar Documents

Publication Publication Date Title
US7243110B2 (en) Searchable archive
CN110929120B (en) Method and apparatus for managing technical metadata
US20140351241A1 (en) Identifying and invoking applications based on data in a knowledge graph
US20120246154A1 (en) Aggregating search results based on associating data instances with knowledge base entities
US20150081732A1 (en) Subscription for integrating external data from external system
US20130013590A1 (en) Searching and Displaying Data Objects Residing in Data Management Systems
CN102893281A (en) Information retrieval device, information retrieval method, computer program, and data structure
JP2010520549A (en) Data storage and management methods
US20130198117A1 (en) Systems and methods for semantic data integration
US9158599B2 (en) Programming framework for applications
CN110879799B (en) Method and device for labeling technical metadata
JP2004030221A (en) Method for automatically detecting table to be modified
CN111090656A (en) Method and system for dynamically constructing object portrait
CN110851663A (en) Method and apparatus for managing metadata
US20050080820A1 (en) Method and system for generating, associating and employing user-defined fields in a relational database within an information technology system
KR102153259B1 (en) Data domain recommendation method and method for constructing integrated data repository management system using recommended domain
Malinova et al. Automatic extraction of process categories from process model collections
CN116561181A (en) Data query method, device, computer equipment and computer readable storage medium
WO2020041827A1 (en) Data deduplication and data merging
CN110928979B (en) Method and apparatus for managing technical metadata
US20170323015A1 (en) Automated metadata cleanup and distribution platform
CN114490644A (en) Data storage method, device and storage medium
JP2004192657A (en) Information retrieval system, and recording medium recording information retrieval method and program for information retrieval
CN112925817A (en) Library book retrieval method and system
CN110609926A (en) Data tag storage management method and device

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant