WO2023124209A1 - 数据目录生成方法和装置、数据检索方法和装置、电子设备 - Google Patents

数据目录生成方法和装置、数据检索方法和装置、电子设备 Download PDF

Info

Publication number
WO2023124209A1
WO2023124209A1 PCT/CN2022/117489 CN2022117489W WO2023124209A1 WO 2023124209 A1 WO2023124209 A1 WO 2023124209A1 CN 2022117489 W CN2022117489 W CN 2022117489W WO 2023124209 A1 WO2023124209 A1 WO 2023124209A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
industry
directory
handle
retrieval
Prior art date
Application number
PCT/CN2022/117489
Other languages
English (en)
French (fr)
Inventor
张海生
崔晓峰
王波
周文明
Original Assignee
杭州数梦工场科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州数梦工场科技有限公司 filed Critical 杭州数梦工场科技有限公司
Publication of WO2023124209A1 publication Critical patent/WO2023124209A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Definitions

  • the present application relates to the field of retrieval, and in particular to a method and device for generating a data directory, a method and device for data retrieval, electronic equipment and a storage medium.
  • each industry generally forms a data directory that conforms to the industry standards to facilitate data retrieval within the industry.
  • the embodiments of this specification provide a method, device, and equipment for generating a data directory.
  • the technical scheme is as follows.
  • a method for generating a data directory which is applied to any local handle service in a cross-industry data system;
  • the cross-industry data system includes different local handle services corresponding to users in different industries; the industry users corresponding to the local handle service are preset
  • There is an original data directory ;
  • the original data directory contains a plurality of digital objects, and different digital objects correspond to different original data identifiers;
  • the method includes: obtaining the handle prefix corresponding to the local handle service, and the cross-industry data system Different local handle services correspond to different handle prefixes; for the original data directory preset by the corresponding industry users, the original data identifier corresponding to each digital object is used as the handle suffix, and the obtained corresponding handle prefix is spliced to obtain the Describe the new data identifier corresponding to each digital object, and generate a new data directory according to the obtained new data identifier.
  • the method further includes: sending the obtained new data identifiers to a global handle registration, and the global handle registration is used to summarize the new data identifiers obtained by various local handle services in the cross-industry data system.
  • the obtaining the handle prefix corresponding to the local handle service includes: obtaining the handle prefix applied by the industry user corresponding to the local handle service as the corresponding handle prefix, and the applied handle prefix uniquely corresponds to the Industry users corresponding to the local handle service.
  • the method further includes: updating the catalog metadata information of the raw data catalog according to the information of the corresponding industry user according to the preset raw data catalog of the corresponding industry user, so that the updated catalog metadata information can represent Find out the actual business meaning of the field value of the corresponding digital object, wherein the directory metadata information refers to the field information of one or more fields included in the digital object; the generation of a new data directory according to the obtained new data identification includes: A new data directory is generated according to the updated directory metadata information and the obtained new data identifier; the directory metadata information of the generated new data directory is the updated directory metadata information, and the directory number is the obtained new data identifier.
  • the format definition of the new data directory is defined by using the Type mechanism to define the directory metadata information of the new data directory, so as to generate the updated directory metadata information.
  • a data retrieval method for a data directory the data directory is generated by the above-mentioned data directory generation method, and the data retrieval method is applied to a management device; the management device is a newly added device for the cross-industry data system;
  • the data retrieval method includes: receiving a retrieval request sent by a target user, the retrieval request at least including a retrieval condition; the retrieval request is used to request to obtain a digital object that meets the contained retrieval condition; searching for a target that can respond to the retrieval request Local handle service: retrieve digital objects that meet the search criteria for the new data directory of the found target local handle service, and return the search result to the target user.
  • the management device stores a first correspondence set between the identifier of the local handle service in the cross-industry data system and the corresponding industry user information; the search for the target local handle service capable of responding to the retrieval request , comprising: in the case of determining that the retrieval request contains industry user information, searching the stored first correspondence set for the identifier of the local handle service corresponding to the industry user information contained therein, and using the found local handle service The local handle service corresponding to the identifier of is determined to be the target local handle service.
  • the industry user information includes at least one of an industry identifier, an industry user identifier, and address information of an industry user.
  • the management device stores a second set of correspondences between local handle service identifiers and corresponding handle prefixes in the cross-industry data system; the search for a target local handle service capable of responding to the retrieval request, Including: when it is determined that the retrieval request includes the target data identifier, in the stored second correspondence set, search for the identifier of the local handle service corresponding to the handle prefix in the target data identifier, and convert the found local The local handle service corresponding to the identifier of the handle service is determined to be the target local handle service.
  • the global handle registration stores a second correspondence set between the identifier of the local handle service in the cross-industry data system and the corresponding handle prefix;
  • the searching for the target local handle service that can respond to the retrieval request includes :
  • the management device searches for the local handle service corresponding to the handle prefix in the target data identifier from the second correspondence set registered by the global handle ID, and determine the local handle service corresponding to the found local handle service ID as the target local handle service.
  • the global handle registration stores a third correspondence set between the identifier of the local handle service in the cross-industry data system and the obtained new data identifier; the search for the target that can respond to the retrieval request
  • the local handle service includes: when it is determined that the retrieval request includes the target data identifier, the management device searches for the local handle corresponding to the target data identifier from the third correspondence set registered by the global handle The identifier of the service, determining the local handler service corresponding to the found identifier of the local handler service as the target local handler service.
  • the method further includes: in the case that no target local handle service capable of responding to the retrieval request can be found, for a new data catalog of each local handle service in the cross-industry data system, search for The digital object of the search condition, and return the search result to the target user.
  • the management device stores the directory metadata information of the new data directory generated by the local handle service in the cross-industry data system; the return of the search result to the target user includes: the search result includes a number In the case of an object, determine the directory metadata information of the new data directory generated by the target local handle service; display the digital object contained in the search result to the target user in combination with the determined directory metadata information.
  • the catalog metadata information of the new data catalog includes field information
  • the sending the digital object included in the retrieval result to the target user in combination with the determined catalog metadata information includes: Combining the field value of the digital object with the corresponding field information and sending it to the target user.
  • the management device stores a fourth correspondence set between the identification of the local handle service in the cross-industry data system and the directory metadata information of the new data directory correspondingly generated by the local handle service; wherein, the Returning the retrieval result to the target user includes: determining the directory metadata information of the corresponding new data directory according to the searched target local handle service identifier; combining the retrieved digital objects that meet the retrieval conditions with the determined Directory metadata information is sent to the target user.
  • the management device is configured to receive a new data directory generated by a local handle service in the cross-industry data system, and merge the received new data directories of different local handle services into a cross-industry data directory.
  • a data directory generating device which is applied to any local handle service in a cross-industry data system;
  • the cross-industry data system includes different local handle services corresponding to users in different industries; the industry users corresponding to the local handle service are preset
  • the original data directory includes a plurality of digital objects, and different digital objects correspond to different original data identifiers;
  • the device includes: an acquisition unit, configured to acquire the handle prefix corresponding to the local handle service, the Different local handle services in the above-mentioned cross-industry data system correspond to different handle prefixes;
  • the splicing unit is used to use the original data identifier corresponding to each digital object as the handle suffix for the original data directory preset by the corresponding industry user.
  • the acquired corresponding handle prefixes are concatenated to obtain a new data identifier corresponding to each digital object, and a new data directory is generated according to the obtained new data identifier.
  • a data retrieval device for a data directory the data directory is generated by the above-mentioned data directory generation device, and the data retrieval device is applied to a management device;
  • the management device is a newly added device for the cross-industry data system;
  • the data retrieval device includes: a receiving unit, configured to receive a retrieval request sent by a target user, and the retrieval request includes at least a retrieval condition; the retrieval request is used to request acquisition of a digital object that meets the contained retrieval condition; a search unit, configured to To find the target local handle service that can respond to the retrieval request; the result unit is used to retrieve the digital object that meets the retrieval condition for the new data directory of the found target local handle service, and return it to the target user Search Results.
  • An electronic device including a processor and a memory; the memory stores a program that can be called by the processor; wherein, when the processor executes the program, the method for generating a data directory or data retrieval as described above is implemented method.
  • a machine-readable storage medium on which a program is stored, and when the program is executed by a processor, the method for generating a data directory or the method for retrieving data as described above can be realized.
  • the above technical solution splices the globally unique handle prefix corresponding to a local handle service in the cross-industry data system with the original data identifier uniquely corresponding to a digital object in the local handle service to obtain the unique corresponding to a digital object in the cross-industry data system New data identification of digital objects, so as to facilitate the subsequent summary of new data catalogs generated by different industries based on new data identification in cross-industry data systems.
  • FIG. 1 is a schematic flowchart of a method for generating a data catalog provided in an embodiment of the present application
  • Fig. 2 is a schematic flow chart of a data retrieval method provided by the embodiment of the present application.
  • FIG. 3 is a schematic diagram of the principle of a data retrieval method provided by an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of a device for generating a data directory provided by an embodiment of the present application
  • Fig. 5 is a schematic structural diagram of a data retrieval device provided by an embodiment of the present application.
  • Fig. 6 is a schematic structural diagram of a device for configuring the method of the embodiment of the application.
  • first, second, third, etc. may be used in this application to describe various objects, the information should not be limited to these terms. These terms are only used to distinguish objects of the same type from each other. For example, without departing from the scope of the present application, a first object may also be called a second object, and similarly, a second object may also be called a first object. Depending on the context, the word “if” as used herein may be interpreted as “at” or "when” or "in response to a determination.”
  • each industry generally forms a data directory that conforms to the industry standards to facilitate data retrieval within the industry.
  • the numbering specification of the edition number can usually be used to form an internal data directory in the publishing industry, which is convenient for quickly searching the unique corresponding book according to the edition number.
  • the product code In the case of data fusion between the publishing industry and the retail industry, since the product code usually follows the format agreed within the retail industry, it may overlap with the version number of the publishing industry. If the product code within the retail industry is directly used as the product code of the department store , it cannot uniquely correspond to a commodity. For example, a publication and a daily necessities may be retrieved at the same time through a commodity number, which makes it difficult to form a unified data catalog when cross-industry data is fused.
  • the catalog number (for example, commodity code) within the industry uniquely corresponds to a digital object in the industry, and the corresponding digital object can be uniquely identified by using the catalog number.
  • the digital object may contain data information indicating various attributes of the target object in the industry.
  • the target object is a TV produced by a TV manufacturer, which belongs to the household appliance industry; the digital object corresponding to the target object contains data information related to various attributes of the TV, for example, various production parameters of the TV, sales information, etc.
  • One method is to select only non-repeated catalog numbers for cross-industry data fusion, but this method will reduce the number of digital objects in the industry; the other method is to uniformly number all digital objects after cross-industry data fusion, However, this method consumes more resources and modifies the original data directory within the industry.
  • an embodiment of the present application provides a method for generating a data directory.
  • a unique identification prefix can be assigned to the industry, and the industry can be determined by the identification prefix.
  • the catalog number of the internal data catalog of the industry is used as the identification suffix, the identification prefix and the identification suffix are spliced, and the splicing result is a unique identification that can be retrieved across industries.
  • the splicing result can still uniquely correspond to a specific digital object after cross-industry data fusion, so as to facilitate the formation of new Unified data catalog.
  • the commodity code (catalogue number) in the retail industry is represented by 111
  • the edition number (catalog number) in the publishing industry is also represented by 111
  • the commodity code and edition number overlap.
  • a unique identification prefix 10 is assigned to the retail industry
  • a unique identification prefix 11 is assigned to the publishing industry.
  • the commodity code 111 in the retail industry is changed to 10111
  • the edition number 111 in the publishing industry is changed to 11111. Different splicing results can be obtained, and each industry uniquely corresponds to one digital object.
  • This method can reuse the original data directory within the industry, and by simply assigning a unique identification prefix to the industry, and then through simple splicing, a cross-industry unique splicing result corresponding to a digital object can be obtained, which can save resources and facilitate generation A unified cross-industry data directory, and minor changes to the original data directory within the industry, without reducing digital objects, improving the efficiency of data directory generation.
  • the local handle service and the global handle registry (Global Handle Registry) in the handle system can also be directly used to help generate a unified data directory, further improving development efficiency.
  • both the local handle service and the global handle registration can be regarded as a server or a service function deployed on the server.
  • the global handle registration can see all the local handle services.
  • the local handle service can be considered as a small area.
  • the local handle service will report some information (such as industry catalog) to the global handle registration.
  • the client needs to access the local handle service It needs to be registered through the global handle, and only after the global handle is registered can we know which objects are located in which local handle service.
  • the digital object architecture can be used to help produce data directories
  • the local handle service and global handle registration structure and interaction protocols can be used, for example, DOIP (Digital Object Interface Protocol) and DO-IRP (Digital Object Identifier Resolution Protocol) protocol to further improve development efficiency.
  • DOIP Digital Object Interface Protocol
  • DO-IRP Digital Object Identifier Resolution Protocol
  • DOIP is used to specify the interaction protocol between the client and the server
  • DO-IRP is based on the early Handle System, which is used to specify the process of creating, modifying, and parsing digital objects.
  • a cross-industry data directory can be formed, and cross-industry data retrieval can be performed based on the data directory.
  • FIG. 1 it is a schematic flow chart of a method for generating a data catalog provided in the embodiment of the present application. This method can be applied to any local handle service in a cross-industry data system.
  • the cross-industry data system may include different local handle services respectively corresponding to users in different industries.
  • one industry user uniquely correspond to one industry, and different industry users correspond to different industries, so as to avoid the situation where a unified numbering specification has been formed within the industry, Different merchants repeatedly assign different identification prefixes, wasting the resources of identification prefixes.
  • the industry user can be a data integration platform within the industry, which can connect with multiple merchants in the industry, so as to facilitate the synthesis of catalog numbers of multiple merchants.
  • the cross-industry data system may include different local handle services corresponding to different industries, and the industry corresponding to any local handle service is preset with Raw data directory.
  • industry users can actively request to participate in cross-industry data fusion.
  • the industry user can establish a corresponding relationship with the deployed local handle service, so that based on The local handle service generates a new data directory, so that after cross-industry data fusion, a globally unique directory number is obtained.
  • any industry user corresponding to the local handle service may be preset with an original data directory; the original data directory contains multiple digital objects, and different digital objects correspond to different original data identifiers.
  • each digital object in the original data directory uniquely identifies the digital object through the original data identifier corresponding to the digital object.
  • any local handle service in the cross-industry data system can execute the following method to generate a new data directory, and the directory number in the new data directory is globally unique and corresponds to a digital object in the cross-industry data system.
  • the method may include steps S101 and S102.
  • different local handle services in the cross-industry data system correspond to different handle prefixes.
  • the handle prefix uniquely corresponds to a local handle service in the cross-industry data system.
  • This method flow does not limit the method for the local handle service to obtain the handle prefix, which can be obtained directly from the local, or the corresponding handle prefix can be applied for allocation.
  • the handle prefix may be pre-applied for allocation and stored locally in the local handle service, or may be directly pre-allocated for the local handle service without applying.
  • the specific application for allocation of handle prefixes can be applied by industry users, or applied by the local handle service itself.
  • obtaining the handle prefix corresponding to the local handle service may include: obtaining the handle prefix applied by the industry user corresponding to the local handle service, as the corresponding handle prefix, and the applied handle prefix uniquely corresponds to the corresponding handle prefix of the local handle service industry users.
  • obtaining a handle prefix corresponding to the local handle service may include: obtaining a handle prefix applied for by the local handle service, and the applied handle prefix uniquely corresponds to the local handle service.
  • the global handle registration can be used to manage handle prefixes, and assign unique corresponding handle prefixes to local handle service or industry users who apply for handle prefixes.
  • the global handle registration can store a set of correspondences between handle prefixes and local handle services, and can also store a set of correspondences between handle prefixes and industry users.
  • applying for a handle prefix may include: registering with the global handle to apply for a handle prefix uniquely corresponding to a local handle service or an industry user corresponding to the application handle prefix.
  • DONA DONA MPA member
  • MPA Multi-Primary Administrator, multi-master administrator
  • the global handle registration can be through one or more DONA Authorized and authenticated MPAs interact with the public.
  • the globally unique splicing in the cross-industry data system can be obtained by splicing with the original data identifier in the original data directory result. Therefore, the resulting new data identifier uniquely corresponds to a digital object in the cross-industry data system.
  • a handle can be obtained by concatenating the handle prefix and the handle suffix together, and the handle can be used as a new data identifier corresponding to the digital object.
  • the new data catalog can be regarded as a part of the cross-industry data catalog, which facilitates subsequent generation of the cross-industry data catalog.
  • the process of this method does not limit the storage location of the generated new data directory, which can be distributed and stored in each local handle service, that is, any local handle service is stored locally after generating a new data directory; in addition, due to The new data identifier in the new data directory is globally unique and corresponds to a digital object. Therefore, it is also possible to integrate the new data directories generated by various local handle services of cross-industry data systems, and directly merge these new data directories into a comprehensive data directory. stored in a specified location.
  • the local handle service in the handle system since the local handle service in the handle system is used, it is usually necessary to register the spliced handle, specifically, register with the global handle registration.
  • the above method flow may further include: sending the obtained new data identifier to the global handle registration.
  • the global handle registration can be used to summarize new data identifiers obtained by various local handle services in cross-industry data systems.
  • the global handle registration can register and store the handles spliced by various local handle services in the cross-industry data system (that is, the obtained new data identifier), specifically, the local handle service and the local handle service corresponding to The handle and related information of the local handle service are stored in association to facilitate subsequent retrieval.
  • Information related to the local handle service such as address information of the local handle service, is not specifically limited in this application.
  • the metadata information may be field information of one or more fields included in the digital object, and each field corresponds to one or more field values. Since a unified specification may have been formed within the industry, some fields can uniquely express a meaning within the industry, but in cross-industry scenarios, due to different specifications in different industries, some field information may cause ambiguity in cross-industry scenarios.
  • the "logo" field can unambiguously represent the logo of the vehicle, but after cross-industry data fusion, the "logo” field itself has a wide range of meanings, and may be misunderstood by users in other industries as the identity of other products. logo.
  • recall rate can be used to measure the performance of algorithms
  • recall rate can be used to measure product the quality of.
  • S102 in the above method flow may further include: updating the catalog metadata information for the raw data catalog preset by the corresponding industry user.
  • it may specifically be to update the metadata information of the directory according to the information of the corresponding industry user according to the raw data directory preset for the corresponding industry user.
  • This method flow does not specifically limit the method for updating catalog metadata information, as long as the updated catalog metadata information can show actual business meanings and will not be misunderstood by people in other industries.
  • business personnel corresponding to industry users may update the directory metadata information for the preset original data directory. Specifically, it may be updated to the actual meaning of the directory metadata information.
  • the specific update of the catalog metadata information can be based on the catalog metadata information of the original data catalog, Add information about corresponding industry users.
  • the information of industry users may at least include industry information.
  • the vehicle industry information "vehicle” can be added to obtain the “vehicle identification”, so as to clarify the actual meaning of the field;
  • the "recall rate” field within the software industry you can add the software industry Information “software algorithm” to get “software algorithm recall rate”, so that the actual meaning of this field can be clarified.
  • the information corresponding to the industry user may also include the brand of the merchant.
  • this embodiment can also retain the directory metadata information in the original data directory, avoid modification as much as possible, and can also improve update efficiency.
  • generating a new data directory according to the obtained new data identifier may include: generating a new data directory according to the updated directory metadata information and the obtained new data identifier, and the directory element of the generated new data directory
  • the data information is the updated directory metadata information
  • the directory number is the obtained new data identifier
  • the problem caused by inconsistent metadata information specifications can be further solved in a cross-industry scenario, so that personnel in other industries in the cross-industry scenario can understand the metadata information of a certain industry.
  • the Type mechanism provided by the DOA can be used to redefine the directory metadata information, so as to facilitate the generation of a new data directory according to the defined Type.
  • the Type mechanism can also be used to directly bind the corresponding handle prefix of the local handle service and the defined Type set (including the updated directory metadata information), so as to facilitate subsequent acquisition of the updated directory metadata information.
  • the development cost can be saved and the development efficiency can be improved.
  • the process of the above method splices the globally unique handle prefix corresponding to a local handle service in the cross-industry data system with the original data identifier uniquely corresponding to a digital object in the local handle service to obtain the unique corresponding to a digital object in the cross-industry data system New data identification of digital objects, so that a new data catalog can be generated according to the obtained new data identification, so that the catalog number in the new data catalog can uniquely correspond to a digital object in the cross-industry data system, thus facilitating cross-industry data fusion , to facilitate the generation of cross-industry data catalogs.
  • the original data identifier can be retained as much as possible, the modification of the original data directory can be reduced, and the original data directory can be quickly accessed without intrusion; on the other hand, resources can be saved and the generation of the data directory can be improved. Efficiency, which facilitates the rapid completion of the construction of cross-industry data catalogs.
  • the interpretability of the catalog metadata information in cross-industry scenarios can be improved, making it easier for people in other industries to understand the catalog metadata information of a certain industry.
  • DOA global handle registration and local handle service can be included.
  • Global handle registration It is maintained by DONA MPA members and is responsible for assigning handle prefixes (handle prefixes) to users in various industries.
  • GCS Global Handle Service
  • Industry directory the original data directory, which can be maintained by industry users, usually the original directory system or asset management system in the industry.
  • DO-IRP is the core of DOA, which is essentially a Key, and the Value database provides the ability to parse the Value based on the Key. Key is also called handle in DO-IRP.
  • the format of the handle stipulated in the protocol is as follows: prefix/suffix, where the prefix is managed by DONA MPA members, a prefix can only be assigned to one user, and the suffix is managed by the user itself.
  • handle can be used to uniquely mark a digital object in DOA.
  • the global directory number can be regarded as a handle, and a prefix can be assigned to each industry user. Since the internal directory number of an industry user is already unique, it is only necessary to use the industry directory number as a suffix Splicing with the prefix can form a globally unique handle.
  • This method generates a globally unique catalog number, which is non-invasive to industry users.
  • the local handle service registers the handle to the global handle registration through the handle protocol.
  • DOA can be composed of global handle registration and local handle service, wherein the global handle registration gathers all the handle information, and the local handle service is used to store the specific content of the digital object corresponding to the handle.
  • the global handle registration is a large routing table, which records the mapping relationship between the handle and the local handle service. Later, the location of the local handle service can be found through the handle, and then the specific data corresponding to the handle can be obtained from the local handle service.
  • the aggregation of all directory numbers (handles) can be completed through global handle registration, and the specific industry directory corresponding to the handle can be stored using the local handle service.
  • DOA For the DOA Type mechanism, the DOA protocol itself does not care about the specific content of the data, and it leaves the definition of the data format to the user.
  • DOA provides the Type mechanism through the DOIP protocol, and the format definition of the data object can be completed through the Type.
  • the format of the data object can refer to the existence mode of the data object, and the industry data directory can be considered as a data object. Therefore, industry users use the Type mechanism
  • the catalog metadata information of the industry can be easily defined, so the aggregation of catalog numbers can be completed at low cost.
  • the format definition of the new data directory can be completed by using the Type mechanism to define the directory metadata information of the new data directory, so as to generate updated directory metadata information.
  • the whole process is relatively simple.
  • the cost of development is very low, borrowing the ability of DOA, can quickly promote the collection of catalog numbers in various industries, gather the catalog numbers into the global handle registration, and the industry catalogs of users in various industries can be stored in each local handle service middle.
  • the cross-industry data retrieval capability can be provided according to the generated new data directory.
  • the global handle registration may store new data identifiers obtained by various local handle services in the cross-industry data system, so that retrieval may be performed according to the new data identifiers stored in the global handle registration.
  • the global handle registration can first determine whether there is a new data identifier contained in the request, and if there is a new data identifier, it can further determine the corresponding local handle service, from Continue to search in the local handle service, get the digital object corresponding to the new data identifier and return it.
  • retrieval requests from different industries can be processed uniformly by the global handle registration, providing cross-industry data retrieval capabilities.
  • the embodiment of this specification also provides a data retrieval method based on the above method flow, which can provide cross-industry data retrieval capabilities according to the generated new data directory.
  • FIG. 2 it is a schematic flowchart of a data retrieval method provided by the embodiment of this specification.
  • the global handle registration is usually maintained by members of DONA MPA, it is usually difficult to develop, for the convenience of development, optionally, a new management device can be added for cross-industry data systems, the The management device can connect to several local handle services in the cross-industry data system to provide cross-industry data retrieval capabilities.
  • the management device may be maintained by a management user, and the management user may specifically be a business party that provides retrieval capabilities.
  • the method may be applied to a management device, and may specifically include steps S201 to S203.
  • S201 Receive a retrieval request sent by a target user; the retrieval request includes at least a retrieval condition.
  • a retrieval request is used to request to obtain digital objects that meet the contained retrieval conditions.
  • the target user may be any user with retrieval needs, for example, an individual, a merchant, an institution, and the like.
  • any user with retrieval needs is called a target user.
  • S202 Search for a target local handle service that can respond to the retrieval request.
  • the target local handle service that can respond to the retrieval request may be a local handle service that may store digital objects that meet the contained retrieval conditions, specifically, the target local handle service may be a local handle service specified according to the retrieval request , or a local handle service that meets the search criteria.
  • S203 retrieve digital objects that meet the retrieval conditions for the found new data directory served by the target local handle, and return a retrieval result to the target user.
  • the flow of the method does not limit the form of the retrieval request or retrieval condition.
  • the target user may perform retrieval in the form of a retrieval formula or keyword to generate a retrieval request and retrieval condition.
  • the data directory itself can provide retrieval capabilities, specifically, digital objects that meet the retrieval conditions can be retrieved according to the data directory, for example, directly retrieve the corresponding digital object according to the directory number (that is, the new data identifier), or according to a certain digital object attribute value (such as , the attribute value of the product type is electronic product) to retrieve digital objects with this attribute value, therefore, you can first determine the local handle service that can respond to the retrieval request, narrow the search scope, and quickly search for the new data directory generated by the local handle service .
  • the directory number that is, the new data identifier
  • a certain digital object attribute value such as , the attribute value of the product type is electronic product
  • the process of the above method can use the retrieval capability of the new data directory generated by the local handle service to perform fast retrieval by determining the target local handle service that can respond to the retrieval request, so that various retrieval capabilities can be provided according to the new data directory of different local handle services , which can also provide cross-industry data retrieval capabilities.
  • the management device acts as an intermediary, on the one hand, it connects to several local handle services, and on the other hand, it corresponds to several target users, which can save connection resources and avoid separately establishing a connection between the target user and the local handle service for each retrieval request.
  • the management device can maintain connections with several local handle services, and receive retrieval requests from several target users.
  • the management device can provide information about the local handle service, so that the target user does not know the local handle service to be retrieved, or does not know the address information of the local handle service to be retrieved, so that the connection cannot be established.
  • this method flow does not limit the specific implementation method.
  • the target local handle service can be searched according to the content included in the retrieval request.
  • the management device may store a first correspondence set between identifiers of local handle services in the cross-industry data system and user information in corresponding industries.
  • the first set of correspondences may be acquired through interaction between the management device and several docked local handle services, and the first set of correspondences may include identifiers of one or more local handle services in the cross-industry data system.
  • the identifier of the local handle service may be information uniquely corresponding to the local handle service, for example, the address information of the local handle service may specifically be the physical address or IP of the local handle service, which can be conveniently based on the address of the local handle service information to establish a connection.
  • the corresponding industry user information may include at least one of industry identifiers, industry user identifiers, address information of industry users, and the like.
  • Finding the target local handle service that can respond to the retrieval request may include: in the case of determining that the retrieval request contains industry user information, searching for the identifier of the local handle service corresponding to the industry user information contained in the stored first correspondence relationship set , determining the local handle service corresponding to the found local handle service identifier as the target local handle service.
  • the industry users targeted by the retrieval request can be determined, and the determined industry users can be one or more.
  • the determined industry users can be one or more.
  • the retrieval request may contain the industry user information as "retail industry”, therefore, it may be determined that the local handle service corresponding to each industry user of "retail industry” is the target local handle service.
  • the management device may store a second correspondence set between identifiers of local handle services in the cross-industry data system and corresponding handle prefixes.
  • the second set of correspondences may be acquired through interaction between the management device and several docked local handle services, and the second set of correspondences may include identifiers of one or more local handle services in the cross-industry data system.
  • the handle prefix may uniquely correspond to a local handle service
  • the uniquely corresponding local handle service may be found according to the handle prefix, and the local handle service may be determined as the target local handle service.
  • Finding the target local handle service that can respond to the retrieval request may include: in the case of determining that the retrieval request includes the target data identifier, in the stored second correspondence set, searching for the local handle service corresponding to the handle prefix in the target data identifier ID, and determine the local handle service corresponding to the found local handle service ID as the target local handle service.
  • the unique corresponding local handle service can be quickly found by searching the handle prefix identified by the target data in the request.
  • the global handle registry may store a third set of correspondences between identifiers of local handler services in cross-industry data systems and obtained new data identifiers.
  • the third correspondence set may be obtained through interaction between the global handle registration and the local handle service in the cross-industry data system, and the third correspondence set may include one or more local handle services in the cross-industry data system logo.
  • Finding the target local handle service that can respond to the retrieval request may include: in the case where it is determined that the retrieval request includes the target data identifier, the management device searches for the local handle service corresponding to the target data identifier from the third correspondence set registered by the global handle ID, and determine the local handle service corresponding to the found local handle service ID as the target local handle service.
  • the target data identifier may be any new data identifier, and for ease of description, any new data identifier is referred to as a target data identifier.
  • the third corresponding relationship set stored in the global handle registration can be used. On the one hand, it is determined whether the target data identifier is successfully registered in the global handle registration, and on the other hand, the target data can also be quickly found. Identifies the uniquely corresponding local handle service.
  • the global handle registry may also store a second set of correspondences between identifiers of local handle services in cross-industry data systems and corresponding handle prefixes.
  • identifiers of local handle services in cross-industry data systems and corresponding handle prefixes.
  • searching for the target local handle service that can respond to the retrieval request may include: when it is determined that the retrieval request includes the target data identifier, the management device searches for the handle prefix correspondence in the target data identifier from the second set of correspondences registered by the global handle. The identification of the local handle service identified, and the local handle service corresponding to the found identification of the local handle service is determined as the target local handle service.
  • the above-mentioned target data identifier and/or industry user information may be included in the retrieval condition.
  • the cross-industry data can be directly Every local handle service in the system is retrieved.
  • the target local handle service that can respond to the retrieval request cannot be found, for the new data directory of each local handle service in the cross-industry data system, retrieve the digital object that meets the retrieval conditions, and send the target user Return the search results.
  • FIG. 3 it is a schematic diagram of the principle of a data retrieval method provided by the embodiment of this specification.
  • the target local handle service uniquely corresponding to the target data identifier can be directly retrieved, so as to facilitate subsequent retrieval according to the retrieval conditions.
  • Target data identifier If it does not contain the target data identifier, continue to judge whether it contains industry user information, and if it contains industry user information, you can directly retrieve the target local handle service corresponding to the industry user information, so as to facilitate subsequent retrieval according to the retrieval conditions.
  • Industry user information may correspond to one or more target local handle services.
  • the industry user information is not included, it can be retrieved according to the retrieval conditions for each local handle service.
  • the retrieved digital object can be returned to the target user as a retrieval result; if no digital object meeting the retrieval condition is retrieved, the The information used to characterize the digital objects that have not been retrieved that meet the retrieval conditions is returned to the target user as a retrieval result.
  • Subsequent sorting or filtering can be performed on several digital objects that meet the retrieval conditions.
  • all or part of the digital objects meeting the retrieval condition may also be returned.
  • the management device can also summarize new data directories served by each local handle in the cross-industry data system to obtain a cross-industry data directory.
  • the new data catalog sent by each local handle service in the cross-industry data system can be directly obtained and summarized to obtain a cross-industry data catalog, which is convenient for displaying the cross-industry data catalog.
  • the catalog number in the data catalog usually has no practical meaning, only some or all of the metadata information in the data catalog can be selected for display, for example, the attribute type of the digital object, the value range of the attribute value, the business of each field Type and description information and so on.
  • the above method process may also include: receiving new data directory information sent by each local handle service in the cross-industry data system, and summarizing to obtain a cross-industry data directory; the new data directory information includes at least the directory metadata of the new data directory information.
  • the management device can also be used to display the obtained cross-industry data catalog.
  • the management device may be used to receive a new data directory generated by a local handle service in a cross-industry data system, for example, it may receive a new data directory generated by one or more local handle services in a cross-industry data system, and The new data catalogs received from different local handle services are merged into a cross-industry data catalog.
  • the catalog number (namely, the new data identifier) of the new data catalog is globally unique in the cross-industry data system, it can be directly merged to obtain the cross-industry data catalog.
  • the cross-industry data catalog can contain digital objects and corresponding catalog numbers of users in different industries.
  • the management device can also display the cross-industry data directory to users who need it. Specifically, in the case of being affected by data security or insufficient display resources, the management device can display part of the data in the cross-industry data directory according to user needs, specifically part of the digital object or part of the content of the digital object.
  • a certain shopping platform can be connected to several industries, such as retail industry, electrical appliance industry, service industry, and so on.
  • industries such as retail industry, electrical appliance industry, service industry, and so on.
  • the management device may display some products or some field values of products in the cross-industry data catalog to the user.
  • the management device can obtain the directory metadata information of the new data directory served by the local handle, and then can compare the digital objects in the cross-industry data directory with the corresponding The directory metadata information is combined and displayed.
  • the directory metadata information may include field information of one or more fields of the digital object, and the field value of the digital object and the corresponding field information may be combined and displayed to the user, so that the user can view the field value and field information to determine the actual business meaning of the field value.
  • the original data directory metadata information for industry users can be updated, so that personnel in other industries can easily understand the updated directory metadata information in cross-industry scenarios.
  • the directory metadata information here may be updated directory metadata information.
  • the management device may store directory metadata information of the new data directory generated by the local handle service in the cross-industry data system.
  • any local handle service updates the metadata information of the directory according to the information of the corresponding industry user for the original data directory preset by the corresponding industry user, and generates a new The data directory; the directory metadata information of the generated new data directory is the updated directory metadata information, and the directory number is the obtained new data identifier.
  • the directory metadata information of the new data directory stored in the management device may be updated directory metadata information.
  • the management device can receive the new data directory generated by the local handle service in the cross-industry data system, and merge the received new data directories of different local handle services into a cross-industry data directory, and determine the cross-industry data directory according to user needs
  • the demand digital object in the industry data catalog and then combine the determined demand digital object with the catalog metadata information (specifically, field information) of the new data catalog where it is located, and return it to the user (target user) who has the user demand. It can also be displayed to users who have the user needs.
  • the management device determines the product data recommended for the user in the cross-industry data directory, and returns it to the user in combination with the field information of the new data directory where the product data is located.
  • it may be "Product type: electrical appliances, product brand: xx brand, product price: 60 yuan”.
  • the retrieval result may be directly displayed to the target user.
  • the retrieved digital object can be returned to the target user in combination with the catalog metadata information, so that the target user can view it conveniently.
  • the management device may store the directory metadata information of the new data directory generated by the local handle service in the cross-industry data system.
  • the management device may store directory metadata information of a new data directory generated by one or more local handle services in the cross-industry data system.
  • the management device may obtain the directory metadata information of the new data directory from the local handle service in the cross-industry data system.
  • what the management device stores may be a fourth correspondence set between the identifier of the local handle service and the directory metadata information of the new data directory correspondingly generated by the local handle service.
  • the identifier of the local handle service may specifically be the handle prefix or address information corresponding to the local handle service.
  • the directory metadata information of the corresponding new data directory can be directly determined according to the identification of the target local handle service found in step S202, and then the retrieved The digital objects that meet the retrieval conditions are sent to the target users in combination with the determined directory metadata information.
  • returning the retrieval result to the target user may include: if the retrieval result includes a digital object, determining directory metadata information of a new data directory generated by the target local handle service. Send the digital object included in the retrieval result to the target user in combination with the determined directory metadata information.
  • the digital objects contained in the retrieval results may be displayed to the target user in combination with the determined directory metadata information.
  • the directory metadata information may be field information, and in combination with the digital object and directory metadata information, it may be to fill the corresponding field information with the field value of the digital object included in the retrieval result.
  • the determined directory metadata information may be updated directory metadata information.
  • the determined directory metadata information may be updated directory metadata information.
  • industry users can use the Type mechanism of DOA to redefine the directory metadata information (that is, generate updated directory metadata information), and then obtain the Type set corresponding to the local handle service, where Contains updated catalog metadata information.
  • These industry user-defined Type collections can be stored in the management device. Since the Type collection directly corresponds to the local handle service, specifically, it can correspond to the corresponding handle prefix assigned to the local handle service. Therefore, the management device can directly use the Type mechanism to obtain The directory metadata information (that is, the updated directory metadata information) of the new data directory generated by the local handle service in the cross-industry data system.
  • the embodiment of this specification also provides an apparatus embodiment.
  • FIG. 4 it is a schematic structural diagram of a device for generating a data directory provided by the embodiment of this specification.
  • the device can be applied to any local handle service in the cross-industry data system.
  • the cross-industry data system may include different local handle services corresponding to users in different industries; the industry users corresponding to the local handle services are preset with original data directories; in the original data directories, different digital objects correspond to different original data identifiers.
  • the apparatus may include units 401 and 402 .
  • the obtaining unit 401 is used to obtain the handle prefix corresponding to the local handle service, and different local handle services in the cross-industry data system correspond to different handle prefixes.
  • the splicing unit 402 is configured to use the original data identifier corresponding to each digital object as a handle suffix for the original data directory preset by the corresponding industry user, and splice it with the obtained corresponding handle prefix to obtain a new data identifier corresponding to the digital object , and generate a new data directory according to the obtained new data identifier.
  • the device may further include a sending unit 403, configured to send the obtained new data identifiers to the global handle registration, and the global handle registration is used to summarize the new data identifiers obtained by various local handle services in the cross-industry data system.
  • a sending unit 403 configured to send the obtained new data identifiers to the global handle registration, and the global handle registration is used to summarize the new data identifiers obtained by various local handle services in the cross-industry data system.
  • the acquiring unit 401 may be configured to: acquire the handle prefix applied by the industry user corresponding to the local handle service, as the corresponding handle prefix, and the applied handle prefix uniquely corresponds to the industry user corresponding to the local handle service.
  • the device may further include an update unit 403, configured to update the catalog metadata information of the original data catalog according to the information of the corresponding industry user for the raw data catalog preset by the corresponding industry user.
  • an update unit 403 configured to update the catalog metadata information of the original data catalog according to the information of the corresponding industry user for the raw data catalog preset by the corresponding industry user.
  • the splicing unit 402 can be used to: generate a new data directory according to the updated directory metadata information and the obtained new data identifier; the directory metadata information of the generated new data directory is the updated directory metadata information, and the directory number is The resulting new data identifier.
  • the updating unit 403 may be configured to: use the Type mechanism to redefine the directory metadata information of the original data directory, so as to generate the updated directory metadata information.
  • FIG. 5 it is a schematic structural diagram of a data retrieval device provided by the embodiment of this specification.
  • the device is a data retrieval device for a data directory, wherein the data directory is generated based on the above-mentioned data directory generation device, and the data retrieval device can be applied to a management device; the management device is a newly added device for a cross-industry data system.
  • the data retrieval apparatus may include units 501 to 503 .
  • the receiving unit 501 is configured to receive a retrieval request sent by a target user.
  • the retrieval request includes at least a retrieval condition; the retrieval request is used to request to obtain a digital object that meets the contained retrieval condition.
  • the search unit 502 is configured to search for a target local handle service that can respond to the retrieval request.
  • the result unit 503 is configured to retrieve digital objects that meet the retrieval conditions for the found new data directory served by the target local handle, and return the retrieval result to the target user.
  • the management device stores a first correspondence set between identifiers of local handle services in the cross-industry data system and corresponding industry user information.
  • the search unit 502 can be specifically configured to: in the case of determining that the retrieval request contains industry user information, search the stored first correspondence set for the identifier of the local handle service corresponding to the industry user information contained therein, and store the found The local handle service corresponding to the identifier of the local handle service is determined to be the target local handle service.
  • the industry user information may include at least one of industry identifiers, industry user identifiers, and address information of industry users.
  • the management device stores a second correspondence set between identifiers of local handle services in the cross-industry data system and corresponding handle prefixes.
  • the search unit 502 can specifically be configured to: in the case where it is determined that the retrieval request includes the target data identifier, in the stored second set of correspondences, search for the identifier of the local handle service corresponding to the handle prefix in the target data identifier, and retrieve the found The local handle service corresponding to the identifier of the local handle service is determined as the target local handle service.
  • the global handle registry stores a second set of correspondences between identifiers of local handle services in the cross-industry data system and corresponding handle prefixes.
  • the searching unit 502 may specifically be configured to: in the case of determining that the retrieval request includes a target data identifier, the management device searches for the handle in the target data identifier from the second correspondence set registered by the global handle The identifier of the local handle service corresponding to the prefix, and the local handle service corresponding to the found identifier of the local handle service is determined as the target local handle service.
  • the global handle registration stores a third correspondence set between the identifier of the local handle service in the cross-industry data system and the obtained new data identifier.
  • the search unit 502 may specifically be configured to: in the case that the retrieval request includes the target data identifier, the management device searches for the identifier of the local handle service corresponding to the target data identifier from the third correspondence set registered by the global handle, and searches the found The local handle service corresponding to the identifier of the local handle service is determined as the target local handle service.
  • the device may also include a returning unit, which is used to search for a new data catalog of each local handle service in the cross-industry data system if the target local handle service that can respond to the retrieval request cannot be found A numeric object of the search criteria and returns the search results to the target user.
  • a returning unit which is used to search for a new data catalog of each local handle service in the cross-industry data system if the target local handle service that can respond to the retrieval request cannot be found A numeric object of the search criteria and returns the search results to the target user.
  • the device may also include a summary unit, configured to receive new data directory information sent by each local handle service in the cross-industry data system, and summarize to obtain a cross-industry data directory; the new data directory information includes at least the directory elements of the new data directory Data information; the management facility is also used to present the resulting cross-industry data catalog.
  • a summary unit configured to receive new data directory information sent by each local handle service in the cross-industry data system, and summarize to obtain a cross-industry data directory; the new data directory information includes at least the directory elements of the new data directory Data information; the management facility is also used to present the resulting cross-industry data catalog.
  • the management device stores directory metadata information of the new data directory generated by the local handle service in the cross-industry data system.
  • the result unit 503 can be specifically used to: if the retrieval result includes digital objects, determine the directory metadata information of the new data directory generated by the target local handle service; combine the digital objects contained in the retrieval results with the determined directory metadata The data information is displayed to the target user.
  • the directory metadata information of the new data directory may include field information
  • the result unit 503 may be specifically configured to: combine the field value of the digital object with the corresponding field information and send it to the target user.
  • the management device stores a fourth correspondence set between the identification of the local handle service in the cross-industry data system and the directory metadata information of the new data directory correspondingly generated by the local handle service, and the result unit 503 can be specifically used for: Determine the directory metadata information of the corresponding new data directory according to the searched identifier of the target local handle service; send the retrieved digital objects that meet the retrieval conditions, combined with the determined directory metadata information, to the target user.
  • the management device may be configured to receive a new data directory generated by a local handle service in a cross-industry data system, and merge the received new data directories of different local handle services into a cross-industry data directory.
  • the embodiment of this specification also provides an electronic device, including a processor and a memory; the memory stores a program that can be called by the processor; wherein, when the processor executes the program, the data directory as described in any of the above method embodiments is realized. Generation methods or data retrieval methods.
  • the embodiment of this specification also provides a machine-readable storage medium, on which a program is stored.
  • the program is executed by a processor, the method for generating a data directory or the method for retrieving data as described in any of the above method embodiments is implemented.
  • the embodiment of this specification also provides a computer device, which at least includes a memory, a processor, and a computer program stored in the memory and operable on the processor, wherein, when the processor executes the program, it can implement any of the methods described above.
  • a computer device which at least includes a memory, a processor, and a computer program stored in the memory and operable on the processor, wherein, when the processor executes the program, it can implement any of the methods described above.
  • FIG. 6 shows a schematic diagram of a more specific hardware structure of a computer device provided by the embodiment of this specification.
  • the device may include: a processor 1010 , a memory 1020 , an input/output interface 1030 , a communication interface 1040 and a bus 1050 .
  • the processor 1010 , the memory 1020 , the input/output interface 1030 and the communication interface 1040 are connected to each other within the device through the bus 1050 .
  • the processor 1010 may be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, and is used to execute related programs to realize the technical solutions provided by the embodiments of this specification.
  • a general-purpose CPU Central Processing Unit, central processing unit
  • a microprocessor an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits
  • ASIC Application Specific Integrated Circuit
  • the memory 1020 can be implemented in the form of ROM (Read Only Memory, read-only memory), RAM (Random Access Memory, random access memory), static storage device, dynamic storage device, and the like.
  • the memory 1020 can store operating systems and other application programs. When implementing the technical solutions provided by the embodiments of this specification through software or firmware, the relevant program codes are stored in the memory 1020 and invoked by the processor 1010 for execution.
  • the input/output interface 1030 is used to connect the input/output module to realize information input and output.
  • the input/output module can be configured in the device as a component (not shown in the figure), or can be externally connected to the device to provide corresponding functions.
  • the input device may include a keyboard, mouse, touch screen, microphone, various sensors, etc.
  • the output device may include a display, a speaker, a vibrator, an indicator light, and the like.
  • the communication interface 1040 is used to connect a communication module (not shown in the figure), so as to realize the communication interaction between the device and other devices.
  • the communication module can realize communication through wired means (such as USB, network cable, etc.), and can also realize communication through wireless means (such as mobile network, WIFI, Bluetooth, etc.).
  • Bus 1050 includes a path that carries information between the various components of the device (eg, processor 1010, memory 1020, input/output interface 1030, and communication interface 1040).
  • the above device only shows the processor 1010, the memory 1020, the input/output interface 1030, the communication interface 1040, and the bus 1050, in the specific implementation process, the device may also include other components.
  • the above-mentioned device may only include components necessary to implement the solutions of the embodiments of this specification, and does not necessarily include all the components shown in the figure.
  • the embodiment of this specification also provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the method for generating a data catalog or the method for retrieving data as described in any one of the above method embodiments is implemented.
  • Computer-readable media including both permanent and non-permanent, removable and non-removable media, can be implemented by any method or technology for storage of information.
  • Information may be computer readable instructions, data structures, modules of a program, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash memory or other memory technology, Compact Disc Read-Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage, A magnetic tape cartridge, disk storage or other magnetic storage device or any other non-transmission medium that can be used to store information that can be accessed by a computing device.
  • computer-readable media excludes transitory computer-readable media, such as modulated data signals and carrier waves.
  • a typical implementing device is a computer, which may take the form of a personal computer, laptop computer, cellular phone, camera phone, smart phone, personal digital assistant, media player, navigation device, e-mail device, game control device, etc. desktops, tablets, wearables, or any combination of these.
  • each embodiment in this specification is described in a progressive manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments.
  • the description is relatively simple, and for relevant parts, please refer to part of the description of the method embodiment.
  • the device embodiments described above are only illustrative, and the modules described as separate components may or may not be physically separated, and the functions of each module may be integrated in the same or multiple software and/or hardware implementations. Part or all of the modules can also be selected according to actual needs to achieve the purpose of the solution of this embodiment. It can be understood and implemented by those skilled in the art without creative effort.

Abstract

本说明书公开了一种数据目录生成方法和装置、数据检索方法和装置、设备以及机器可读存储介质。该数据目录生成方法应用于跨行业数据系统中任一本地句柄服务;该方法包括:获取本地句柄服务对应的句柄前缀,跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀;针对对应行业用户预先设置的原始数据目录,将每个数据对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到所述每个数据对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。

Description

数据目录生成方法和装置、数据检索方法和装置、电子设备 技术领域
本申请涉及检索领域,尤其涉及一种数据目录生成方法和装置、数据检索方法和装置、电子设备及存储介质。
背景技术
目前,随着信息化技术的不断发展,各个行业内部一般会形成符合本行业规范的数据目录,方便本行业内部的数据检索。
而随着大数据的发展,不同行业之间进行跨行业的数据融合的需求越来越多,但是由于不同行业之间数据目录的规范并不相同,导致不同行业之间的数据目录难以汇总。
发明内容
为了解决上述技术问题,本说明书实施例提供了一种数据目录生成方法、装置及设备。技术方案如下所示。
一种数据目录生成方法,应用于跨行业数据系统中任一本地句柄服务;所述跨行业数据系统包括分别对应于不同行业用户的不同本地句柄服务;所述本地句柄服务对应的行业用户预先设置有原始数据目录;所述原始数据目录包含多个数字对象,不同数字对象分别对应于不同的原数据标识;所述方法包括:获取所述本地句柄服务对应的句柄前缀,所述跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀;针对对应行业用户预先设置的原始数据目录,将每个数字对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到所述每个数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
可选地,所述方法还包括:将所得到的新数据标识发送给全局句柄注册,所述全局句柄注册用于汇总所述跨行业数据系统中各个本地句柄服务所得到的新数据标识。
可选地,所述获取所述本地句柄服务对应的句柄前缀,包括:获取所述本地句柄服务对应的行业用户所申请的句柄前缀,作为对应的句柄前缀,所申请的句柄前缀唯一对应于该本地句柄服务对应的行业用户。
可选地,所述方法还包括:针对对应行业用户预先设置的原始数据目录,根据对应行业用户的信息更新所述原始数据目录的目录元数据信息,以使更新后的目录元数据信息能够表现出对应的数字对象的字段值的实际业务含义,其中所述目录元数据信息指数字对象包括的一个或多个字段的字段信息;所述根据所得到的新数据标识生成新数据目录,包括:根据更新后的目录元数据信息和所得到的新数据标识生成新数据目录;所生成的新数据目录的目录元数据信息为更新后的目录元数据信息,目录编号为所得到的新数据标识。
可选地,通过利用Type机制完成所述新数据目录的格式定义来定义所述新数据目录的目录元数据信息,以生成所述更新后的目录元数据信息。
一种针对数据目录的数据检索方法,所述数据目录通过上述数据目录生成方法生成,所述数据检索方法应用于管理设备;所述管理设备是针对所述跨行业数据系统新增的设备;所述数据检索方法包括:接收目标用户发送的检索请求,所述检索请求至少包含检索条件;所述检索请求用于请求获取符合所包含的检索条件的数字对象;查找能够响应所述检索请求的目标本地句柄服务;针对所查找到的目标本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
可选地,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与对应行 业用户信息之间的第一对应关系集合;所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包含行业用户信息的情况下,在所存储的第一对应关系集合中查找所包含的行业用户信息对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,所述行业用户信息包括行业标识、行业用户标识、行业用户的地址信息中的至少一项。
可选地,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合;所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包括目标数据标识的情况下,在所存储的第二对应关系集合中,查找所述目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,全局句柄注册存储有所述跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合;所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包括目标数据标识的情况下,所述管理设备从所述全局句柄注册的所述第二对应关系集合中,查找所述目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,所述全局句柄注册存储有所述跨行业数据系统中本地句柄服务的标识与所得到的新数据标识之间的第三对应关系集合;所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包括目标数据标识的情况下,所述管理设备从所述全局句柄注册的所述第三对应关系集合中,查找所述目标数据标识对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,所述方法还包括:在无法查找到能够响应所述检索请求的目标本地句柄服务的情况下,针对所述跨行业数据系统中的每个本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
可选地,所述管理设备存储有所述跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息;所述向所述目标用户返回检索结果,包括:在检索结果包括数字对象的情况下,确定所述目标本地句柄服务所生成的新数据目录的目录元数据信息;将所述检索结果中包含的数字对象,结合所确定的目录元数据信息展示给所述目标用户。
可选地,所述新数据目录的目录元数据信息包括字段信息,其中,所述将所述检索结果中包含的数字对象,结合所确定的目录元数据信息发送给所述目标用户,包括:将所述数字对象的字段值与对应的字段信息结合后发送给所述目标用户。
可选地,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与本地句柄服务对应生成的新数据目录的目录元数据信息之间的第四对应关系集合;其中,所述向所述目标用户返回检索结果,包括:根据查找到的目标本地句柄服务的标识确定对应的新数据目录的目录元数据信息;将检索到的符合所述检索条件的数字对象,结合所确定的目录元数据信息发送给所述目标用户。
可选地,所述管理设备用于接收所述跨行业数据系统中本地句柄服务所生成的新数据目录,并将所接收到不同本地句柄服务的新数据目录合并为跨行业数据目录。
一种数据目录生成装置,应用于跨行业数据系统中任一本地句柄服务;所述跨行业数据系统包括分别对应于不同行业用户的不同本地句柄服务;所述本地句柄服务对应的行业用户预先设置有原始数据目录;所述原始数据目录包含多个数字对象,不同数字对象分别对应于不同的原数据标识;所述装置包括:获取单元,用于获取所述本地句柄服务对应的句柄前缀,所述跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀;拼接单元,用于针对对应行业用户预先设置的原始数据目录,将每个数字对象对应 的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到所述每个数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
一种针对数据目录的数据检索装置,所述数据目录通过上述数据目录生成装置生成,所述数据检索装置应用于管理设备;所述管理设备是针对所述跨行业数据系统新增的设备;所述数据检索装置包括:接收单元,用于接收目标用户发送的检索请求,所述检索请求至少包含检索条件;所述检索请求用于请求获取符合所包含的检索条件的数字对象;查找单元,用于查找能够响应所述检索请求的目标本地句柄服务;结果单元,用于针对所查找到的目标本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
一种电子设备,包括处理器及存储器;所述存储器存储有可被处理器调用的程序;其中,所述处理器执行所述程序时,实现如上任一所述的数据目录生成方法或数据检索方法。
一种机器可读存储介质,其上存储有程序,该程序被处理器执行时,实现如上任一所述的数据目录生成方法或数据检索方法。
上述技术方案通过将跨行业数据系统中全局唯一对应于一个本地句柄服务的句柄前缀,与本地句柄服务中唯一对应于一个数字对象的原数据标识进行拼接,得到跨行业数据系统中唯一对应于一个数字对象的新数据标识,从而便于后续汇总跨行业数据系统中不同行业基于新数据标识生成的新数据目录。
附图说明
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅用于辅助说明本说明书中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。
图1是本申请实施例提供的一种数据目录生成方法的流程示意图;
图2是本申请实施例提供的一种数据检索方法的流程示意图;
图3是本申请实施例提供的一种数据检索方法的原理示意图;
图4是本申请实施例提供的一种数据目录生成装置的结构示意图;
图5是本申请实施例提供的一种数据检索装置的结构示意图;
图6是用于配置申请实施例方法的一种设备的结构示意图。
具体实施方式
为了使本领域技术人员更好地理解本申请的技术方案,下面将结合附图,对本申请实施例进行详细地描述,显然,所描述的实施例仅仅是本申请的一部分实施例,而不是全部的实施例。基于本说明书中记载的实施例,本领域普通技术人员所获得的所有其他实施例,都应当属于本申请公开的范围。
下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。
在本申请使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本申请。在本申请和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。
应当理解,尽管在本申请可能采用术语第一、第二、第三等来描述各种对象,但这些信息不应限于这些术语。这些术语仅用来将同一类型的对象彼此区分开。例如,在不脱离本申请范围的情况下,第一对象也可以被称为第二对象,类似地,第二对象也可以 被称为第一对象。取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。
目前,随着信息化技术的不断发展,各个行业内部一般会形成符合本行业规范的数据目录,方便本行业内部的数据检索。
例如,对于专利行业,通常可以根据专利号的编号规范形成行业内部的数据目录,方便针对专利进行数据检索,具体可以直接根据专利号查找唯一对应的专利文件。
针对零售行业,通常可以使用商品码的编号规范,形成零售行业内部的数据目录,方便根据商品码快速检索唯一对应的商品数据。
针对出版行业,通常可以使用版号的编号规范,形成出版行业内部的数据目录,方便根据版号快速检索唯一对应的书籍。
而随着大数据的发展,不同行业之间进行跨行业的数据融合的需求越来越多,但是由于不同行业之间数据目录的规范并不相同,可能存在不同行业的数据目录的编号出现重复的情况,导致不同行业之间的数据目录难以汇总。
为了便于理解,下面给出一个跨行业数据融合的示例。例如,在百货超市中,可能同时出售食品、服装、出版物、家用电器等等,这些货物分别属于不同行业。然而,为了方便货物的检索,可能要对这些分属于不同行业的货物进行统一编号,使得货物编号唯一对应于百货超市内的一件货物。
而在出版行业和零售行业进行数据融合时,由于商品码通常遵照零售行业内部约定的格式,可能会与出版行业的版号存在重复,如果直接使用零售行业内部的商品码作为百货超市的货物编号,则无法唯一对应于一件货物,例如通过一个货物编号可能同时检索到一个出版物和一件生活用品,从而导致在跨行业数据融合时难以形成统一的数据目录。
通过对上述问题的分析,行业内部的目录编号(例如,商品码)在本行业中唯一对应于一个数字对象,可以利用该目录编号唯一识别对应的数字对象。该数字对象可以包含指示行业内目标对象的各类属性的数据信息。例如,该目标对象是电视机生产厂商生产的电视机,属于家用电器行业;该目标对象对应的数字对象包含与电视机的各类属性相关的数据信息,例如,电视机的各类生产参数、售卖信息等。
在进行跨行业数据融合时,需要使得目录编号在数据融合之后仍然唯一对应于一个数字对象,才能方便形成统一的数据目录。
一种方法是只选择不会重复的目录编号进行跨行业数据融合,但这种方式会使得行业中的数字对象减少;另一种方法是针对跨行业数据融合后的全部数字对象进行统一编号,但这种方式耗费资源较多,并且对行业内部原本的数据目录进行了修改。
为了解决上述问题,本申请实施例提供了一种数据目录生成方法。
在本方法中,考虑到行业内部原本的数据目录已经形成,并且在行业内部唯一对应于一个数字对象,因此,可以针对行业分配一个唯一的标识前缀,由该标识前缀即可确定出该行业,将行业内部数据目录的目录编号作为标识后缀,将标识前缀和标识后缀进行拼接,得到拼接结果即为能够跨行业检索出的唯一标识。
这样,即使数据目录的目录编号在不同行业之间存在重复,由于不同行业的标识前缀不同,从而可以使得拼接结果在跨行业数据融合后仍然唯一对应于某个特定数字对象,从而方便形成新的统一的数据目录。
例如,零售行业内的商品码(目录编号)用111表示,出版行业内的版号(目录编号)也用111表示,则该商品码与版号出现重复。而针对零售行业分配唯一的标识前缀10,针对出版行业分配唯一的标识前缀11。经过标识前缀和标识后缀的拼接,零售行业内的商品码111被修改为10111,而出版行业内的版号111被修改为11111,可以得到不同的拼接结果,在行业之间分别唯一对应于一个数字对象。
本方法可以复用行业内部原有的数据目录,通过简单地针对行业分配唯一的标识前 缀,进而通过简单的拼接,得到跨行业唯一对应于一个数字对象的拼接结果,从而可以节约资源,方便生成跨行业的统一的数据目录,并且对于行业内部原本的数据目录改动较小,也无需减少数字对象,提高了数据目录的生成效率。
此外,针对一个需要进行跨行业数据融合的新行业,也可以直接分配一个唯一的标识前缀,可扩展性较强。
在本方法中,也可以直接利用handle系统中的本地句柄服务(Local Handle Service)与全局句柄注册(Global Handle Registry)帮助生成统一的数据目录,进一步提高开发效率。需要说明的是,本地句柄服务和全局句柄注册都可以看作是一种服务器或者服务器上部署的一种服务功能。例如,全局句柄注册能够看到所有的本地句柄服务,本地句柄服务可以认为是一个小的区域,本地句柄服务会把一些信息(如行业目录)上报给全局句柄注册,客户端要访问本地句柄服务时需要经由全局句柄注册,只有经过全局句柄注册后才能知道哪些对象位于哪个本地句柄服务。
具体地,可以利用数字对象架构(Digital Object Architecture,DOA)帮助生产数据目录,利用其中的本地句柄服务和全局句柄注册结构以及交互协议,例如,DOIP(Digital Object Interface Protocol)和DO-IRP(Digital Object Identifier Resolution Protocol)协议,进一步提高开发效率。其中DOIP用于规定客户端与服务端的交互协议,DO-IRP基于早期的Handle System,用于规定数字对象的创建、修改、解析过程。
通过利用DOA技术,能够形成跨行业的数据目录,并根据该数据目录进行跨行业数据检索。
下面结合附图,针对本申请实施例提供的一种数据目录生成方法进行详细解释。如图1所示,为本申请实施例提供的一种数据目录生成方法的流程示意图。该方法可以应用于跨行业数据系统中任一本地句柄服务。
其中,跨行业数据系统可以包括分别对应于不同行业用户的不同本地句柄服务。
在一种可选的实施例中,针对单个行业可能存在多个不同的商家,即使行业内部商家之间形成统一的编号规范,但是各个商家的产品数据通常难以共享,为了方便进行跨行业数据融合,综合更多的数字对象方便后续检索,需要综合多个商家的数据,因此需要针对各个商家部署本地句柄服务。
并且,在还未统一编号规范的行业内部,可能不同商家所生成的产品编号也存在重复的情况。针对这种情况,需要分别为每个商家分配标识前缀。为了便于描述,将独立的商家称为行业用户,单个行业用户唯一对应于一个行业,可以维护行业中的部分数字对象,而一个行业可以对应于一个或多个行业用户。
在另一种可选的实施例中,也可以使得一个行业用户唯一对应于一个行业,不同行业用户对应于不同行业,从而避免在行业内部已经形成统一的编号规范的情况下,针对该行业内的不同商家重复分配不同的标识前缀,浪费标识前缀的资源。
在又一种可选的实施例中,行业用户可以是行业内部的数据综合平台,可以对接行业内的多个商家,从而方便综合多个商家的目录编号。
可选地,由于一个行业用户唯一对应于一个行业,不同行业用户对应于不同行业,跨行业数据系统可以包括分别对应于不同行业的不同本地句柄服务,任一本地句柄服务对应的行业预先设置有原始数据目录。
在一种可选的实施例中,行业用户可以主动请求参与跨行业数据融合,在部署跨行业数据系统中的本地句柄服务后,行业用户可以与所部属的本地句柄服务构建对应关系,从而基于本地句柄服务生成新的数据目录,使得在跨行业数据融合之后,得到全局唯一的目录编号。
可选地,任一本地句柄服务对应的行业用户可以预先设置有原始数据目录;在原始数据目录中包含多个数字对象,不同数字对象分别对应于不同的原数据标识。换言之,在原始数据目录中的每个数字对象,都通过与该数字对象对应的原数据标识来唯一识别 数字对象。
需要说明的是,跨行业数据系统中的任意一个本地句柄服务都可以执行下述方法流程生成新数据目录,新数据目录中的目录编号在跨行业数据系统中全局唯一对应于一个数字对象。
如图1所示,该方法可以包括步骤S101和S102。
S101:获取本地句柄服务对应的句柄前缀。
可选地,跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀。跨行业数据系统中句柄前缀唯一对应于一个本地句柄服务。
本方法流程并不限定本地句柄服务获取句柄前缀的方法,可以是直接从本地获取,也可以是申请分配对应的句柄前缀。
可选地,句柄前缀可以是预先申请分配并存储在本地句柄服务本地,也可以是无需申请而直接针对本地句柄服务预先分配的。
可选地,具体申请分配句柄前缀,可以是行业用户申请,也可以是本地句柄服务自行申请。
可选地,获取本地句柄服务对应的句柄前缀,可以包括:获取本地句柄服务对应的行业用户所申请的句柄前缀,作为对应的句柄前缀,所申请的句柄前缀唯一对应于该本地句柄服务对应的行业用户。
可选地,获取本地句柄服务对应的句柄前缀,可以包括:获取本地句柄服务所申请的句柄前缀,所申请的句柄前缀唯一对应于该本地句柄服务。
在一种可选的实施例中,全局句柄注册可以用于管理句柄前缀,为申请句柄前缀的本地句柄服务或行业用户分配唯一对应的句柄前缀。全局句柄注册可以存储有句柄前缀与本地句柄服务之间的对应关系集合,也可以存储有句柄前缀与行业用户之间的对应关系集合。
因此,可选地,申请句柄前缀可以包括:向全局句柄注册申请与申请句柄前缀的本地句柄服务或行业用户唯一对应的句柄前缀。
在一种可选的实施例中,在DOA中,存在DONA MPA成员,用于维护全局句柄注册,并为各个行业用户分配句柄前缀。其中,DONA为负责全局句柄注册的整体管理的组织,MPA(Multi-Primary Administrator,多主管理员)可以认为是该组织中的根管理员,例如,全局句柄注册可以通过一个或多个由DONA授权和认证的MPA与公众进行交互。
S102:针对对应行业用户预先设置的原始数据目录,将每个数字对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到该数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
可选地,由于本地句柄服务所获取的对应句柄前缀是跨行业数据系统中全局唯一的,因此,通过与原始数据目录中的原数据标识进行拼接,可以得到跨行业数据系统中全局唯一的拼接结果。因此,所得到的新数据标识在跨行业数据系统中唯一对应于一个数字对象。
可选地,将句柄前缀和句柄后缀拼接在一起可以得到句柄,句柄可以作为数字对象对应的新数据标识。
可选地,由于新数据标识在跨行业数据系统中全局唯一对应于一个数字对象,因此,可以将新数据目录看作是跨行业数据目录的一部分,方便之后生成跨行业数据目录。
可选地,本方法流程并不限定所生成的新数据目录的存储位置,可以分布式存储在各个本地句柄服务中,即任一本地句柄服务在生成新数据目录后存储到本地;此外,由于新数据目录中的新数据标识全局唯一对应于一个数字对象,因此,也可以综合跨行业数据系统各个本地句柄服务所生成的新数据目录,将这些新数据目录直接合并为一个综合数据目录,统一存储到一个指定位置。
在一种可选的实施例中,由于使用了handle系统中的本地句柄服务,因此,通常需要将所拼接的句柄进行注册,具体可以是向全局句柄注册进行注册。
可选地,上述方法流程还可以包括:将所得到的新数据标识发送给全局句柄注册。可选地,全局句柄注册可以用于汇总跨行业数据系统中各个本地句柄服务所得到的新数据标识。
可选地,全局句柄注册可以针对跨行业数据系统中各个本地句柄服务所拼接的句柄(也就是所得到的新数据标识)进行注册和存储,具体可以是将本地句柄服务、本地句柄服务对应的句柄和本地句柄服务的相关信息相关联地存储,以方便后续检索。本地句柄服务的相关信息,例如有本地句柄服务的地址信息等等,本申请并不具体进行限制。
此外,在进行跨行业数据融合时,还存在数据目录的元数据信息不明确的问题。
元数据信息具体可以是数字对象包括的一个或多个字段的字段信息,每个字段对应一个或多个字段值。由于行业内部可能已经形成统一的规范,对于部分字段可以在行业内部唯一表示一种含义,但是在跨行业场景中,由于不同行业的规范不同,部分字段信息可能在跨行业场景中产生歧义。
例如,在车辆行业内部,“标识”字段可以毫无疑义地表征车辆的标识,但是在跨行业数据融合后,“标识”字段本身具有广泛的含义,可能被其他行业的用户误解为其他产品的标识。
并且,不同行业之间,可能存在实际含义不同的相同名词作为字段,例如,在软件行业内部,可以使用“召回率”衡量算法的性能,而在零售行业内部,可以使用“召回率”衡量产品的质量。
因此,在进行跨行业数据融合时,还需要进一步明确不同行业之间的元数据信息所表示的实际含义,而不需要遵守行业内部原本统一的规范。
例如,针对车辆行业内部的“标识”字段,可以更新为“车辆标识”,以便于其他行业的用户查看。
当然,如果原本的元数据信息已经足以表示实际含义,也就无需进行更新。
在一种可选的实施例中,上述方法流程中的S102还可以包括:针对对应行业用户预先设置的原始数据目录,更新目录元数据信息。
可选地,具体可以是针对对应行业用户预先设置的原始数据目录,根据对应行业用户的信息更新目录元数据信息。
本方法流程并不具体限定更新目录元数据信息的方法,只要更新后的目录元数据信息可以表现出实际的业务含义,不会被其他行业的人员误解即可。
作为示例性说明,可选地,可以由对应行业用户的业务人员针对预先设置的原始数据目录,更新目录元数据信息。具体可以是更新为目录元数据信息的实际含义。
可选地,由于行业内部统一规范的元数据信息通常是因为省略行业信息而导致的含义不明确,因此,具体更新目录元数据信息,可以是在原始数据目录的目录元数据信息的基础上,增加对应行业用户的信息。行业用户的信息至少可以包括行业信息。
例如,针对车辆行业内部的“标识”字段,可以添加车辆行业信息“车辆”,得到“车辆标识”,从而明确该字段的实际含义;针对软件行业内部的“召回率”字段,可以添加软件行业信息“软件算法”,得到“软件算法召回率”,从而可以明确该字段的实际含义。
当然,由于行业用户可以是商户,因此,对应行业用户的信息也可以包括商户品牌。
并且,本实施例还可以保留原始数据目录中的目录元数据信息,尽量避免修改,也可以提高更新效率。
相对应地,在更新目录元数据信息后,需要利用更新后的目录元数据信息生成新数据目录。
因此,可选地,根据所得到的新数据标识生成新数据目录,可以包括:根据更新后 的目录元数据信息和所得到的新数据标识生成新数据目录,所生成的新数据目录的目录元数据信息为更新后的目录元数据信息,目录编号为所得到的新数据标识。
通过上述实施例,可以在跨行业场景中,进一步解决元数据信息规范不统一导致的问题,方便跨行业场景中其他行业的人员可以理解某一行业的元数据信息。
在一种具体的实施例中,在DOA中,可以使用DOA所提供的Type机制,针对目录元数据信息进行重新定义,从而方便根据所定义的Type生成新数据目录。
此外,还可以借助Type机制,直接绑定本地句柄服务的对应句柄前缀和所定义的Type集合(包含更新后的目录元数据信息),方便后续获取更新后的目录元数据信息。并且,由于借助现有机制进行更新,可以节约开发成本,提高开发效率。
上述方法流程通过将跨行业数据系统中全局唯一对应于一个本地句柄服务的句柄前缀,与本地句柄服务中唯一对应于一个数字对象的原数据标识进行拼接,得到跨行业数据系统中唯一对应于一个数字对象的新数据标识,从而可以根据所得到的新数据标识生成新数据目录,使得新数据目录中的目录编号可以在跨行业数据系统中唯一对应于一个数字对象,从而方便进行跨行业数据融合,便于生成跨行业的数据目录。
并且,由于复用了原数据标识,一方面可以尽量保留原数据标识,减少对原始数据目录的修改,可以无侵入地快速接入原始数据目录,另一方面可以节约资源,提高数据目录的生成效率,便于快速完成跨行业数据目录的构建。
此外,还可以通过更新目录元数据信息,提高跨行业场景中目录元数据信息的可解释性,方便其他行业的人员理解某一行业的目录元数据信息。
为了便于理解,下面给出一种具体的应用实施例。
在DOA中可以包括全局句柄注册和本地句柄服务。
全局句柄注册:由DONA MPA成员来维护,负责为各行业用户分配句柄前缀(handle前缀)。
本地句柄服务:由行业用户来维护,需要提交目录上报到全局句柄服务(Global Handle Service,GHS)和实现DOIP服务的能力。
行业目录:即原始数据目录,可以由行业用户来维护,通常是行业内原有的目录系统或者资产管理系统。
DO-IRP是DOA的核心,其本质上是一个Key,Value数据库提供了根据Key解析Value的能力。在DO-IRP中Key也称作句柄(handle),协议规定的handle格式如下:前缀/后缀,其中前缀由DONA MPA成员统一管理,一个前缀只能分配给一个用户,后缀由用户自己管理。
在DOA中handle可用于唯一的标记一个数字对象。在跨行业数据目录的场景下,可以把全局的目录编号看成handle,可以为每个行业用户分配一个前缀,由于行业用户内部目录编号已经是唯一的,因此只需要把行业内目录编号作为后缀与前缀进行拼接即可形成一个全局唯一的handle。
这种方式来生成全局唯一的目录编号,对行业用户无侵入。
因此,具体的步骤可以如下所示。
1.行业用户向MPA成员申请对应的handle前缀。
2.行业用户部署对应行业内的本地句柄服务。
3.行业用户通过DOA Type机制在本地句柄服务上定义行业目录的目录元数据信息。
4.从原有的数据目录或者资产管理系统上获取行业目录信息。
5.把行业目录(根据上述步骤3中定义的目录元数据信息形成的行业目录)导入到所部署的本地句柄服务中,其中handle前缀是MPA成员分配的,后缀为原有的行业内目录编号。
6.本地句柄服务通过handle协议把handle注册到全局句柄注册。
其中,DOA可以是由全局句柄注册和本地句柄服务组成的,其中全局句柄注册汇 聚了所有handle信息,本地句柄服务用于存储handle对应的数字对象的具体内容。可以认为全局句柄注册是一个大的路由表,记录了handle到本地句柄服务的映射关系,后续可以通过handle查找到本地句柄服务的位置,然后到本地句柄服务上获取handle对应的具体的数据。
在跨行业数据目录的场景中,可以通过全局句柄注册来完成所有目录编号(handle)的汇聚,同时可以利用本地句柄服务来存储handle对应的具体的行业目录。
针对DOA Type机制,DOA协议本身不关心数据的具体内容,其把数据格式的定义交给了使用者。DOA通过DOIP协议提供了Type机制,通过Type可以完成数据对象的格式定义,这里,数据对象的格式可以指数据对象的存在方式,行业数据目录可以认为是一种数据对象,因此行业用户通过Type机制可以很容易的定义出本行业的目录元数据信息,因此可以低成本的完成目录编号的汇聚。另外,利用Type机制可以完成新数据目录的格式定义,以定义新数据目录的目录元数据信息,从而生成更新后的目录元数据信息。
整个过程相对来说比较简单。开发的成本很低,借用DOA的能力,可以快速的推动各个行业快速的完成目录编号的汇聚工作,将目录编号汇聚到全局句柄注册中,并且各个行业用户的行业目录可以存储在各个本地句柄服务中。
在上述方法流程生成新数据目录之后,可以根据所生成的新数据目录提供跨行业数据的检索能力。
在一种可选的实施例中,全局句柄注册可以存储有跨行业数据系统中各个本地句柄服务所得到的新数据标识,从而可以根据全局句柄注册所存储的新数据标识进行检索。
可选地,针对包含新数据标识的检索请求,全局句柄注册可以先确定是否存在该请求所包含的新数据标识,在存在该新数据标识的情况下,可以进一步确定对应的本地句柄服务,从本地句柄服务中继续进行检索,得到该新数据标识对应的数字对象并返回。
换言之,针对不同行业的检索请求,可以由全局句柄注册统一处理,提供跨行业的数据检索能力。
因此,本说明书实施例还提供一种基于上述方法流程的数据检索方法,可以根据所生成的新数据目录提供跨行业数据的检索能力。如图2所示,为本说明书实施例提供的一种数据检索方法的流程示意图。
在一种可选的实施例中,由于全局句柄注册通常是由DONA MPA成员进行维护,通常难以进行开发,为了方便进行开发,可选地,可以针对跨行业数据系统新增一个管理设备,该管理设备可以对接跨行业数据系统中的若干本地句柄服务,提供跨行业数据的检索能力。可选地,该管理设备可以由管理用户进行维护,管理用户具体可以是提供检索能力的业务方。
如图2所示,该方法可以应用于管理设备,具体可以包括步骤S201至S203。
S201:接收目标用户发送的检索请求;检索请求至少包含检索条件。
检索请求用于请求获取符合所包含的检索条件的数字对象。
可选地,目标用户可以是任一具有检索需求的用户,例如,个人、商家、机构等。为了便于描述,将任一具有检索需求的用户称为目标用户。
S202:查找能够响应检索请求的目标本地句柄服务。
可选地,能够响应检索请求的目标本地句柄服务可以是可能存储有符合所包含的检索条件的数字对象的本地句柄服务,具体地,目标本地句柄服务可以是根据检索请求所指定的本地句柄服务,也可以是符合检索条件的本地句柄服务。
S203:针对所查找到的目标本地句柄服务的新数据目录,检索符合检索条件的数字对象,并向目标用户返回检索结果。
本方法流程并不限定检索请求或者检索条件的形式,可选地,可以是目标用户通过检索式或者关键字等形式进行检索,生成检索请求和检索条件。
由于数据目录本身可以提供检索能力,具体可以根据数据目录检索符合检索条件的数字对象,例如,根据目录编号(即新数据标识)直接检索对应的数字对象,或者根据某一数字对象属性值(例如,产品类型的属性值为电子产品)检索具有该属性值的数字对象,因此,可以先确定能够响应检索请求的本地句柄服务,缩小检索范围,针对本地句柄服务所生成的新数据目录进行快速检索。
上述方法流程可以借助本地句柄服务所生成的新数据目录的检索能力,通过确定能够响应检索请求的目标本地句柄服务,进行快速检索,从而可以根据不同本地句柄服务的新数据目录提供多种检索能力,也就可以提供跨行业数据的检索能力。
并且,由于管理设备作为中间方,一方面对接若干本地句柄服务,另一方面对应若干目标用户,可以节约连接资源,避免针对每次检索请求单独建立目标用户与本地句柄服务之间的连接。管理设备可以保持与若干本地句柄服务之间的连接,接收若干目标用户的检索请求。
此外,管理设备可以提供本地句柄服务的相关信息,避免目标用户在检索时不清楚所需要检索的本地句柄服务,或者不知道所需要检索的本地句柄服务的地址信息,从而无法建立连接。
而针对查找能够响应检索请求的目标本地句柄服务,本方法流程并不限定具体的实现方法,作为一种示例性说明,可以根据检索请求所包含的内容查找目标本地句柄服务。
在一种可选的实施例中,管理设备可以存储有跨行业数据系统中本地句柄服务的标识与对应行业用户信息之间的第一对应关系集合。
可选地,第一对应关系集合可以是管理设备与对接的若干本地句柄服务进行交互获取的,第一对应关系集合中可以包括跨行业数据系统中一个或多个本地句柄服务的标识。
可选地,本地句柄服务的标识具体可以是唯一对应于本地句柄服务的信息,例如,本地句柄服务的地址信息,具体可以是本地句柄服务的物理地址或者IP,可以方便根据本地句柄服务的地址信息建立连接。
可选地,对应行业用户信息可以包括行业标识、行业用户标识、行业用户的地址信息等等中的至少一项。
查找能够响应检索请求的目标本地句柄服务,可以包括:在确定检索请求包含行业用户信息的情况下,在所存储的第一对应关系集合中查找所包含的行业用户信息对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
本实施例通过检索请求中的行业用户信息,可以确定检索请求所针对的行业用户,所确定的行业用户可以是一个或多个。相对应地,所查找到的目标本地句柄服务可以是一个或多个。
例如,检索请求中可以包含行业用户信息为“零售行业”,因此,可以确定“零售行业”的每个行业用户所对应的本地句柄服务为目标本地句柄服务。
在另一种可选的实施例中,管理设备可以存储有跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合。
可选地,第二对应关系集合可以是管理设备与对接的若干本地句柄服务进行交互获取的,第二对应关系集合中可以包括跨行业数据系统中一个或多个本地句柄服务的标识。
可选地,由于句柄前缀可以唯一对应于一个本地句柄服务,因此,可以根据句柄前缀查找到唯一对应的本地句柄服务,并将该本地句柄服务确定为目标本地句柄服务。
查找能够响应检索请求的目标本地句柄服务,可以包括:在确定检索请求包括目标数据标识的情况下,在所存储的第二对应关系集合中,查找目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
本实施例通过检索请求中的目标数据标识的句柄前缀,可以快速地查找到唯一对应 的本地句柄服务。
在另一种可选的实施例中,全局句柄注册可以存储有跨行业数据系统中本地句柄服务的标识与所得到的新数据标识之间的第三对应关系集合。
可选地,第三对应关系集合可以是全局句柄注册与跨行业数据系统中的本地句柄服务进行交互获取的,第三对应关系集合中可以包括跨行业数据系统中一个或多个本地句柄服务的标识。
查找能够响应检索请求的目标本地句柄服务,可以包括:在确定检索请求包括目标数据标识的情况下,管理设备从全局句柄注册的第三对应关系集合中,查找目标数据标识对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,目标数据标识可以是任一新数据标识,为了便于描述,将任一新数据标识称为目标数据标识。
本实施例通过检索请求中的目标数据标识,可以利用全局句柄注册所存储的第三对应关系集合,一方面确定目标数据标识是否成功注册到全局句柄注册,另一方面也可以快速查找到目标数据标识唯一对应的本地句柄服务。
可选地,全局句柄注册也可以存储跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合。具体的查找目标本地句柄服务的方法可以参见上述实施例。
例如,查找能够响应检索请求的目标本地句柄服务,可以包括:在确定检索请求包括目标数据标识的情况下,管理设备从全局句柄注册的第二对应关系集合中,查找目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,上述目标数据标识和/或行业用户信息可以包含在检索条件中。
此外,在一种可选的实施例中,如果无法查找到目标本地句柄服务,例如,检索请求中包含的内容较少,不存在能够确定目标本地句柄服务的数据,则可以直接针对跨行业数据系统中的每个本地句柄服务进行检索。
可选地,在无法查找到能够响应检索请求的目标本地句柄服务的情况下,针对跨行业数据系统中的每个本地句柄服务的新数据目录,检索符合检索条件的数字对象,并向目标用户返回检索结果。
需要说明的是,上述查找目标本地句柄服务的实施例可以同时使用,使得针对不同类型的检索请求都可以进行检索。
为了便于理解,下面给出一种具体的应用实施例。
如图3所示,为本说明书实施例提供的一种数据检索方法的原理示意图。
其中,针对检索请求,可以首先判断其中是否包含有目标数据标识,如果包含目标数据标识,则可以直接检索到目标数据标识唯一对应的目标本地句柄服务,以便于后续根据检索条件进行检索。
如果不包含目标数据标识,则继续判断其中是否包含有行业用户信息,如果包含行业用户信息,则可以直接检索到行业用户信息对应的目标本地句柄服务,以便于后续根据检索条件进行检索。行业用户信息可以对应于一个或多个目标本地句柄服务。
如果不包含行业用户信息,则可以针对每个本地句柄服务根据检索条件进行检索。
在一种可选的实施例中,如果检索到符合检索条件的数字对象,则可以将检索到的数字对象作为检索结果返回给目标用户;如果没有检索到符合检索条件的数字对象,则可以将用于表征未检索到符合检索条件的数字对象的信息作为检索结果返回给目标用户。
可选地,在检索过程中可能存在若干符合检索条件的数字对象。后续可以针对若干符合检索条件的数字对象进行排序或者筛选。
例如,针对检索条件为“价格在50以下的零售产品”,也可以返回全部或部分符合检索条件的数字对象。
除了上述的检索,管理设备还可以针对跨行业数据系统中的各个本地句柄服务的新数据目录进行汇总,得到跨行业数据目录。
具体可以是直接获取跨行业数据系统中的各个本地句柄服务发送的新数据目录进行汇总,得到跨行业数据目录,方便针对跨行业数据目录进行展示。
此外,由于数据目录中的目录编号通常没有实际含义,因此,可以只选择数据目录中的部分或全部元数据信息进行展示,例如,数字对象的属性类型、属性值取值范围、各个字段的业务类型和描述信息等等。
因此,可选地,上述方法流程还可以包括:接收跨行业数据系统中各个本地句柄服务发送的新数据目录信息,汇总得到跨行业数据目录;新数据目录信息至少包括新数据目录的目录元数据信息。
可选地,管理设备还可以用于展示所得到的跨行业数据目录。
可选地,管理设备可以用于接收跨行业数据系统中本地句柄服务所生成的新数据目录,例如可以是接收跨行业数据系统中一个或多个本地句柄服务所生成的新数据目录,并将所接收到不同本地句柄服务的新数据目录合并为跨行业数据目录。
其中,由于新数据目录的目录编号(即新数据标识)在跨行业数据系统中全局唯一,因此可以直接进行合并,得到跨行业数据目录。
跨行业数据目录中可以包含不同行业用户的数字对象以及对应的目录编号。
针对这一跨行业数据目录,管理设备还可以向有需求的用户展示跨行业数据目录。具体地,在受到数据安全方面的影响或者显示资源不足的情况下,管理设备可以根据用户需求,展示跨行业数据目录中的部分数据,具体可以是部分数字对象,或者数字对象的部分内容。
例如,在跨行业的购物场景中,某购物平台可以对接若干行业,例如零售行业、电器行业、服务行业等等。用户在并未进行商品检索时,可能希望随机获取推荐的商品,因此,可以由管理设备向用户展示跨行业数据目录中的部分产品或者产品的部分字段值。
而在具体展示跨行业数据目录时,可选地,为了便于用户查看,管理设备可以获取本地句柄服务的新数据目录的目录元数据信息,进而可以将跨行业数据目录中的数字对象与对应的目录元数据信息结合后进行展示。
可选地,目录元数据信息可以包括数字对象的一个或多个字段的字段信息,可以将数字对象的字段值与对应的字段信息结合后向用户展示,使得用户可以查看到字段值和字段信息的对应关系,从而确定字段值的实际业务含义。
例如,用户查看到数字对象中的字段值60,通常难以理解,在结合对应的字段信息“零售产品价格”后,可以明确字段值的实际业务含义。
当然,在之前的实施例中,针对行业用户的原始数据目录元数据信息可以进行更新,使得跨行业场景中也可以方便其他行业的人员理解更新后的目录元数据信息。
因此,可选地,这里的目录元数据信息可以是更新后的目录元数据信息。
在一种可选的实施例中,管理设备可以存储有跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息。
可选地,任一本地句柄服务针对对应行业用户预先设置的原始数据目录,根据对应行业用户的信息更新目录元数据信息,并且根据更新后的目录元数据信息和所得到的新数据标识生成新数据目录;所生成的新数据目录的目录元数据信息为更新后的目录元数据信息,目录编号为所得到的新数据标识。在这种情况下,管理设备所存储的新数据目录的目录元数据信息可以是更新后的目录元数据信息。
可选地,管理设备可以接收跨行业数据系统中本地句柄服务所生成的新数据目录,并将所接收到的不同本地句柄服务的新数据目录合并为跨行业数据目录,并根据用户需 求确定跨行业数据目录中的需求数字对象,再将所确定的需求数字对象结合其所在新数据目录的目录元数据信息(具体可以是字段信息),返回给具有该用户需求的用户(目标用户),具体也可以是展示给具有该用户需求的用户。
例如,针对用户获取推荐产品的需求,管理设备确定跨行业数据目录中针对该用户所推荐的产品数据,并结合产品数据所在新数据目录的字段信息,返回给该用户,具体地,例如可以是“产品类型:电器,产品品牌:xx牌,产品价格:60元”。
而在具体向目标用户返回检索结果时,可选地,可以直接向目标用户展示检索结果。
可选地,在检索结果包含检索到的数字对象的情况下,可以将该检索到的数字对象结合目录元数据信息返回给目标用户,从而方便目标用户查看。具体可以参见上述实施例。
因此,在一种可选的实施例中,管理设备可以存储有跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息。
可选地,管理设备可以存储有跨行业数据系统中一个或多个本地句柄服务所生成的新数据目录的目录元数据信息。可选地,管理设备可以从跨行业数据系统中本地句柄服务获取新数据目录的目录元数据信息。
可选地,管理设备存储的可以是本地句柄服务的标识与本地句柄服务对应生成的新数据目录的目录元数据信息之间的第四对应关系集合。本地句柄服务的标识具体可以是本地句柄服务对应的句柄前缀或者地址信息。在这种情况下,在向目标用户返回检索结果时,可以根据步骤S202中查找到的目标本地句柄服务的标识直接确定出对应的新数据目录的目录元数据信息,然后,可以将检索到的符合检索条件的数字对象,结合所确定的目录元数据信息发送给目标用户。
可选地,向目标用户返回检索结果,可以包括:在检索结果包括数字对象的情况下,确定目标本地句柄服务所生成的新数据目录的目录元数据信息。将检索结果中包含的数字对象,结合所确定的目录元数据信息发送给目标用户。
可选地,也可以是将检索结果中包含的数字对象,结合所确定的目录元数据信息展示给目标用户。
可选地,目录元数据信息可以是字段信息,具体结合数字对象和目录元数据信息,可以是将检索结果中包含的数字对象字段值,填充到对应的字段信息。
此外,可选地,所确定的目录元数据信息可以是更新后的目录元数据信息。具体可以参见上述实施例。
本实施例可以通过结合目录元数据信息,方便目标用户查看检索结果。
在一种具体的实施例中,在DOA中行业用户可以使用DOA的Type机制重新定义目录元数据信息(即生成更新后的目录元数据信息)后,可以得到本地句柄服务对应的Type集合,其中包含更新后的目录元数据信息。这些行业用户定义的Type集合可以存储到管理设备中,由于Type集合直接对应于本地句柄服务,具体可以是对应于本地句柄服务被分配的对应句柄前缀,因此,管理设备可以直接利用Type机制获取到跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息(也就是更新后的目录元数据信息)。
对应于上述方法流程,本说明书实施例还提供了一种装置实施例。
如图4所示,为本说明书实施例提供的一种数据目录生成装置的结构示意图。该装置可以应用于跨行业数据系统中任一本地句柄服务。
跨行业数据系统可以包括分别对应于不同行业用户的不同本地句柄服务;本地句柄服务对应的行业用户预先设置有原始数据目录;在原始数据目录中,不同数字对象分别对应于不同的原数据标识。
如图4所示,该装置可以包括单元401和402。
获取单元401,用于获取本地句柄服务对应的句柄前缀,跨行业数据系统中不同本 地句柄服务分别对应于不同的句柄前缀。
拼接单元402,用于针对对应行业用户预先设置的原始数据目录,将每个数字对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到该数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
可选地,该装置还可以包括发送单元403,用于将所得到的新数据标识发送给全局句柄注册,全局句柄注册用于汇总跨行业数据系统中各个本地句柄服务所得到的新数据标识。
可选地,获取单元401可以用于:获取本地句柄服务对应的行业用户所申请的句柄前缀,作为对应的句柄前缀,所申请的句柄前缀唯一对应于该本地句柄服务对应的行业用户。
可选地,该装置还可以包括更新单元403,用于针对对应行业用户预先设置的原始数据目录,根据对应行业用户的信息更新原始数据目录的目录元数据信息。
拼接单元402可以用于:根据更新后的目录元数据信息和所得到的新数据标识生成新数据目录;所生成的新数据目录的目录元数据信息为更新后的目录元数据信息,目录编号为所得到的新数据标识。
可选地,更新单元403可以用于:利用Type机制重新定义所述原始数据目录的目录元数据信息,以生成所述更新后的目录元数据信息。
如图5所示,为本说明书实施例提供的一种数据检索装置的结构示意图。
该装置是针对数据目录的数据检索装置,其中数据目录基于上述数据目录生成装置而生成,该数据检索装置可以应用于管理设备;管理设备是针对跨行业数据系统新增的设备。
如图5所示,该数据检索装置可以包括单元501至503。
接收单元501,用于接收目标用户发送的检索请求,检索请求至少包含检索条件;检索请求用于请求获取符合所包含的检索条件的数字对象。
查找单元502,用于查找能够响应检索请求的目标本地句柄服务。
结果单元503,用于针对所查找到的目标本地句柄服务的新数据目录,检索符合检索条件的数字对象,并向目标用户返回检索结果。
可选地,管理设备存储有跨行业数据系统中本地句柄服务的标识与对应行业用户信息之间的第一对应关系集合。
查找单元502具体可以用于:在确定检索请求包含行业用户信息的情况下,在所存储的第一对应关系集合中查找所包含的行业用户信息对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,所述行业用户信息可以包括行业标识、行业用户标识、行业用户的地址信息中的至少一项。
可选地,管理设备存储有跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合。
查找单元502具体可以用于:在确定检索请求包括目标数据标识的情况下,在所存储的第二对应关系集合中,查找目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,全局句柄注册存储有所述跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合。
查找单元502具体可以用于:在确定所述检索请求包括目标数据标识的情况下,所述管理设备从所述全局句柄注册的所述第二对应关系集合中,查找所述目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,全局句柄注册存储有跨行业数据系统中本地句柄服务的标识与所得到的新 数据标识之间的第三对应关系集合。
查找单元502具体可以用于:在确定检索请求包括目标数据标识的情况下,管理设备从全局句柄注册的第三对应关系集合中,查找目标数据标识对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
可选地,该装置还可以包括返回单元,用于在无法查找到能够响应检索请求的目标本地句柄服务的情况下,针对跨行业数据系统中的每个本地句柄服务的新数据目录,检索符合检索条件的数字对象,并向目标用户返回检索结果。
可选地,该装置还可以包括汇总单元,用于接收跨行业数据系统中各个本地句柄服务发送的新数据目录信息,汇总得到跨行业数据目录;新数据目录信息至少包括新数据目录的目录元数据信息;管理设备还用于展示所得到的跨行业数据目录。
可选地,管理设备存储有跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息。
结果单元503具体可以用于:在检索结果包括数字对象的情况下,确定目标本地句柄服务所生成的新数据目录的目录元数据信息;将检索结果中包含的数字对象,结合所确定的目录元数据信息展示给所述目标用户。
可选地,新数据目录的目录元数据信息可以包括字段信息,结果单元503具体可以用于:将数字对象的字段值与对应的字段信息结合后发送给目标用户。
可选地,管理设备存储有跨行业数据系统中本地句柄服务的标识与本地句柄服务对应生成的新数据目录的目录元数据信息之间的第四对应关系集合,结果单元503具体可以用于:根据查找到的目标本地句柄服务的标识确定对应的新数据目录的目录元数据信息;将检索到的符合检索条件的数字对象,结合所确定的目录元数据信息发送给目标用户。
可选地,管理设备可以用于接收跨行业数据系统中本地句柄服务所生成的新数据目录,并将所接收到不同本地句柄服务的新数据目录合并为跨行业数据目录。
上述装置实施例的具体解释可以参见上述方法流程的解释。
本说明书实施例还提供一种电子设备,包括处理器及存储器;存储器存储有可被处理器调用的程序;其中,处理器执行该程序时,实现如上述任一方法实施例所述的数据目录生成方法或者数据检索方法。
本说明书实施例还提供一种机器可读存储介质,其上存储有程序,该程序被处理器执行时,实现如上述任一方法实施例所述的数据目录生成方法或者数据检索方法。
本说明书实施例还提供一种计算机设备,其至少包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其中,处理器执行所述程序时实现如上述任一方法实施例所述的数据目录生成方法或者数据检索方法。
图6示出了本说明书实施例所提供的一种更为具体的计算机设备硬件结构示意图,该设备可以包括:处理器1010、存储器1020、输入/输出接口1030、通信接口1040和总线1050。其中处理器1010、存储器1020、输入/输出接口1030和通信接口1040通过总线1050实现彼此之间在设备内部的通信连接。
处理器1010可以采用通用的CPU(Central Processing Unit,中央处理器)、微处理器、应用专用集成电路(Application Specific Integrated Circuit,ASIC)、或者一个或多个集成电路等方式实现,用于执行相关程序,以实现本说明书实施例所提供的技术方案。
存储器1020可以采用ROM(Read Only Memory,只读存储器)、RAM(Random Access Memory,随机存取存储器)、静态存储设备,动态存储设备等形式实现。存储器1020可以存储操作系统和其他应用程序,在通过软件或者固件来实现本说明书实施例所提供的技术方案时,相关的程序代码保存在存储器1020中,并由处理器1010来调用执行。
输入/输出接口1030用于连接输入/输出模块,以实现信息输入及输出。输入输出/ 模块可以作为组件配置在设备中(图中未示出),也可以外接于设备以提供相应功能。其中输入设备可以包括键盘、鼠标、触摸屏、麦克风、各类传感器等,输出设备可以包括显示器、扬声器、振动器、指示灯等。
通信接口1040用于连接通信模块(图中未示出),以实现本设备与其他设备的通信交互。其中通信模块可以通过有线方式(例如USB、网线等)实现通信,也可以通过无线方式(例如移动网络、WIFI、蓝牙等)实现通信。
总线1050包括一通路,在设备的各个组件(例如处理器1010、存储器1020、输入/输出接口1030和通信接口1040)之间传输信息。
需要说明的是,尽管上述设备仅示出了处理器1010、存储器1020、输入/输出接口1030、通信接口1040以及总线1050,但是在具体实施过程中,该设备还可以包括实现正常运行所必需的其他组件。此外,本领域的技术人员可以理解的是,上述设备中也可以仅包含实现本说明书实施例方案所必需的组件,而不必包含图中所示的全部组件。
本说明书实施例还提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如上述任一方法实施例所述的数据目录生成方法或者数据检索方法。
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。
通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到本说明书实施例可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本说明书实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本说明书实施例各个实施例或者实施例的某些部分所述的方法。
上述实施例阐明的系统、装置、模块或单元,具体可以由计算机芯片或实体实现,或者由具有某种功能的产品来实现。一种典型的实现设备为计算机,计算机的具体形式可以是个人计算机、膝上型计算机、蜂窝电话、相机电话、智能电话、个人数字助理、媒体播放器、导航设备、电子邮件收发设备、游戏控制台、平板计算机、可穿戴设备或者这些设备中的任意几种设备的组合。
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于装置实施例而言,由于其基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的模块可以是或者也可以不是物理上分开的,在实施本说明书实施例方案时可以把各模块的功能在同一个或多个软件和/或硬件中实现。也可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。
以上所述仅是本说明书实施例的具体实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本说明书实施例原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本说明书实施例的保护。

Claims (20)

  1. 一种数据目录生成方法,应用于跨行业数据系统中任一本地句柄服务;所述跨行业数据系统包括分别对应于不同行业用户的不同本地句柄服务;所述本地句柄服务对应的行业用户预先设置有原始数据目录;所述原始数据目录包含多个数字对象,不同数字对象分别对应于不同的原数据标识;所述方法包括:
    获取所述本地句柄服务对应的句柄前缀,所述跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀;
    针对对应行业用户预先设置的原始数据目录,将每个数字对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到所述每个数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
  2. 根据权利要求1所述的方法,还包括:
    将所得到的新数据标识发送给全局句柄注册,所述全局句柄注册用于汇总所述跨行业数据系统中各个本地句柄服务所得到的新数据标识。
  3. 根据权利要求1所述的方法,其中,所述获取所述本地句柄服务对应的句柄前缀,包括:
    获取所述本地句柄服务对应的行业用户所申请的句柄前缀,作为对应的句柄前缀,所申请的句柄前缀唯一对应于该本地句柄服务对应的行业用户。
  4. 根据权利要求1所述的方法,还包括:
    针对对应行业用户预先设置的原始数据目录,根据对应行业用户的信息更新所述原始数据目录的目录元数据信息,以使更新后的目录元数据信息能够表现出对应的数字对象的字段值的实际业务含义,其中所述目录元数据信息指数字对象包括的一个或多个字段的字段信息;
    所述根据所得到的新数据标识生成新数据目录,包括:
    根据所述更新后的目录元数据信息和所得到的新数据标识生成新数据目录;
    所生成的新数据目录的目录元数据信息为所述更新后的目录元数据信息,目录编号为所得到的新数据标识。
  5. 根据权利要求4所述的方法,其中,通过利用Type机制完成所述新数据目录的格式定义来定义所述新数据目录的目录元数据信息,以生成所述更新后的目录元数据信息。
  6. 一种针对数据目录的数据检索方法,所述数据目录通过如权利要求1至5任意一项所述的数据目录生成方法来生成,所述数据检索方法应用于管理设备;所述管理设备是针对所述跨行业数据系统新增的设备;
    所述数据检索方法包括:
    接收目标用户发送的检索请求,所述检索请求至少包含检索条件;所述检索请求用于请求获取符合所包含的检索条件的数字对象;
    查找能够响应所述检索请求的目标本地句柄服务;
    针对所查找到的目标本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
  7. 根据权利要求6所述的数据检索方法,其中,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与对应行业用户信息之间的第一对应关系集合;
    所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包含行业用户信息的情况下,在所存储的第一对应关系集合中查找所包含的行业用户信息对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
  8. 根据权利要求7所述的数据检索方法,其中,所述行业用户信息包括行业标识、行业用户标识、行业用户的地址信息中的至少一项。
  9. 根据权利要求6所述的数据检索方法,其中,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合;
    所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包括目标数据标识的情况下,在所存储的第二对应关系集合中,查找所述目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
  10. 根据权利要求6所述的数据检索方法,其中,全局句柄注册存储有所述跨行业数据系统中本地句柄服务的标识与对应句柄前缀之间的第二对应关系集合;
    所述查找能够响应所述检索请求的目标本地句柄服务,包括:在确定所述检索请求包括目标数据标识的情况下,所述管理设备从所述全局句柄注册的所述第二对应关系集合中,查找所述目标数据标识中句柄前缀对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
  11. 根据权利要求6所述的数据检索方法,其中,全局句柄注册存储有所述跨行业数据系统中本地句柄服务的标识与所得到的新数据标识之间的第三对应关系集合;
    所述查找能够响应所述检索请求的目标本地句柄服务,包括:
    在确定所述检索请求包括目标数据标识的情况下,所述管理设备从所述全局句柄注册的所述第三对应关系集合中,查找所述目标数据标识对应的本地句柄服务的标识,将所查找到的本地句柄服务的标识对应的本地句柄服务确定为目标本地句柄服务。
  12. 根据权利要求6所述的数据检索方法,还包括:
    在无法查找到能够响应所述检索请求的目标本地句柄服务的情况下,针对所述跨行业数据系统中的每个本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
  13. 根据权利要求6所述的数据检索方法,其中,所述管理设备存储有所述跨行业数据系统中本地句柄服务所生成的新数据目录的目录元数据信息;
    所述向所述目标用户返回检索结果,包括:
    在检索结果包括数字对象的情况下,确定所述目标本地句柄服务所生成的新数据目录的目录元数据信息;
    将所述检索结果中包含的数字对象,结合所确定的目录元数据信息发送给所述目标用户。
  14. 根据权利要求13所述的数据检索方法,其中,所述新数据目录的目录元数据信息包括字段信息,其中,所述将所述检索结果中包含的数字对象,结合所确定的目录元数据信息发送给所述目标用户,包括:
    将所述数字对象的字段值与对应的字段信息结合后发送给所述目标用户。
  15. 根据权利要求6所述的数据检索方法,其中,所述管理设备存储有所述跨行业数据系统中本地句柄服务的标识与本地句柄服务对应生成的新数据目录的目录元数据信息之间的第四对应关系集合;
    其中,所述向所述目标用户返回检索结果,包括:
    根据查找到的目标本地句柄服务的标识确定对应的新数据目录的目录元数据信息;
    将检索到的符合所述检索条件的数字对象,结合所确定的目录元数据信息发送给所述目标用户。
  16. 根据权利要求6所述的数据检索方法,其中,所述管理设备用于接收所述跨行业数据系统中本地句柄服务所生成的新数据目录,并将所接收到不同本地句柄服务的新数据目录合并为跨行业数据目录。
  17. 一种数据目录生成装置,应用于跨行业数据系统中任一本地句柄服务;所述跨行业数据系统包括分别对应于不同行业用户的不同本地句柄服务;所述本地句柄服务对应的行业用户预先设置有原始数据目录;所述原始数据目录包含多个数字对象,不同数 字对象分别对应于不同的原数据标识;所述装置包括:
    获取单元,用于获取所述本地句柄服务对应的句柄前缀,所述跨行业数据系统中不同本地句柄服务分别对应于不同的句柄前缀;
    拼接单元,用于针对对应行业用户预先设置的原始数据目录,将每个数字对象对应的原数据标识作为句柄后缀,与所获取的对应句柄前缀进行拼接,得到所述每个数字对象对应的新数据标识,并根据所得到的新数据标识生成新数据目录。
  18. 一种针对数据目录的数据检索装置,所述数据目录通过权利要求17所述的数据目录生成装置生成,所述数据检索装置应用于管理设备;所述管理设备是针对所述跨行业数据系统新增的设备;
    所述数据检索装置包括:
    接收单元,用于接收目标用户发送的检索请求,所述检索请求至少包含检索条件;所述检索请求用于请求获取符合所包含的检索条件的数字对象;
    查找单元,用于查找能够响应所述检索请求的目标本地句柄服务;
    结果单元,用于针对所查找到的目标本地句柄服务的新数据目录,检索符合所述检索条件的数字对象,并向所述目标用户返回检索结果。
  19. 一种电子设备,包括处理器及存储器;所述存储器存储有可被处理器调用的程序;其中,所述处理器执行所述程序时,实现如权利要求1至5中任意一项所述的数据目录生成方法或者如权利要求6至16中任意一项所述的数据检索方法。
  20. 一种机器可读存储介质,其上存储有程序,该程序被处理器执行时,实现如权利要求1至5中任意一项所述的数据目录生成方法或者如权利要求6至16中任意一项所述的数据检索方法。
PCT/CN2022/117489 2021-12-30 2022-09-07 数据目录生成方法和装置、数据检索方法和装置、电子设备 WO2023124209A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111658097.1 2021-12-30
CN202111658097.1A CN114329091A (zh) 2021-12-30 2021-12-30 一种数据目录生成方法、装置及设备

Publications (1)

Publication Number Publication Date
WO2023124209A1 true WO2023124209A1 (zh) 2023-07-06

Family

ID=81019224

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/117489 WO2023124209A1 (zh) 2021-12-30 2022-09-07 数据目录生成方法和装置、数据检索方法和装置、电子设备

Country Status (2)

Country Link
CN (1) CN114329091A (zh)
WO (1) WO2023124209A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114329091A (zh) * 2021-12-30 2022-04-12 杭州数梦工场科技有限公司 一种数据目录生成方法、装置及设备

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170163741A1 (en) * 2015-12-07 2017-06-08 Dell Products L.P. Method and system for execution of client-initiated operations on file handles in a distributed server system
CN111858496A (zh) * 2020-07-27 2020-10-30 北京大道云行科技有限公司 一种元数据的检索方法、装置、存储介质和电子设备
CN111917551A (zh) * 2020-06-23 2020-11-10 深圳奥联信息安全技术有限公司 一种基于无证书公钥的句柄访问保护方法和系统
CN112650855A (zh) * 2020-12-26 2021-04-13 曙光信息产业股份有限公司 知识图谱工程化构建方法、装置、计算机设备和存储介质
CN112653774A (zh) * 2020-12-16 2021-04-13 北京航天智造科技发展有限公司 工业互联网标识编码方法及装置
CN114329091A (zh) * 2021-12-30 2022-04-12 杭州数梦工场科技有限公司 一种数据目录生成方法、装置及设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170163741A1 (en) * 2015-12-07 2017-06-08 Dell Products L.P. Method and system for execution of client-initiated operations on file handles in a distributed server system
CN111917551A (zh) * 2020-06-23 2020-11-10 深圳奥联信息安全技术有限公司 一种基于无证书公钥的句柄访问保护方法和系统
CN111858496A (zh) * 2020-07-27 2020-10-30 北京大道云行科技有限公司 一种元数据的检索方法、装置、存储介质和电子设备
CN112653774A (zh) * 2020-12-16 2021-04-13 北京航天智造科技发展有限公司 工业互联网标识编码方法及装置
CN112650855A (zh) * 2020-12-26 2021-04-13 曙光信息产业股份有限公司 知识图谱工程化构建方法、装置、计算机设备和存储介质
CN114329091A (zh) * 2021-12-30 2022-04-12 杭州数梦工场科技有限公司 一种数据目录生成方法、装置及设备

Also Published As

Publication number Publication date
CN114329091A (zh) 2022-04-12

Similar Documents

Publication Publication Date Title
US20190303088A1 (en) Transferring an application interface from one device to another device
TWI689881B (zh) 一種定向顯示訊息方法及裝置
CN104823169B (zh) 用于网络中的可搜索数据的索引配置
US11276097B1 (en) Canonical order management system
US8661103B2 (en) Business application address determination
WO2016177250A1 (zh) 客户端页面渲染方法及装置
US10223180B2 (en) Extensible data interface for shared service module
CN110716783A (zh) 一种前端页面生成、部署方法及装置、存储介质和设备
JP2003122765A (ja) 対応する複数の主体によって複数の識別コードが割当てられた対象物に対して特定の識別コードを決定するための方法
CN107103011B (zh) 终端数据搜索的实现方法和装置
US20220114163A1 (en) Generating external identifiers for data entities using a data catalog system
CN107423037B (zh) 应用程序接口定位方法及设备
JP2012501492A (ja) インターネット上で商品のオブジェクト情報を提供するための方法およびシステム
WO2023124209A1 (zh) 数据目录生成方法和装置、数据检索方法和装置、电子设备
CN109726295A (zh) 品牌知识图谱显示方法、装置、图服务器及存储介质
CN111431962B (zh) 基于情景感知计算的跨域资源接入物联网服务发现方法
US8224797B2 (en) System and method for validating data record
CN114675982A (zh) 一种业务集成系统的数据获取通用方法及系统
JP2014067314A (ja) 電子商取引サーバ装置
CN110858209A (zh) 业务对象访问/发布方法、装置、系统及电子设备
US8131729B2 (en) System and method for best-fit lookup of multi-field key
CN111784376A (zh) 用于处理信息的方法和装置
CN110035108A (zh) 一种信息推送方法及系统
JP6577620B1 (ja) コンテンツ決定装置およびプログラム
CN110378758A (zh) 店铺展示信息的处理方法、装置及设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22913482

Country of ref document: EP

Kind code of ref document: A1