CN115640275A - Automatic generation and updating method for data products and assets - Google Patents

Automatic generation and updating method for data products and assets Download PDF

Info

Publication number
CN115640275A
CN115640275A CN202211036253.5A CN202211036253A CN115640275A CN 115640275 A CN115640275 A CN 115640275A CN 202211036253 A CN202211036253 A CN 202211036253A CN 115640275 A CN115640275 A CN 115640275A
Authority
CN
China
Prior art keywords
data
value
factor
freedom
factors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211036253.5A
Other languages
Chinese (zh)
Other versions
CN115640275B (en
Inventor
林少伟
马莺
李志男
龚䶮
张微
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Huayixin Technology Co ltd
Lin Shaowei
Original Assignee
Huajiao Lianchuang Xiamen Technology Co ltd
Beijing Huayixin Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huajiao Lianchuang Xiamen Technology Co ltd, Beijing Huayixin Technology Co ltd filed Critical Huajiao Lianchuang Xiamen Technology Co ltd
Priority to CN202211036253.5A priority Critical patent/CN115640275B/en
Publication of CN115640275A publication Critical patent/CN115640275A/en
Application granted granted Critical
Publication of CN115640275B publication Critical patent/CN115640275B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a method for automatically generating and updating data products and assets, which comprises the following steps: acquiring and dividing a data product and asset data resource library to obtain a minimum data unit which cannot be split; combining the minimum data units which cannot be split according to the value attributes to obtain a minimum value unit; processing the minimum value unit to obtain a data factor with a recombination degree of freedom, a circulation degree of freedom and a value attribute; processing the data factors to obtain a data factor dynamic data value map, processing the data factor dynamic data value map to obtain a data factor recombination freedom function and a data factor circulation freedom function, and processing the data factor recombination freedom function and the data factor circulation freedom function to obtain a user demand function and a value maximization function; and processing the user demand function and the value maximization function to obtain a data factor recombination scheme and a circulation scheme, and automatically generating and updating data products and assets. The method meets the characteristics and requirements of pursuing high value and high liquidity of data products and assets.

Description

Automatic generation and updating method for data products and assets
Technical Field
The invention relates to the technical field of automatic generation of data products and assets, in particular to an automatic generation and updating method of data products and assets.
Background
At present, a report automatic generation method for data mainly extracts, searches and replaces data, and then establishes an automatic generation report with relatively fixed modes such as detection, medical treatment, document and the like according to a certain scene, professional requirements or a regular template of a typical model through network or specialized software. These methods are passive data report generation methods, and the requirements of report users and the value of reports themselves are not fully reflected in the generation process. Although there is a certain dynamic update on the data values and contents, the method is limited by the regularity template, the possibility of reorganization of organization rules among data is low, the association among data is not fully mined, and the circulation among data and after combination cannot be guaranteed. In short, the current automatic data report generation methods are more suitable for the field application with unchanged scene, stable demand, fixed mode, no pursuit of high value and low circulation requirement.
However, unlike the application areas for which these current methods of automatic generation of data reports are suitable, the characteristics and requirements of data products and data assets are just the opposite. The data products and the data assets have the characteristics and the requirements of uncertain scenes, large demand difference, unfixed modes, pursuit of high value and high circulation requirement. This is because, data products and data assets have economic features, the relevance between the contained data is very important, the organization rule pattern between the data is more flexible, different data combinations can bring different values, and the more combinations, the more applicable scenarios, the higher the possibility of bringing more values. The currency and reorganization of data products and data assets formed between and after the combination of data is not only oriented to the maximization of their diverse and diverse user needs and value, but also closely related to the attributes of the data products and data assets themselves that tend to be of high value. Automatic updating and immediate updated version delivery of data products/assets are also highly necessary due to the characteristics and requirements of data products and data assets, and the high frequency and magnitude of dynamic changes of data resources and data elements.
Disclosure of Invention
The technical problem to be solved by the invention is to provide a method for automatically generating and updating data products and assets, which can extract and establish data factors with recombination and circulation freedom degrees and value attributes, construct a data factor historical dynamic data value map, establish a data factor recombination and circulation freedom degree function on the basis of the value map, construct a user demand and value maximization function as a demand and value target of data factor recombination and circulation, further determine a data factor recombination and circulation scheme, automatically generate data products and assets, and realize automatic updating of the data products and the assets.
In order to solve the technical problem, the embodiment of the invention discloses a method for automatically generating and updating data products and assets, which comprises the following steps:
s1, acquiring a data product and asset data resource library; the data product and asset data repository is composed of data and has value attributes and liquidity;
the value attribute is the actual value of the data product and the data asset in three dimensions of time, space and path, which accords with the interest of the demand side;
the currency is a social circulation of data products and data assets for their value to be sought and realized;
s2, dividing the data product and asset data resource library to obtain the smallest irreparable data unit;
combining the minimum data units which can not be split according to the value attributes to obtain a minimum value unit;
processing the minimum value unit to obtain a data factor with a recombination degree of freedom, a circulation degree of freedom and a value attribute;
s3, processing the data factors to obtain a data factor dynamic data value map;
s4, processing the data factor dynamic data value map to obtain a data factor recombination freedom function and a data factor circulation freedom function;
s5, processing the data factor recombination freedom function and the data factor circulation freedom function to obtain a user demand function and a value maximization function;
and S6, processing the user demand function and the value maximization function to obtain a data factor recombination scheme and a circulation scheme, and automatically generating and updating data products and assets.
As an optional implementation manner, in an embodiment of the present invention, the processing the minimum value unit to obtain a data factor having a reorganization degree of freedom, a circulation degree of freedom, and a value attribute includes:
when the minimum value unit has the condition of recombining with at least two other minimum value units to form a value, the minimum value unit is called to have a recombination degree of freedom;
when the value represented by the minimum value unit is not directionally supplied to a certain user and can not be only applied to a certain specific scene, the minimum value unit is called to have circulation freedom;
the least valuable unit with the reorganization and circulation freedom is the data factor.
As an optional implementation manner, in an embodiment of the present invention, the processing the data factor to obtain a data factor dynamic data value map includes:
s31, carrying out value marking on the data factors from three dimensions of time, space and path;
s32, mining the association value relation between the data factor and other data factors in three dimensions of time, space and path by using a data value association mining method;
s33, processing the time, space and path three-dimensional values of the data factors to obtain the influence coefficients of the correlation value relationship of the data factors on the time, space and path three-dimensional values;
influence coefficients of the correlation value relations of the data factors in the three dimensions of time, space and path on the three dimensional values of the time, space and path are multiplied by the three dimensional values of the time, space and path corresponding to other data factors respectively to obtain the correlation values of the data factors and the other data factors in the three dimensions of time, space and path;
s34, taking the three-dimensional values of time, space and path of each data factor as the three-dimensional coordinates of each data factor, and marking the associated values of the time, space and path of each data factor and other data factors to obtain a data factor value map of the data factor at a certain time point;
and S35, repeating the steps S31, S32, S33 and S34 at each historical time point to obtain a data factor dynamic data value map.
As an optional implementation manner, in the embodiment of the present invention, the time value includes a time cost and a data aging;
the space value comprises economic cost, resource cost, data content, data reliability and data technology;
the path value comprises a data category, a data positioning, a data association, a data path and a data source.
As an optional implementation manner, in an embodiment of the present invention, the calculating method of the time value, the space value, and the path value of each data factor includes:
the time value, the space value and the path value of each data factor are equal to the correlation value of the data factor and other data factors on the time value, the space value and the path value and the basic value of the data factor on the time value, the space value and the path value;
the basic value is a common factor of values of the data factors in time value, space value and path value at various historical time points.
As an optional implementation manner, in an embodiment of the present invention, the data factor reorganization degree of freedom function includes:
s41, extracting the associated value relation of the same data factor and the time, space and path three-dimensional values of other data factors at each historical time point from the data factor dynamic data value map;
s42, analyzing the correlation value relationship between the same data factor and other data factors in three dimensional values of time, space and path at each historical time point to obtain all non-null correlation value relationships in the correlation value relationship between the same data factor and other data factors in three dimensional values of time, space and path at each historical time point, and marking the non-null correlation value relationships;
s43, combining the data factor and other data factors, and the non-null associated value relation in three dimensions of time, space and path at each historical time point and the corresponding associated value into a non-null associated value set;
s44, regarding all the data factors and other data factors as non-empty associated value sets corresponding to non-empty associated values in the associated values of three dimensions of time, space and path at each historical time point, and referring to the non-empty associated value sets as recombination freedom degree sets of the data factors at each historical time point;
and S45, extracting a historical data factor recombination track function by using the recombination freedom degree set of the data factor at each historical time point to obtain a data factor recombination freedom degree function.
As an optional implementation manner, in an embodiment of the present invention, the data factor circulation degree of freedom function includes:
s46, judging whether the time cost, the data timeliness, the economic cost, the resource cost, the data content, the data reliability, the data technicality, the data category, the data positioning, the data association, the data path and the data source of the data factor in three dimensions of time, space and path at each historical time point are all directionally supplied to a certain user or only can be applied to a certain specific scene;
if so, the data factor has no circulation freedom; if not, marking a plurality of user requirements and a plurality of application scenes corresponding to each item in three dimensions of time, space and path of the data factor at each historical time point to obtain the self circulation freedom degree of the data factor;
s47, extracting other data factors with the recombination freedom degrees of the data factors at the historical time points by using the recombination freedom degree sets of the data factors at the historical time points;
s48, judging the self circulation freedom degree of the data factor at each historical time point, matching the self circulation freedom degree of the data factor at the same historical time point and the self circulation freedom degree of other data factors with recombination freedom degrees;
calculating the correlation value of the data factor and other data factors in the specific dimension with recombination freedom degrees of the data factors and other data factors;
combining the self circulation freedom degree of the data factor at each historical time point, the self circulation freedom degree of other data factors having a recombination freedom degree relation with the self circulation freedom degree, the data factor and the recombination dimension correlation value of other data factors having a recombination freedom degree relation with the data factor into a circulation freedom degree set of the data factor at each historical time point;
and analyzing the circulation freedom set of the data factors at each historical time point, extracting a historical circulation track function of the data factors, and obtaining the circulation freedom function of the data factors.
As an optional implementation manner, in an embodiment of the present invention, the processing is performed on the data factor reorganization degree of freedom function and the data factor circulation degree of freedom function to obtain a user requirement function, where the method includes:
aiming at user requirements, the data products and data assets can be divided into customized data products and data assets determined by buyers and scenes and non-customized data products and data assets determined by buyers and scenes;
respectively determining user demand functions of different types of data products and data asset characteristics;
the user demand function is used as a demand and value target for data factor recombination and circulation;
customizing user demand functions of data products and data assets, which change over time, including specific demands of buyers on time value, space value, path value and determined application scenarios;
the user demand function of the non-customized data products and data assets changes with time, and the specific demands and the potential application scenes of the potential buyers are predicted according to the historical user demands and the overall market demands.
As an optional implementation manner, in an embodiment of the present invention, the processing is performed on the data factor reorganization degree of freedom function and the data factor circulation degree of freedom function to obtain a value maximization function, and the method includes:
processing the data factor recombination freedom function and the data factor circulation freedom function to construct a value maximization function;
the value maximization function is a demand and value target of data factor recombination and circulation;
if all data factors are contained in the data product and the data asset which need to be automatically generated, processing the values of all the data factors on the time value, the space value and the path value, removing the associated values between two repeated data factors in each subentry of the value, and then summing to obtain the value sum of the time value, the space value and the path value of all the data factors contained in the data product and the data asset;
the cost sum maximum is a cost maximization function.
As an optional implementation manner, in an embodiment of the present invention, the processing is performed on the user demand function and the cost maximization function to obtain a data factor reorganization scheme and a circulation scheme, and automatically generate and update data products and assets, where the method includes:
s61, screening the time value, the space value and the path value of each data factor by taking the user demand function as a screening condition, and reserving the data factors which accord with the user demand function in the time value, the space value and the path value;
s62, combining the data factors which accord with the user demand function in the time value, the space value and the path value to obtain the data factors which accord with the circulation freedom degree requirement;
s63, taking the value maximization function as a constraint target of relevance and relevance value between the data factor meeting the requirement of circulation freedom degree and other data factors;
processing the data dynamic data value map and the relevance value between the data factor and other data factors to obtain a data factor recombination scheme which accords with a value maximization function and the requirement of circulation freedom;
the data factor reorganization scheme is automatically generated data products and data assets;
and S64, repeating S61, S62 and S63 when the data factor or the user requirement is changed, and obtaining the updated automatically generated data product and data asset.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
(1) The invention provides a data product and asset automatic generation and updating method based on free recombination and circulation of data factors of a value map, which has the advantages of breaking through the passive generation of the current data report automatic generation method, solving the problems that the requirements of report users and the value of reports are not fully embodied in the generation process, solving the problem that the recombination of organization rules among data is limited by regular templates, and solving the problems that the relevance among data is not fully mined and the circulation among data and after combination cannot be guaranteed. In a word, the method breaks through the limitation of the existing automatic generation method of the current data report, changes the current situation that the automatic generation method of the current data report is suitable for the field application of unchanged scene, stable demand, fixed mode, no pursuit of high value and low circulation requirement, is more suitable for the automatic generation and updating of data products and assets, and meets the characteristics and requirements of uncertain scene, large demand difference, unfixed mode, pursuit of high value and high circulation requirement of the data products and the data assets.
(2) The invention provides a data product and asset automatic generation and updating method based on free recombination and circulation of data factors of a value map, which has the advantages that the data factors with the recombination and circulation freedom degrees and the value attributes are extracted and established, the historical dynamic data value map of the data factors is established, and the data factor recombination and circulation freedom degree functions are established on the basis of the value map. Due to the data resources and the data elements, data products and data assets with different modes, higher values, meeting different scene requirements and higher circulation can be formed through the recombination of multiple data. The method develops research around data per se, constructs data factors with recombination freedom and circulation freedom, describes historical evolution situation of the data factors in a value level through construction of a historical dynamic data value map of the data factors, provides flexible recombination conditions and ranges for data product and data asset for data factor recombination, and provides a technical support basis for potential value realization of the data product and the data asset.
(3) The invention provides a data product and asset automatic generation and updating method based on data factor free recombination and circulation of a value map, which has the advantages that a user demand and value maximization function is constructed to serve as a demand and value target of data factor recombination and circulation, and then a data factor recombination and circulation scheme is determined. The currency and reorganization of data products and data assets formed by data to data and after combination are not only oriented to the maximization of their diverse and diverse user needs and values, but also closely related to the attributes of the data products and data assets themselves that tend to be of high value. Therefore, the method constructs a user demand function and a value maximization function aiming at the customized and non-customized data products and data assets respectively, and the value maximization function is used as a constraint condition of the future potential reorganization freedom and the future potential circulation freedom, so that the range of the future potential reorganization freedom and the future potential circulation freedom is further reduced, and the data factor reorganization and circulation scheme is accurately determined.
(4) The invention provides a method for automatically generating and updating data products and assets by freely recombining and circulating data factors based on a value map, which has the advantages of automatically generating the data products and the assets and realizing the automatic updating of the data products and the assets. The data products and the assets have the characteristics of range economy, the relevance between contained data is very important, the organization rule mode among the data is more flexible, different data combinations can bring different values, the more the combinations are, the more applicable scenes are, and the higher the possibility of bringing more values is. Automatic updating and immediate updated version delivery of data products/assets are also highly necessary due to the characteristics and requirements of data products and data assets, and the high frequency and large amplitude of dynamic changes of data resources and data elements. It is to these needs that the present invention is directed.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
Fig. 1 is a schematic flow chart of a method for automatically generating and updating data products and assets according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The terms "first," "second," and the like in the description and claims of the present invention and in the above-described drawings are used for distinguishing between different objects and not for describing a particular order. Furthermore, the terms "include" and "have," as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, apparatus, product, or apparatus that comprises a list of steps or elements is not limited to those listed but may alternatively include other steps or elements not listed or inherent to such process, method, product, or apparatus.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the invention. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is explicitly and implicitly understood by one skilled in the art that the embodiments described herein can be combined with other embodiments.
Example one
Referring to fig. 1, fig. 1 is a schematic flow chart diagram of a method for automatically generating and updating data products and assets according to an embodiment of the present invention:
s1, acquiring a data product and asset data resource library; the data product and asset data repository is composed of data and has value attributes and liquidity;
the value attribute is the actual value of the data product and the data asset in three dimensions of time, space and path, which accords with the benefit of the demander;
the currency is a social currency of data products and data assets for their value to be sought and realized;
s2, dividing the data product and asset data resource library to obtain the minimum data unit which can not be split;
combining the minimum data units which cannot be split according to the value attributes to obtain minimum value units;
processing the minimum value unit to obtain a data factor with a recombination degree of freedom, a circulation degree of freedom and a value attribute;
s3, processing the data factor to obtain a data factor dynamic data value map;
s4, processing the data factor dynamic data value map to obtain a data factor recombination freedom function and a data factor circulation freedom function;
s5, processing the data factor recombination freedom function and the data factor circulation freedom function to obtain a user demand function and a value maximization function;
and S6, processing the user demand function and the value maximization function to obtain a data factor recombination scheme and a circulation scheme, and automatically generating and updating data products and assets.
Optionally, the minimum value unit is processed to obtain a data factor having a reorganization degree of freedom, a circulation degree of freedom, and a value attribute, and the method includes:
when the minimum value unit has the condition of forming value by recombining with at least two other minimum value units, the minimum value unit is called to have the recombination freedom degree;
when the value represented by the minimum value unit is not directionally supplied to a certain user and can not be only applied to a certain specific scene, the minimum value unit is called to have circulation freedom;
the least valuable unit with the reorganization and circulation freedom is the data factor.
Optionally, the data factor is processed to obtain a data factor dynamic data value map, and the method includes:
s31, carrying out value marking on the data factors from three dimensions of time, space and path;
s32, excavating the association value relation of the data factor and other data factors in three dimensions of time, space and path by using a data value association excavation method;
s33, processing the time, space and path three-dimensional values of the data factors to obtain the influence coefficients of the correlation value relationship of the data factors on the time, space and path three-dimensional values;
influence coefficients of the correlation value relationship of each data factor in three dimensions of time, space and path on the three dimensional values of time, space and path are multiplied by the three dimensional values of time, space and path corresponding to other data factors respectively to obtain the correlation values of the data factors and the other data factors in three dimensions of time, space and path;
s34, taking the three-dimensional values of time, space and path of each data factor as the three-dimensional coordinates of each data factor, and marking the associated values of each data factor and other data factors in the three dimensions of time, space and path to obtain a data factor value map of the data factor at a certain time point;
and S35, repeating the steps S31, S32, S33 and S34 at each historical time point to obtain a data factor dynamic data value map.
Optionally, the time value includes time cost and data aging;
the space value comprises economic cost, resource cost, data content, data reliability and data technology;
the path value comprises a data category, a data positioning, a data association, a data path and a data source.
Optionally, the time value, the space value, and the path value of each data factor are calculated by:
the time value, the space value and the path value of each data factor are equal to the correlation value of the data factor and other data factors on the time value, the space value and the path value plus the basic value of the data factor on the time value, the space value and the path value;
the basic value is a common factor of the values of the data factors in time value, space value and path value at each historical time point.
Optionally, the data factor is a recombination degree of freedom function, and the calculation method includes:
s41, extracting the associated value relation of the same data factor and the time, space and path three-dimensional values of other data factors at each historical time point from the data factor dynamic data value map;
s42, analyzing the correlation value relationship between the same data factor and other data factors in three dimensional values of time, space and path at each historical time point to obtain all non-null correlation value relationships in the correlation value relationship between the same data factor and other data factors in three dimensional values of time, space and path at each historical time point, and marking the non-null correlation value relationships;
s43, combining the data factor and other data factors, and combining the non-null association value relationship in three dimensions of time, space and path at each historical time point and the corresponding association value into a non-null association value set;
s44, regarding all the data factors and other data factors, and regarding the non-null associated value sets corresponding to the non-null associated values in the associated values of three dimensions of time, space and path at each historical time point as recombination freedom degree sets of the data factors at each historical time point;
and S45, extracting a historical data factor recombination track function by using the recombination freedom set of the data factor at each historical time point to obtain a data factor recombination freedom function.
Optionally, the data factor circulation degree of freedom function, the calculation method includes:
s45, judging whether time cost, data timeliness, economic cost, resource cost, data content, data reliability, data technology, data category, data positioning, data association, data path and data source of the data factor in three dimensions of time, space and path at each historical time point are all directionally supplied to a certain user or can only be applied to a certain specific scene;
if so, the data factor has no circulation freedom; if not, marking a plurality of user requirements and a plurality of application scenes corresponding to each item of the data factor in three dimensions of time, space and path at each historical time point to obtain the self circulation freedom degree of the data factor;
s46, extracting other data factors with recombination freedom degrees of the data factors at each historical time point by using the recombination freedom degree sets of the data factors at each historical time point;
s47, judging the self circulation freedom degree of the data factor at each historical time point, matching the self circulation freedom degree of the data factor at the same historical time point and the self circulation freedom degree of other data factors with recombination freedom degrees;
calculating the correlation value of the data factor and other data factors in the specific dimension with recombination freedom degrees of the data factors and other data factors;
combining the self circulation freedom degree of the data factor at each historical time point, the self circulation freedom degree of other data factors having a recombination freedom degree relation with the self circulation freedom degree, the data factor and the recombination dimension correlation value of other data factors having a recombination freedom degree relation with the data factor into a circulation freedom degree set of the data factor at each historical time point;
and analyzing the circulation freedom set of the data factors at each historical time point, extracting a historical circulation track function of the data factors, and obtaining the circulation freedom function of the data factors.
Optionally, the processing is performed on the data factor reorganization degree of freedom function and the data factor circulation degree of freedom function to obtain a user demand function, and the method includes:
aiming at user requirements, the data products and data assets can be divided into customized data products and data assets determined by buyers and scenes and non-customized data products and data assets determined by buyers and scenes;
respectively determining user demand functions of different types of data products and data asset characteristics;
the user demand function is used as a demand and value target for data factor recombination and circulation;
customizing user demand functions of data products and data assets, which change over time, including specific demands of buyers on time value, space value, path value and determined application scenarios;
the user demand function of the non-customized data products and data assets changes with time, and the specific demands and the potential application scenes of the potential buyers are predicted according to the historical user demands and the overall market demands.
Optionally, the data factor recombination degree of freedom function and the data factor circulation degree of freedom function are processed to obtain a value maximization function, and the method includes:
processing the data factor recombination freedom function and the data factor circulation freedom function to construct a value maximization function;
the value maximization function is a demand and value target of data factor recombination and circulation;
if all data factors are contained in the data product and the data asset which need to be automatically generated, processing the values of all the data factors on the time value, the space value and the path value, removing the associated values between two repeated data factors in each subentry of the value, and then summing to obtain the value sum of the time value, the space value and the path value of all the data factors contained in the data product and the data asset;
the cost sum maximum is a cost maximization function.
Optionally, the user demand function and the value maximization function are processed to obtain a data factor reorganization scheme and a circulation scheme, and data products and assets are automatically generated and updated, wherein the method includes:
s61, screening the time value, the space value and the path value of each data factor by taking the user demand function as a screening condition, and reserving the data factors which accord with the user demand function in the time value, the space value and the path value;
s62, combining the data factors which accord with the user demand function in the time value, the space value and the path value to obtain the data factors which accord with the circulation freedom degree requirement;
s63, taking the value maximization function as a constraint target of relevance and relevance value between the data factor meeting the circulation freedom degree requirement and other data factors;
processing the data dynamic data value map and the relevance value between the data factor and other data factors to obtain a data factor recombination scheme which accords with a value maximization function and the requirement of circulation freedom;
the data factor reorganization scheme is automatically generated data products and data assets;
and S64, repeating S61, S62 and S63 when the data factor or the user requirement is changed, and obtaining the updated automatically generated data product and data asset.
Example two
Data products and assets are made up of data, the data types including numbers, text, and the like. Data products and data assets have value attributes and currency, and thus the data that constitutes them also has value attributes and currency. The data is used as data resources and data elements, and data products and data assets which have different modes, higher values, meet different scene requirements and have higher circulation can be formed through recombination of multiple data. In order to meet the characteristics and requirements of data products and data assets, such as uncertain scenes, large demand difference, unfixed modes, pursuit of high value and high circulation requirements, a data product and asset automatic generation and updating method based on free recombination and circulation of data factors of a value map is provided, and the method comprises the following steps: extracting and establishing data factors with recombination and circulation freedom degrees and value attributes, constructing a historical dynamic data value map of the data factors, establishing data factor recombination and circulation freedom degree functions on the basis of the value map, constructing user demand and value maximization functions as demand and value targets of data factor recombination and circulation, further determining a data factor recombination and circulation scheme, automatically generating data products and assets, and realizing automatic updating of the data products and the assets.
(1) Extracting and establishing data factors with recombination and circulation freedom degrees and value attributes
Suppose the smallest unit of data a (n) in the data repository that is meaningfully non-separable (n =1,2, \8230;, n). According to the value property, a minimum value unit v ({ a (n ') }) is constructed by aggregating partial minimum data units (n' is the partial code set in n). When the minimum worth unit v ({ a (n') }) has at least two or more minimum worth units with the others
Figure BDA0003819184590000141
The least valuable unit v ({ a (n') }) is said to have the freedom to recombine, when the conditions for recombination to form value are present. The minimum value unit v ({ a (n ') }) is said to have a circulation degree of freedom when the value represented by the minimum value unit v ({ a (n') }) is not targeted to a certain user and is not applicable only to a certain scenario. The least valued unit v ({ a (n') }) with the reorganization and circulation freedom is called the data factor. Data of non-data factors cannot be recombined and can only be used for targeted provisioning or specific scenario applications.
(2) Construction of historical dynamic data value map of data factor
Values are marked for data factor v ({ a (n ') }) from three dimensions, time value tv (v ({ a (n') })), space value sv (v ({ a (n ') })), path value pv (v ({ a (n') })). Among them, the time value tv (v ({ a (n ') })) includes a time cost tc (v ({ a (n ') })) and a data age da (v ({ a (n ') })). The space value sv (v ({ a (n ') })) includes economic cost ec (v ({ a (n') })), resource cost rc (v ({ a (n ') })), data content dc (v ({ a (n') })), data reliability dr (v ({ a (n ') })), data technology dt (v ({ a (n') })). The path value pv (v ({ a (n ') })) includes a data category dc (v ({ a (n') })), a data location dl (v ({ a (n ') })), a data association da (v ({ a (n') })), a data path dp (v ({ a (n ') })), and a data source ds (v ({ a (n') })).
The data factor v ({ a (n') }) and other data factors are excavated by adopting a method of data value association mining, such as clustering, a grey association analysis method, an Apriori algorithm and the like
Figure BDA0003819184590000142
An associated value relationship RVT (tv (v ({ a (n ') })))) over a time value tv (v ({ a (n') })),
Figure BDA0003819184590000143
an associated value relationship RVS (sv (v ({ a (n ') })))) in the space value sv (v ({ a (n') })),
Figure BDA0003819184590000144
an associated value RVP (pv (v ({ a (n ') })))) on the path value pv (v ({ a (n') }))),
Figure BDA0003819184590000145
a time-value relationship RVT (tv (V ({ a (n ') })) for each data factor V ({ a (n ') })) can be derived from the time value, space value, and path value V (V ({ a (n ') })) = (VT (V ({ a (n ') })), VS (V ({ a (n ') })), VP (V ({ a (n ')))))))) for each data factor V ({ a (n ') }),
Figure BDA0003819184590000146
the coefficient of influence on value kt (RVT (tv (v ({ a (n') })),
Figure BDA0003819184590000147
spatial value relationship RVS (sv ({ a (n') })), sv
Figure BDA0003819184590000148
Value-influencing factor ks (RVS (sv (v ({ a (n') })),
Figure BDA0003819184590000149
Figure BDA00038191845900001410
the path-value relationship RVP (pv (v ({ a (n') })),
Figure BDA00038191845900001411
Figure BDA00038191845900001412
the coefficient of influence on value kp (RVP (pv (v ({ a (n'))))))))),
Figure BDA00038191845900001413
Figure BDA0003819184590000151
the formula is found as follows:
Figure BDA00038191845900001513
tv(v({-a(n’)}))))*tv(v({a(n’)}))]+VT0(v({a(n’)}))
Figure BDA00038191845900001514
sv(v({-a(n’)}))))*sv(v({a(n’)}))]+VS0(v({a(n’)}))
Figure BDA0003819184590000154
pv(v({-a(n’)}))))*pv(v({a(n’)}))]+VP0(v({a(n’)}))
where N is the total number of data factors, and VT0 (v ({ a (N ') })), VS0 (v ({ a (N') })), VP0 (v ({ a (N ') })) is the underlying value of the data factor v ({ a (N') }) in terms of time value, space value, and path value. These base value values are a common factor of the value of the data factor v ({ a (n') }) in three dimensions at various time points in history, and are usually obtained by a factor analysis method.
The influence coefficient kt (RVT (tv ({ a (n ') })) of the correlation between the time value, the space value and the path value of each data factor v ({ a (n') }) on the value,
Figure BDA0003819184590000155
ks(RVS(sv(v({a(n’)})),
Figure BDA0003819184590000156
kp(RVP(pv(v({a(n’)})),
Figure BDA0003819184590000157
the values are multiplied by the time value tv (v ({ a (n ') })), the space value sv (v ({ a (n') }) and the path value pv (v ({ a (n ') })) corresponding to each data factor v ({ a (n') }), to obtain the data factor v ({ a (n ') }) and other data factors v (v) (a (n') })
Figure BDA0003819184590000158
The correlation value RVDT ((tv (v ({ a (n'))))))))))),
Figure BDA0003819184590000159
the relevance value in terms of spatial value RVDS ((sv (v ({ a (n')))))))))),
Figure BDA00038191845900001510
the correlation value RVDP in path value ((pv (v ({ a (n') })),
Figure BDA00038191845900001511
the time value, the space value and the path value of each data factor v ({ a (n ') }) are used as the three-dimensional coordinates of each data factor v ({ a (n ') }), and each data factor v ({ a (n ') }) and other data factors are marked
Figure BDA00038191845900001512
The time value, the space value and the path value of the data factor value map VM { v ({ a (n ') }), v (n') }) are constructed according to the correlation value{-a(n’)}))}。
The data factor value map constructed at a certain time point is obtained through the steps. And repeating the steps at each historical time point, constructing a data factor value map at each historical time point, and obtaining a data factor historical dynamic data value map VMT { v ({ a (n ') }), v ({ -a (n') })), t } (t =1,2, \8230;, n) according to a time clue.
(3) Establishing data factor reorganization and circulation freedom function
(1) Establishing a data factor reorganization degree of freedom function
From the data factor historical dynamic data value map VMT v ({ a (n') }),
Figure BDA0003819184590000161
t } the same data factor v ({ a (n') }) is extracted at various historical time points with other data factors v (n }){-a(n’)}) A correlation value relationship RVT (tv (v ({ a (n'))))))))))),
Figure BDA0003819184590000162
Figure BDA0003819184590000163
the relevance value relationship RVS (sv (v ({ a (n')))))))))),
Figure BDA0003819184590000164
relevance value relationship on path value RVP (pv (v ({ a (n'))))))))),
Figure BDA0003819184590000165
analyzing the above associated value relationship RVT (tv (v ({ a (n') })) in time value at each historical time point,
Figure BDA0003819184590000166
the relevance value relationship RVS (sv (v ({ a (n')))))))))),
Figure BDA0003819184590000167
relevance value relationship in Path value RVP (pv (v ({ a (n') })),
Figure BDA0003819184590000168
extracting data factors v ({ a (n') }) and other data factors at various historical time points
Figure BDA0003819184590000169
Marking all non-null associated value relations among the associated value relations on the time value, the space value and the path value as RVT p (tv(v({a(n’)})),
Figure BDA00038191845900001610
RVS p (sv(v({a(n’)})),
Figure BDA00038191845900001611
RVP p (pv(v({a(n’)})),
Figure BDA00038191845900001612
And obtaining the data factor v ({ a (n') }) corresponding to the non-null association value relationship and other data factors at each historical time point
Figure BDA00038191845900001613
The correlation value RVDT on the time value of p ((tv(v({a(n’)})),
Figure BDA00038191845900001614
Relevance value RVDS on space value p ((sv(v({a(n’)})),
Figure BDA00038191845900001615
Relevance value RVDP on Path value p ((pv(v({a(n’)})),
Figure BDA00038191845900001616
The data factor v ({ a (n') }) is compared with other data factors
Figure BDA00038191845900001617
The non-null associated value relationship among the time value, the space value and the path value at each historical time point and the corresponding associated value set are a non-null associated value set { (RVTp (v ({ a (n') })),
Figure BDA00038191845900001618
Figure BDA00038191845900001619
RVDTp((tv(v({a(n’)})),
Figure BDA00038191845900001620
(RVSp(sv(v({a(n’)})),sv(v({-a(n’)})),t),RVDSp((sv(v({a(n’)})),
Figure BDA00038191845900001621
(RVPp(pv(v({a(n’)})),pv(v({-a(n’)})),t),RVDPp((pv(v({a(n’)})),pv(v({-a(n’)}))),t)))}。
judging the data factor v ({ a (n ') }) and other data factors according to the definition of the recombination degree of freedom of the data factor v ({ a (n') })
Figure BDA0003819184590000171
The non-null associated value set of time value, space value, path value at each historical time point, and all data factors v ({ a (n') }) and other data factors
Figure BDA0003819184590000172
The correlation value RVDTp in time value, space value and path value at each historical time point ((tv (v ({ a (n'))))))))),
Figure BDA0003819184590000173
RVDSp((sv(v({a(n’)})),
Figure BDA0003819184590000174
RVDPp((pv(v({a(n’)})),
Figure BDA0003819184590000175
the set of non-null associated value set elements corresponding to the associated value with non-null time value, space value and path value is called as a recombination degree of freedom set (RVTpo (tv ({ a (n ') })) of a data factor v ({ a (n') }) at each historical time point,
Figure BDA0003819184590000176
RVDTpo((tv(v({a(n’)})),
Figure BDA0003819184590000177
(RVSpo(sv(v({a(n’)})),
Figure BDA0003819184590000178
Figure BDA0003819184590000179
RVDSpo((sv(v({a(n’)})),
Figure BDA00038191845900001710
(RVPp(pv(v({a(n’)})),
Figure BDA00038191845900001711
RVDPpo((pv(v({a(n’)})),
Figure BDA00038191845900001712
according to the recombination freedom degree set of the data factor v ({ a (n') }) at each historical time point, a numerical analysis method, a data mining method, a factor analysis method and other methods are adopted to extract a historical recombination track function of the data factor, and the freedom degree range of the future potential recombination of the data factor is determined.
(2) Establishing a data factor circulation degree of freedom function
It is determined whether the time cost tc (v ({ a (n ') })), the data age da (v ({ a (n') })), the economic cost ec (v ({ a (n ') })), the resource cost rc (v ({ a (n') })), the data content dc (v ({ a (n ') })), the data reliability dr (v ({ a (n') })), the data technical dt (v ({ a (n ') })), the data category dc (v ({ a (n') })), the data location dl (v ({ a (n ')), the data association da (v ({ a (n') })), the data pathway (v) }) ({ a (n ')), the data source (v) }) or the data cost pv ({ a (n')), at each historical time point, is applicable to a specific user orientation or a specific user orientation. If so, the data factor v ({ a (n') }) is said to have no flow-through freedom. Otherwise, a plurality of user demands and a plurality of application scenarios corresponding to each of time values tv (v ({ a (n ') })), space values sv (v ({ a (n') })), and path values pv (v ({ a (n ') })) of the data factor v ({ a (n') }) at each historical time point are marked, and the user demands and the application scenarios are called as self circulation degrees of freedom DFC (dfct (tv ({ a (n ') }))), DFCs (sv (a ({ n') })), dfcv (pv ({ a (n ') })), t) of the data factor v ({ a (n') }).
Extracting other data factors with recombination freedom degrees at the historical time points with the data factor v ({ a (n ') }) according to the recombination freedom degree set of the data factor v ({ a (n') }) at the historical time points
Figure BDA0003819184590000181
Judging the other data factors according to the previous step
Figure BDA0003819184590000182
Self-circulation freedom at various historical time points. Self-flux freedom degrees matching data factor v ({ a (n ') }) at the same historical time point, self-flux freedom degrees DFC (dfct (tv (v ({ -a (n ') }))) of other data factors v ({ -a (n ') })) with which there is a recombination freedom degree,
Figure BDA0003819184590000183
Figure BDA0003819184590000184
and according to the data factor v ({ a (n') }) and these other data factors
Figure BDA0003819184590000185
Associated values in temporal value RVDTp ((tv (v ({ a (n ') })), tv (v ({ -a (n') }))), t), associated values in spatial value RVDSp ((sv (v ({ a (n ') })), sv (v ({ -a (n') })), t), associated values in path value RVDPp ((pv (v ({ a (n ') })), pv ({ -a (n')))), t) with reorganization freedom. The self-circulation freedom degree of the data factor v ({ a (n ') }) at various historical time points is heavily communicated with the data factor v ({ a (n') }) at various historical time pointsSelf-circulation freedom of other data factors v ({ -a (n ') }) in group-degree-of-freedom relationship, data factor v ({ a (n') }) and other data factors having a reorganization freedom relationship therewith
Figure BDA0003819184590000186
The set is the flow-through freedom set { DFC (dfct (tv ({ a (n ') }))), DFCs (sv (v ({ a (n') }))), dfcv (pv ({ a (n ') }))), t) of the data factor v ({ a (n') }) at various time points in the history,
Figure BDA0003819184590000187
dfcv(pv(v({-a(n’)})))),t),RVDTp((tv(v({a(n’)})),
Figure BDA0003819184590000188
RVDSp((sv(v({a(n’)})),
Figure BDA0003819184590000189
Figure BDA00038191845900001810
RVDPp((pv(v({a(n’)})),
Figure BDA00038191845900001811
and analyzing the circulation freedom degree set of the data factor v ({ a (n') }) at each historical time point by adopting a numerical analysis method, a data mining method, a factor analysis method and other methods, extracting a historical circulation track function of the data factor, and determining the freedom degree range of the future potential circulation of the data factor.
(4) Constructing a user demand and value maximization function as a demand and value target of data factor recombination and circulation
Due to the data resources and the data elements, data products and data assets with different modes, higher value, accordance with different scene requirements and higher circulation can be formed through the recombination of multiple data.
And establishing a recombination and circulation freedom degree function of the data factor v ({ a (n') }), and constructing a user demand and value maximization function as a demand and value target of data factor recombination and circulation on the basis of the two functions.
(1) Building user demand functions
Data products and data assets can be divided into two categories for user needs, namely, buyer and scenario specific customized data products and data assets and buyer and scenario specific non-customized data products and data assets.
The user requirements for the customized data products and data assets may change over time, and thus the user requirement target is time-dependent, labeled (UN (tg, sg, pg), SR, t). UN is the specific requirement of the buyer, tg, sg and pg are the specific requirements of the buyer in three dimensions of time, space and path, and SR is the determined application scenario.
The user demand of the non-customized data products and data assets is also a function of time, and the specific demand UNm (tg (PB (m)), sg (PB (m)), pg (PB (m))) and the potential application scenario SR (c) (c =1,2, 8230; n) of the potential buyers PB (m) (m =1,2, \ 8230;, n) are predicted according to the historical user demand and the overall demand of the market by adopting methods such as data mining, predictive analysis, time series analysis, causal relation prediction and the like.
(2) Constructing a value maximization function
Assuming that the data products and data assets to be automatically generated are DPAs, all the data factors contained therein are assumed to be vDPA ({ a (n ') }), and the association relationship value between these data factors can be expressed according to step 2), so as to obtain the cost functions VT (vDPA ({ a (n ') })) in three dimensions, VS (vDPA ({ a (n ') })), VP (vDPA ({ a (n ') })) in three dimensions of all the data factors vDPA ({ a (n ') }). The associated merit values between the two repeated data factors in each of the components of the merit function are redundantly removed and summed to obtain the three-dimensional sum of the merit values of all the data factors vDPA ({ a (n') }) included in the data product and data asset DPA. This maximum sum is the value maximization objective.
(5) Determining data factor reorganization and circulation scheme, and automatically generating or updating data products and assets
The user demand and value maximization function is equivalent to the constraint conditions of the future potential recombination freedom degree and the future potential circulation freedom degree of the data factor v ({ a (n') }), and further narrows the range of the future potential recombination freedom degree and the future potential circulation freedom degree.
The user demand function is used as a condition for screening the value of three dimensions of each data factor, the data factors meeting the user demand function in three dimensions are reserved according to the actual user demand target, and the data factors meeting the circulation freedom degree requirement are determined for the data factor set meeting the screening of the value of the three dimensions of the data factors.
And taking the value maximization function as a constraint target of the relevance and the relevance value between the data factor meeting the circulation freedom requirement and other data factors. And according to the data factor value map and the relevance value between the data factor and other data factors, solving a data factor recombination scheme which maximizes the value function and meets the requirement of circulation freedom. The data factor reorganization scheme is the automatically generated data product and data asset. When the data factor or the user requirement is changed, the steps are repeated, and the data product and the data asset which are automatically generated after updating can be obtained.
Through the above detailed description of the embodiments, those skilled in the art will clearly understand that the embodiments may be implemented by software plus a necessary general hardware platform, and may also be implemented by hardware. Based on such understanding, the above technical solutions may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, wherein the storage medium includes a Read-Only Memory (ROM), a Random Access Memory (RAM), a Programmable Read-Only Memory (PROM), an Erasable Programmable Read-Only Memory (EPROM), a One-time Programmable Read-Only Memory (OTPROM), an Electrically Erasable Programmable Read-Only Memory (EEPROM), a Compact Disc-Read-Only Memory (CD-ROM) or other Memory capable of storing data, a magnetic tape, or any other computer-readable medium capable of storing data.
Finally, it should be noted that: the method for automatically generating and updating data products and assets disclosed in the embodiments of the present invention is only a preferred embodiment of the present invention, and is only used for illustrating the technical solutions of the present invention, not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those skilled in the art; the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A method for automatically generating and updating data products and assets, the method comprising:
s1, acquiring a data product and asset data resource library; the data product and asset data repository is composed of data and has value attributes and liquidity;
the value attribute is the actual value of the data product and the data asset in three dimensions of time, space and path, which accords with the interest of the demand side;
the currency is a social currency of data products and data assets for their value to be sought and realized;
s2, dividing the data product and asset data resource library to obtain the smallest irreparable data unit;
combining the minimum data units which cannot be split according to the value attributes to obtain minimum value units;
processing the minimum value unit to obtain a data factor with a recombination degree of freedom, a circulation degree of freedom and a value attribute;
s3, processing the data factor to obtain a data factor dynamic data value map;
s4, processing the data factor dynamic data value map to obtain a data factor recombination freedom function and a data factor circulation freedom function;
s5, processing the data factor recombination freedom function and the data factor circulation freedom function to obtain a user demand function and a value maximization function;
and S6, processing the user demand function and the value maximization function to obtain a data factor recombination scheme and a circulation scheme, and automatically generating and updating data products and assets.
2. The method of claim 1, wherein said processing of said least valued units results in data factors having reorganization degrees of freedom, circulation degrees of freedom, and value attributes, the method comprising:
when the minimum value unit has the condition of forming value by recombining with at least two other minimum value units, the minimum value unit is called to have the recombination freedom degree;
when the value represented by the minimum value unit is not directionally supplied to a certain user and can not be only applied to a certain specific scene, the minimum value unit is called to have circulation freedom;
the least valuable unit with the reorganization freedom and the circulation freedom is a data factor.
3. The method of claim 1 for automatically generating and updating data products and assets, wherein the processing of the data factors to obtain a data factor dynamic data value graph comprises:
s31, carrying out value marking on the data factors from three dimensions of time, space and path;
s32, mining the association value relation between the data factor and other data factors in three dimensions of time, space and path by using a data value association mining method;
s33, processing the time, space and path three-dimensional values of the data factors to obtain the influence coefficients of the correlation value relationship of the data factors on the time, space and path three-dimensional values;
influence coefficients of the correlation value relations of the data factors in the three dimensions of time, space and path on the three dimensional values of the time, space and path are multiplied by the three dimensional values of the time, space and path corresponding to other data factors respectively to obtain the correlation values of the data factors and the other data factors in the three dimensions of time, space and path;
s34, taking the three-dimensional values of time, space and path of each data factor as the three-dimensional coordinates of each data factor, and marking the associated values of each data factor and other data factors in the three dimensions of time, space and path to obtain a data factor value map of the data factor at a certain time point;
and S35, repeating the steps S31, S32, S33 and S34 at each historical time point to obtain a data factor dynamic data value map.
4. The method of claim 3, wherein the time value includes time cost and data age;
the space value comprises economic cost, resource cost, data content, data reliability and data technology;
the path value comprises a data category, a data positioning, a data association, a data path and a data source.
5. The method of claim 3, wherein the computing method comprises the steps of:
the time value, the space value and the path value of each data factor are equal to the correlation value of the data factor and other data factors on the time value, the space value and the path value plus the basic value of the data factor on the time value, the space value and the path value;
the basic value is a common factor of values of the data factors in time value, space value and path value at various historical time points.
6. The method of claim 1, wherein the data factors reorganize the degree of freedom function, the calculation method comprising:
s41, extracting the associated value relation of the same data factor with the time, space and path values of other data factors at each historical time point from the data factor dynamic data value map;
s42, analyzing the correlation value relationship between the same data factor and the three dimensional values of time, space and path of other data factors at each historical time point to obtain all non-null correlation value relationships among the correlation value relationships between the same data factor and the other data factors at the three dimensional values of time, space and path at each historical time point, and marking the non-null correlation value relationships;
s43, combining the data factor and other data factors, and the non-null associated value relation in three dimensions of time, space and path at each historical time point and the corresponding associated value into a non-null associated value set;
s44, regarding all the data factors and other data factors as non-empty associated value sets corresponding to non-empty associated values in the associated values of three dimensions of time, space and path at each historical time point, and referring to the non-empty associated value sets as recombination freedom degree sets of the data factors at each historical time point;
and S45, extracting a historical data factor recombination track function by using the recombination freedom set of the data factor at each historical time point to obtain a data factor recombination freedom function.
7. The method of claim 1, wherein the data factor currency freedom function is calculated by:
s46, judging whether time cost, data timeliness, economic cost, resource cost, data content, data reliability, data technology, data category, data positioning, data association, data path and data source of the data factor in three dimensions of time, space and path at each historical time point are all directionally supplied to a certain user or can only be applied to a certain specific scene;
if so, the data factor has no circulation freedom; if not, marking a plurality of user requirements and a plurality of application scenes corresponding to each item in three dimensions of time, space and path of the data factor at each historical time point to obtain the self circulation freedom degree of the data factor;
s47, extracting other data factors with the recombination freedom degrees of the data factors at the historical time points by using the recombination freedom degree sets of the data factors at the historical time points;
s48, judging the self circulation freedom degree of the data factor at each historical time point, matching the self circulation freedom degree of the data factor at the same historical time point and the self circulation freedom degree of other data factors with recombination freedom degrees;
calculating the correlation value of the data factor and other data factors in the specific dimension with recombination freedom degrees of the data factors and other data factors;
combining the self circulation freedom degree of the data factor at each historical time point, the self circulation freedom degree of other data factors having a recombination freedom degree relation with the self circulation freedom degree, the data factor and the recombination dimension correlation value of other data factors having a recombination freedom degree relation with the data factor into a circulation freedom degree set of the data factor at each historical time point;
and analyzing the circulation freedom set of the data factors at each historical time point, extracting a historical circulation track function of the data factors, and obtaining the circulation freedom function of the data factors.
8. The method of claim 1, wherein the processing of the data factor reorganization degree of freedom function and the data factor currency degree of freedom function to obtain a user demand function comprises:
aiming at user requirements, the data products and data assets can be divided into customized data products and data assets determined by buyers and scenes and non-customized data products and data assets determined by buyers and scenes;
respectively determining user demand functions of different types of data products and data asset characteristics;
the user demand function is used as a demand and value target for data factor recombination and circulation;
customizing user demand functions of data products and data assets, which change over time, including specific demands of buyers on time value, space value, path value and determined application scenarios;
the user demand function of the non-customized data products and data assets changes with time, and the specific demands and the potential application scenes of the potential buyers are predicted according to the historical user demands and the overall market demands.
9. The method of claim 1, wherein the processing of the data factor reorganization degree of freedom function and the data factor currency degree of freedom function to obtain a cost maximization function comprises:
processing the data factor recombination freedom function and the data factor circulation freedom function to construct a value maximization function;
the value maximization function is a demand and value target of data factor recombination and circulation;
if all data factors are contained in the data products and the data assets which need to be automatically generated, processing the values of all the data factors on time value, space value and path value, removing the associated values between two repeated data factors in each subentry of the value, and summing to obtain the value sum of the time value, the space value and the path value of all the data factors contained in the data products and the data assets;
the cost sum maximum is a cost maximization function.
10. The method of claim 1, wherein the user demand function and the cost maximization function are processed to obtain a data factor reorganization scheme and a circulation scheme, and the method of automatically generating and updating data products and assets comprises:
s61, taking the user demand function as a screening condition, screening the time value, the space value and the path value of each data factor, and reserving the data factors which accord with the user demand function in the time value, the space value and the path value;
s62, combining the data factors which accord with the user demand function in the time value, the space value and the path value to obtain the data factors which accord with the circulation freedom degree requirement;
s63, taking the value maximization function as a constraint target of relevance and relevance value between the data factor meeting the circulation freedom degree requirement and other data factors;
processing the data dynamic data value map and the relevance value between the data factor and other data factors to obtain a data factor recombination scheme which accords with a value maximization function and the requirement of circulation freedom;
the data factor reorganization scheme is automatically generated data products and data assets;
and S64, repeating S61, S62 and S63 when the data factor or the user requirement is changed, and obtaining the updated automatically generated data product and data asset.
CN202211036253.5A 2022-08-27 2022-08-27 Automatic generation and update method for data products and assets Active CN115640275B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211036253.5A CN115640275B (en) 2022-08-27 2022-08-27 Automatic generation and update method for data products and assets

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211036253.5A CN115640275B (en) 2022-08-27 2022-08-27 Automatic generation and update method for data products and assets

Publications (2)

Publication Number Publication Date
CN115640275A true CN115640275A (en) 2023-01-24
CN115640275B CN115640275B (en) 2023-04-21

Family

ID=84939870

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211036253.5A Active CN115640275B (en) 2022-08-27 2022-08-27 Automatic generation and update method for data products and assets

Country Status (1)

Country Link
CN (1) CN115640275B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190019119A1 (en) * 2017-07-17 2019-01-17 Theodore J Lucas System for automated resource set multi-factor risk analysis
CN111178940A (en) * 2019-12-11 2020-05-19 北京明略软件系统有限公司 Method and system for automatically generating sales call technology map
US20200242697A1 (en) * 2019-01-25 2020-07-30 Franklin Advisers, Inc. Systems and methods for dynamic fund allocation in goals-based wealth management portfolio
CN112308173A (en) * 2020-12-28 2021-02-02 平安科技(深圳)有限公司 Multi-target object evaluation method based on multi-evaluation factor fusion and related equipment thereof
KR102270683B1 (en) * 2019-12-23 2021-06-29 주식회사 현대케피코 Engine ignition timing efficiency determination method
CN113643078A (en) * 2021-10-14 2021-11-12 北京华宜信科技有限公司 Block chain-based information value marking method, device, equipment and medium
CN113886484A (en) * 2021-10-26 2022-01-04 北京华宜信科技有限公司 Method for automatically generating and updating scientific and technological achievement evaluation report based on block chain
CN114490644A (en) * 2022-01-10 2022-05-13 阿里云计算有限公司 Data storage method, device and storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190019119A1 (en) * 2017-07-17 2019-01-17 Theodore J Lucas System for automated resource set multi-factor risk analysis
US20200242697A1 (en) * 2019-01-25 2020-07-30 Franklin Advisers, Inc. Systems and methods for dynamic fund allocation in goals-based wealth management portfolio
CN111178940A (en) * 2019-12-11 2020-05-19 北京明略软件系统有限公司 Method and system for automatically generating sales call technology map
KR102270683B1 (en) * 2019-12-23 2021-06-29 주식회사 현대케피코 Engine ignition timing efficiency determination method
CN112308173A (en) * 2020-12-28 2021-02-02 平安科技(深圳)有限公司 Multi-target object evaluation method based on multi-evaluation factor fusion and related equipment thereof
CN113643078A (en) * 2021-10-14 2021-11-12 北京华宜信科技有限公司 Block chain-based information value marking method, device, equipment and medium
CN113886484A (en) * 2021-10-26 2022-01-04 北京华宜信科技有限公司 Method for automatically generating and updating scientific and technological achievement evaluation report based on block chain
CN114490644A (en) * 2022-01-10 2022-05-13 阿里云计算有限公司 Data storage method, device and storage medium

Also Published As

Publication number Publication date
CN115640275B (en) 2023-04-21

Similar Documents

Publication Publication Date Title
US10235425B2 (en) Entity fingerprints
Alfaro‐García et al. Logarithmic aggregation operators and distance measures
Ben-Gal et al. Efficient construction of decision trees by the dual information distance method
CN110570111A (en) Enterprise risk prediction method, model training method, device and equipment
Ahn et al. Multi-attribute decision aid under incomplete information and hierarchical structure
CN101937446A (en) Web site accelerator
Ke A genetic algorithm-based optimizing approach for project time-cost trade-off with uncertain measure
Jun et al. Learning dispatching rules for single machine scheduling with dynamic arrivals based on decision trees and feature construction
Allahviranloo et al. A new distance measure and ranking method for generalized trapezoidal fuzzy numbers
Sarkar et al. Developing an objectives hierarchy for multicriteria decisions on land use options, with a case study of biodiversity conservation and forestry production from Papua, Indonesia
CN113190670A (en) Information display method and system based on big data platform
CN112785005B (en) Multi-objective task assistant decision-making method and device, computer equipment and medium
CN112036483B (en) AutoML-based object prediction classification method, device, computer equipment and storage medium
Noori Developing a CBR system for marketing mix planning and weighting method selection using fuzzy AHP
Ulrich Pareto‐Set Analysis: Biobjective Clustering in Decision and Objective Spaces
Alissa et al. Automated algorithm selection: from feature-based to feature-free approaches
CN112860850B (en) Man-machine interaction method, device, equipment and storage medium
CN110019751A (en) Machine learning model modification and natural language processing
Salehi Latent feature based recommender system for learning materials using genetic algorithm
CN115640275A (en) Automatic generation and updating method for data products and assets
CN114722442B (en) Cabinet layout method and device, computer equipment and storage medium
Hess et al. C-salt: Mining class-specific alterations in boolean matrix factorization
CN116471281A (en) Decentralised service combination method considering node selfiness
Danielson et al. Using surrogate weights for handling preference strength in multi-criteria decisions
Czajkowski et al. An evolutionary algorithm for global induction of regression and model trees

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230526

Address after: 101127 No. 1-219, Juhe 6th Street, Jufuyuan Industrial Park, economic development zone, Tongzhou District, Beijing

Patentee after: Beijing HUAYIXIN Technology Co.,Ltd.

Patentee after: Lin Shaowei

Address before: 101127 No. 1-219, Juhe 6th Street, Jufuyuan Industrial Park, economic development zone, Tongzhou District, Beijing

Patentee before: Beijing HUAYIXIN Technology Co.,Ltd.

Patentee before: Huajiao Lianchuang (Xiamen) Technology Co.,Ltd.

TR01 Transfer of patent right
OR01 Other related matters
OR01 Other related matters