CN115878637A - Object data processing method and device, electronic equipment and storage medium - Google Patents

Object data processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN115878637A
CN115878637A CN202111142195.XA CN202111142195A CN115878637A CN 115878637 A CN115878637 A CN 115878637A CN 202111142195 A CN202111142195 A CN 202111142195A CN 115878637 A CN115878637 A CN 115878637A
Authority
CN
China
Prior art keywords
field
operator
object data
data source
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111142195.XA
Other languages
Chinese (zh)
Inventor
杨一帆
卞凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zhuanzhuan Spiritual Technology Co ltd
Original Assignee
Beijing Zhuanzhuan Spiritual Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zhuanzhuan Spiritual Technology Co ltd filed Critical Beijing Zhuanzhuan Spiritual Technology Co ltd
Priority to CN202111142195.XA priority Critical patent/CN115878637A/en
Publication of CN115878637A publication Critical patent/CN115878637A/en
Pending legal-status Critical Current

Links

Images

Abstract

The application discloses an object data processing method and device, electronic equipment and a storage medium. The method comprises the following steps: receiving a trigger change data source; determining that the trigger change data source corresponds to at least a first field; formulating an object data processing plan corresponding to the first field, wherein the object data processing plan comprises one or more operators, and the downstream operator is triggered and executed after the upstream operator in the one or more operators is executed; executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source; and updating the first field in the object data by using the processed data. The embodiment of the application can be used for quickly finishing format unification and updating and warehousing of the object data, and is suitable for maintaining the business data in each field.

Description

Object data processing method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of internet technologies, and in particular, to an object data processing method and apparatus, an electronic device, and a storage medium.
Background
With the increasing development of internet technology, e-commerce platforms continuously provide new functions and new services, and strive to meet diversified consumption requirements of users. For example, a common large-scale e-commerce platform not only sells self-owned commodities, but also provides sales interfaces for affiliated scattered merchants, and some may also provide sales interfaces for second-hand commodities for individuals or merchants, and so on, to form a unified sales platform integrating multi-party commodity data. Because the commodity data from different sources usually use different data formats, the content of the commodity description is also five-fold, and especially the second-hand commodity has a more obvious spoken description problem. If the commodity data are not processed and processed uniformly, the accuracy rate of searching commodities by the user cannot be ensured.
On the other hand, the number of commodities in the commodity library of the e-commerce platform is large, users (including merchants and individuals) who perform commodity transaction through a network daily are huge, and in order to improve the competitiveness of the e-commerce platform, besides basic information such as inventory and price of the commodities and the like, other attribute information of the commodities needs to be updated timely, including attributes of the commodities (such as models, colors, selling prices and the like of mobile phones) and/or attributes of related users (such as credit levels, liveness and the like of sellers), so that meaningful data support is provided for the platform to develop other services (such as commodity recommendation, marketing popularization and the like).
Under the circumstance, how to rapidly process and update commodity data from diversified sources into a warehouse, keep the commodity information form in the commodity warehouse uniform and updated in time, and improve the accuracy of searching commodities by users is a problem needing deep research.
Disclosure of Invention
The embodiment of the application provides an object data processing method and device, an electronic device and a storage medium, which are used for solving at least one technical problem.
In a first aspect, an embodiment of the present application provides an object data processing method, where the object data includes a plurality of fields, the method includes:
receiving a triggering change data source;
determining that the trigger change data source corresponds to at least a first field;
formulating an object data processing plan corresponding to the first field, wherein the object data processing plan comprises one or more operators, and after the upstream operator in the one or more operators is executed, the downstream operator is triggered to be executed;
executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source;
and updating the first field in the object data by using the processed data.
Optionally, the triggering change data source includes at least one of: object information change data source, system refresh data source, update failure retry data source, wherein,
generating an object information change data source when the object information is changed;
generating a system refresh data source when the system is refreshed;
when the object data fails to be updated, the retry data source generates the update failure.
Optionally, the making of the object data processing plan corresponding to the first field includes:
determining at least one operator from the first field, the at least one operator being an operator triggerable by the first field;
merging the output field of the at least one operator with the first field input field, and determining a next-stage operator meeting conditions based on a merged field set, wherein at least one input field of the next-stage operator belongs to the merged field set;
merging the output field and the input field of the next-stage operator, and determining the next-stage operator which meets the condition based on the merged field set until the operator which meets the condition is empty;
and determining the object data processing plan corresponding to the first field based on operators of all levels meeting the conditions.
Optionally, the determining, based on operators at different levels meeting the condition, the object data processing plan corresponding to the first field includes: and associating operators which meet the conditions and have upstream and downstream relations into an operator execution chain, wherein a plurality of operators in the same level can be executed in parallel.
Optionally, a single said operator comprises the following fields: an input field, a trigger type field, an operator instance field, an output field and an execution condition field;
wherein the trigger type field comprises an anyInput field or an AllInput field, and the AnyInput field indicates that the current operator can be triggered when the output field of the upstream operator contains any one of the input fields of the current operator; the AllInput field indicates that the current operator can be triggered when the output field of the upstream operator contains all fields in the input field of the current operator.
Optionally, the object data includes commodity information data.
Optionally, the commodity information data includes attribute data of the commodity itself and/or seller attribute data corresponding to the commodity;
the goods information data includes at least one of the following fields: brand, category, card bound status, description of the item, item label, price, inventory, presence or absence of video, seller release time, seller ID, seller liveness.
Optionally, the operator comprises at least one of: the unit of the commodity price is converted into a designated unit, and the commodity label is changed into a combination of the user label and the commodity label.
In a second aspect, an embodiment of the present application provides an apparatus for processing object data, where the object data includes multiple fields, and the apparatus includes:
the receiving module is used for receiving a triggering change data source;
a determining module, configured to determine that the trigger change data source corresponds to at least a first field;
the system comprises a formulation module, a processing module and a processing module, wherein the formulation module is used for formulating an object data processing plan corresponding to a first field, the object data processing plan comprises one or more operators, and after the execution of an upstream operator in the one or more operators is finished, the downstream operator is triggered to be executed;
the execution module is used for executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source;
and the updating module is used for updating the first field in the object data by using the processed data.
Optionally, the triggering change data source includes at least one of: the method comprises the following steps of changing a data source by object information, refreshing the data source by a system and retrying the data source after updating failure, wherein the data source is changed by the object information when the object information is changed; generating a system refresh data source when the system is refreshed; when the object data fails to be updated, the retry data source generates the update failure.
Optionally, the formulating module 130 includes:
a first determining submodule for determining at least one operator from the first field, the at least one operator being an operator triggerable by the first field;
the first merging processing sub-module is used for merging the output field of the at least one operator with the input field of the first field, and determining a next-stage operator meeting conditions based on a merged field set, wherein at least one input field of the next-stage operator belongs to the merged field set;
the second merging processing submodule is used for merging the output field and the input field of the next-stage operator, and determining the next-stage operator which meets the condition based on the merged field set until the operator which meets the condition is empty;
and the second determining submodule is used for determining the object data processing plan corresponding to the first field based on the operators of all levels meeting the conditions.
Optionally, wherein the second determining submodule includes: and the association unit is used for associating the operators which meet the conditions and have the upstream and downstream relations into an operator execution chain, wherein a plurality of operators in the same level can be executed in parallel.
Optionally, a single said operator comprises the following fields: the system comprises an input field, a trigger type field, an operator instance field, an output field and an execution condition field; the trigger type field comprises an AnyInput field or an AllInput field, and the AnyInput field indicates that the current operator can be triggered when the output field of the upstream operator contains any field in the input field of the current operator; the AllInput field indicates that the current operator can be triggered when the output field of the upstream operator contains all fields in the input field of the current operator.
Optionally, the object data includes commodity information data.
Optionally, the commodity information data includes attribute data of the commodity itself and/or seller attribute data corresponding to the commodity; the goods information data includes at least one of the following fields: brand, category, binding status, item description, item label, price, inventory, presence of video, seller release time, seller ID, seller liveness.
Optionally, the operator comprises at least one of: the unit of the commodity price is converted into a designated unit, and the commodity label is changed into a combination of the user label and the commodity label.
In a third aspect, an embodiment of the present application provides an electronic device, including: a processor and a memory storing computer program instructions; the electronic device, when executing the computer program instructions, implements the method as described above.
In a fourth aspect, embodiments of the present application provide a computer-readable storage medium having stored thereon computer program instructions, which, when executed by a processor, implement the method as described above.
The embodiment of the application designs a set of complete object data information warehousing mechanism, by utilizing the embodiment of the application, when a change data source is triggered, the change data source can trigger an update flow of object data, a processing plan is formulated and executed aiming at a field in the change data source, the processing plan comprises operators, different operators can be used for realizing different change processing of the field, and after the execution of an upstream operator is finished, a next-stage operator can be automatically triggered to start executing, so that the format unification and the updating warehousing of the object data are quickly completed. The embodiment of the application is suitable for updating and maintaining the business data in various fields, and can provide the search service capability close to real time and high accuracy for the user.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the embodiments of the present application will be briefly described below, and it is obvious that the drawings described below are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
Fig. 1 is a flowchart of an object data processing method according to an embodiment of the present application.
Fig. 2 is a schematic flow chart of the process planning in the embodiment of the present application.
FIG. 3 is a diagram illustrating an example of a plurality of operators in an operator repository, according to an embodiment of the present application.
Fig. 4 and 5 are schematic diagrams of two operator execution chains based on fig. 3 in the embodiment of the present application.
Fig. 6 is a schematic structural diagram of an object data processing apparatus according to an embodiment of the present application.
Fig. 7 is a hardware configuration diagram of an electronic device for implementing the object data processing method according to the embodiment of the present application.
Detailed Description
Features and exemplary embodiments of various aspects of the present application will be described in detail below, and in order to make objects, technical solutions and advantages of the present application more apparent, the present application will be further described in detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are intended to be illustrative only and are not intended to be limiting. It will be apparent to one skilled in the art that the present application may be practiced without some of these specific details. The following description of the embodiments is merely intended to provide a better understanding of the present application by illustrating examples thereof.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising 8230; \8230;" 8230; "does not exclude the presence of additional like elements in a process, method, article, or apparatus that comprises the element.
Fig. 1 is a flow chart illustrating an embodiment of an object data processing method provided in the present application, where the method includes:
s102, receiving a trigger change data source;
s104, determining that the trigger change data source corresponds to at least a first field;
s106, an object data processing plan corresponding to the first field is formulated, the object data processing plan comprises one or more operators, and after the execution of an upstream operator in the one or more operators is finished, a downstream operator is triggered to be executed;
s108, executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source;
and S110, updating the first field in the object data by using the processed data.
According to the embodiment of the application, when a trigger change data source occurs, the target data is updated by using the trigger change data source, wherein data in a plurality of formats may exist in the trigger change data source, and therefore before the update, the data in the trigger change data source is processed, and the specific processing mode is as follows: and (2) formulating a processing plan (or called a processing flow) aiming at the field in the triggering change data source, wherein the processing plan can contain one or more operators, different change processing on the field can be realized by utilizing different operators, and the next-stage operator can be automatically triggered to start executing after the upstream operator is executed. Therefore, when the triggering change data source appears, a corresponding series of operators can be automatically triggered and executed, so that the format unification and updating and warehousing of the object data can be quickly completed. The embodiment of the application designs a set of complete object data information warehousing mechanism, is suitable for updating and maintaining business data in various fields, and is particularly suitable for occasions with various trigger data sources and high requirements on real-time performance, such as unified warehousing of second-hand commodity information.
In some embodiments of the present application, the object data includes merchandise information data. The commodity information data comprises attribute data of the commodity and/or seller attribute data corresponding to the commodity; the commodity information data may include information data of the commodity itself, and may further include various information related to the seller of the commodity, for example, at least one of the following fields may be included: brand _ ID, cat _ name, bind _ status, content item description, itemtag item label, now _ price, has _ stock inventory, whether video is present, time stock seller release time, uid seller ID, user _ active _ time seller activity time, whether salecity is in the same city, etc. The commodity information includes a plurality of types, and the same type includes a plurality of configuration parameters. Taking the mobile phone as an example, the apple brand mobile phones include the iPhone X series, the iPhone11 series and the iPhone12 series, wherein the iPhone12 series further includes iPhone12mini, iPhone12Pro and iPhone12ProMax. The system can standardize the commodity description fields according to the commonalities and the characteristics of the commodities, store the standardized commodity fields into a retrieval library, facilitate accurate retrieval of the commodities and improve the retrieval efficiency.
In some embodiments of the present application, triggering the change of the data source comprises at least one of:
object information change data source: generating an object information change data source when the object information is changed;
system refresh data source: generating a system refresh data source when the system is refreshed;
update failure retry data sources: when the object data fails to be updated, the retry data source generates the update failure.
For example, the change data source may be a user modifying commodity information, such as modifying commodity price, modifying commodity description, and the like, and modifying the upload may trigger data update. The system refresh data source may be data at a timed refresh, where the timed time unit may be set as desired, such as 5 minutes, 10 minutes, 32 minutes, 1 hour, more, or less. An object data update failure may be the triggering of a retry mechanism upon the occurrence of an exception condition, including but not limited to an interface call timeout, a data format conversion error, and a data write database failure. The data source for triggering change in the present application is not limited to the above three types, and may be other types of triggering methods. The embodiment of the application triggers the data updating process when the system data changes, can shorten the time consumed by data change and data processing until data updating as much as possible, is particularly suitable for occasions with high requirements on real-time updating of the system data, can provide latest and effective data in time, and is favorable for improving the overall competitiveness of the system performance.
In some embodiments of the present application, the object data processing plan corresponding to the first field is formulated by:
s202, determining at least one operator according to the first field, wherein the at least one operator is an operator which can be triggered by the first field;
s204, combining the output field of the at least one operator with the input field of the first field, and determining a next-stage operator meeting the condition based on the combined field set, wherein the at least one input field of the next-stage operator belongs to the combined field set;
s206, merging the output field and the input field of the next-stage operator, and determining the next-stage operator which meets the condition based on the merged field set until the operator which meets the condition is empty;
and S208, determining the object data processing plan corresponding to the first field based on operators of all levels meeting the conditions.
According to an embodiment of the present application, after determining one or more fields corresponding to a data source triggering change, one or more operators triggerable by each field may be determined, where an operator may generally be understood as performing some operation (e.g., a predetermined mapping or function operation, etc.) on an input object, and in the embodiment of the present application, an operator is used to perform required transformation on a data source triggering change, and more specifically, when determining a next-stage operator, an input and an output of the current-stage operator need to be merged, and a merged field set is used as a basis for determining the next-stage operator, that is, if at least one input field of each of one or more operators belongs to the merged field set, the one or more operators are determined as a next-stage operator of a current operator, and then, similarly, a next-stage operator is determined, and so on until an operator meeting a condition is not found. By processing according to the method, operators related to triggering the update of the data source can be found as much as possible, so that necessary conversion is carried out on the data, and unified storage is achieved.
In some embodiments of the present application, the first field may include a variety of merchandise information, such as brand, price, freshness, etc. of the merchandise; user information may also be included, such as a user ID, attributes of the user (personal or merchant), a credit rating of the user, and the like. The data source change may be that the user changes the commodity information, and when the user updates the commodity information, the description may not be standard and may not be consistent with the description of other commodity information. For example, some describe the trade name as "apple 11", and some describe the trade name as "iPhone11", if the system is not processed, the products with inconsistent descriptions cannot be uniformly warehoused and managed, and are not easy to be retrieved by the user. Therefore, the embodiment of the present application needs to be converted before being updated to be put into storage.
In some embodiments of the present application, one or more triggerable operators are stored in an operator repository, and configuration information for the operators in the operator repository may be stored, for example, in an Apollo configuration center. When the system starts to work, the full amount of configuration information is loaded, an operator warehouse is generated and used for being matched with a field triggering an update data source, an executable operator is triggered, and an object data processing plan is obtained based on the executable operator. Wherein the operator repository may also be referred to as operator repository for short.
In some embodiments of the present application, the operators that can be used include at least one of: the unit of the commodity price is converted into a designated unit, and the commodity label is changed into a combination of the user label and the commodity label. For example, when the user inputs that the commodity price is 400 yuan, the operator having the conversion unit stores the converted commodity price 40000 in the database and associates it with the commodity. For another example, the user issues a new product, the system determines the credit value of the user according to the user information (such as a purchase or sale record), and the credit value of the user is regarded as the user tag and is associated with the issued product.
In some embodiments of the present application, the operator may convert the information in the field to obtain standard information; the operator may also generate stealth information from the fields, such as user value scores, whether it is an initial release, user activity time, etc. In other embodiments, the function of the operator can be set according to actual needs, and is not limited to the functions listed in the examples herein.
In some embodiments of the application, a single said operator comprises the following fields: an input field, a trigger type field, an operator instance field, an output field and an execution condition field; wherein the trigger type field comprises an anyInput field or an AllInput field, and the AnyInput field indicates that the current operator can be triggered when the output field of the upstream operator contains any one of the input fields of the current operator; the AllInput field indicates that the current operator can be triggered when the output field of the upstream operator contains all fields in the input field of the current operator. Wherein, the operator instance field represents the concrete work content of the operator. As can be seen from the trigger type field, when a field is entered, it is possible that multiple operators will be triggered simultaneously.
In some embodiments of the present application, operators that meet a condition and have a top-bottom hierarchical relationship are associated as an operator execution chain, wherein a plurality of operators in the same hierarchy can be executed in parallel. Alternatively, operators of the top and bottom hierarchies can be connected using a "next" pointer. Different trigger sources enter the operator library to be capable of formulating different operator execution chains. The dependency relationship of a plurality of fields can be quickly and intuitively obtained through the operator execution chain. Wherein the plurality of fields includes an input field and an output field of the operator. Furthermore, by utilizing the dependency relationship of a plurality of fields and adjusting the trigger condition of one or more operators, the execution strategy of the operators can be optimized, the path of an operator execution chain is shortest, and the working efficiency is improved.
In some embodiments of the present application, the operator may be defined by a "handlerMeta" data structure, which typically contains the following 5 fields:
(1) input set: operator input field set;
(2) handleType trigger type: the method can comprise AnyInput and AllInput, and when the field is AnyInput, the operator can be triggered to execute when the upstream output field contains any one operator input field; when the field is AllInput, the operator can be triggered to execute only when the upstream output field contains all fields in the operator input;
(3) the op field is as follows: representing an operator instance;
(4) output set: operator output set, defaulting all outputs;
(5) condition: the operator executes the condition.
Furthermore, through a handlerMeta data structure, a dependency relationship among operators can be generated, and the dependency relationship can be called as a 'handlechain' operator execution chain, wherein the data structure of a node on the operator execution chain can contain the following 4 fields;
(1) input: input sets of all operators in the current operator execution chain;
(2) the ops field is as follows: all executable handleMata in the current operator execution chain can be executed in parallel;
(3) output: the output sets of all operators in the current operator execution chain;
(4) next pointer: pointing to the next processing set.
In the embodiment of the present application, the generation logic of the operator execution chain may be understood as: analyzing the MQ message queue to generate an original input set, inputting the original input set into an operator warehouse, matching an operator which can be triggered to execute currently according to the input set, merging the output of the operator which can be triggered to execute and the input of the layer into output, continuously searching the operator set which can be triggered to execute in the next layer by using the merged output, and so on until a new operator which can be triggered cannot be found, ending the cycle, and generating an operator execution chain by using the found operator.
In an embodiment of the application, the operator execution chain may include a number of handlerMeta, and the operator execution chain may be constructed by a next pointer. According to the embodiment of the application, different original inputs enter an operator warehouse to be customized to output different operator execution chains, the dependency relationship of each field can be rapidly and visually obtained according to the execution chains, and an operator execution strategy is optimized.
In order to more clearly describe the operators and the operator execution chains used in the object data processing plan of the embodiment of the present application, an illustrative example is provided below for detailed description.
FIG. 3 is a diagram illustrating an example of a plurality of operators in an operator repository in an embodiment of the present application, where 6 base operators are shown, including fields that each operator contains.
Specifically, the first operator (the operator at the top left corner in fig. 3) contains the following fields:
input field input: A. c;
operator instance field op: method _1;
trigger type field type: anyInput;
output field output: A. c and Z.
Wherein, method _1 represents the data processing function of the operator operation, such as converting RMB price unit "element" into RMB unit "score" and so on. The remaining 5 operators in fig. 3 are constructed as shown and will be referred to in the following description.
Fig. 4 and 5 schematically show two operator execution chains according to the embodiment of the present application, which are based on the operator according to the embodiment of fig. 3, and are described in detail below.
Example 1
For the embodiment of fig. 4, the input field is a, the triggerable operator is retrieved from the operator library, the triggerable operator method _1 (i.e., the operator at the top left corner in fig. 3) is searched, and the input field a is transformed by the operator method _1 to obtain the output field sets a, C, and Z; combining (or called splicing) the output fields A, C and Z with the input field A to obtain field sets A, C and Z; here, the operator method _1 can be regarded as a triggerable operator at the first level of level _1 in the operator execution chain;
then, using the obtained field sets A, C and Z as input fields, and searching triggerable operators in an operator library; retrieving the level 2 triggerable operator method _2 (because the input field of the operator method _2 is Z, the trigger type is AllInput, and the trigger condition is met); after the input fields A, C and Z are subjected to conversion processing by an operator method _2, an output field set X and Z are obtained; combining the output fields X and Z with the input fields A, C and Z to obtain field sets A, C, X and Z; taking the field sets A, C, X and Z as input fields to continuously search triggerable operators in an operator library; if operators meeting the trigger condition are not found after retrieval, directly outputting the fields A, C, X and Z; here, the operator method _2 can be regarded as a triggerable operator at the second level of level _2 in the operator execution chain;
then, the triggerable operator method _1 of level _1 and the triggerable operator method _2 of level _2 constitute an operator execution chain, and the two operators are connected by a next pointer.
Example 2
For the embodiment of FIG. 5, the input fields are C and D, the triggerable operators are retrieved from the operator library, the level _1 triggerable operator method _1 is retrieved, and the input field A is transformed by the operator method _1 to obtain the output fields A, C and Z; combining the output fields A, C and Z with the input fields C and D to obtain fields A, C, Z and D; taking the fields A, C, Z and D as input fields to retrieve triggerable operators in an operator library; retrieving level _2 triggerable operators has two: method _2 and method _3; converting the input fields A, C, Z and D through operators method _2 and method _3 to respectively obtain output fields X and Z and output fields A, Z, D, E and F; merging the output fields X and Z, the output fields A, Z, D, E and F and the input fields A, C, Z and D to obtain a merged field set of A, C, Z, D, X, E and F, and searching a triggerable operator in an operator library based on the merged field set of A, C, Z, D, X, E and F; and analogizing in sequence until the triggerable operator is empty, in the embodiment, the output field result is a set A, C, Z, D, X, E, F, G, H, X, B and J.
As can be seen from fig. 3, the operator execution chain in example 2 includes 5 levels, each level including one or more operators. Wherein, no dependency relationship exists between the operators method _2 and method _3 in the level 2, and the operators can be executed in parallel. The operators in the same level are executed in parallel, so that the changed data source can be processed in time, the object data processing efficiency is improved, the time consumption of data compression is reduced to the maximum extent, and the real-time performance of system data updating is improved.
In addition, in some embodiments of the present application, in order to conveniently troubleshoot the online problem of the system, all the written data may be reported to the log, so as to form a log reporting service close to real time. Optionally, the result data is encapsulated into an agreed format and sent to the log reporting service in an MQ manner, and the log reporting service is isolated from the system main service, so that the stability of the main service can be ensured.
In some embodiments of the present application, a failure retry mechanism may be set, for example, various abnormal situations such as interface call over, data format conversion error, or data write ES system failure may be captured (for example, about 1000 abnormal writes may be captured per hour), and then a retry MQ is sent to rewrite the ES system (elastic search). In some embodiments of the present application, day-level timing data correction may be set, the reported log data may be analyzed, and all data written into the system on the previous day of zero point correction may be set. In some embodiments of the present application, after a system program is upgraded or repaired, the system program can be restored through the "SavePoint" global image, and each operator needs to be set with the uid for normal restoration.
According to at least one embodiment of the present application, when maintaining the commodity information data, whether a user (which may be an individual or a merchant, or a buyer or a seller) updates a data source or a system updates a data source (for example, system automatic refresh data or system update failure data), a unified data processing flow may be triggered for the changed data source, and the standardized data is obtained through conversion of a related operator, and is updated and warehoused in time. Therefore, a more accurate recall index can be provided for the user, and the accuracy and the recall speed of the whole system for recalling the commodities are improved.
Corresponding to the object data processing method provided by the application, the application also provides an object data processing device. Fig. 6 is a schematic structural diagram illustrating an embodiment of an object data processing apparatus according to an embodiment of the present application. As shown in fig. 6, the article search device 400 includes:
a receiving module 110, configured to receive a trigger change data source;
a determining module 120, configured to determine that the trigger change data source corresponds to at least a first field;
the formulating module 130 is configured to formulate an object data processing plan corresponding to the first field, where the object data processing plan includes one or more operators, and after the execution of an upstream operator in the one or more operators is completed, the downstream operator is triggered to be executed;
an executing module 140, configured to execute the object data processing plan on the trigger change data source, so as to perform data processing on a first field in the trigger change data source;
and the updating module 150 is used for updating the first field in the object data by using the processed data.
Optionally, the triggering change data source includes at least one of: the method comprises the following steps of changing a data source by object information, refreshing the data source by a system and retrying the data source after updating failure, wherein the data source is changed by the object information when the object information is changed; generating a system refresh data source when a system is refreshed; when the object data fails to be updated, the retry data source generates the update failure.
Optionally, the formulating module 130 includes:
a first determining submodule for determining at least one operator from the first field, the at least one operator being an operator triggerable by the first field;
a first merging processing sub-module, configured to merge an output field of the at least one operator with the first field input field, and determine, based on a merged field set, a next-stage operator that meets a condition, where at least one input field of the next-stage operator belongs to the merged field set;
the second merging processing submodule is used for merging the output field and the input field of the next-stage operator, and determining the next-stage operator which meets the condition based on the merged field set until the operator which meets the condition is empty;
and the second determining submodule is used for determining the object data processing plan corresponding to the first field based on the operators of all levels meeting the conditions.
Optionally, wherein the second determining submodule includes: and the association unit is used for associating the operators which meet the conditions and have the upstream and downstream relations into an operator execution chain, wherein a plurality of operators in the same level can be executed in parallel.
Optionally, a single said operator comprises the following fields: an input field, a trigger type field, an operator instance field, an output field and an execution condition field; the trigger type field comprises an AnyInput field or an AllInput field, and the AnyInput field indicates that the current operator can be triggered when the output field of the upstream operator contains any field in the input field of the current operator; the AllInput field indicates that the current operator can be triggered when the output field of the upstream operator contains all fields in the input field of the current operator.
Optionally, the object data comprises commodity information data.
Optionally, the commodity information data includes attribute data of the commodity itself and/or seller attribute data corresponding to the commodity; the goods information data includes at least one of the following fields: brand, category, card bound status, description of the item, item label, price, inventory, presence or absence of video, seller release time, seller ID, seller liveness.
Optionally, the operator comprises at least one of: the unit of the commodity price is converted into a designated unit, and the commodity label is changed into a combination of the user label and the commodity label.
The present application further provides an electronic device, which includes: a processor and a memory storing computer program instructions; the object data processing method of any one of the above embodiments is implemented when the processor executes the computer program instructions.
Fig. 7 shows a hardware structure diagram of an embodiment of the electronic device provided in the present application.
As shown in fig. 7, the electronic device may include a processor 601 and a memory 602 that stores computer program instructions.
Specifically, the processor 601 may include a Central Processing Unit (CPU), or an Application Specific Integrated Circuit (ASIC), or may be configured to implement one or more integrated circuits of the embodiments of the present application.
Memory 602 may include mass storage for data or instructions. By way of example, and not limitation, memory 602 may include a Hard Disk Drive (HDD), floppy disk drive, flash memory, optical disk, magneto-optical disk, tape, or Universal Serial Bus (USB) drive or a combination of two or more of these. Memory 602 may include removable or non-removable (or fixed) media, where appropriate. The memory 602 may be internal or external to the integrated gateway disaster recovery device, where appropriate. In a particular embodiment, the memory 602 is a non-volatile solid-state memory.
The memory may include Read Only Memory (ROM), random Access Memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical, or other physical/tangible memory storage devices. Thus, in general, the memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software comprising computer-executable instructions and when the software is executed (e.g., by one or more processors), it is operable to perform operations described with reference to the methods according to an aspect of the present disclosure.
The processor 601 realizes any one of the article searching methods in the above embodiments by reading and executing the computer program instructions stored in the memory 602.
In one example, the electronic device may also include a communication interface 603 and a bus 610. As shown in fig. 7, the processor 601, the memory 602, and the communication interface 603 are connected via a bus 610 to complete communication therebetween. The electronic device in the embodiment of the application can be a server or other computing devices, and can also be a cloud server.
The communication interface 603 is mainly used for implementing communication between modules, apparatuses, units and/or devices in this embodiment.
Bus 610 includes hardware, software, or both to couple the components of the online data traffic billing device to each other. By way of example, and not limitation, a bus may include an Accelerated Graphics Port (AGP) or other graphics bus, an Enhanced Industrial Standard Architecture (EISA) bus, a Front Side Bus (FSB), a Hyper Transport (HT) interconnect, an Industrial Standard Architecture (ISA) bus, an infiniband interconnect, a Low Pin Count (LPC) bus, a memory bus, a Micro Channel Architecture (MCA) bus, a Peripheral Component Interconnect (PCI) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a video electronics standards association local (VLB) bus, or other suitable bus or a combination of two or more of these. Bus 610 may include one or more buses, where appropriate. Although specific buses are described and shown in the embodiments of the application, any suitable buses or interconnects are contemplated by the application.
In addition, in combination with the commodity searching method in the above embodiments, the embodiments of the present application may provide a computer storage medium to implement. The computer storage medium having computer program instructions stored thereon; the computer program instructions, when executed by a processor, implement any of the article searching methods in the above embodiments.
It is to be understood that the present application is not limited to the particular arrangements and instrumentality described above and shown in the attached drawings. A detailed description of known methods is omitted herein for the sake of brevity. In the above embodiments, several specific steps are described and shown as examples. However, the method processes of the present application are not limited to the specific steps described and illustrated, and those skilled in the art can make various changes, modifications, and additions or change the order between the steps after comprehending the spirit of the present application.
The functional blocks shown in the above-described structural block diagrams may be implemented as hardware, software, firmware, or a combination thereof. When implemented in hardware, it may be, for example, an electronic circuit, an Application Specific Integrated Circuit (ASIC), suitable firmware, plug-in, function card, or the like. When implemented in software, the elements of the present application are the programs or code segments used to perform the required tasks. The program or code segments can be stored in a machine-readable medium or transmitted by a data signal carried in a carrier wave over a transmission medium or a communication link. A "machine-readable medium" may include any medium that can store or transfer information. The machine-readable medium may include non-transitory computer-readable storage media such as electronic circuits, semiconductor memory devices, ROMs, flash memories, erasable ROMs (EROMs), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, radio Frequency (RF) links, and so forth. The code segments may be downloaded via computer networks such as the internet, intranet, etc.
It should also be noted that the exemplary embodiments mentioned in this application describe some methods or systems based on a series of steps or devices. However, the present application is not limited to the order of the above-described steps, that is, the steps may be performed in the order mentioned in the embodiments, may be performed in an order different from the order in the embodiments, or may be performed simultaneously.
Aspects of the present disclosure are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, implement the functions/acts specified in the flowchart and/or block diagram block or blocks. Such a processor may be, but is not limited to, a general purpose processor, a special purpose processor, an application specific processor, or a field programmable logic circuit. It will also be understood that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware for performing the specified functions or acts, or combinations of special purpose hardware and computer instructions.
As described above, only the specific embodiments of the present application are provided, and it can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes of the system, the module and the unit described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again. It should be understood that the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive various equivalent modifications or substitutions within the technical scope of the present application, and these modifications or substitutions should be covered within the scope of the present application.

Claims (11)

1. A method of processing object data, the object data comprising a plurality of fields, the method comprising:
receiving a triggering change data source;
determining that the trigger change data source corresponds to at least a first field;
formulating an object data processing plan corresponding to the first field, wherein the object data processing plan comprises one or more operators, and the downstream operator is triggered and executed after the upstream operator in the one or more operators is executed;
executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source;
and updating the first field in the object data by using the processed data.
2. The method of claim 1, wherein,
the trigger change data source includes at least one of: object information change data source, system refresh data source, update failure retry data source, wherein,
generating an object information change data source when the object information is changed;
generating a system refresh data source when a system is refreshed;
when the object data fails to be updated, the retry data source generates the update failure.
3. The method of claim 1, wherein the creating the object data processing plan corresponding to the first field comprises:
determining at least one operator from the first field, the at least one operator being an operator triggerable by the first field;
merging the output field of the at least one operator with the first field input field, and determining a next-stage operator meeting conditions based on the merged field set, wherein at least one input field of the next-stage operator belongs to the merged field set;
merging the output field and the input field of the next-stage operator, and determining the next-stage operator which meets the condition based on the merged field set until the operator which meets the condition is empty;
and determining the object data processing plan corresponding to the first field based on operators of all levels meeting the conditions.
4. The method of claim 3, wherein determining the object data processing plan corresponding to the first field based on the operators at the respective stages that meet the condition comprises:
and associating operators which meet the conditions and have upstream and downstream relations into an operator execution chain, wherein a plurality of operators in the same level can be executed in parallel.
5. The method of claim 1, wherein,
a single one of the operators comprises the following fields: an input field, a trigger type field, an operator instance field, an output field and an execution condition field;
wherein the trigger type field comprises an anyInput field or an AllInput field, and the AnyInput field indicates that the current operator can be triggered when the output field of the upstream operator contains any one of the input fields of the current operator; the AllInput field indicates that the current operator can be triggered when the output field of the upstream operator contains all fields in the input field of the current operator.
6. The method according to any one of claims 1 to 5, wherein,
the object data includes commodity information data.
7. The method of claim 6, wherein,
the commodity information data comprises attribute data of the commodity and/or seller attribute data corresponding to the commodity;
the goods information data includes at least one of the following fields: brand, category, binding status, item description, item label, price, inventory, presence of video, seller release time, seller ID, seller liveness.
8. The method according to any one of claims 1 to 5, wherein,
the operator comprises at least one of: the unit of the commodity price is converted into a designated unit, and the commodity label is changed into a combination of the user label and the commodity label.
9. An object data processing apparatus, said object data including a plurality of fields, comprising:
the receiving module is used for receiving a triggering change data source;
a determining module, configured to determine that the trigger change data source corresponds to at least a first field;
the system comprises a formulation module, a processing module and a processing module, wherein the formulation module is used for formulating an object data processing plan corresponding to a first field, the object data processing plan comprises one or more operators, and after the execution of an upstream operator in the one or more operators is finished, the downstream operator is triggered to be executed;
the execution module is used for executing the object data processing plan on the trigger change data source so as to process data of a first field in the trigger change data source;
and the updating module is used for updating the first field in the object data by using the processed data.
10. An electronic device, characterized in that the electronic device comprises: a processor and a memory storing computer program instructions;
the electronic device, when executing the computer program instructions, implements the method of any of claims 1-8.
11. A computer-readable storage medium having computer program instructions stored thereon which, when executed by a processor, implement the method of any one of claims 1-8.
CN202111142195.XA 2021-09-28 2021-09-28 Object data processing method and device, electronic equipment and storage medium Pending CN115878637A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111142195.XA CN115878637A (en) 2021-09-28 2021-09-28 Object data processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111142195.XA CN115878637A (en) 2021-09-28 2021-09-28 Object data processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115878637A true CN115878637A (en) 2023-03-31

Family

ID=85763386

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111142195.XA Pending CN115878637A (en) 2021-09-28 2021-09-28 Object data processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115878637A (en)

Similar Documents

Publication Publication Date Title
CN110648185B (en) Target crowd circling method and device and computer equipment
US10832250B2 (en) Long-term short-term cascade modeling for fraud detection
CN106844372B (en) Logistics information query method and device
CN108932625B (en) User behavior data analysis method, device, medium and electronic equipment
CN116308684B (en) Online shopping platform store information pushing method and system
US20230297552A1 (en) System, Method, and Computer Program Product for Monitoring and Improving Data Quality
CN106447337A (en) Method and device for day-end transaction data checking
CN112214508B (en) Data processing method and device
CN110942392A (en) Service data processing method, device, equipment and medium
CN113205402A (en) Account checking method and device, electronic equipment and computer readable medium
CN107368501B (en) Data processing method and device
CN108537635A (en) A kind of recommendation method and device of product
EP2816491A1 (en) Dictionary generation device, dictionary generation method, dictionary generation program and computer-readable recording medium storing same program
CN115564531A (en) Commodity combination recommendation method and device, equipment, medium and product thereof
CN110347887B (en) Method and device for acquiring time sequence data of service scene
CN105303447A (en) Method and device for carrying out credit rating through network information
CN113287100A (en) System and method for generating in-memory table model database
US20050049909A1 (en) Manufacturing units of an item in response to demand for the item projected from page-view data
CN115878637A (en) Object data processing method and device, electronic equipment and storage medium
US20220292407A1 (en) System and method for machine learning-based delivery tagging
US20040117287A1 (en) Fixed assets management system and method
CN110544165B (en) Credit risk score card creating method and device and electronic equipment
CN112991033A (en) Method and device for determining value attribute of article
CN117455579B (en) Commodity recommendation intervention method, commodity recommendation intervention device, medium and equipment
CN111507366B (en) Training method of recommendation probability model, intelligent completion method and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination