CN112732788A - Data processing method and device, electronic equipment and storage medium - Google Patents

Data processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN112732788A
CN112732788A CN202110032338.5A CN202110032338A CN112732788A CN 112732788 A CN112732788 A CN 112732788A CN 202110032338 A CN202110032338 A CN 202110032338A CN 112732788 A CN112732788 A CN 112732788A
Authority
CN
China
Prior art keywords
data
measurement
control data
data processing
time sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110032338.5A
Other languages
Chinese (zh)
Inventor
唐卿
周倜
刘晓辉
欧阳柳
王磊
张祖丽
王尧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unit 63920 Of Pla
Original Assignee
Unit 63920 Of Pla
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unit 63920 Of Pla filed Critical Unit 63920 Of Pla
Priority to CN202110032338.5A priority Critical patent/CN112732788A/en
Publication of CN112732788A publication Critical patent/CN112732788A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Abstract

The invention discloses a data processing method, a data processing device, electronic equipment and a storage medium, wherein the method comprises the following steps: acquiring measurement and control data from a plurality of systems; analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data; clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship; based on a dynamic load balancing strategy, sending data without coupling relation to different data processing nodes so that the data processing nodes process the data in parallel; and receiving the data processing results of the data processing nodes, and outputting the data processing results in sequence according to the time sequence information. The invention can improve the accuracy and the high efficiency of data parallel processing.

Description

Data processing method and device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of data processing, and in particular, to a data processing method and apparatus, an electronic device, and a storage medium.
Background
With the continuous advance of aerospace engineering in China, the density of aerospace tasks is increasingly higher, flight measurement and control tasks are increasingly complex, and the data downlink code rate is increased by orders of magnitude. In order to ensure that the measurement and control system can quickly judge and accurately control the flight process state and provide high requirements for the real-time performance of data processing, the measurement and control system is limited by the performance and resources of computer hardware, and in order to improve the processing performance, a parallel processing technology is generally adopted.
The system composition scale of modern aerospace engineering is very large, the interior of the system is divided into a plurality of subsystems according to services, the subsystems are composed of a plurality of subsystems, and the subsystems are closely linked when performing cooperative work, so that the downlink measurement and control data of the systems have strong coupling and time sequence characteristics.
At present, when data parallel processing is designed in a large-scale space measurement and control application software system, in order to solve the problems of time sequence and data coupling, the association relationship between data sources is often manually identified, a static data distribution strategy is established, and then parallel processing of load balancing is performed.
However, the manual maintenance cost is high, the difficulty is high, the required period is long, and the efficiency is low; in addition, due to human difference, the professional nature of the personnel and other factors, the identification errors and the missing items can be caused, and wrong input is generated for later static load balancing information configuration; in addition, due to human factors and limitations, in static data allocation, the granularity of data allocation is coarse, and the processing performance of a hardware system is difficult to give full play.
Disclosure of Invention
In view of the above, the present invention provides a data processing method, an apparatus, an electronic device and a storage medium to solve at least one of the above-mentioned problems.
According to a first aspect of the present invention, there is provided a data processing method, the method comprising:
acquiring measurement and control data from a plurality of systems;
analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data;
clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship;
based on a dynamic load balancing strategy, sending data without coupling relation to different data processing nodes so that the data processing nodes process the data in parallel;
and receiving the data processing results of the data processing nodes, and outputting the data processing results in sequence according to the time sequence information.
According to a second aspect of the present invention, there is provided a data processing apparatus, the apparatus comprising:
the data acquisition unit is used for acquiring measurement and control data from a plurality of systems;
the analysis unit is used for analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data;
the clustering unit is used for clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship;
the data sending unit is used for sending the data without the coupling relation to different data processing nodes based on a dynamic load balancing strategy so as to enable the data processing nodes to process the data in parallel;
and the data output unit is used for receiving the data processing results of the data processing nodes and outputting the data processing results in sequence according to the time sequence information.
According to a third aspect of the present invention, there is provided an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the method when executing the program.
According to a fourth aspect of the invention, a computer-readable storage medium is provided, on which a computer program is stored which, when being executed by a processor, carries out the steps of the above-mentioned method.
According to the technical scheme, the acquired measurement and control data are analyzed according to the preset configuration information, the dependency relationship in the measurement and control data is identified, the time sequence information of the measurement and control data is acquired, then the measurement and control data are clustered according to the dependency relationship of the measurement and control data, data without coupling relationship are generated, then the data without coupling relationship are sent to different data processing nodes based on a dynamic load balancing strategy, the data processing nodes process the data in parallel, and when the data processing results of the data processing nodes are received, the data processing results can be output in sequence according to the time sequence information.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flow diagram of a data processing method according to an embodiment of the invention;
FIG. 2 is a schematic diagram of data clustering according to an embodiment of the present invention;
FIG. 3 is a diagram illustrating a process for writing parsing results based on index numbers according to an embodiment of the present invention;
FIG. 4 is a diagram illustrating an output parsing result according to an embodiment of the present invention;
FIG. 5 is a block diagram of a data processing apparatus according to an embodiment of the present invention;
FIG. 6 is an exemplary block diagram of a data processing apparatus according to an embodiment of the present invention;
fig. 7 is a schematic block diagram of a system configuration of an electronic apparatus 600 according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
At present, a static data distribution strategy is established by manually identifying the incidence relation between data sources, and then parallel processing of load balancing is carried out. However, the efficiency and accuracy of manual identification are low, and more errors are generated in the later data processing. Based on the above, the embodiment of the invention provides a data processing scheme, which fully considers the characteristics of data dependence and time sequence constraint, does not need manual identification, and can ensure the correctness and the high efficiency of data parallel processing. Embodiments of the present invention are described in detail below with reference to the accompanying drawings.
Fig. 1 is a flowchart of a data processing method according to an embodiment of the present invention, as shown in fig. 1, the method including:
step 101, acquiring measurement and control data from a plurality of systems.
And 102, analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data.
103, clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship.
And 104, sending the data without the coupling relation to different data processing nodes based on the dynamic load balancing strategy so that the data processing nodes process the data in parallel.
And 105, receiving the data processing results of the data processing nodes, and outputting the data processing results in sequence according to the time sequence information.
The method comprises the steps of analyzing obtained measurement and control data according to preset configuration information, identifying the dependency relationship in the measurement and control data and obtaining time sequence information of the measurement and control data, clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship, sending the data without coupling relationship to different data processing nodes based on a dynamic load balancing strategy, processing the data by the data processing nodes in parallel, outputting the data processing results in sequence according to the time sequence information when the data processing results of the data processing nodes are received, and compared with the prior art, the method provided by the embodiment of the invention can identify the dependency relationship in the measurement and control data without manual identification, so that the accuracy and the efficiency of data parallel processing can be improved.
In an embodiment, for step 103, a topological directed graph of the measurement and control data may be generated according to the dependency relationship of the measurement and control data; and then, clustering the data with the dependency relationship based on a topological sorting algorithm and a topological directed graph to generate data without coupling relationship.
In the specific implementation process, the data dependency-based topological sorting algorithm can ensure that the depended data and the depended source packet are processed firstly, the data and source packet with dependency precondition are processed later, the source packet with dependency constraint is processed serially, and the source packet without dependency constraint is processed parallelly, and the main process comprises the following steps: relationship description, data clustering and topological sorting, and the three processes are respectively described below.
(1) Description of relationships
And mapping the data and the dependency relationship thereof to the graph to form a directed graph, wherein the data is mapped to a vertex in the graph, and the dependency relationship is mapped to an edge on the graph.
Defining (dependency), and assuming the data set as D, the data dependency R is defined as follows:
Figure BDA0002891885020000041
if and only if data a depends on data b.
Defining (dependency graph), assuming that D is a data set and R is a data dependency, G ═ V, E > is defined as follows:
Figure BDA0002891885020000051
Figure BDA0002891885020000052
and is
Figure BDA0002891885020000053
Where V represents the set of vertices of the graph, E represents the set of edges,
Figure BDA0002891885020000054
indicating that there is one edge e that is present,
Figure BDA0002891885020000055
representing elements of the data set D other than a,
Figure BDA0002891885020000056
representing elements in the representation data set D other than b.
(2) Data clustering
According to the dependency relationship shown in fig. 2, the data with dependency are merged, and then classified according to the data merging process, wherein the classification process is shown in fig. 2.
(3) Topological ordering
From the perspective of fig. 2, if the vertex a has an outgoing edge e ═ a, b >, the data corresponding to the point a needs to be calculated after the data corresponding to the point b is calculated, that is, the vertex with an incoupling degree of not 0 is calculated before the vertex with an incoupling degree of 0. Therefore, the dependency sorting problem is converted into a topological sorting problem on the directed graph, and each time a vertex on the graph is scanned, the vertex with the in-degree of 0 is moved into the parallel queue.
And the sorting algorithm decides whether to move the vertex into the parallel set of the calculation according to whether the in-degree of the vertex is 0 or not, and if so, the vertex is moved out of the current graph until no vertex exists in the graph (sorting is finished) or the out-degrees of the vertices are all >0 (with a ring, sorting is unsuccessful). The specific algorithm is as follows:
Figure BDA0002891885020000057
in a specific implementation process, after the time sequence information of the measurement and control data is acquired in step 102, a data index number may be added to each data in the measurement and control data according to the time sequence information based on a time sequence indexing technology.
For step 105, the data processing results can be output in order according to the data index number.
To ensure data consistency, in step 102, the measurement and control data are analyzed sequentially according to a time sequence during real-time analysis (i.e., analysis). By a time sequence indexing technology, a data index number is established, and the problem of parallel processing time sequence is solved by adopting an analysis result fast writing and reading mechanism.
(1) Defining data index numbers
Three elements defining the data index number as shown below: data source identification, virtual channel, frame number:
DATA:{SOURCE,VCID,FRAMEID}
in the data preprocessing step (i.e., the analysis operation in step 102), an index number is added to each data frame according to the data sequence.
(2) Fast writing and reading of the parsed result based on the index number (i.e., the data processing result of step 105)
Fig. 3 shows a fast writing process based on the index number parsing result, when the parsing result is written, a data index tree is established according to the index number, the parsing result is inserted in the leaf node of the index tree by using an insertion sorting method, and the data writing process supports concurrent writing.
(3) Resolution result readout support concurrent readout
Fig. 4 is a schematic diagram of reading out all the analysis results after two times of concurrent reading, and as shown in fig. 4, first, an initial reading flag is set for each reading thread, and each time a flag position result is read out, the reading flag is deleted from the index tree and updated.
The parallel processing scheme for the strongly-coupled data fully considers the characteristics of data dependence and time sequence constraint, and ensures the correctness and the high efficiency of the data parallel processing. The topological sorting algorithm based on the data dependency relationship solves the problem of parallel processing node distribution during data dependency, processes depended data and source packets firstly, processes the data and source packets with dependency preconditions and then processes the data and source packets with dependency relationship restrictions, serially processes the source packets with dependency relationship restrictions, and processes the source packets without dependency relationship restrictions in parallel, thereby improving the processing performance. Meanwhile, in order to ensure data consistency, the measurement and control data are analyzed in sequence according to time sequence during real-time analysis, a data index number is established through a time sequence indexing technology, and the problem of parallel processing time sequence is solved by adopting an analysis result fast writing and reading mechanism.
Based on similar inventive concepts, the embodiment of the present invention further provides a data processing apparatus, which is preferably used for implementing the flow of the data processing method.
Fig. 5 is a block diagram of the data processing apparatus, and as shown in fig. 5, the data processing apparatus includes: data acquisition unit 1, analysis unit 2, clustering unit 3, data transmission unit 4 and data output unit 5, wherein:
the data acquisition unit 1 is used for acquiring measurement and control data from a plurality of systems;
the analysis unit 2 is used for analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data;
the clustering unit 3 is used for clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship;
the data sending unit 4 is used for sending the data without the coupling relation to different data processing nodes based on a dynamic load balancing strategy so as to enable the data processing nodes to process the data in parallel;
and the data output unit 5 is used for receiving the data processing results of the data processing nodes and outputting the data processing results in sequence according to the time sequence information.
The analysis unit 2 analyzes the acquired measurement and control data according to the preset configuration information, identifies the dependency relationship in the measurement and control data and acquires the time sequence information of the measurement and control data, and then, the clustering unit 3 carries out clustering operation on the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship, then the data sending unit 4 sends the data without coupling relationship to different data processing nodes based on a dynamic load balancing strategy, each data processing node processes the data in parallel, when receiving the data processing results of the respective data processing nodes, the data output unit 5 may output the data processing results in order according to the timing information, compared to the prior art, according to the embodiment of the invention, the dependency relationship in the measurement and control data can be identified without manual identification, so that the accuracy and the efficiency of data parallel processing can be improved.
Specifically, the clustering unit includes: the directed graph generation module and the clustering module: the directed graph generating module is used for generating a topological directed graph of the measurement and control data according to the dependency relationship of the measurement and control data; and the clustering module is used for clustering the data with the dependency relationship based on a topological sorting algorithm and a topological directed graph so as to generate data without coupling relationship.
In one embodiment, the apparatus further includes: and the data index number adding unit is used for adding data index numbers to each piece of data in the measurement and control data according to the time sequence information based on a time sequence index technology.
Correspondingly, the data output unit is specifically configured to: and outputting the data processing results in sequence according to the data index numbers.
For specific execution processes of the units and the modules, reference may be made to the description in the foregoing method embodiments, and details are not described here again.
In practical operation, the units and the modules may be combined or may be singly arranged, and the present invention is not limited thereto.
For a better understanding of embodiments of the present invention, an example is given below.
FIG. 6 is an exemplary architecture diagram of a data processing apparatus, as shown in FIG. 6, comprising: data analysis module, data distribution module, parallel processing module, order preserving module, wherein:
and the data analysis module is used for automatically finishing the extraction of the strong coupling dependency relationship and the time sequence information in the measurement and control data according to the preset configuration information and describing the extracted information into a data structure which can be identified by a computer.
The configuration information here includes: the parameter description information, the calculation formula, the precondition and the like required by the processing of the space measurement and control data.
In a specific implementation process, when the data analysis module processes the temperature type parameter a, a reference voltage parameter B needs to be used for calculation, and in configuration information, a calculation formula is described by using the parameter B; for another example, when the data analysis module processes the parameter C, the parameter C is processed only in a certain flight phase of the space measurement and control, the flight phase is usually described by using a certain parameter D, and the precondition in the configuration information of the parameter C includes a certain condition of the parameter D. When a certain parameter X has the above two conditions, the data analysis module needs to perform dependency analysis, and a general method is to perform regular expression matching on each parameter information to be processed, match whether processing conditions of other parameters Y exist in configuration information, if so, X depends on Y, and during actual processing, the processing of X is completed after Y is required to be processed based on timing requirements.
And the data distribution module is used for sequencing, clustering and distributing the data in sequence by using a topological sequencing algorithm according to the analyzed data dependency relationship, so that the accuracy of parallel processing on the premise of ensuring the data dependency relationship is ensured.
And the parallel processing module is used for distributing the data which are subjected to clustering distribution to different nodes for processing by using a dynamic load balancing strategy, so that the data processing performance is improved.
And the order-preserving module is used for inputting the processed data in a mode of parallel writing and sequential reading of results according to the extracted time sequence information, so that the data is ensured to enter first.
The example device of the embodiment of the invention provides a parallel processing architecture for strong coupling data, after external measurement and control data are received, the dependency relationship in the data can be analyzed according to preset configuration information, time sequence index information in the data is extracted, after the data are analyzed and extracted, a data distribution module carries out topological sorting and clustering division on the data to form a data block which can be independent and has no coupling relationship, then a parallel processing module carries out dynamic distribution on the processed data block according to the processing load condition to complete data parallel processing, an order preserving module writes data results which are processed in parallel into a data index tree structure in parallel and is responsible for data to be issued outwards, and the time sequence is guaranteed according to the index tree structure when the data are issued.
The present embodiment also provides an electronic device, which may be a desktop computer, a tablet computer, a mobile terminal, and the like, but is not limited thereto. In this embodiment, the electronic device may be implemented by referring to the above method embodiment and the data processing apparatus embodiment, and the contents thereof are incorporated herein, and repeated descriptions are omitted.
Fig. 7 is a schematic block diagram of a system configuration of an electronic apparatus 600 according to an embodiment of the present invention. As shown in fig. 7, the electronic device 600 may include a central processor 100 and a memory 140; the memory 140 is coupled to the central processor 100. Notably, this diagram is exemplary; other types of structures may also be used in addition to or in place of the structure to implement telecommunications or other functions.
In one embodiment, the data processing functions may be integrated into the central processor 100. The central processor 100 may be configured to control as follows:
step 101, acquiring measurement and control data from a plurality of systems.
And 102, analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data.
103, clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship.
And 104, sending the data without the coupling relation to different data processing nodes based on the dynamic load balancing strategy so that the data processing nodes process the data in parallel.
And 105, receiving the data processing results of the data processing nodes, and outputting the data processing results in sequence according to the time sequence information.
As can be seen from the above description, in the electronic device provided in the embodiment of the present application, the acquired measurement and control data is analyzed according to the preset configuration information, the dependency relationship in the measurement and control data is identified, and the time sequence information of the measurement and control data is acquired, and then, clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship, then sending the data without coupling relationship to different data processing nodes based on a dynamic load balancing strategy, processing the data by the data processing nodes in parallel, when receiving the data processing result of each data processing node, compared with the prior art, the data processing method and the device can identify the dependency relationship in the measurement and control data without manual identification, thereby improving the accuracy and the efficiency of data parallel processing.
In another embodiment, the data processing apparatus may be configured separately from the central processor 100, for example, the data processing apparatus may be configured as a chip connected to the central processor 100, and the data processing function is realized by the control of the central processor.
As shown in fig. 7, the electronic device 600 may further include: communication module 110, input unit 120, audio processing unit 130, display 160, power supply 170. It is noted that the electronic device 600 does not necessarily include all of the components shown in fig. 7; furthermore, the electronic device 600 may also comprise components not shown in fig. 7, which may be referred to in the prior art.
As shown in fig. 7, the central processor 100, sometimes referred to as a controller or operational control, may include a microprocessor or other processor device and/or logic device, the central processor 100 receiving input and controlling the operation of the various components of the electronic device 600.
The memory 140 may be, for example, one or more of a buffer, a flash memory, a hard drive, a removable media, a volatile memory, a non-volatile memory, or other suitable device. The information relating to the failure may be stored, and a program for executing the information may be stored. And the central processing unit 100 may execute the program stored in the memory 140 to realize information storage or processing, etc.
The input unit 120 provides input to the cpu 100. The input unit 120 is, for example, a key or a touch input device. The power supply 170 is used to provide power to the electronic device 600. The display 160 is used to display an object to be displayed, such as an image or a character. The display may be, for example, an LCD display, but is not limited thereto.
The memory 140 may be a solid state memory such as Read Only Memory (ROM), Random Access Memory (RAM), a SIM card, or the like. There may also be a memory that holds information even when power is off, can be selectively erased, and is provided with more data, an example of which is sometimes called an EPROM or the like. The memory 140 may also be some other type of device. Memory 140 includes buffer memory 141 (sometimes referred to as a buffer). The memory 140 may include an application/function storage section 142, and the application/function storage section 142 is used to store application programs and function programs or a flow for executing the operation of the electronic device 600 by the central processing unit 100.
The memory 140 may also include a data store 143, the data store 143 for storing data, such as contacts, digital data, pictures, sounds, and/or any other data used by the electronic device. The driver storage portion 144 of the memory 140 may include various drivers of the electronic device for communication functions and/or for performing other functions of the electronic device (e.g., messaging application, address book application, etc.).
The communication module 110 is a transmitter/receiver 110 that transmits and receives signals via an antenna 111. The communication module (transmitter/receiver) 110 is coupled to the central processor 100 to provide an input signal and receive an output signal, which may be the same as in the case of a conventional mobile communication terminal.
Based on different communication technologies, a plurality of communication modules 110, such as a cellular network module, a bluetooth module, and/or a wireless local area network module, may be provided in the same electronic device. The communication module (transmitter/receiver) 110 is also coupled to a speaker 131 and a microphone 132 via an audio processor 130 to provide audio output via the speaker 131 and receive audio input from the microphone 132 to implement general telecommunications functions. Audio processor 130 may include any suitable buffers, decoders, amplifiers and so forth. In addition, an audio processor 130 is also coupled to the central processor 100, so that recording on the local can be enabled through a microphone 132, and so that sound stored on the local can be played through a speaker 131.
Embodiments of the present invention further provide a computer-readable storage medium, on which a computer program is stored, where the computer program is executed by a processor to implement the steps of the data processing method.
In summary, the embodiment of the present invention provides an order-preserving scheme for parallel processing of measurement and control data, and aims to solve the problems of result disorder and processing errors caused by strong data source coupling and time sequence characteristics during parallel processing of data. The embodiment of the invention designs a serial-parallel combined processing architecture, realizes data serial processing with dependency relationship, does not have dependent source packet parallel processing, and exerts parallel processing performance to the maximum extent. Meanwhile, the embodiment of the invention provides a dynamic allocation strategy for data dependence, measurement and control data with a coupling relation are subjected to relevance division, dependent data are arranged in dependence data for pretreatment, and data without the dependence relation can be processed in parallel. In addition, the embodiment of the invention designs and processes the order-preserving output of the result, and realizes the consistency of the result and the input sequence.
The preferred embodiments of the present invention have been described above with reference to the accompanying drawings. The many features and advantages of the embodiments are apparent from the detailed specification, and thus, it is intended by the appended claims to cover all such features and advantages of the embodiments which fall within the true spirit and scope thereof. Further, since numerous modifications and changes will readily occur to those skilled in the art, it is not desired to limit the embodiments of the invention to the exact construction and operation illustrated and described, and accordingly, all suitable modifications and equivalents may be resorted to, falling within the scope thereof.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The principle and the implementation mode of the invention are explained by applying specific embodiments in the invention, and the description of the embodiments is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (10)

1. A method of data processing, the method comprising:
acquiring measurement and control data from a plurality of systems;
analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data;
clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship;
based on a dynamic load balancing strategy, sending data without coupling relation to different data processing nodes so that the data processing nodes process the data in parallel;
and receiving the data processing results of the data processing nodes, and outputting the data processing results in sequence according to the time sequence information.
2. The method of claim 1, wherein clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship comprises:
generating a topological directed graph of the measurement and control data according to the dependency relationship of the measurement and control data;
and clustering the data with the dependency relationship based on a topological sorting algorithm and a topological directed graph to generate data without coupling relationship.
3. The method of claim 1, wherein after obtaining timing information of the measurement and control data, the method further comprises:
and based on a time sequence indexing technology, adding a data index number to each piece of data in the measurement and control data according to the time sequence information.
4. The method of claim 3, wherein outputting data processing results in sequence according to the timing information comprises:
and outputting the data processing results in sequence according to the data index numbers.
5. A data processing apparatus, characterized in that the apparatus comprises:
the data acquisition unit is used for acquiring measurement and control data from a plurality of systems;
the analysis unit is used for analyzing the measurement and control data according to preset configuration information so as to identify the dependency relationship in the measurement and control data and acquire the time sequence information of the measurement and control data;
the clustering unit is used for clustering the measurement and control data according to the dependency relationship of the measurement and control data to generate data without coupling relationship;
the data sending unit is used for sending the data without the coupling relation to different data processing nodes based on a dynamic load balancing strategy so as to enable the data processing nodes to process the data in parallel;
and the data output unit is used for receiving the data processing results of the data processing nodes and outputting the data processing results in sequence according to the time sequence information.
6. The apparatus of claim 5, wherein the clustering unit comprises:
the directed graph generating module is used for generating a topological directed graph of the measurement and control data according to the dependency relationship of the measurement and control data;
and the clustering module is used for clustering the data with the dependency relationship based on a topological sorting algorithm and a topological directed graph so as to generate data without coupling relationship.
7. The apparatus of claim 5, further comprising:
and the data index number adding unit is used for adding data index numbers to each piece of data in the measurement and control data according to the time sequence information based on a time sequence index technology.
8. The apparatus of claim 7, wherein the data output unit is specifically configured to:
and outputting the data processing results in sequence according to the data index numbers.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the method of any of claims 1 to 4 are implemented when the processor executes the program.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 4.
CN202110032338.5A 2021-01-11 2021-01-11 Data processing method and device, electronic equipment and storage medium Pending CN112732788A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110032338.5A CN112732788A (en) 2021-01-11 2021-01-11 Data processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110032338.5A CN112732788A (en) 2021-01-11 2021-01-11 Data processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN112732788A true CN112732788A (en) 2021-04-30

Family

ID=75590291

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110032338.5A Pending CN112732788A (en) 2021-01-11 2021-01-11 Data processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112732788A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102156782A (en) * 2011-04-14 2011-08-17 国电南瑞科技股份有限公司 Graph theory-based formula parallel calculation management method for electric power system
CN102902590A (en) * 2012-09-03 2013-01-30 南京师范大学 Parallel digital terrain analysis-oriented massive DEM (Digital Elevation Model) deploying and scheduling method
CN105426410A (en) * 2015-11-02 2016-03-23 东软集团股份有限公司 Data acquisition system and analytic method for same
CN110245010A (en) * 2019-06-21 2019-09-17 广州小鹏汽车科技有限公司 A kind of data processing method and device
US20200073712A1 (en) * 2018-08-30 2020-03-05 Baidu Online Network Technology (Beijing) Co., Ltd. Method, apparatus, device and medium for processing topological relation of tasks

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102156782A (en) * 2011-04-14 2011-08-17 国电南瑞科技股份有限公司 Graph theory-based formula parallel calculation management method for electric power system
CN102902590A (en) * 2012-09-03 2013-01-30 南京师范大学 Parallel digital terrain analysis-oriented massive DEM (Digital Elevation Model) deploying and scheduling method
CN105426410A (en) * 2015-11-02 2016-03-23 东软集团股份有限公司 Data acquisition system and analytic method for same
US20200073712A1 (en) * 2018-08-30 2020-03-05 Baidu Online Network Technology (Beijing) Co., Ltd. Method, apparatus, device and medium for processing topological relation of tasks
CN110245010A (en) * 2019-06-21 2019-09-17 广州小鹏汽车科技有限公司 A kind of data processing method and device

Similar Documents

Publication Publication Date Title
CN109769141B (en) Video generation method and device, electronic equipment and storage medium
CN109947426B (en) Application program generation method and device and electronic equipment
CN110377462B (en) Interface testing method and device and terminal equipment
CN109815448B (en) Slide generation method and device
CN110543301B (en) Method and device for generating jenkins code file
CN109408468A (en) Document handling method and device calculate equipment and storage medium
CN115048254B (en) Simulation test method, system, equipment and readable medium for data distribution strategy
CN112463634A (en) Software testing method and device under micro-service architecture
CN102426567A (en) Graphical editing and debugging system of automatic answer system
CN111367895A (en) Data migration method and device
CN110888972A (en) Sensitive content identification method and device based on Spark Streaming
CN111078573A (en) Test message generation method and device
CN112860264B (en) Method and device for reconstructing abstract syntax tree
CN113746840A (en) JSON format message generation method and device
CN112732788A (en) Data processing method and device, electronic equipment and storage medium
WO2023071566A1 (en) Data processing method and apparatus, computer device, computer-readable storage medium, and computer program product
CN109857576A (en) RPC interface call method and device
CN113419957A (en) Rule-based big data offline batch processing performance capacity scanning method and device
CN113360411A (en) Code coverage rate detection method and device based on parallel variation test
CN112905461A (en) Method and device for executing automatic interface test case
CN113760680A (en) Method and device for testing system pressure performance
CN108415981B (en) Data dimension generation method, device, equipment and computer readable storage medium
CN115208960B (en) Protocol switching method and related equipment
CN111526034A (en) Route centralized management method, device, electronic device and storage medium
CN111552479A (en) Mirror image file compiling method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination